6,784 Matching Annotations
  1. Mar 2022
    1. One of the most challenging aspects of the Pandemic for dual-income parents is the school and daycare closures. (Note: Whereas the first support focused on gender roles, the second paragraph focuses on the particular challenges for parents during the Covid-19 epidemic.) These dual-earner parents should find a way to split children’s needs during the shelter-in-place. If they do not balance paid work and child care, both sides will feel the consequences. To emphasize these consequences, Lewis humorously says “Dual-income couples might suddenly be living like their grandparents, one homemaker, and one breadwinner.” (Note: Drawing on evidence from the text, this passage shows how gender roles relate to the challenges of Covid-19 for working parents and families.) Instead of splitting the housework, women take the role of “homemaker” so the author implies here that this regresses gender dynamics two generations backward. It obviously demonstrates that nothing much has changed over time and the mentality remains. While many couples are trying to find a middle way, others think that women have to suck it up and sacrifice their jobs. In reference to school closures, Lewis brings up the Ebola health crisis which occurred in West Africa in the time period of 2014-2016. (Note: The following paragraph cites a historical precedent for the Covid-19 outbreak as a basis for comparison.) According to Lewis, during this outbreak, many African girls lost their chance at education; moreover, many women died during childbirth because of a lack of medical care. Mentioning these elaborations proves once again that not only coronavirus but also many other outbreaks have caused a disaster for feminism. Pandemics, in other words, pile yet another problem on women who always face an uphill battle against patriarchal structures. (Note: This passage ties this observation about the Ebola outbreak in West Africa to a greater observation about Pandemics and gender roles overall.) I started reading her article with a feeling of frustration. While the main topic of the article is feminism, Lewis gives a couple of male examples from the past, such as William Shakespeare and Isaac Newton. (Note: The author makes a personal note here, marking an emotional connection and reaction to the text.) She seems at times to attribute their success to their masculinity. They both lived in times of plague, demonstrating that despite all our progress, the human species is still grappling with the same issues. According to Lewis, neither Newton nor Shakespeare had to worry about childcare or housework. Even though her comparison seemed odd to me, she managed to surprise me that in over 300 years many gender inequities remain the same. This is actually very tragic. It is hard to acknowledge that women are still facing gender inequality in almost every area even 300 years after the time of these great English thinkers. (Note: The author cites historical precedent again: this passage argues that the relationship between plagues and gender roles has not changed much in centuries.) Assuming housework is the natural place of women without asking women if they want to do it is asking for too big a sacrifice. Since couples have the option to split the housework and childcare, why should only women have to shoulder most of the burden? This is a question that I might never be able to answer, even if I search my whole life. It is unacceptable that there is pressure on women to conform to gender roles, such as cultural settings and expectations. (Note: The author uses a rhetorical question to segue into a new supporting argument.) Women should not have to sacrifice their leisure time completing unpaid work. I agree with Lewis when she mentions the “second shift” situation. When we consider women’s first shift as their paid work, the second shift represents the time that they spend working in the home. In this case, there is apparently no shift for leisure time. Lewis also supports this by saying “Across the world, women—including those with jobs—do more housework and have less leisure time than their male partners.” Additionally, it seems like economic recovery is going to be long-lasting because of the Coronavirus. As a solution, if men and women have equal housework responsibilities, women may spend more of their time completing paid work. (Note: The author makes a call to action near the end of the essay.) In this way, they can contribute to the economy while they are socializing. Especially after the Pandemic is over, we will need a greater workforce, so hopefully both men and women can equally participate in the economy. (Note: Much like the first sentence of the essay, the last sentence speaks to a greater, big-picture context: the need for equality in a post-pandemic world.)

      Many schools and daycares are sadly closed at the moment because of COVID19 pandemic.

    1. I think that the students’ voice is not always heard entirely, even through dialogue. I feel that by doing this journal we can make a difference with our personal experience and touch the heart of someone who is willing to stand by us. I also wanted to get the attention of other students who may be feel-ing the same frustration I have felt

      Rashida, as an SLA student, talked about the issue she met before, and explained why this process can help. Her letter is a great evidence to show that our action is effective.

    1. Author Response:

      Reviewer #1:

      The authors of this study carried out two carefully designed field and a glasshouse experiment simulating effects of rapid warming on soil carbon loss. They did this by transplanting alpine turfs from their cold environment to lowland warm environment. They found that when lowland plants were inserted into alpine turfs under these lowland climatic conditions (referred to as warming treatment combined with warm-adapted plant introduction) they rapidly increased soil microbial decomposition of carbon stocks due to root exudates feeding the microbes.

      The question is how well this experimental setup mimics what would happen if lowland plants would be inserted into alpine turfs in situ (which have already experienced considerable warming over the past decades), perhaps with an additional warming treatment there.

      The Reviewer alludes to two pertinent points here. The Reviewer’s first point considers whether lowland plants would function similarly (and, by extension, have the same effect on the soil system) if moved from the warmer lowland site to the cooler alpine site. This is a fascinating question in its own right, in that it raises questions about how migrations of non-adapted genotypes far beyond range edges (e.g. via human activity) impact recipient ecosystems. However, although we agree that alpine ecosystems have warmed considerably in recent decades, we cannot be confident that the high elevation sites in our study are already within the climate niche of the lowland focal species. As such, to address our research questions in situ at the high sites would have required additional warming treatments, which come with their own set of disadvantages (see our second point to this comment, below). We also refer the Reviewer to specific questions about adaptation below (see R6), although we see that we were not careful enough about the rationale for our design in the previous version of the manuscript. We have therefore added a clarifying sentence to the Main Text as follows:

      L101: “In short, the experiments used here examined how the arrival of warm-adapted lowland plants influences alpine ecosystems in a warmed climate matching lowland site conditions (i.e. turf transplantation to low elevation plus lowland plant addition) relative to warming-only (i.e. turf transplantation to low elevation) or control (i.e. turf transplantation within high elevation) scenarios.”

      Second, the Reviewer implicitly raises a point about whether our chosen approach of simulating warming plus lowland plant arrival (i.e. transplantation plus addition of lowland plants) is the most appropriate, specifically by suggesting an alternative option of adding lowland plants to (possibly experimentally-warmed) alpine turfs at the high elevation origin site. Here, it was essential to create a climate scenario in which lowland plants would survive and operate within their climatic niche (i.e. relative to their home conditions) once planted into alpine turfs, rather than perform sub-optimally (e.g. be in a potentially inferior competitive position) or be unable to persist at all. The most parsimonious and reliable way to ensure this was to transplant alpine turfs to a site with a lowland temperature regime, with transplantations also being shown to outperform other methods when novel species interactions are involved (Yang et al. 2018). Most importantly, it was crucial to select a method that warmed the entire plant-soil system rather than only the air (e.g. open-top chambers, IR lamps; Marion et al. 1997; Aronson et al. 2009) or soil (e.g. heating cables; Hanson et al. 2017), and did so realistically throughout the year regardless of the weather (e.g. open-top chambers only work on sunny days in the summer; Marion et al. 1997) or a power supply (e.g. IR lamps, heating cables). Transplantation remains the only way to achieve this (Hannah 2022; Shaver et al. 2000). We now clarify our logic in the manuscript as follows:

      L91: “Elevation-based transplant experiments are powerful tools for assessing climate warming effects on ecosystems because they expose plots to a real-world future temperature regime with natural diurnal and seasonal cycles while also warming both aboveground and belowground subsystems. This is especially true if they include rigorous disturbance controls (here, see Methods) and are performed in multiple locations where the common change from high to low elevation is temperature (here, warming of 2.8 ºC in the central Alps and 5.3 ºC in the western Alps). While factors other than temperature can co-vary with elevation, such factors either do not vary consistently with elevation among experiments (e.g. precipitation, wind), are not expected to strongly influence plant performance (e.g. UV radiation) or in any case form part of a realistic climate warming scenario (e.g. growing-season length, snow cover).”

      A further question is if alpine plants inserted in turfs at alpine climatic conditions would have a similar effect as lowland plants inserted in turfs at lowland climatic conditions.

      We interpret “turfs” to mean “lowland turfs” here, since we did insert lowland plants into alpine turfs under lowland climatic conditions (i.e. the WL treatment). We found that adding alpine plants to alpine turfs in alpine climatic conditions (i.e. planting disturbance control, see Methods) had no effect on alpine soil carbon content. By extension, we would expect that adding lowland plants to lowland turfs in lowland climatic conditions would have no effect on lowland soil carbon content. While not explicitly tested, including this treatment would not change our finding that adding lowland plants to alpine turfs causes a reduction in soil carbon content relative to adding alpine plants to alpine turfs. Given this, we have left the text as is, but are happy to revisit this issue based on further discussion with the Reviewer/Editor.

      I suggest that the authors consider these questions when they draw conclusions about the results from their experiments. It would also be interesting to discuss the relevance of sudden strong warming effects relative to slower warming, potentially allowing ecosystems to adjust via changes in genetic composition of species (i.e. evolution) or species composition of communities (i.e. community assembly).

      Thank you for this excellent suggestion. We absolutely agree that anything short of a decadal experiment is unable to detect the role of longer-term evolutionary or community processes on soil carbon dynamics. While this doesn’t eliminate the need for experiments that consider shorter timescales, it is important to explicitly state this limitation. As suggested, we have added a sentence discussing this possibility in the concluding paragraph:

      L387: “While our findings demonstrate that lowland plants affect the rate of soil carbon release in the short term, short-term experiments, such as ours, cannot resolve whether lowland plants will also affect the total amount of soil carbon lost in the long term. This includes whether processes such as genetic adaptation (in both alpine and lowland plants) or community change will moderate soil carbon responses to gradual or sustained warming.”

      We also agree that it is extremely challenging to undertake warming experiments that do not initially “shock” the system through a sudden change in temperature. Having said this, alpine ecosystems are adapted to rapid within- and between-season temperature changes, making such shocks less relevant here.

      Reviewer #2:

      The authors were trying to test whether the migration of lowland plants into alpine ecosystems affects the warming impact on soil carbon. To achieve this goal, the authors first did two field experiments (moving intact turf from high-elevation to low-elevation to simulate warming) in the Alps, and then did a greenhouse pot study to explore the potential mechanisms for the results observed in the field experiments.

      The main strenghs of this work are the combination of a field experiment (conducted at two sites) and a greenhouse pot experiment (to explore the detailed mechanisms). Moreover, a number of techniques were used to measure plant traits, soil DOM and microbial properties (e.g. CUE, growth) which help to find the potential mechanisms.

      We thank the Reviewer for this positive comment.

      The main weaknesses of this work are below:

      1) The two field experiments are very short-term (<1 year), but the results were that warming and/or warming+lowland plants led to very high amount of soil C loss (up to ~40%, Fig. 1). I was shocked to see these results as many field warming studies have shown undetectable change in SOC even after years or decades. The authors did not provide a good explanation for this rapid and large change in SOC.

      We apologise for the confusion. We’re unsure where “up to ~40%” comes from here, so we have taken the Reviewer’s later suggestion of changing the annotation on Fig. 1 to contrast C versus WL treatments (Western Alps = 25.6 ± 7.2 mg g-1; Central Alps = 25.3 ± 8.6 mg g-1) rather than W versus WL treatments.

      With regards to the magnitude of soil carbon loss observed, we express soil carbon content in mg g-1 (i.e. mass-based per-mil), not cg g-1 (i.e. mass-based percent). This is so that we could use percent changes in the text to highlight the numeric magnitude of differences between treatments without confusing them with mass-based percent soil carbon – although we appreciate that this also caused confusion. To clarify, converting the above C versus WL treatment contrasts from mg g-1 to mass-based percent yields 2.56% ± 0.72% for the Western Alps experiment and 2.53% ± 0.86% for the Central Alps experiment. While it is striking that the WL treatments lost ~2.5% (~25 mg g-1) soil carbon in one year, such a loss is not extraordinary. To avoid future confusion, we have clarified the units in the Fig. 1 caption as follows:

      L77: “Mean ± SE soil carbon content (mg C g-1 dry mass; i.e. mass-based per-mil) in alpine turfs transplanted to low elevation (warming, W; light grey), transplanted plus planted with lowland plants (warming plus lowland plant arrival, WL; dark grey) or replanted at high elevation (control, C; white). Data are displayed for two experiments in the western (left) and central (right) Alps, with letters indicating treatment differences (LMEs; N = 58).”

      2) The greenhouse experiment was used to explore the potential reasons for the amplified loss of soil C in the field experiment. However, a key result was based on incubation of disturbed soils (8 g) and a two-pool modeling of the respiration data from the short-term incubation. This may not provide a good estimate of the true turnover rate of SOC under different plant species (even in the greenhouse condition). If rhizosphere priming was the proposed mechanism (as hinted by the authors), a better approach (such as 13C labeling) is needed to measure microbial respiration from intact soils (with plant/root presence).

      We agree with the Reviewer that using an approach such as 13C-labelling would have provided more direct evidence that lowland plants cause a rhizosphere priming effect. However, although some of our evidence comes from disturbed soils (i.e. microbial respiration), some (i.e. soil pore water) also comes from intact pots prior to harvest and we now also include another line of evidence from plant root biomass. In short, we draw on multiple lines of evidence suggesting that root exudates were involved, and note that Reviewer #3 thought our approach and interpretation on this aspect of the study was robust.

      Having said this, we acknowledge that we were too confident in our interpretation here, so we have added caveats to the text as follows:

      L207: “While not directly measured here, a nine-day decay period corresponds to the time expected for newly photosynthesised CO2 to be released through root exudation and respired by soil microbes, suggesting that this carbon pool was mostly root exudates.”

      L215: “While further directed studies are required to resolve whether root exudates are truly involved, our findings collectively suggest that lowland plants have the capacity to increase total root exudation into alpine soil relative to resident alpine plants.”

      3) Some details of the sampling or measurement are very crucial and affect the results/interpretations. For example, in the field experiment, the soil core was only 1-cm diameter. Considering the spatial heterogeneity of soil carbon in field plots, this small volume may not well represent the true soil condition. Moreover, in the field plots, did soil bulk density change after planting of lowland plants or warming? This will affect the measured SOC concentration (mg/g) even the SOC stock (g/m2) did not change.

      We agree with the Reviewer that taking a single soil core of 1 cm diameter in each plot would not have been robust. We did not do this. While we used 1 cm diameter cores to minimise disturbance, we took three cores per plot to account for within-plot heterogeneity and combined them into a composite sample. This is stated in the Methods as follows:

      L523: “In each plot, we created a composite sample from three cores (ø = 1 cm, approx. d = 7 cm) no closer than 7 cm from a planted individual and from the same quarter of the plot used for ecosystem respiration measurements (see below; Supplementary Fig. S1).”

      We also agree that bulk density measurements were an important omission in the initial submission. We note that this point was fleshed out by Reviewer #3, below, so we refer the Reviewer to our response to that comment for further details.

      Reviewer #3:

      The authors investigated the effect of warming and herbaceous plant migration on soil carbon (C) content using an ecosystem monolith transplant experiment along an elevation gradient in the Swiss Alp mountains. They observed, approximately 1 year after the transplant, that warming alone had little effect on soil carbon content (monoliths transplanted to a lower elevation with higher temperature remained unchanged in C content) but that the presence of lowland (warm-adapted) herbaceous plants in combination with warming had a negative effect on soil C content. The authors then conducted a glasshouse experiment and used a series of field and laboratory measurements to explore potential mechanisms explaining the observed changes in soil C content in the field. They concluded that soil C losses under lowland plant migration were likely mediated via increased microbial activity and CO2 release from soil C decomposition.

      The research questions are extremely relevant to our understanding of the feedback between soil C dynamics and climate warming and remain an unexplored part of this debate. Moreover, both field and laboratory experimental designs are robust, with all the relevant and necessary validation checks needed for transplant experiments; the laboratory techniques employed to measure the range of microbial and plant variables potentially explaining soil C dynamics are adequate and modern; and the statistical analyses are appropriate. These elements make the present data set very relevant and valuable. The manuscript is also very well and clearly written.

      We thank the Reviewer, and are delighted that they think the study is extremely relevant, novel, experimentally robust, cutting-edge and valuable.

      However, I have two major concerns, casting doubt respectively on the main field results and on the proposed explanatory mechanisms.

      First, at no point is bulk density mentioned and it does not appear to have been measured. This is critical because changes in soil C concentration (which was measured and reported here, in mg C g-1 soil) does not necessarily indicate an actual change in the quantity of C present in the soil (C stock, in unit mass C per unit soil volume, or per unit surface area to a constant depth) if this is accompanied by a change in bulk density: if less C per unit mass of soil (lower C concentration) is concurrent with more mass of soil in a constant volume (higher bulk density), this could mean that no change in C stocks actually occurs (or that even an increase occurs). In the present study, it is possible that the presence of lowland plants increased bulk density as compared to only alpine plants, compensating the lower C concentration and resulting in no change in C stocks. This is perhaps not likely, but it is too critical an issue not to be quantified (or at the very least discussed).

      This is an excellent point, and one also raised by Reviewer #2. To clarify, we initially decided against measuring bulk density because it is destructive and the experiments were still being used for other studies. Having said this, we agree with the Reviewer that more consideration of soil bulk density was needed, so we have rectified this in three ways. First, although the western Alps experiment has now been taken-down, to address this comment we took new soil cores to measure bulk density in the central Alps experiment in 2021 to indirectly confirm that no changes occurred in the presence versus absence of lowland plants. They did not, and we now include these data in the Methods as follows:

      L539: “It was not possible to take widespread measurements of soil bulk density due to the destructive sampling required while other studies were underway (e.g. ref 28). Instead, we took additional soil cores (ø = 5 cm, d = 5 cm) from the central Alps experiment in 2021 once other studies were complete to indirectly explore whether lowland plant effects on soil carbon content in warmed alpine plots could have occurred due to changes in soil bulk density. We found that although transplantation to the warmer site increased alpine soil bulk density (LR = 7.18, P = 0.028, Tukey: P < 0.05), lowland plants had no effect (Tukey: P = 0.999). It is not possible to make direct inferences about the soil carbon stock using measurements made on different soil cores four years apart. Nevertheless, these results make it unlikely that lowland plant effects on soil carbon content in warmed alpine plots occurred simply due to a change in soil bulk density.”

      Second, in the Main Text we now caution readers against translating soil carbon content changes to soil carbon stock in absence of coupled measurements of soil bulk density as follows:

      L113: “We caution against equating changes to soil carbon content with changes to soil carbon stock in the absence of coupled measurements of soil bulk density (Methods). Nevertheless, these findings show that once warm-adapted lowland plants establish in warming alpine communities, they facilitate warming effects on soil carbon loss on a per gram basis.”

      Finally, we have altered the language throughout the manuscript (including the title) to make it clearer that we focussed on soil carbon content/concentration – not stock.

      Second, even assuming that no changes in bulk density occurred and that indeed soil C stocks decreased under warming combined with lowland plant migration, the interpretation of the results are, in my view, at least incomplete. Certainly, the results do not support the claim that soil C losses were mediated via increased microbial decomposition of soil C with the certainty suggested by the authors. Generally speaking, I see three issues with the interpretation:

      • Very schematically, increased microbial respiration and soil C losses from decomposition is only one of two equally likely pathways potentially explaining soil C losses (the other being decreased C inputs to the soil from the plant community). The possibility that decreased soil C content was simply mediated by decreased inputs of C to the soil is hardly explored at all in the study (there is a quick mention of it (L155), but differences in plant biomass are interpreted only for their correlations with microbial activity (L160-166), not as a component of the C balance. Plant traits are measured and analysed but not in a way that can be used to test the hypothesis of changing C inputs. The presence of "more productive traits" (L141) for the lowland plants does not directly relate to differences in the quantity of C inputs to the soil, nor is it interpreted in relation to inputs. Even the interpretation of changes in ecosystem respiration seem to omit the possibility of changes in plant respiration (L208): "depressed microbial respiration per unit of soil was also evident at the ecosystem scale in that warming accelerated total ecosystem respiration but its effect was dampened in plots containing lowland plants". This statement was made despite no significant differences in microbial respiration per unit soil in the field data, and disregards the possibility that the dampened effect in plots with lowland plants could be due to lower plant respiration.

      This is an excellent point. We have performed new analyses of the plant trait/biomass data from the field experiment, included additional measurements/analyses of NEE and GPP from the field experiments (originally omitted due to space, which was a mistake!) and have rewritten all relevant sections in the manuscript to change the focus to a shifting balance between soil carbon inputs and outputs. Importantly, our original interpretation remains robust – i.e. that lowland plants most likely operate by accelerating soil carbon outputs, not decelerating soil carbon inputs – but we are careful to present our conclusions with an appropriate level of caution.

      • For the glasshouse experiment, I agree that the results indicate that (L115); "lowland plants accelerated microbial activity by increasing the quantity of root exudates", but not that (L112): "these findings together imply that lowland plants accelerate alpine soil C loss" because stimulating microbial activity is not per se an indicator of soil C loss. It is now well-known that the activity of microbes is not only a motor for soil C losses, but also a key mechanism leading to transformation of C inputs from plants that leads to the subsequent stabilisation of C in the soil. This is actually clearly stated further down in the manuscript when interpreting the field microbial data (L190. Furthermore, there is no direct evidence that the pots with lowland plants were losing more C than those without. Therefore, results from the glasshouse experiment could be interpreted differently: a larger fast cycling pool of soil C constituted of recently photosynthetically fixed exudates associated with higher microbial activity could well be interpreted as an early indicator of more C stabilisation, particularly since the absorbance index seems to indicate more microbially derived product in the DOC. It would have been great to measure microbial biomass C over time (as well as CUE, and mass specific growth and respiration), to see if higher respiratory activity was associated with higher biomass. The lack of differences in microbial biomass between the plant community treatments at the end of the 6 weeks does not show that the quantity of microbial biomass produced over the whole incubation period remained constant. In a word, more respiration of a larger fast cycling pool is not an indicator of future soil C loss (in the presence of plants).

      We thank the Reviewer for raising this important point. On reflection, we agree that the previous version of the manuscript did not give sufficient consideration to the possibility for increased microbial activity (and, indeed, respiration) in the glasshouse experiment to signal soil carbon accumulation via increased microbial growth. Having said this, all pots began with the same soil and microbial biomass remained unchanged between alpine and lowland plant treatments at the end of the six-week experiment. By extension, no net microbial growth occurred during this timeframe, making it unlikely that the accelerated respiration observed under lowland plants was indicative of soil carbon accumulation. Sadly, while we can deduce that intrinsic rates of respiration were higher, we can only speculate that growth remained unchanged (no new measurements can be done since growth measurements require fresh soil). We have rewritten the respective section in the manuscript in light of this and the Reviewer’s other comments, which includes the following caveat:

      L181: “These findings support the hypothesis that lowland plants have the capacity to increase soil carbon outputs relative to alpine plants by stimulating soil microbial respiration and associated CO2 release. While accelerated microbial respiration can alternatively be a signal of soil carbon accumulation via greater microbial growth, such a mechanism is unlikely to have been responsible here because it would have led to an increase in microbial biomass carbon under lowland plants, which we did not observe.”

      • The interpretation of the microbial variables measured in the field line up better with current conceptualisations of the role of microbes in C cycling (but overall interpretation still lacks consideration for plant C inputs). However, interpreting those data measured once 1 year after the transplant to explain the changes that happened gradually over this whole year is a risky and difficult exercise. How do we know that CUE, Rmass, Gmass etc… measured then represent what they were a day, a week, a month before? There is an attempt to deal with this timing issue by comparison with the glasshouse experiment, but only Cmic and Rmass can really be compared and it only very partially fills in the gap in time. Besides, the interpretation of this comparison can be questioned: in the glasshouse, Rmass was higher for the lowland plant pots (as compared to alpine plant at constant temperature) but actually remained constant between the comparable treatments W and WL in the field (Fig 2m). The results from the field, therefore, do not "support observations from the glasshouse experiment" in this context (L197) and neither do they "confirm (…) that this persists for at least one season" (L199). Finally, the thinking around the pulsed nature of C losses seems misplaced because there are no evidence that soil C losses had stopped after a year in the field (no measurements of soil C content are presented after that year).

      With regards to plant carbon inputs, we refer the Reviewer to their previous comment for corresponding revisions. With regards to specific comparisons between the glasshouse and field experiments, we have now deleted the sentences in question and have interpreted our results as follows:

      L329: “Thus, despite lower rates of ecosystem respiration overall, alpine soil microbes still respired intrinsically faster in warmed plots containing lowland plants. Moreover, accelerated microbial respiration, but not growth, implies that alpine soils had a higher capacity to lose carbon under warming, but not to gain carbon via accumulation into microbial biomass, when lowland plants were present. These findings align with observations from the glasshouse experiment that lowland plants generally accelerated intrinsic rates of microbial respiration (Fig. 3), although in field conditions this effect occurred in tandem with warming.”

      With regards to soil carbon loss being pulsed, while there is support for such a mechanism, we agree that this is one of several hypotheses and with only two timepoints we were too confident about it in the original submission. We have now reshaped this section of the manuscript entirely to be more cautious about the temporal dynamics involved. For instance, the section title now reads “Lowland plant-induced soil carbon loss is temporally dynamic”. Some other notable changes are:

      L286: “Importantly, lowland plants had no significant bearing over net ecosystem exchange (Fig. 5a), implying that although lowland plants were associated with soil carbon loss from warmed alpine plots (Fig. 1), this must have occurred prior to carbon dioxide measurements being taken and was no longer actively occurring.”

      L293: “By contrast, ecosystem respiration in warmed alpine plots was depressed in the presence versus absence of lowland plants (Fig. 5c). These findings generally support the hypothesis that lowland plants affect the alpine soil system by changing carbon outputs. However, they contrast with expectations that lowland plants perpetually increase carbon outputs from the ecosystem and thus raise questions about how soil carbon was lost from warmed plots containing lowland plants (Fig. 1).”

      L320: “Carbon cycle processes are constrained by multiple feedbacks within the soil system, such as substrate availability and microbial acclimation, that over time can slow, or even arrest, soil carbon loss. We thus interrogated the state of the soil system in the field experiments in the western Alps experiment to explore whether such a feedback may be operating here, in particular to limit ecosystem respiration once soil carbon content had decreased in warmed alpine plots containing lowland plants.”

      L354: “Taken together, one interpretation of our findings is that the establishment of lowland plants in warming alpine ecosystems accelerates intrinsic rates of microbial respiration (Fig. 3, Fig. 6a), leading to soil carbon release at baseline levels of microbial biomass (Fig. 1, Fig. 3c), a coupled decline in microbial biomass (Fig. 6c) and a cessation of further carbon loss from the ecosystem (Fig. 5a, Fig. 6d).”

      L358: “Although such a mechanism has been reported in other ecosystems, applying it here is speculative without additional timepoints because field soil measurements came from a single sampling event after soil carbon had already been lost from the ecosystem. For instance, an alternative mechanism could be that soil microbes acclimate to the presence of lowland plants and this decelerated microbial processes over time.”

      L368: “Beyond the mechanism for lowland plant effects on alpine soil carbon loss, it is conceivable that soil carbon loss is not isolated to a single season, but will reoccur in the future even without further warming or lowland plant arrival. This is especially true in the western Alps experiment where warming yielded a net output of carbon dioxide from the ecosystem (Fig. 5a). Moreover, in our field experiments we simulated a single event of lowland plant establishment and at relatively low abundance in the community (mean ± SE relative cover: 4.7% ± 0.7%), raising the possibility that increases in lowland plant cover or repeated establishment events in the future could facilitate further decreases in alpine soil carbon content under warming.”

      Reviewer #4:

      This manuscript took alpine grasslands as a model system and investigated whether lowland herbaceous plants contributed to the short-term dynamics of soil carbon under the context of climate warming. The authors find that warming individually does not render significant changes in alpine soil carbon, but corporately causes ~52% of carbon loss with lowland herbaceous plants in two short periods of field experiments. They further show that alpine soil carbon loss is likely mediated by lowland herbaceous plants through root exudation, soil microbial respiration, and CO2 release. This work adds in an interesting way to the ongoing debate on whether a positive climate feedback will be mediated by plant uphill range expansion in alpine grasslands, where climate warming may lead to a rapid loss of soil carbon.

      The claims of this manuscript are well supported, but some aspects of background information in the studied alpine systems and field experiment design need to be clarified.

      1) There is an extremely high level of carbon stored in the alpine soils (Figure 1). Climate warming will certainly lead to a great loss of soil carbon in the study systems that could contribute to the positive climate feedback. However, it is unclear for me how the effects of climate warming on soil carbon are relevant to the ongoing climate change in the studied alpine grasslands. It is therefore reasonable to provide more background information about ongoing climate change, and whether the simulated climate warming (i.e., 2.8 oC in central alps and 5.3 oC in western alps, Line 328-329) is realized as real-world climate change in the local systems. In addition, it seems that the manuscript aims to address a question that is of global concern, but my concern is about how the findings could be generalized to other regions.

      We thank the Reviewer for pointing this out. With regards to the amount of soil carbon stored in the alpine soils, we refer the Reviewer to comments from Reviewer #2. With regards to the magnitude of warming expected in mountain regions, we agree with the Reviewer that the original submission lacked context. We have therefore added specific values as suggested:

      L59: “They are experiencing both rapid temperature change (0.4 to 0.6 ºC per decade) and rapid species immigration…”

      With regards to how findings could be generalised to other regions or ecosystems, this is an important point that requires further research – and which we raise in the concluding paragraph. However, we see that we could have been more explicit about validating our findings in other mountain regions, so we have amended the sentence in question as follows:

      L400: “Future work should focus on testing the conditions under which this feedback could occur in different mountain regions, as well as other ecosystems, experiencing influxes of range expanding plant species, on quantifying how deeply it occurs in shallow alpine soils, and on estimating the magnitude of the climate feedback given both ongoing warming and variation in rates of species range shifts.”

      2) I understand that the manuscript considers elevation as a natural gradient of climate change, which makes it possible to compare soil carbon dynamics in lowlands with alpine grasslands under climate warming. I also understand that the authors have done everything they can to control for the disturbances caused by transplanting that has been well justified by the supplementary data (e.g., Figure S6). However, it is unclear how the authors controlled for the influences of other factors given there are huge differences between lowlands and alpine grasslands, such as differences in wind, solar radiation, humidity, and the length of growing season.

      This is an excellent point. We note that Reviewer #1 also raised this point, so we refer the Reviewer to our response to that comment for further details.

      3) It is generally known that different species respond to climate warming differently. Some species may be sensitive to climate warming and have traits aiding to dispersion that could expand their living ranges to some degree, while others may adjust themselves to adapt to climate warming and may not migrate to alpine systems. It is therefore cautious to assume that all the lowland species have the same dispersal ability. In other words, it is unclear how lowland plant species are selected for the field transplanting experiment (Line 284-290). Do all the lowland plant species selected have the potential to migrate to alpine systems?

      This is an excellent question. In short, the specific dispersal abilities of lowland species used are currently unknown and will certainly vary. However, all are widespread and we assume have the capacity to migrate to higher elevations, given that horizontal distances between high and low elevation sites were in both cases less than 2 km. We now clarify this in the manuscript as follows:

      L433: “While exact dispersal distances for selected lowland species are unknown, all species are widespread and are expected to migrate uphill under warming and the horizontal distance between high and low sites in the field experiments was always less than 2 km.”

      4) The authors acknowledge that "we did not perform a reverse transplantation (that is, from low to high elevation), so we cannot entirely rule out the possibility that transplantation of any community to any new environment could yield a loss of soil carbon" (Line 318-320). When I read the title "lowland plant migrations into alpine grasslands …", I thought lowland plant species that were transplanted from low to high elevation. In fact, it is just the opposite to my thoughts. Without performing a reverse transplantation experiment, I am not sure the conclusion will stand that "lowland plant migrations into alpine grasslands amplify soil carbon loss under climate warming". In addition, it is unclear whether lowland plant effects stand alone or depend on climate warming based on the results in Figure 1 that lowland plant treatment is missing, and it is impossible to test the interactions between lowland plant and climate warming.

      We apologise for the confusion. This comment echoes other comments from Reviewer #1 asking us to be more explicit about the treatments used when interpreting findings, to caveat the step in logic from transplantation to warming and to acknowledge throughout the manuscript that lowland plant effects were dependent on transplantation in the field experiment. We therefore refer the Reviewer to our responses to those comments for details on how we resolved this. We have also modified the title and abstract to more accurately represent the experimental design, as follows:

      Title: “Lowland plant arrival in alpine ecosystems facilitates soil carbon loss under experimental climate warming”

      L30: “Here we used two whole-community transplant experiments and a follow-up glasshouse experiment to determine whether the establishment of herbaceous lowland plants in alpine ecosystems influences soil carbon content under warming. We found that warming (transplantation to low elevation) led to a negligible decrease in alpine soil carbon content, but its effects became significant and 52% ± 31% (mean ± 95% CIs) larger after lowland plants were introduced at low density into the ecosystem.”

      With regards to testing the interaction between warming and lowland plants, while we acknowledge that not performing a fully-factorial design limited our ability to explicitly separate lowland plant versus warming effects on alpine soil, both are occurring simultaneously due to climate warming and we thus focussed effort on simulating such a scenario with greater experimental replication and at multiple locations. We note that Reviewers #1, #2 and #3 thought that this approach was robust. Importantly, the statistical analyses performed are valid for such an experimental design, and we have clarified and nuanced our interpretation throughout to avoid reaching beyond it.

    1. Author Response:

      Reviewer #1 (Public Review):

      This paper uses a combination of confocal and electron microscopy to localize gap junctions in the outer retina. Electrical coupling between photoreceptors is an important aspect of retinal function, and past work provides (often indirect) evidence for rod-rod, rod-cone and cone-cone coupling. The work described here indicates that rod-cone coupling dominates. The combination of techniques is quite convincing and very elegant. My concerns are primarily about the appeal of the work to non-retina readers. Some of these concerns could be mitigated by a more accessible presentation of some of the results. Suggestions along these lines, and a few other minor issues, follow.

      Introduction:

      The introduction is a bit retina-centric. I think more needs to be done to explain how each type of coupling (rod-rod, rod-cone, cone-cone) could impact retinal processing, and why it is important to resolve which are present or dominant. One issue that could get emphasized is the difference between gap junctions between like cell types (presumably involved in lateral spread of signals, averaging, etc) and between unlike cells (potentially providing an alternate path for signal flow - as in the secondary rod pathway).

      We have included new text in the introduction to address this issue. We have tried to provide background material of a general nature and we have included some introductory text about different types of gap junctions, as requested. We thank reviewer 1 for this helpful suggestion.

      Cone-cone coupling:

      It would be helpful to put the conclusions about rod-cone and cone-cone coupling together. The paragraph starting on line 585 is a bit confusing that way. It starts by summarizing evidence that blue cones are not coupled with red/green cones. But then (in mouse) all the cones are coupled to rods, so that specific exclusion of blue cones seems unlikely to hold. You come back to this a bit later in the discussion, and there indicate that there appears to be weak cone-cone coupling. Merging the text in those two locations might help. It might also help to make the (seemingly clear) prediction that blue and green cone signals in mouse will get mixed.

      Thank you for pointing out that this section is not clear. It seems two different points are muddled: 1) Blue cones do not make gap junctions with other cones, perhaps to minimize spectral mixing: the evidence from primate and ground squirrel suggests that blue cones are not coupled to red/green cones or green cones. 2) In contrast, we find no evidence of color selectivity in rod/cone coupling: green cones and blue cones are both coupled to all nearby rods. Thus, rod signals can be injected into the downstream pathways of both blue and green cones.

      We have rewritten the text and separated these points into separate paragraphs for clarity, as below.

      Revised Text:

      Blue cone pedicles are also coupled to rods.

      In the cone networks of primate and ground squirrel retina, there is good evidence that blue cones are not coupled to neighboring red/green (primate) or green cones (ground squirrel) (Hornstein et al., 2004; Li and DeVries, 2004; O’Brien et al., 2012). In the primate retina, the telodendria of blue cones are few in number and too short to reach the neighboring red/green cones (O’Brien et al., 2012). Thus, blue cones appear to be electrically separated from other cones in these two species, perhaps to maintain spectral discrimination (Hsu et al., 2000). In the mouse retina, although the blue cones were identified by Behrens et al., (2016), we were unable to find any cone to cone gap junctions, regardless of color (see below).

      In contrast to the selective connections between cones in some species, rods were coupled to both blue and green cones indiscriminately in the mouse retina (present work) and in primate retina (O’Brien et al., 2012). Blue cones, identified in confocal work by the presence of S-cone opsin, and in SBF-SEM by their connections with blue cone bipolar cells (Behrens et al., 2016; Nadal-Nicolás et al., 2020), and green cones both made telodendrial contacts at Cx36 clusters with all nearby rod spherules (Fig. 4). Thus, we find no evidence for color specificity in rod/cone coupling. In fact, a single rod spherule may be coupled to both blue and green cones (Fig. 5, supplement 5). Therefore, rod signals can pass via the secondary rod pathway into both blue and green cones and their downstream pathways. Considering blue cone circuits specifically, rod input to blue cone bipolar cells and downstream circuits is predicted via the secondary rod pathway, in addition to the previously reported primary rod pathway inputs from AII amacrine cells to blue cone bipolar cells (Field et al., 2009; Whitaker et al., 2021).

      Relation to other circuits:

      Are there implications of the present results for gap junctional coupling in other circuits that could be emphasized? Things like the open probability how strongly it can be modulated seem like points of general interest - but I don't have enough expertise to know if those are established facts on other systems. Some of that is touched on in the Discussion, but quite briefly.

      In an effort to keep the discussion short, we have perhaps been too abrupt. We have added text to the discussion to include some general issues concerning gap junctions.

      Location of Cx36:

      Can you speculate on why Cx36 is generally located at the mouth of the synaptic opening in the rod spherule? This was a very clear result, but it was unclear (at least to me) if it was important.

      This is an interesting topic and we have expanded the discussion to consider potential functions and mechanisms.

      Added to discussion:

      The position of rod/cone gap junctions, at the base of the rod spherule, close to the opening of the post-synaptic cavity, appears to be systematic in that the vast majority of rod/cone gap junctions occur at this site. We may speculate that gap junctions are localized with some of the same scaffolding proteins that occur at the rod synaptic terminal, but the functional significance of this repeated motif is unknown. In mutant mouse lines, where Cx36 has been deleted from either rods or cones, cone telodendria are still present and they still reach out to contact nearby rod spherules in the absence of rod/cone gap junctions. Therefore, the specificity of synaptic connections is not determined or maintained by the presence of Cx36 gap junctions.

      Reviewer #2 (Public Review):

      Previous studies demonstrate that modulation of gap junctional coupling in the outer plexiform layer of the mouse retina regulates the balance between sensitivity and resolution. The authors use optical and electron microscopy to structurally characterize this coupling. They find that gap junctional coupling in mouse OPL is produced by a dense meshwork of cone photoreceptor telodendrions that selectively innervate the rim surrounding the synaptic openings of rod photoreceptor spherules. The density of this coupling network is such that each cone is coupled to dozens of rods and each rod is coupled to multiple cones. Rod/rod and cone/cone gap junctions were not detected.

      The combination of antibody labeling, reconstruction of the photoreceptor terminal network, and ultrastructural analysis provides a remarkably clear view of the gap junctional connectivity that constitutes the first stage of visual processing. A few results are only weakly supported due to sample size or technical limitations. However, the overall conclusions are well supported and the data is presented with unusual transparency. The map of the network organization of photoreceptor coupling generated here is an important contribution to visual science.

      Optical imaging:

      The quality of the confocal imaging is high and the images of the Cx36 distribution relative to rod spherules is convincing. There does seem to be a significant amount of processing in the images and a lack of background signal in antibody images. Whether this processing is due to the airy scan software or additional filtering and thresholding, it can be difficult to judge the distribution of signal in several images.

      In general, there was no filtering or processing of any confocal images, except for adjusting brightness and contrast. However, we may have been over-zealous in reducing the background. Therefore, we have adjusted Figures 1 and 2 to include more background as requested, to enable the reader to better judge the specificity of the immunolabeling. In addition, we have prepared supplementary figures to show the individual channels with background, as well as the combined images, to be absolutely clear and transparent. Finally, for each confocal image, the confocal series from which it was derived has been archived and is publicly accessible.

      Former Figure 1D, now Fig. 2D is an exception because it shows a 3D projection of the colocalization between a single EGFP labeled cone pedicle and Cx36. We have revised this figure, providing new 2D optical sections to show how the image was prepared, in addition to revising the final 3D projection, labeling it as a 3D projection with colocalized Cx36.

      Electron microscopy:

      The authors perform annotations on two previously acquired volume EM datasets. The first serial blockface EM dataset is relatively low resolution and lacks ultrastructural labeling but is used effectively to reconstruct the terminal morphology and points of contacts between photoreceptors. The second EM data set uses FIB SEM to obtain smaller voxel sizes from tissue stained in such a way that the darkened membranes of putative gap junctions are distinct from surrounding membrane. Most measures of gap junction number come from the ultrastructure free dataset. In isolation, counting of gap junctions in this type of image volume could be unreliable. However, comparing the putative gap junctions in this dataset to the morphology and distribution of Cx36 antibody clusters in the confocal imaging and the darkened plaques in the FIB SEM images greatly increases confidence that the network description of rod/cone gap junctional coupling is accurate.

      Quantification:

      Most quantification is presented with an unusually high degree of transparency, with scatterplots showing all data points, data source files showing the animals that data came from, and standard deviations being supplied in descriptive statistics. There are a few places where Ns are difficult to determine or the analysis is not quite clear. For several results, claims are made when the sample size is too small to be sufficiently confident. The reconstruction of 5 blue cones suggests that, overall, blue cones are not radically different from other cones in their terminal morphology or gap junctional coupling to rod spherules. Claims that the blue cones are identical to other cones in most measures or that their telodendrions are smaller, but not statistically smaller are not well supported by the sampling. Similarly, the fact that the 6 nearby cones closely analyzed for cone/cone gap junctions yield no junctions, strongly suggests that vast majority of gap junctions are cone/rod gap junctions. However, the sample is too small to argue that there could not be infrequent, atypical, or region-specific cone/cone gap junctions.

      We have addressed the issues of blue cones and cone/cone coupling to soften our conclusions and explicitly point out the small numbers.

      Estimate of open channels:

      The authors estimate that 89% of gap junction channels are open during times of maximum rod/cone coupling and point out that this number is surprisingly high relative to previous estimates. However, this estimate appears to be subject to many significant potential errors. The estimate combines previous freeze fracture studies of the density of gap junctions from various species and various parts of the retina the measurements of the length and width of the gap junctions in the current study. Differences in tissue processing, density variation within and between systems, reconstruction error, and variation and error in the inputs to the model could all contribute to an underestimate of the total number of channels linking mouse rods and cones. Moreover, without an accounting of these issues, the real error bars on the range of possible open channels would seem to include both surprising and less surprising estimates of open gap junction fractions.

      This is a major issue. In short, for the calculations of open probability, we have estimated the cumulative errors, added these numbers to the text and attached an appendix showing the statistical analysis. We have also added a section to the discussion to address the possible sources of error enumerated by reviewer 2.

      Reviewer #3 (Public Review):

      In the presented work, Ishibashi and colleagues combine immunohistochemistry, analysis of a publicly available large scale 3D EM dataset and smaller but more detailed newly acquired EM datasets to qualitatively and quantitatively study gap junctions of mouse rod and cone axon terminals. The existence of rod-to-cone gap junctions has been known before, but the use of larger 3D EM data allows to determine an average number of contacts as well as an estimate of the strength of gap junctions. This as well as the (very likely) exclusion of direct cone-to-cone coupling in the mouse as opposed to some other mammals are the main contributions of this paper and one more puzzle piece of the big picture of mouse retinal connectivity. However, while the findings are a valuable addition towards a complete picture of the connectivity in the mouse retina, the novelty of the findings is limited to the number of contacts per photoreceptor and gap junction sizes.

      In my opinion, while the authors present a thorough analysis of their data, the manuscript in its current state has stylistic flaws on the motivational side. To me, abstract and introduction lack a motivation or stronger statement of relevance for this analysis. Similarly, while each individual analysis is discussed one by one, I'm missing a broader discussion of the implications of the findings for the field and possible directions for future research to highlight relevance for a broader readership.

      Thank you for the positive comments. We have rewritten and added material to the Abstract, Introduction and Discussion in an attempt to explain the reasoning for this study and to explain the findings to a broader audience.

    1. Backward design (or backward planning/mapping) is about designing with the end in mind. Where do you want students to end up after a lesson? What knowledge and skills do they need to showcase? What are the desired results of the lesson?

      I believe these questions are so important to consider when creating any lesson! As someone who needs time adjusting to new tools or apps, I have experience many times that I have had to focus more on how to use the tool than the content we were using it for. I think when you consider these questions, it helps make sure the students will all benefit from the tool. It may take some extra time during planning, but it will be more beneficial in the long run.

    1. Author Response:

      First we would like to thank the reviewers for their very kind words regarding our manuscript and for their helpful suggestions for how to improve our paper. We believe their suggestions have helped to strength the paper as a whole. We will address below the specific weaknesses that the reviewers have brought up and describe how we have modified our manuscript in response to these suggestions.

      Reviewer #1:

      This is an interesting study of the relation between vividness of visual imagery and the pupillary light response that can result from it. The authors collected data in two experimental paradigms, which they ran in two independent samples. One of these samples was a larger group of psychology students; the other a self-reported group of people with aphantasia. In a first paradigm, the authors show that a lack of vivid imagery is associated with a smaller (or even absent) pupillary light response. Using a second paradigm, binocular rivalry, they show that the degree to which imagery primes binocular rivalry is correlated (to a degree that is quite striking) with the magnitude of the pupillary light response to imagined stimuli. These results were obtained both for low-scoring individuals in the large sample as well as for the aphantasics. The study provides objective evidence for the absence of imagery in individuals that self-report as aphantasic.

      The paper is well written and all the necessary controls for potentially confounding variables are in place. For instance, age or visual persistence are discussed and excluded as alternative explanations based on convincing analyses. A particular strength of the manuscript is that the authors report positive results for pupillary responses in the group with aphantasia. That is, these individuals show regular pupillary responses to changes in physical stimulus brightness as well as to cognitive load. Another strength is that the group of aphantasics was invited separately and not determined post-hoc in the initial sample.

      In summary, there is a lot to like about this paper. I have three comments / questions that I think should be addressed, however.

      1. A point that I would like to see analyzed and discussed is the role of eye movements. The authors do not report any analyses of fixation behavior or the frequency of saccades in the two groups. These should be analyzed and reported. The only mention of fixation control is in lines 423-424, but the authors remain at a very superficial level, stating that footage from this scene camera of the pupil labs eye tracker was "assessed to ensure fixation on the computer monitor". Does this mean that participants could look anywhere provided they looked at the monitor?



      We have now analysed the eye-movements of participants to assess whether or not they might be driving some of our findings, which we agree is a very important additional analysis to add to this paper to confirm our findings are not being driven by eye-movements. When analysing both eccentricity and the number of saccades participants made there was no differences between the two groups when imagining the triangles (see supplementary figures s7 and s11). There was also no correlation between eccentricity data and either the imagery pupillary light reflex or binocular rivalry priming. Taken together it seems unlikely that the observed pupillary light response during imagery is being driven by eye-movements.

      1. In Figure 1D (also lines 120-124), the authors show a correlation between vividness ratings and the pupillary light response. I assume that participants differ substantially in their distributions of responses. So these correlations could be a consequence of individual differences or they could provide evidence for trial-by-trial variation. There might be ways to find out. For instance, is there evidence for these correlations at the level of individuals? Does the correlation persist if individual vividness-response distributions are normalized to span the same range for each observer?

      We would like to clarify the analysis we ran. Figure 1D is the results of 2 x 4 linear mixed-effects analysis, not correlations. This model included subject identity as a random effect (see Methods section of our paper) and therefore the effects reported were computed at the subject level. We report in the text, effects that are significant at the level of the sample. This does not exclude the possibility of inter-individual differences, but we are not sure how interpretable a single-subject analysis is in the current study.

      1. In lines 314-315, the authors state that the pupillary light response to imagined stimuli may serve as an objective indicator of aphantasia. I think this is taking the interpretation of the data too far, mainly for two reasons. First, the authors haven't shown that low pupillary light response predicts aphantasia in a group of people that does not self-report as aphantasics before the test. Second, the absence of a pupillary light response (in a new sample with no additional controls) could also indicate a lack of motivation to engage in imagery. The authors should thus clarify that such tests would always have to be combined with positive tests that show the commitment of participants to the task instructions.

      We agree that it is very important to include positive controls in not only pupillary light response imagery tasks, but any task that measures imagery or any other internal experience. We have now expanded on this point in our discussion as well as reporting on the mock binocular rivalry trials that were included in the priming imagery task as a control for potential response biases.

      Reviewer #2:

      Kay et al. investigated visual mental imagery in the general population and the lack thereof in individuals with aphantasia by measuring the pupillary light response to imagined light and dark shapes. Their findings are twofold. First, they show a link between pupil size change and perceived vividness of imagery and corroborate this finding using another established objective measure of vividness. Secondly, they found a lack of such a pupillary light response in a group of individuals who maintain no visual experience of imagery. This demonstrates the usefulness of using the pupillary light response as a measure of subjective vividness of imagery and potentially demonstrates the first physiological finding in aphantasia.

      Strengths

      The experiment incorporates several different dimensions into a single clean design that is useful for isolating and tracking multiple relevant measures. First, by having the brightness of the perceived and imagined shapes vary across trials, the authors could show that changes in the pupillary light response correspond to changes in imagined brightness. The authors also added in an independent number-of-objects dimension since pupil size also varies with cognitive effort. This provided evidence that aphantasic subjects were attempting to imagine, since the pupil size did change with set size, even when it didn't change with brightness. Finally, by having subjects report the perceived vividness of each imagined image, the authors could link subjective experience of imagery to the pupillary light response.

      The authors also strengthen their findings by comparing changes in pupil size to an objective measure of imagery vividness. By leveraging the fact that imagery mimics vision's ability to bias a perception during binocular rivalry, the authors avoid the severe limitations present in measures that rely on introspection only.

      Weaknesses

      Due to the inherently private nature of mental imagery, ruling out fabrication or demand characteristics is extremely difficult. This is especially true in aphantasia research, as we are often looking for the absence of an effect rather than an enhancement. Readers should keep in mind that, while the authors made some effort to confirm that the aphantasic subjects were attempting to imagine, the potential for this and other biases were not ruled out. Without the use of probes to test subjects on the remembered/imagined objects and reporting the outcomes of catch trials, it is difficult to tell whether subjects were fully engaging with the stimuli.

      Readers should also take the pupillary light response as a tool to add to the battery of assessments for aphantasia, not as one that a diagnosis can be based on alone. While the authors do show a group level difference in pupil size in response to imagined shapes and claim it as a "new low-cost objective measure for aphantasia", it should be remembered that this manuscript does not demonstrate the tool's efficacy in identifying individual subjects with aphantasia. The absence/presence of an imagery pupillary light response does not confirm/rule out aphantasia.

      Overall, the manuscript helps characterize an intriguing condition that until relatively recently received little empirical attention. These findings support the internal experiences described by aphantasic individuals, experiences that are often met with skepticism. Importantly, the authors have also offered the field a new objective physiological approximation of imagery vividness which can be incorporated into a number of study designs examining changes in imagery. The majority of previous measures relied on self-report alone and often suffered from the limitations of language (e.g., what it means for something to be "like vision" can be very different for different people). This manuscript also adds to the growing body of evidence of the power of internally generated signals, which can apparently reach all the way down the visual hierarchy to the eyes themselves.

      We are in full agreement that when we investigate the internal contents of the mind we need to be mindful of the many caveats that exist when relying on people’s ability to introspect. We agree that future studies should expand on our research by adding in further controls, such as having participants report what item they were asked to remember at the end of the trial. However researchers should also keep in mind that changing the demands of a task can alter how participants undertake a given task. For example by emphasising remembering the items, rather than creating detailed vivid images in mind, participants may revert to a non-visual imagery strategy to remember the items, such as labelling the items. This may be particularly easy to do in the current study as the items being imagined are simple geometric shapes. Indeed it was important to avoid this potential pitfall here with our aphantasic population as we have previously shown that aphantasic individuals can perform a wide array of visual working memory tasks despite their lack of visual imagery. We believe that the addition of a set-size/cognitive load condition, plus our added reporting on mock trails helps to answer some of these potential response bias issues, but future research can and should further investigate these potential biases in greater detail.

      The second point Reviewer #2 brings up is a very good one, that no one singular measure in isolation, at this point in time, can be used to ‘diagnose’ aphantasia. The field is very young and we are still in the process of understanding exactly what aphantasia is. For example there may be many subtypes of aphantasia, with previous work from our group and others showing that aphantasic individuals are heterogenous in their reporting of how other imagery modalities are affected. We agree with Reviewer #2’s point that a battery of tests, potentially comprising questionnaires (e.g. VVIQ), psychophysical tasks (e.g. binocular rivalry paradigm) and physiological (e.g. skin conductance, pupillometry) should be aimed for where possible in testing aphantasic populations. The pupillary light response is a new tool that can be added to this arsenal.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)): *

      In this manuscript by Wang and colleagues, the authors analyse single-cell RNA-seq (scRNAseq) data by applying transition path theory to infer gene regulatory network (GRN) changes along the transition (reaction coordinate, trajectory) between free energy stable states (i.e. cell types). The work aims to understand how stable cell types, and their regulatory programs (combination of active and repressed genes) switches during differentiation/reprogramming/response (i.e. cell phenotypic transition/CPT). The premise of the work is to assess whether genes within GRNs undergo step-wise repression, state-change and activation (& vice-versa; analogous to SN1) or concurrently regulate gene expression (analogous to SN2). The GRNs are inferred based on highly variable genes and their expression dynamics from RNA velocity over CPT, across 3 scRNA-seq datasets.

      The authors first analyse public scRNA-seq dataset of 3003 human A549 adenocarcinomic basal epithelial cells treated with TGF-b for 0hrs, 8hrs, 1 day and 3 days (4 timepoints). The authors select two stable states (Day0-untreated; Epithelial and Day 3-treatment; Mesenchymal) using local kernel densities and set transition paths using Dijkstra shortest path, dividing state space into Voronoi cells (i.e. reaction coordinate value), and constructed single-cell GRNs based on RNA velocity differences (n=500 genes) and a linear model (from Qiu et al). This GRN is based on expression and velocity estimates, and does not distinguish direct from indirect regulation. Calculating interaction frequency (edges) across two stable states over 4 louvain clusters, the authors find global increase in effective edges that correlates with increased active genes; but with variable trend within inter-cluster edges. To quantify the concerted GRN changes between clusters, the authors utilise a "frustration" score (from Tripathi et al 2020). The average frustration score increases and peaks at day 1 treatment, followed by a decline over terminal stable state (day 3-treatment); similar to interaction frequency trends. The author also separately measure network heterogeneity and repeat analysis using alternative transition matrix. The authors conclude that EMT proceeds through concerted regulation of multiple genes first with an increase in inter-cluster edges, frustration and heterogeneity followed by a decrease into final stable state. The authors apply the analysis to scRNA-seq data from (i) pancreatic endocrine differentiation where Ngn3-low progenitors give rise to Ngn3-high, then Fev-high and into glucagon producing a-endocrine cells; (ii) dendate gyrus; radial glial cell differentiation into nIPCs, neuroblast, immature granule and mature granule cells. In both cases, the authors observe concerted regulation with initial increase in inter-community edges, heterogeneity during differentiation followed by decrease towards final stable state. **

      The study and ideas in the manuscript are interesting and the methods would be potentially be useful. However, there are a few specific and general comments stated below, which the authors should try to address.

      1 • P4: "RC increases first and reaches a peak when cells were treated with TGF-β for about one day, then decreases (Fig. 1G)". It would be better to label the figure with the treatment information. *

      Reply: Thanks for your advice. In the revised manuscript, we analyzed two additional datasets, and moved the EMT result in the supplemental Fig. EV8. In the new Fig. 1d, we marked the cell types along the reaction coordinate.

      *2 • Fig. 1G and EV1D: Why are the trends different? *

      Reply: In the original figures, ____Fig____.1g is the frustration score and EV1D shows the variation of pseudo-Hamiltonian along the reaction coordinate. The frustration score is the focus of this work. We also calculated the pseudo-Hamiltonian since it has been used in the literature. However, we realized that showing both of the results might lead to confusion, so we deleted all pseudo-Hamiltonian results in the revised manuscript.

      * 3 • How is the appropriate community/cluster/Louvain resolution selected? This can have a major impact on number of cell states, types and transition path from initial to final state. *

      Reply: The number of cell states, types and transition path from initial to final state____ are not determined from the community/cluster/Louvain analyses. For the EMT data, we assume most cells in the initial treatment time are epithelial cells, and those in the final time point are mesenchymal cells. For other datasets, we followed the original publications to assign cell types based on known marker expression.

      The Louvain method was applied to coarse grain the gene regulation network, and it does not affect the number of cell states, types and transition path, which were determined separately. To address the reviewer’s question, we also use the Leiden method to adjust the resolution ____(1)____. The resolution does not affect the result. The results are added to Fig. EV12. We tried three different resolution values 0.8,1.0 and 1.2. The number of inter-community edges consistently shows the trend that it increases first then decreases.

      Figure EV12 Cell-specific variation of the number of effective inter-community edges between communities calculated with different resolution parameter values for dentate gyrus neurogenesis (a), pancreatic endocrinogenesis (b), and bone marrow marrow hematopoiesis (c). Each dot represents a cell and the color represents the number of inter-community edges____.

      • * What effect does the Louvain resolution have on e.g. frustration scores? * Reply: The resolution of community division algorithm doesn’t affect the frustration scores, since the frustration score is based on the gene-gene interactions instead of community assignment.

      • * The authors match resolution to samples/timepoints/known prior cell types i.e. 3-4 communities. However it is unclear whether this is enough to describe entire differentiation/transition process. * Reply: This is a good question. In one above reply we have explained how the cell types were determined____. We also agree with the reviewer that these coarse-grained communities cannot reflect the overall heterogeneity and dynamics of the whole process. Notice in most of our analyses (e.g., reaction coordinate and transition paths), we treated the transition as continuous and the distribution of single cell data points in all datasets cover the whole space that involved in cell phenotype transition. The coarse-grained analyses are for further mechanistic insights on how gene regulatory networks are reorganized during the transition process.

      • * Gene selection: The selection based on minimum 20 counts as highly expressed genes is arbitrary and dependent on sequencing depth. Perhaps the authors could show distribution of gene counts for the datasets and have a data-driven filtering criteria * Reply: Thanks for the advice. The number 20 is a default value suggested in the package (scVelo) we use, and in another package dynamo the default number is 30. Following the reviewer’s suggestion (together with the next question on the influence of all highly variable genes), we looked for a data-drive filtering criterion. The method has been described in different tools ____(2-4)____. We first grouped the genes into 20 bins by their mean expression values, and____ scaled their dispersions by subtracting the mean of dispersions and dividing standard deviation of dispersions____. Figure EV9 shows the distribution of the minimum shared counts. ____As one can see, most genes counts are larger than 10, and using a smaller value causes error in the following velocity analysis. Therefore we set the minimum shared counts as 10 in the new results.

      Figure EV9 Shared counts distribution of the datasets. (a) Dentate gyrus neurogenesis; (b) Pancreatic endocrinogenesis; (c) Bone marrow hematopoiesis.

      • * The choice of 500 variable genes (for human A549 cells) is also quite arbitrary. Perhaps, the authors could compare how additional genes (all highly variable genes) affects their analysis and interpretation. * Reply: ____Thanks. Following previous question on shared counts and ____data-driven filtering criteria____,____ we take all the highly variable genes into consideration. The details of gene selection and binarization are given in the Materialss and Methods (Materials and Methods 2) section.

      • * How are other factors (sequencing depth, genes detected, #of cell types, multiple branches) affects the connectivity between communities at different phases of transition/development? * Reply: This is a good question. The A549 EMT dataset has a sequence depth of 40000-50000. The ____dentate gyrus neurogenesis dataset____ has a sequence depth of 56,700 reads. A saturation depth would be close to 1,000,000, but there is a compromise between cell number and depth. There are genes that are not detected even under the saturation reads setting. That is why the preprocessing is needed. On the other hand, the network we inferred include both direct and indirect interaction, so the influence of sequence depth and gene number detected can be reduced to a certain extent. We used a random subset of the selected gene and performed the same analyses. The results are consistent with what we obtained using all the genes (Fig. EV11b). With the new gene selection criteria (Materials and Method 2), our analyses are not related with the number of cell types.

      We did analysis on another beta branch of pancreatic endocrinogenesis data. The other branches show the same results (Fig. EV4). There are two additional branches in the pancreatic endocrinogenesis dataset. It has been reported that the RNA velocity estimation for the epsilon branch is incorrect ____(3)____. There are too few cells in the delta branch for reliable analyses. Therefore we didn’t present results for these two branches.

      Figure EV4 Analyses on the branch of glucagon producing β-cells in pancreatic endocrinogenesis.

      (a) Transition graph based on RNA velocity.

      (b) The RCs and corresponding Voronoi cells. The large colored dots represent the RC points (start from blue and ends in red). The small dots represent cells with color as cell type.

      (c) Frustration score along the RCs.

      (d) Cell-specific variation of effective intercommunity regulation. Each dot represents a cell. Color represents the number of effective intercommunity edges within each cell in the GRN.

        • Are the velocity graph, transition matrix and further shortest path estimation derived in a reduced latent space, and if so, how much (nPCs) and what impact does it have. Presumably, the density estimation is not performed in expression space. Reply: Yes. ____The calculation of transition matrix is based on neighbor information. The calculation of neighbors was in the reduced latent space in scVelo and Dynamo. We performed the same analysis by varying number of principal components. The results are similar because the first several components account for large proportion of variance. Figure R1 shows the results of dentate gyrus neurogenesis with the number of principal components being 10, 20 and 30, respectively. In the revised manuscript, we delete the step of using density estimation constrain to simplify the procedure. __Figure R1 Frustration scorer along RCs (left) and cell specific variation of number of effective intercommunity edges (Each dot represents a cell and color represents the number of effective intercommunity edges) in the GRN within each cell (right) when using different number of PCs in analyses (dentate gyrus neurogenesis): (a) number of PCs is 10.*__

      (b) number of PCs is 20. (c) number of PCs is 30

      * - The figure legends and labels were hard to read. These should be improved for better readability. *

      Reply: Thanks. We modified the figure legends and labels.

      * - A suggestion would be move the initial results section to methods and highlight the biological interpretation. *

      Reply: Thanks for your advice. We moved large part of this section to the Materials and Methods.

      *The authors could highly which GRN and representative genes/edge pairs are highest ranked within inter-community and to overall final stable states. *

      Reply: Thanks. We list some representative gene pairs in the Table. EV 2&EV 3 &EV 4 for different datasets. And we performed gene enrichment analysis for each community.

      * - How does the GRN inference compare to current state-of-the-art GRN inference scRNA-seq methods? *

      Reply: we used the method GRISLI to perform the same analysis ____(5)____. The results are similar to what obtained with our current method (Figure EV6). We want to emphasize that the focus of this work is not on another GRN inference method, but discussing some general principles of GRN reorganization during a cell phenotypic transition process.

      Figure EV6 Analyses of datasets of dentate gyrus neurogenesis (a), pancreatic endocrinogenesis (b), and hematopoiesis (c) based on GRN inferred with GRISLI.

      (a) Frustration score along the RCs of dentate gyrus neurogenesis (left) and cell-specific variation of the number of inter-community edges (right). Each dot represents a cell and color represents the number of inter-community edges in GRN within each cell.

      (b) Same as in panel (a), except for pancreatic endocrinogenesis.

      (c) Same as in panel (a), except for hematopoiesis.

      * - How do extremely noisy/stochastic genes vary in metrics between final stable states? How are the metrics affected by number of cells and stochasticity of expression within a given cluster/community. *

      Reply: To address this question, we selected two genes, Id2 and Cdkn1c, with high variance and compare their distributions in the initial and final states. ____The gene distributions show significant shift between the Ngn3 low EP cells and Alpha cells (Fig. R2 a &b left).____ Then we randomly selected a subset (half) of cells and compared the distributions of these high-variance genes in the sub-population (Fig. R2 a&b right). The results are similar to the full-set results.

      Fig. R2 Comparison of gene distribution in the initial and final states in pancreatic endocrinogenesis. (a) Comparison of the distribution of gene Id2 at the initial and final states (left), and in the randomly selected sub-population at the initial and final states (right). (b) Comparison of the distribution of Cdkn1c at the initial and final states (left), and in the randomly selected sub-population at the initial and final states (right).

      * - Given that the author's approach includes both direct and indirect genes effects, the authors could further prune genes based on existing TF databases or protein-protein validated networks. *Reply: This is a good suggestion. We will work on this idea in future work. As we mentioned, due to constrains of data quality, only tens of transcription factors can be analyzed in these dataset. We list some regulations of transcription factors inferred with current method in Table EV1.

      • *It is unclear which GRNs are already known and which ones are novel and biologically relevant * Reply: We compare some regulations inferred with the method and compare these interactions w____ith some references in Table. EV1____.

      * - It would be good for authors to comment when there are multiple bifurcations instead of A-B transitions. Particularly in datasets with multiple discrete stable states. *Reply: This is a good question.____ In our analysis, we focus on the transition from one stable state to another stable state. For transition process with multiple bifurcations like____ the pancreatic endocrinogenesis, the results are similar across different branches. For the transition that goes through multiple discrete stable states, for example, a transition from state A____à____B____à____C, we expect to observe two peaks in the frustration score and the number of inter-community edges. We added some discussions in the Discussion section.

      • *Another suggestion would be to highlight gene expression of selected markers based on f-regression and mi over the trajectory * Reply: As we modified the criteria of gene selection, we plotted trajectories of some high-variance genes versus the reaction coordinate obtained with different datasets in Fig. EV10 based on current criteria.

      Figure EV10 ____Typical trajectories of high variance genes versus RCs of dentate gyrus neurogenesis (a), pancreatic endocrinogenesis (b) and bone marrow ____hematopoiesis ____(c).

      * - If possible, a proof of principle could be re-analysis of a perturbation scRNA-seq dataset (e.g. where one path/transition path is stalled) *

      Reply: Thanks. This is a really a good suggestion. We will perform more systematic studies in future work.

      * Reviewer #1 (Significance (Required)): Nature and significance of advance: The study and ideas in the manuscript are interesting and the methods would be potentially be useful to community. Compare to existing published knowledge: *

      *Audience: Predominantly computational audience *

      *Your Expertise: PI with background in experimental, computational biology and expertise in single-cell genomic tools and developmental biology *

      *

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Understanding the cellular and molecular basis of cell type or cell state transitions occurring during development or reprogramming is a fundamental challenge. scRNA-seq has provided a window into gene expression programs across thousands of cells undergoing such transitions. Wang and colleagues leverage scRNA-seq and develop an approach to reverse engineer gene regulatory network underlying cells along a path from one cell type/state to another, and characterize community-level properties of this network associated with various stages of the cell phenotype transition. The study is innovative and rigorous, and their results point to how intercommunity interactions increase and then decrease, indicating a concerted regulatory rewiring that orchestrates transitions. Application of their approach to three different datasets also shows that this trend is consistent across three different transitions and maybe a general trend. However, there are some major and minor concerns that need to be addressed.

      **Major comments and questions**

      1. The analogy to SN1 and SN2 mechanisms of chemical bond formation is very nice.
      2. What is the basis for the two statements made in paragraph 3 of Introduction (beginning with "A question arises ...") about transitions being sequential or concurrent? Please *Reply: Thanks. We added references in this paragraph.

      * 2.1. Provide references to previous experimental and computational studies that have investigated developmental and reprogramming gene expression programs. *

      Reply: Thanks. We added a paragraph in the Introduction.

      *

      2.2. Describe specific examples of findings that support the two possible transitions highlighted here. Why couldn't transitions happen through an entirely gradual process involving changes to overlapping subsets of genes. *

      Reply: Thanks. In the review paper of Naomi Moris et. al., they proposed the hypothesis that cell phenotype transition is similar to a chemical reaction ____(6)____. Thus we extrapolate this hypothesis and test it in our study. For the example of SN1 mechanism, ____Kalkan et al. showed that mouse embryonic stem cells can exit from ____naïve pluripotency____ but remain uncommitted ____(7)____.

      Just like the SN1 and SN2 mechanisms are two extremes in chemical reactions and there are cases lie in between, for cell phenotypic transitions we agree with the reviewer that such gradual process may exist. Actually the result in Fig. EV4d shows that the frustration score remains flat for the Fev+ ____à____ Beta transition, suggesting a possible gradual process. With the analyses provided in this work, such as the reaction coordinate, frustration score, heterogeneity, and inter-/intra- community edges, one may perform more systematic studies on a larger number of datasets and enumerate/classify possible patterns of transitions.

      • Please make plots of the number of effective intra-community edges vs. number of active genes to support the statement that these two numbers are correlated. *

      Reply: We plotted the corresponding intra-community active genes and calculated its correlation coefficient with the number of effective intra-community edges in dentate gyrus neurogenesis (Fig. EV1d). ____The correlation coefficients are 0.91,0.96, 0.99 and 0.96 for community 0, 1, 2 and 3 separately.

      * A bunch of notations are not clear:

      4.1. What is the "r" in "strongest intercommunity interactions at r = 10 (Fig. 1F)"? Is it the same as the "r" mentioned in the Methods section? *

      Reply: r____ is the index number of the discretized reaction coordinate. We added it when we define the reaction coordinate. We modified the conflict usage of r in Materials and Method 4.

      4.2. What is "s_i" in "cell-specific effective matrix, Fbar_ij = (2*s_i - 1)*F_ij"? Also, that description of F_ij, f_ij, and H should be moved to the Methods section, and a more high-level, intuitive description should instead be included in this Results paragraph. Reply: represent the binarized gene expression state. is 0 for when gene is in low expression level (silence) and is 1 when gene is in high express level (active). We modified this part following your advice.

      * How were the h_f and h_m thresholds chosen? *

      Reply: and are based on the distribution of each dataset. Following suggestions from another reviewer, we modified this part. All the highly variable genes were selected and the genes were binarized with the Silverman’s bandwidth method and ____K____means (Materials and Methods 2).

      * What is the "density of each single cell" ("_t")? The formulation of the penalty of the distance between cells i and j (the expression with -logP_ij...) is unclear. What is the intuition behind it? What is r? How were the values of r (0.5 and 0.8) chosen? *

      Reply: The probability density of cells in the expression space is based on the kernel density estimation. Intuitively, a region in the expression space with more cells is more likely passed by more cell trajectories. The values are based on the distribution of kernel density estimation in different datasets.

      In the modified manuscript, we used trajectory simulation and deleted this assumption for simplification.

      * One of the reasons the authors state to justify the choice of PLSR is "In the scRNA dataset, the number of genes is often comparable to or larger than the number of cells." This is not true most of the time. In nearly all recent studies, the number of cells is way larger than the number of genes measured. *

      Reply: The PLSR method definitely can be used for the data whose number of cells is larger than the number of genes. Also the PLSR method was applied on cells that are the k nearest neighbors of each reaction coordinate, which are a subset of the whole dataset (Materials and Methods 5). While we mainly presented results with the PLSR method, in this revised manuscript we also added results with another method of GRISLI (Materials and Methods 9). The results are similar with what we obtained with PLSR.

      * There is a fleeting reference to a nice previous finding that supports their observations: "several lines of evidence support that EMT proceeds through a concerted mechanism. Indeed, both in vivo and in vitro studies have identified intermediate states of EMT that have co-expressed epithelial and mesenchymal genes (Pastushenko et al, 2018; Zhang et al, 2014)". The authors should thoroughly survey the literature related to EMT transition, development of pancreatic endocrine cells, and development of the granule cell lineage in dentate gyrus, to find more previously identified molecular/cellular features relevant to cell state/type transitions, compared and contrasted with findings from this study. *

      Reply: Thanks. We added references on these cell phenotype transitions and modified the corresponding part. We do want to point out that the main focus of this work is that all these processes share a common feature of transient increase of intercommunity interactions.

      * What is the "dynamo" package, which is supposed to contain a Python notebook? As of now, the code and data have not been made available. Both need to be released along with thorough documentation on how to run the code to reproduce the analyses described here. *Reply: Thanks. Dynamo is a python package accompanying our recent publication ____(8)____. We uploaded the code on Github and added the link of Dynamo.

      * **Minor comments and questions**

      1. Replace "confliction" throughout the manuscript with "conflict" or "conflicting" as appropriate. *

      Reply: Thanks. We modified them.

      * Paragraph two of the Introduction (beginning with "Another example of transitions ...") is missing multiple references, esp. for the last four sentences. *

      Reply: Thanks. We added references.

      * There are direct quotes from previous papers like "predicts the future state of individual cells on a timescale of hours". The authors are highly encouraged to check for usage of exact phrasing using available text software such as iThenticate. *

      Reply____: ____Thanks a lot for pointing out this severe mistake. We re-edited the manuscript and checked with iThenticate. *

      *

      • "Each community contains both E and M genes": what does this mean? *

      Reply: The E (M) genes are defined as those genes that are active or have high expression levels in epithelial (mesenchymal) state or sample. As we reorganized the manuscript, we add this explanation for all datasets in the caption of Fig.1i.*

      *

      • Reference to Qui 2021 is missing in the "Path analysis" subsection under Methods. *

      Reply: We added it in the Methods.

      * Fix: "transition between the cells that their sample time points are successive" in Methods. *

      Reply: Thanks. ____We modified it.

      * In Methods, under "Network inference", it is "partial least square regression" (not *least* s square). *

      Reply: Thanks. We modified it.

      * Figure 1: The cyan, magenta, and lime in 1C are very hard to see and, perhaps, the grey of the points can be made lighter. Also, change the red and green colors for the arrows in 1I to something else. These colors are not colorblind-friendly. *

      Reply: Thanks. We re-plotted the figures and changed the colormap.*

      *

      • Periods and commas are missing at several places. Reply: Thanks. We modify these and re-edit the manuscript.

      Reviewer #2 (Significance (Required)):

      The study uses RNA-velocity calculated from scRNA-seq data in an inventive way to characterize paths that reflect cell phenotype transitions. Then, a sparse gene regulatory network is reverse engineered from the data and the community structure within this network is examined at various stages along the transition to make observations about inter- and intra-community regulation and network "frustration". However, the study lacks the context of existing literature in terms of previous work studying cell transitions both experimentally and computationally. Adding this context (as suggested in the comments) will considerably improve the utility and significance of the findings. Overall, this study will be of broad interest to researchers interested in development and reprogramming as well as computational scientists developing and applying methods for scRNA-seq data analysis, trajectory inference, and network reconstruction. All the comments and questions raised here are based on my background and expertise in omics data (including scRNA-seq) analysis and network biology.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      The authors analyze three datasets of Single cell RNA velocity measured during phenotypic transition. They infer the gene regulatory network in each case and characterize the transition between the initial and final expression states (in which different sets of genes are expressed). Their motivating question was to find whether during such transitions first genes characterizing the initial state are no longer expressed and only then the genes associated with the final state start expressing or alternatively there is gradual transition through an intermediate state in which subsets of both initial and final state genes are transiently expressed.

      They define a measure of regulatory frustration representing the mismatch between regulatory signals a gene receives and its current expression state. They conclude that phenotypic transitions involve transient interactions between otherwise non-interacting gene modules and a temporary increase of gene frustration, which is relaxed once the final expression state is reached.

      The study uses of advanced inference and machine learning methods.

      I find the question studied in this manuscript interesting, opening avenue to further questions and studies and relevant to different scientific communities. Personally I think that the focus of the paper should be the exposition of the methods used this manuscript would benefit from a longer format, but that depends of course on the journal they are aiming at. *

      *

      Statistical analysis is missing. Especially since the authors mention the potential of over-fitting due to large number of genes (on the order of the number of cells) - I think the authors should provide a sensitivity analysis testing how sensitive are the conclusions to the choice of cells or genes by applying the methods to subsets of the cells / genes. *

      Reply: Thanks. For the subset of cells, we randomly selected cells from the dataset and performed the analyses (Fig. EV11a). For the subset of genes, we selected a subset of genes randomly and performed the analyses (Fig. EV 11b). We found the results are not affected. We also perform another statistical analysis by varying the value of resolution in community detection algorithm. And we found that the conclusion on variation of inter-community edges is not affected (Fig. EV12).

      Figure EV11 Statistical analyses of dentate gyrus neurogenesis. Each dot represents a cell and color represents the number of inter-community edges.

      (a) Frustration score along the RCs (left) and cell-specific variation of the number of inter-community edges (right) of a randomly selected sub-population of 2000 cells (from a total of 3184 cells);

      (b) Frustration score along the RCs (left) and cell-specific variation of the number of inter-community edges) (right) of cells on the space of 400 randomly selected genes (from a total of 678 genes).

      *What is the meaning of the distribution in the frustration plots? *

      Reply: For each cell we calculated a frustration score. Therefore for cells in each Voronoi cell (which is a geometric cell, don’t be confused with the biological “cells”) along the reaction coordinate (Fig.1d, Fig. 2b &2g), we obtained a distribution of the frustration scores.*

      In general, the conclusions are well-justified, but I think some statements in the discussion are inaccurate: "intercommunity interactions of a GRN are indeed minimized' - are they minimal or are they only lower at the stable states? There are two stable states - for which of them is intercommunity interaction lower? *

      Reply: Thank. We agree with the reviewer and modified the writing. Comparing with the transition state, the number of intercommunity interactions is less for the stable states. ____The datasets' quality are not high enough for us to investigate whether ____"intercommunity interactions of a GRN are indeed minimized”.*

      It is written in the discussion that 'for all three datasets frustration decreases with differentiation', but then Fig. 1g shows the opposite (final state is more frustrated than initial state). It is interesting to discuss the differences between the datasets analyzed in that respect and what could cause transition to a more frustrated state. I suggest that the authors also refer in the discussion to related questions and possible follow-up studies, such as: what determines the duration of the phenotypic transition? A relevant number is the switching time of a single gene. *

      Reply: Good suggestion. Compared to other datasets, we found that the result of EMT shows larger variances. The relative difference of the frustration score is also affected by the GRN inference algorithm. For example, the difference between initial and final frustration scores of the pancreatic endocrinogenesis is more significant when using the GRISLI method (Figure EV6b). Given these, the trend that the frustration scores in the transition states transiently increase keep consistent.

      Our conclusion is limited by the quality of the data. So we delete this part of discussion in the manuscript.

      Qiu et al. have shown that splicing-based ____RNA velocities are relative, while metabolic-labeling-based RNA velocities are more quantitative and accurate____(8)____. We will re-analyze this problem if data with metabolic labeling becomes available.

      * The authors mention at the end that the networks can often reach multiple final states from a common initial states. Do such transitions share some of their path (and in particular the intermediate frustrated state)? Given the intermediate connected state, it would be interesting to characterize the network stability to perturbations. *

      Reply: This is a very important question. To reliably address these questions, we need higher quality data. We plan to characterize the network stability to perturbations in future studies, while in our recent paper using a full nonlinear modeling framework____(8)____, we performed in silico perturbations.

      * While interesting, the manuscript itself is unfortunately hard to read and would benefit from major editing, including better exposition of the science and language editing. *

      Reply: Thanks. We revised the manuscript extensively.*

      Methods: Description of PCA and 'revised finite temperature string method' are missing in the Methods section. *

      Reply:____ Thanks. PCA is used in RNA velocity analysis for dimension reduction. We added this in Materials and Methods 3. The revised string method is in Materials and Methods ____4.

      *

      Some examples:

      Figure captions are very short and often non-informative. Some variables are not defined (or only defined later on) and the reader then needs to guess their meaning: it took me a while to understand what is 'r' in Fig. 1f and what 'r=10' (p. 4) means. *

      Reply: Thanks. ____r____ represents the index number of reaction coordinates. We added this in the manuscript where we define reaction coordinates.*

      p. 4: what are 'f' (as opposed to F) and 's_ij' and 's_j' (expression states?) Or is fs_ij one variable? What does a Hamiltonian of a cell mean (p. 4, bottom)? *

      Reply: is the regulation of gene ____j on gene i, and is the expression state of gene i (0 for silence, and 1 for active expression). is the frustration value of regulation from gene j to gene i.

      The pseudo Hamiltonian value is proposed in the literature as an analogy of ____the magnetic systems following the work of Boolean model in EMT ____(9)____. A high Hamiltonian value indicates that the cell is in an unstable state. In the original manuscript we included this quantity since it has been discussed in the literature. However we found it causes confusion and is not necessary for our discussions, so we removed the pseudo-Hamiltonian results in the revised manuscript. * P. 4: how are 'E and M genes' defined? *

      Reply: The E (M) genes are defined as those genes that are active or have high expression levels at the epithelial (mesenchymal) state or sample. We explained our general strategy in the caption of Fig.1i . * What does 'network heterogeneity' (p. 5) mean? *

      Reply: Network heterogeneity measures how homogenously the connections are distributed among the genes____(10)____. A high heterogeneity ____means that some genes have high degree of connectivity (the so-called hubs), while some have low degree of connectivity.

      *

      Fig. 1 is too tiny and hard to read and details are missing. *

      Reply: Thanks. We modified this figure and caption.*

      A glossary for all the acronyms used would be very helpful. *

      Reply: Thanks. We added glossary in the manuscript.*

      Language (some examples):

      p. 5 bottom: Another system is on development... invitro -> in vitro

      p. 6: 'measure on developmental potential' -> measure of... *

      Reply: Thanks. We modified these and re-edited the whole manuscript.*

      Reviewer #3 (Significance (Required)):

      This study presents a methodological advance in demonstrating the application of data analysis methods to study developmental phenotypic transitions. High throughput measurements and computation power available today enable putting to test theoretical conjectures, as made by Waddington. I think this is a promising line of research, which could be used to further develop the computational methods as well as to further our understanding of developmental transitions and potentially develop associated mathematical modeling frameworks.

      This study should be of interest to a diverse readership composed of developmental biologists as well as to quantitative biologists and CS researchers applying optimization techniques and data analysis methods to high-throughput biological data.

      I am not an expert on the computational methods applied in this manuscript and hence cannot assess their correct use and statistical analysis.

      *

      1. Traag VA, Waltman L, & van Eck NJ (2019) From Louvain to Leiden: guaranteeing well-connected communities. Scientific Reports 9(1):5233.
      2. Stuart T, et al. (2019) Comprehensive Integration of Single-Cell Data. Cell 177(7):1888-1902.e1821.
      3. Bergen V, Lange M, Peidli S, Wolf FA, & Theis FJ (2020) Generalizing RNA velocity to transient cell states through dynamical modeling. Nature Biotechnology 38(12):1408-1414.
      4. Wolf FA, Angerer P, & Theis FJ (2018) SCANPY: large-scale single-cell gene expression data analysis. Genome Biology 19(1):15.
      5. Aubin-Frankowski P-C & Vert J-P (2020) Gene regulation inference from single-cell RNA-seq data with linear differential equations and velocity inference. Bioinformatics (Oxford, England) 36(18):4774-4780.
      6. Moris N, Pina C, & Arias AM (2016) Transition states and cell fate decisions in epigenetic landscapes. Nature reviews. Genetics 17(11):693-703.
      7. Kalkan T, et al. (2017) Tracking the embryonic stem cell transition from ground state pluripotency. Development 144(7):1221-1234.
      8. Qiu X, et al. (2022) Mapping Transcriptomic Vector Fields of Single Cells. Cell 185(4):690-711.
      9. Font-Clos F, Zapperi S, & La Porta CAM (2018) Topography of epithelial–mesenchymal plasticity. Proceedings of the National Academy of Sciences 115(23):5902-5907.
      10. Gao J, Barzel B, & Barabási A-L (2016) Universal resilience patterns in complex networks. Nature 530(7590):307-312.
    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We are very grateful to the three referees for their constructive comments and suggestions which have helped improve the quality of our manuscript.

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      In the publication HAT-field: a very cheap, robust and quantitative point-of-care serological test for Covid-19 by Joly and Ribes the authors describe an adaption and an improved protocol to their previously published haemagglutination based test to detect antibodies to SARS-CoV-2 in patient blood (Towsend et al., 2021). In detail, they analyzed the effect of several adaptions including buffer optimization, plate coating, usage of patient whole blood instead of washed RBCs and plasma. Additionally they tested different temperatures and stability of the reagents, namely the nanobody-RBD construct IH4-RBD. For validation they compared their optimized HAT-field assay with Jurkat-S&R as a FACS-based assay.

      Major comments:

      Introduction: This section is rather short and could benefit from a broader overview of currently established methods and assays to detect appropriate immune responses against SARS-CoV-2. The author are advised to summarize the current literature in the field more comprehensively and not only focus on their own work.

      Response: Hundreds of different tests to monitor immune responses against SARS-CoV-2 have been described to date, and the literature on these various tests is vast, with new articles coming out almost on a daily basis. We would not feel either that the introduction of our rather technical paper would benefit from being lengthened by such a review of the current literature, or even competent to carry out such a summary. Following the referee’s suggestion, we have, however, introduced a new sentence and given three references providing relatively recent overviews on the subject of immune-monitoring.

      Cross-reactivity with IH4-RBD. In Figure 6, the authors highlight the samples in red and orange that showed cross-reactivity with IH4-RBD. In their discussion, however, the authors state that only 2 of 60 (3%) were cross-reactive. In making this statement, they ignore the proportion of cross-reactive samples that were also positive in the Jurkat S&R assay. Therefore, the authors should acknowledge in the discussion that the actual number of cross-reactive samples was higher.

      Response*: The statement in the discussion about 2 cross reactive samples out of 60 concerns the results obtained after an incubation of one hour under normal gravity, and not the two red dots in each of the three graphs of figure 6, which correspond to the two negative samples which gave false-positive results in HAT plasma titrations after spinning (Figure 6C), for which we correctly state in the discussion that 12 samples showed cross-reactivity on IH4 alone. The data presented in Figure 6B corresponds to HAT-field after spinning, for which we correctly state in the discussion that 5 out of 60 showed cross-reactivity (4 orange dots + 1 red dot, the second red dot having a score of 0, in accordance with the fact that this sample showed no cross reaction on IH4 alone in HAT-field after spinning). *

      *To try to prevent this possible confusion, we have now clarified what data we are referring to at the start of that paragraph in the discussion. *

      Quantitative Assay. Since the HAT assay does not allow determination of the absolute number of antibodies reactive to SARS-CoV-2 in the blood samples, the authors should refrain from claiming that the HAT-field is a quantitative assay.

      Response*: Since immune sera are inherently polyclonal, they contain a multitude of different types of antibodies of different affinities and avidities, and we are not aware of any technique that allows to determine the “absolute number” of antibodies directed against a given antigen in such samples. *

      *For many serological tests, including ELISA and the initial protocol of HAT, serum or plasma titrations are used as a means to obtain what is widely considered as a quantitative evaluation of the amounts of antibodies in blood samples. Even FACS-based assays such as the Jurkat-S&R-flow test we have used, are commonly considered as quantitative but those only provide relative results and not absolute numbers. *

      We perceive that the close correlations we find between the results of the HAT-field protocol and those of the Jurkat-S&R-flow test as well as with serum titrations using the standard HAT protocol warrants considering the results of HAT-field as being as quantitative as those obtained with all those other tests.

      Morphological read out For field application, the morphological description of the observed deposits ("teardrop" vs. "button") could be problematic and might lead to bias depending on the user. Thus, the authors should provide a clearer description for phenotype classification.

      Response: We have now introduced a specific paragraph detailing how to score HAT assays in the Methods section, as well as a new figure providing a graphic description of positive, partial and negative RBCs deposits.

      Minor comments: Title: the authors should remove "very"

      Response*: We have now removed the word ‘very’ from the title, and thank the referee for this helpful suggestion. *

      By the way: What are the costs of IH4-RBD for a 96 well plate? Who will produce this reagent? Is the sequence of the IH4 fully disclosed?

      Response*: As specified in our original paper (see Townsend et al. 2021), the plasmid coding for the IH4-RBD is available upon request from Alain Townsend (Oxford, UK). Furthermore, his laboratory funded the production of 1 gram of the IH4-RBD reagent by a commercial company, and professor Townsend has been graciously sending aliquots of 1 mg of this reagent, which suffice for several thousand tests, to all the laboratories that have requested it from him. *

      *In its initial format, HAT only required 100 ng of IH4-RBD per well, corresponding to a cost of 0.0027 £ per well. For the HAT-field protocol, 5 times more reagent is needed, thus bringing the cost of the reagent to 1.5 cts per test, to which one would have to add a similar cost for the IH4-reagent alone. This would thus bring the cost of the two reagents to approximately 3 cts, which is still lower than the price of any of the cheap disposable plasticware necessary for the test (lancet, pipet, plastic tube and portion of a plate). *

      The sequence of the IH4 nanobody is indeed fully disclosed (see figure 1 of Townsend et al. 2021), and has actually been protected by a patent ( US9879090B2 ). Whilst IH4 can be used freely for research purposes, licensing rights would have to be taken into consideration by any health authority wishing to use the technique broadly, or for any commercial distribution.

      The usage of the CR3022 as positive control for neutralizing antibodies should be reconsidered since this antibody does not confer viral neutralization. Other well describe antibodies blocking the ACE2:RBD interface might be better suited.

      Response*: CR3022 was the one that we had at our disposal, but other mAbs can certainly be used instead of as positive controls, and this is actually indicated in the detailed HAT-field protocol provided. Since the use of a positive control is only to ensure that the IH4-RBD has not been degraded and works as well as expected, and that any negative samples are not due to a very rare glycophorin mutation that could prevent IH4 from binding to it at the surface of RBCs, we are not sure why using a mAb with neutralizing activity would necessarily be better than the CR3022 mAb. *

      Figure 2: Please state the concentration of IH4-RBD used. As stated in the figure legends for Figure 2 B, the authors should show the result all 4 replicates (incl. SD)

      Response: The concentration of IH4-RBD was 1 m*g/ml, i.e. the normal concentration for standard HAT tests. This was already indicated in the Methods section, but has now been added to the legend of Figure 2. *

      Whilst 4 experiments were indeed carried out, which all gave similar results, i.e. showed that using PBS-N3 or PBN did not hinder HAT performance, but could instead result in a slight increase in HAT sensitivity, those various experiments were not all exact replicates of the experiment shown on figure 2. Furthermore, performing of those various experiments was spread over a period of over a year, using different reagents, thus precluding numerical comparisons between the various results. We have clarified this issue by rewording the final statement to “Comparable results were obtained in four similar experiments.”

      Figure 3: Although the authors showed stability of IH4-RBD at 2 µg/ml they do not provide data for the stabilities at higher dilutions. As the authors suggest to predistribute the IH4-RBD in plates they should at least discuss this issue.

      We thank the referee for raising this valid point, which has now been discussed in the paragraph entitled “Practical considerations for performing HAT assays” in the Methods section: “One aspect that will have to be considered for the design and use of such individual strips of wells will be to ensure that, upon storage, the various dilutions of IH4-RBD are as stable in such strips as the working stocks of IH4-RBD (2 mg/ml) tested in Figure 3.”

      Figure 6/Supplementary Figure 1 and 3 The presentation of the data is not accurate, as many of the points (samples) are obviously identically positioned in the graph. The authors should choose a different representation of their data. E.g. they could adjust the size of the points to the number of overlapping samples.

      Response: We thank the referee for raising this issue, which was also pointed to by referee #2. This apparent inaccuracy is due to the fact that, on these plots, the scales for both x and Y axes used discrete values, which indeed results in multiple points overlapping on top of one another. This was resolved by adding numbers next to the positions where several dots overlapped

      Wording / text length In the current manuscript the text is very long. Thus, the authors should shorten it to report the essential findings more appropriately. Additionally they should check for correct English wording.

      Response*: We thank the referee for this remark, which helped us realize that the excessive length of the manuscript was mostly due to an extensive discussion of highly technical and practical points. The corresponding paragraphs were indeed out of place in the general discussion, and have not been deleted but have been moved to the Methods section since we feel that they contain very important information for people who would actually start to performing HAT assays. *

      Reviewer #1 (Significance (Required)):

      In summary, the authors describe the HAT-field test as a simple PoC test for the detection of SARS-CoV-2 antibodies in patients. Because of its ease of use and robustness, the test appears to be particularly well suited for use in countries with underdeveloped health care or limited testing facilities, as also reported previously. The value of this manuscript lies mainly in the detailed description of the protocol and its validation. In this context, the adaptations described are certainly useful and helpful from a practical point of view, but do not provide significant new scientific insights. In light of these considerations, we recommend that this work be submitted to an appropriate journal specializing in the publication of such methods

      Expertise The reviewers have established and published different serological assays to monitor immune responses against SARS-CoV-2

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      In this paper, the authors developed a feasible protocol for an affordable point-of-care serological test for SARS-CoV-2. This method was adapted from the HAT plasma titration test that the authors previously published. Specifically, the test utilizes a 96-well plate pre-coated with the RBD of SARS-CoV-2 spike glycoprotein fusing to a red blood cell targeting nanobody (IH4). By adding microliters amount of the blood or plasma samples to the plate, it allows the detection of antibodies against RBD by measuring the level of hemagglutination. In the current upgraded protocol (so called HAT-field), the authors made major modifications including optimizations of buffer and experimental protocol and the use of pre-titrated IH4-RBD on the plate, which collectively helped to lower the sample consumptions, improved the stability and the sensitivity of detection, and made the test more user-friendly under non-clinical settings.

      Major comments: My major concerns are related to the robustness and quantitative capability of this approach. Specifically: It seems that multiple variables may impact the results. These include volume of droplets, the presence/absence of serum IH4 or BSA cross-reactive antibodies, and the amount (%) of red blood cells which may vary substantially among samples. Could you find a way to normalize the results (e.g., the discrepancy shown in Figure 6) instead of only leaving them as false-positives or false-negative?

      Response*: Regarding the volume of the droplets, in other words, the amount of blood collected and used in an assay, two sentences in the manuscript underline the fact that this is not a critical variable: *

      In the Results section “the precise volume of blood collected is not critical; it may vary by as much as 30% with no detectable influence on the results.”

      In the discussion: “On this subject, we have found that increasing the amount of whole blood per well (in other words using blood that is less dilute) has very little influence over the HAT-field results, and, if anything, adding more blood can sometimes reduce the sensitivity, albeit never by more than 1 dilution.”

      Consequently the % of RBCs in samples seem unlikely to influence the HAT-field scores significantly. This is supported by the fact that, although men tend to have higher hematocrits than women, we have not noticed any detectable difference between men and women in the correlation of the HAT-field scores with those of the Jurkat-S&R-flow test.

      We are not sure that we fully understand what discrepancy shown in Figure 6 the referee is pointing to, but if it is about the increase in the number of samples found to be cross reacting on IH4 alone when the sensitivity increases, in the discussion, we propose to perform tests using titrations of the IH4 nanobody alone simultaneously to using the IH4-RBD reagent, so as to minimize the number of samples that would be identified as false positives if only one concentration of IH4 alone was used as negative control. Comparing the titers obtained with IH4-RBD and IH4 alone will then provide some level of normalization for the samples cross reacting on IH4. As for the hypothetical presence of antibodies cross reacting on BSA alluded to by the referee, since such antibodies would not bind to RBCs, we do not think they would affect the HAT results.

      Second, the score of the HAT-field ranges from 0 - 8. However, based on the current manuscript, it is not clear how the scoring and scaling works. How is the noise (non-specific antibody signal) defined here?

      Response: We have now introduced a specific paragraph and a new figure detailing how to score HAT assays in the Methods section.

      In addition, it is unclear how to translate the HAT-field score into a meaningful measure of protection by serum antibodies.

      Response*: Documenting the correlation between HAT-field scores and levels of protection against SARS-CoV-2 infections and/or Covid-19 severity would indeed be extremely interesting. This would, however, require setting up a large scale clinical trial carried out over several months. This type of work could only be carried out by a large consortium including clinicians or even preferably a national health agency. This was, however, far beyond the reach of this initial project, which was based on the work of a single person on a shoestring budget. *

      Can you provide more evidence to demonstrate that the test is quantitative? For example, performing additional orthogonal experiments to better validate the scoring and generate a correlation function?

      Response*: Inasmuch as it would have been very interesting to perform additional serological tests from commercial sources on the samples of our cohort, such tests are all very expensive (e.g. ca. 500 € for one ELISA plate). This was in fact the main reason for developing the Jurkat-S&R-flow test in the first place, since it is much cheaper, more modular, and at least as sensitive as ELISA (see Maurel Ribes et al. 2021). The funds for this whole project came from a single 15 k€ grant obtained from the ANR, and we simply did not have access to the funds, or to the human resources to carry out such experiments based on commercial serological tests. *

      Minor comments: Figure 6: are all results included? To me, it does not seem that all 60 samples data were included in the plot.

      Response: We thank the referee for raising this issue, which was also pointed to by referee #1. This apparent inaccuracy is due to the fact that the scales for both x and Y axes used discrete values, which results in multiple points overlapping on top of one another. This was resolved by adding numbers next to the positions where several dots overlapped.

      There are several redundant statements in the discussion and results section. Please make the text more concise.

      Response: The discussion has now been shortened considerably, mostly by moving the paragraphs pertaining to technical considerations to the Methods section.

      Reviewer #2 (Significance (Required)):

      The current paper is built upon the improvement of previous published work. In addition, there are similar approaches that have been published. It was unclear if the current method is superior to other works.

      Response: Whilst we have made no statement regarding whether the method we describe is superior to other methods, we are pretty confident that very few alternatives will be as frugal and simple as the HAT-field protocol described here. As alluded to in the final paragraph of the discussion, two recent reports have described that HAT could be performed on cards rather than in V-shaped wells, with semi-quantitative results being obtained in minutes. If such card-based approaches turn out to provide sensitivity and reliability comparable to those of the HAT-field protocol, they will certainly represent very interesting alternatives. As stated in our manuscript, we would be very interested if the comparative evaluation of the two approaches could be carried out by one or several independent third party.

      My research involves the development of antiviral antibody therapeutics. This method may be used as a point-of-care tool for the measurement of serologic response to RBD in less developed countries. However, due to the high vaccination rate and large infected populations, the overall needs for such detection drastically decrease. The significance of the work and utilities of the test may expand with more experiments related to the variants.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      This paper describes a low-cost robust and quantitative serological test based on haemgglutination, which could be used in resource limited settings for evaluating population-based and vaccine induced immunity. Neutralising antibodies to the receptor binding domain (RBD) on the SARS CoV-2 spike protein are an immunological correlate of protection. The HAT has a single reagent the RBD domain of SARS CoV-2 linked to a monomeric anti-erythrocyte single domain nanobody. When human polyclonal serum antibodies bind to the RBD they cross-link and agglutinate human red blood cells, resulting in haemagglutination which can be read visually.

      This paper thoroughly evaluate the stability of the HAT reagents used to measure human and monoclonal antibodies examining the robustness of the HAT reagent. It provides a comprehensive protocol for conducting field based HAT with limited reagents. The test can evaluate is subjects have been infected using a simple finger prick to detect RBD specific antibodies. The field HAT can also be used to define people that can be susceptible to reinfection or in need of vaccination, With the use of RBDs from the variants of concern the test can be rapidly adapted to evaluate antibodies as new variants arise to evaluate surrogate correlates of protection to allow timely evaluation of vaccine effectiveness and predict the need for vaccine booster doses. The data are very comprehensively presented with good figures demonstrating the most appropriate buffer to store the IH4-RBD reagent and the robustness of the HAT over time at different temperatures. No additional experiments are needed and suitable numbers of replicates are included. All data, methods and reagents are comprehensively described.

      Minor comments: The paper is well written but rather long in places and may have benefited from being more succinct.

      Response: The excessive length of the manuscript was mostly due to an extensive discussion of highly technical and practical points. The discussion has now been shortened considerably, mostly by moving the paragraphs pertaining to technical considerations to the Methods section.

      Panels in figures could be labelled as A, B, C etc to help in identifying the correct panel..

      Response: We thank the referee for this helpful suggestion, which we have followed.

      I would avoid the use of experiment and project and refer to next we confirmed... or in this paper or our results show Please make sure all abbreviation are defined upon first use. Perhaps include early in the paper that most of the work was conducted with the Wuhan RBD

      Response: We thank the referee for these helpful suggestions, which we have followed to the best of our abilities. The abstract now contains a mention of the fact the work on optimizing the protocol was carried out with the IH4-RBD carrying the Wuhan version.

      Figure 2: I would suggest placing either a solid line between the two halves of the plates to make it easier for the reader to differentiate between the two antibodies. It also would have been easier to read if the bottom PBS, PBS-N3 and PBN were at 45 degree angle. In B include the serum name (e.g. serum 197).

      Response: We thank the referee for these helpful suggestions, which we have followed.

      Legend to figure 4: please include the serum numbers after covid-19 patients. Perhaps include arrows to demonstrate the dilutions of serum and IH4-RBD in the figure.

      Page 6 it might be easiest to use the same times as in figure 6 and use for example more than one year in the discussion

      Response: We thank the referee for these helpful suggestions, which we have all followed.

      Legend figure 6 perhaps replace dots with circles page 10 include the R values from figure 6 in the description of results.

      Response: We are grateful to the referee for these helpful suggestions, but have not followed them since we do not feel that these changes would be real improvements.

      Page 12 of note perhaps this can be moved to the methods ?

      Response: This, and several other paragraphs of the Discussion, have now been moved to the Methods section.

      Supplementary figure 2 A can be seen, is something missing here?

      Response: An s was indeed missing : “A can be seen” corrected to “As can be seen “

      *

      Reviewer #3 (Significance (Required)):

      This paper describes a simple rapid field test for evaluating antibodies to the receptor binding domain of the spike of SARS CoV-2 using the Wuhan and delta variant. Whilst high income countries can provide booster doses and extensive testing (either lateral flow or RT-PCR based) and contact racing to control the waves of the pandemic, low income countries have had limited access to Covid vaccine and the extent of previous waves of the pandemic in the populations are unknown.

      This paper describes a robust and simple test for investigating human antibodies to SARS-CoV-2 which could be performed in resource limited settings providing a very useful tool for monitoring infection in the community and potentially for prioritising this scarce COVID-19 vaccines available.

      This study builds upon the work conducted on the HAT and has extensively studied and optimised the test so that it could be used globally. This paper provides a comprehensive protocol and has simplified the test to ensure it could be used in LMICs.

      This paper would be of great interest to a wide scientific audience who are interested in a rapid low-cost test to evaluate population based and vaccine induced immunity.

      Reviewer: serological assays for use in virology and vaccinology. Suitable competence to review the whole paper *

    1. Author Response:

      Evaluation Summary

      Challa and Ryu et al. systematically evaluated various combinations of ADP-ribose-binding modules to make sensors detecting poly(ADP-ribose). They developed and tested two indicator designs optimized for analyses in cell culture (dimerization-dependent GFP-based) or intact tissues (split Nano luciferase-based). Overall, with further experimental controls and quantification, this timely set of cell biology probes will be useful to study the biological functions of ADP-ribosylation in cultured cells and whole organisms.

      We appreciate the positive and encouraging words from the reviewer. We also appreciate the helpful comments, criticisms, and suggestions, which we have endeavored to address fully.

      Reviewer 1 (Public Review):

      While these tools are more sensitive than existing tools, it is unclear whether a dynamic range of 6-fold (GFP) and 3-fold (luciferase) provide sufficient sensitivity for properly understanding the PAR dynamics (which was thought to increase as much as 100-fold in DNA damage settings). In addition, it is unclear whether the fold increases in both fluorescence and luminescence linearly correlate with the traditional measures by western blot.

      We are pleased that the reviewer found our sensors to potentially useful. The reviewer provided a number of excellent comments and suggestions that have served as a useful guide for improving our paper. We have carefully considered all of the comments, insights, and suggestions from the reviewer and revised the manuscript accordingly. We think this has strengthened our conclusions and improved the paper considerably. We thank the reviewer for the careful and thorough review of our paper.

      Figure 1F indicates on the western blot that there was a precipitous drop of PARylation after 5 min, but the GFP signal indicated a linear drop. It will be important to quantify the signals on western blots and test how correlate their data with the GFP/luciferase data in scatter plots for their various sets of data. Would this system under-estimate the changes and be not sensitive enough to subtle changes that may be 1-2 fold measured by traditional means

      We agree with the reviewer that a comparison with existing PAR detection technologies will improve the manuscript. We now performed a comparative analysis of ELISA, Western blot, and immunofluorescence assays with live cell imaging using PAR-T GFP (Figures 6A, 6C, 6D). The results indicate that the detection range of PAR-T ddGFP is comparable to the established PAR detection assays. In addition, we also compared the live cell luciferase assays using PAR-T NanoLuc to Western blotting (Figure 6B) and found that these two assays are able to detect PAR changes at comparable levels. We would also like to emphasize that these sensors were developed to improve our ability to detect PAR changes in living cells and animals, which the existing techniques are not capable of doing.

      Similarly, how is their quantitation in Figure 2 compared with traditional immunofluorescence?

      We performed this comparison and observed that the changes in PAR levels as detected by live cell imaging using PAR-T ddGFP are comparable to the changes detected in immunofluorescence assays using the WWE-Fc reagent (Figure 6D and 6E).

      Lastly, for the luciferase signal in Figure 3B and C, the corresponding signal in western blots are missing. Therefore, it is difficult to estimate the background signal. If Niraparib, as in other figures, eliminates PAR signals on western blot, these data would indicate half of the basal signal are background, which is rather high. Having said that, tool development is an evolution process. These tools will provide a good foundation for future development. Therefore, understanding these limitations (dynamic range, quantitative sensitivity correlation, and background) will provide a better assessment of the utility of these new tools for investigating PAR biology.

      We appreciate the reviewer’s concern about the high background signal in Niraparibtreated samples. To answer this concern, we compared the dynamic range of PAR-T NanoLuc to Western blotting (Figure 6B) and found that the results from live cell luciferase assays using PAR-T NanoLuc are comparable to Western blotting using WWE-Fc. Of note, we were able to detect decreases in PAR levels with Niraparib using live cell luciferase assays using PAR-T NanoLuc, but not Western blotting. Based on these analyses, we can conclude that the changes in PAR levels at the basal level are very minimal, leading to only 50% decrease in PAR-T NanoLuc signal with Niraparib treatment (Figure 6B, Figure 5A-5C). Note that the decrease in PAR-T NanoLuc signal is greater when UV-treated cells were pre-treated with Niraparib, which is consistent with the results from Western blot analysis (Figure 5A).

      Reviewer 2 (Public Review):

      In this study, the authors attempted to extend their own work and that of others in the field in developing probes to detect the signaling molecule, poly-ADPribose (PAR) that can be used in the test tube, in cells and in tumor models. Major strengths include the development of a set of probes with data demonstrating utility and efficacy. Further, the authors show the assay to be useful in cell models and tumor models. Some weaknesses include what appears to be a high level of background in the assay. Further, regarding methods, the exact probes (sequences) being evaluated are not defined. This is one of several new PAR probes being developed over the last few years but may have widespread utility due to the quantitative nature of the bioluminescent assay.

      We thank the reviewer for these thoughtful and encouraging comments, as well as the interesting, thought-provoking, and constructive criticisms that have prompted us to dig deeper and provide more evidence to support our claims

      Reviewer 3 (Public review):

      The major drawback is that, while the authors demonstrated some applications of these PAR trackers (PAR-T) in both culture cells and in animals, the data of PAR-T ddGFP on cancer cells and the data of PAR-T Nano luciferase may not be sufficient to support the authors' claim that the new tool can detect spatial and temporal dynamics of PAR in cells and in animals. That said, the new tools can potentially expand the capability of cell biologists to visualize and study the PAR production process in both normal and disease states with improved sensitivity and tissue compatibility.

      We thank the reviewer for appreciating the potential utility of the PAR-T sensors, as well as the detailed and constructive criticisms that have prompted us to provide more evidence to support our claims. Addressing these comments has helped us to improve the paper.

      One of the major issues of this manuscript is the lack of time-course data for PAR-T luminescent sensors to demonstrate temporal monitoring of PAR levels in animals. If the binding of two split Nano Luciferase parts is irreversible, the application might be limited. However, according to the literature (Scientific Reports volume 11, Article number: 12535 (2021)), the split Nanoluc technology should be able to detect dynamic changes. Either way, a set of time-course data would be necessary. The authors need to provide evidence to support their statement "The high sensitivity and low signal to noise ratios of the PAR-Trackers described here enable spatial and temporal monitoring of PAR levels in cells and in animals.

      We agree with the reviewer’s comment that the original manuscript did not demonstrate that the PAR-T sensors can be used to detect spatio-temporal changes in PAR. To demonstrate that PAR-T NanoLuc can be used to detect time-dependent changes in PAR levels, we performed a time course of UV-mediated PARP-1 activation (Figure 5D). The results from this assay demonstrated that the dynamic changes in PAR in live cells, in response to DNA damage, can be recaptured using the PAR-T NanoLuc sensors. In addition, we also measured PARGi-mediated PAR accumulation in vivo in xenograft tumors (Figure 8 - figure supplement 1B-1D). We found that PAR can be detected readily in breast cancer cells when injected into mice. Upon treatment with PARGi, the luminescence from PAR-T NanoLuc increased significantly by 6 hours and then diminished by 24 hours. These data demonstrate that PAR-T NanoLuc can be used to track dynamic changes in PAR levels both in cells and in animals. While not in vivo, our work with spheroids also addresses this concern. See our response to the next comment below.

      Figure 2- figure supplement 2. For the detection of spatial dynamics of PAR signals in cancer spheroids, the authors did not provide sufficient evidence as only static images of different spheroids in different conditions were provided. And 2 out of 3 fields of view only include one spheroid. In addition, there is no time-course image data showing the spatial patterns of PAR in cancer cells are dynamic.

      We have now performed a quantitative analysis of multiple spheroids. As indicated in Figure 3B, we observed a significantly higher GFP fluorescence signal in spheroids derived Challa et al. (Kraus) – Rebuttal February 2, 2022 10 from PAR-T ddGFP expressing cells compared to those expressing ddGFP or those treated with Niraparib. To address the reviewer’s concern about using PAR-T ddGFP for spatio-temporal changes in cells, we included a video for live cell imaging of H2O2-mediated increase in PAR-T ddGFP (Figure 2 - figure supplement 2, video). We also developed an analysis approach that allows us to quantify the signals from the core of the spheroids separately from the periphery of the spheroids. We also performed a time course in 3D cancer spheroids to visualize the spatio-temporal changes in PAR levels (Figure 3C and 3D). The results from this experiment demonstrate that the PAR levels in cells at the core of the spheroids are relatively resistant to Niraparib treatment, as the PAR levels in cells at the core of the spheroid decrease at a lower rate when compared to PAR in the cells at the outer layer of the spheroid.

      In the caption of Figure 2 -figure supplement 1 (B and C), it states "Immunofluorescence assay to track PAR formation in response to H2O2.", but there is no evidence showing any antibodies were used there.

      We thank the reviewer for pointing out this error. It should have been written as live cell imaging, not immunofluorescence assay. We made this correction.

      It seems that Figure 3 B and C does not support the statement "we observed specific detection of firefly luciferase with D-Luciferin and NanoLuc with furimazine with no cross-reactivity" And it is unclear why the authors refer Fig. 3B and C after that statement as those data seems not supporting this claim. Similarly, the statement "Moreover, the luminescence of PAR-T Luc is only 30-fold lower than intact firefly luciferase." Was not supported by Fig. 3B. In fact, the differences between PAR-T Luc and intact firefly luciferase were ~1000 fold in vivo, judging from Fig 5B. It is also unclear which data of the construct was used to plot Fig. 3C.

      We thank the reviewer for this comment. We changed the scale bar to represent the true scale for the luminescence from Nano luciferase and Firefly luciferase. This indicates that the brightness of PAR-T NanoLuc is 30-fold lower than intact firefly luciferase. In Figure 3C, we plotted the ratio of PAR-T NanoLuc to firefly luciferase.

      Fig. 4C, it seems that Firefly luciferase was consistently brighter with PARGi, and I wonder if such difference is statistically significant. The authors did not perform a twoway ANOVA test for the firefly luciferase dataset.

      We included the statistics to indicate that these changes were not significant.

      The statement "Moreover, none of these sensors can detect PAR accumulation in vivo." seems to lack support. Have the authors proved that with evidence? I would recommend using the following statement instead: "Moreover, none of these sensors has yet demonstrated detection of PAR accumulation in vivo

      We made this change.

      For the in vivo experiment, it is unclear about the benefits of normalizing the PAR-T radiance to the Firefly luciferase since the signals from Firefly luciferase did not overlap well with that from the PAR-T nano luciferase, which may cause bigger variations.

      We thank the reviewer for raising this point. We normalize the luminescence from PAR-T NanoLuc to that from firefly luciferase to account for the variability in tumor size between the mice. We think this is an important control in the analysis. The luminescence from firefly luciferase represents the differences in tumor size between the mice. Hence, that signal is greater than the signal from PAR-T NanoLuc and is spread over a larger area.

      Judging from the data of Fig 3 supplement 1E, the signal intensity from the split firefly luciferase-based PAR-T sensors was ~10000 fold less than intact firefly luciferase, not ~1000 fold. It makes more sense to give up the split firefly luciferase for ~10000 fold differences since the signal intensity from the split nano luciferase was ~1000 fold less than intact firefly luciferase (Fig 5B).

      We noted the reviewers concern about the split firefly luciferase PAR-T. We agree with the reviewer that the split nano luciferase is brighter than the split firefly luciferase (Figure 4C and Figure 4 - figure supplement 1E). Although split nano luciferase is 1000-fold dimmer than the intact firefly luciferase in vivo (Figure 8B and Figure 8 - figure supplement 1A), this difference is only 30-fold in in vitro assays (Figure 4C). Hence, the comparison of sensors based on split firefly luciferase to split nano luciferase highlights our efforts to make a brighter sensor. Moreover, we included the split firefly luciferase data to compare the performance of WWE vs macrodomain in the development of the PAR-T NanoLuc sensor. Since firefly luciferase is frequently used for sensor development, we believe that it is important to include the results obtained from this sensor.

      Therefore, developing tools to measure ADPR dynamics in cells and in vivo is critical for better understating the various biological processes mediated by ADPR". "understating" should be "understanding".

      We corrected this error.

    1. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      Estrach and colleagues seek to identify the ECM components that are key to regulating hair follicle stem cell (HFSC) activation using the highly-characterized mouse hair follicle as a model. They first use a targeted approach to examine key ECM components expressed by HFSC and find that Fibronectin (FN) is highly expressed. Further, wholemount analysis of the hair follicle reveals a meshwork of FN enveloping the hair follicle. They hypothesize that FN is a fundamental regulator of hair follicle (HF) cycling and then proceed to carry out longterm studies required to examine hair follicle cycling and knockout FN with two different HFSC Cre lines (Lrig1 and Krt19), as well as integrin coreceptor SLC3A2. They clearly show that absence of Fibronectin (FN) and SLC3A2 is detrimental to hair follicle stem cell activation and cycling (FN) and hair follicle identity (SLC3A2).

      Overall comments:

      The authors use the tail hair follicles as a model similarly to the highly-characterized, synchronous back skin hair follicles. However, the tail hair follicles are asynchronous (Braun et al. 2003, PMID: 12954714), thus reporting the age of the mouse from which the tail whole mounts came from is not sufficient to claim a HF cycle disorder - HF should be imaged in an unbiased manner and subsequently quantified for phase. The manuscript would greatly benefit from including more information in the figure legends, such as age of mice, number of mice and HF quantified, as well as what the error bars represent. Further, in samples where many HF were counted per mouse, these should be averaged and then the average per mouse displayed; super plots would be great to use here.

      Major comments:

      1. In Figure 1, the use of tail whole mount images indeed provides striking display of the fibronectin meshwork that envelops the hair follicle. However, addition of a marker of the regenerative phase (e.g. proliferation) and resting phase would provide more convincing evidence that this is the particular phase of the hair cycle that you have captured, especially given my overall comment regarding the asynchronous nature of the tail HF cycle.
      2. The authors show that FN is expressed in early-mid anagen and conclude that FN is a regenerative signal. This claim should be substantiated with FN staining on more time points across the HF cycle to substantiate the argument that it is a regeneration-specific signal, found only in the telogen-anagen transition.
      3. Lrig1-cre and K19-cre-mediated FN knockout result in HF that are thinner at D158 - this is not immediately apparent from histological sections. Can you use your thick sections to give better perspective?
      4. The authors measure the width of the infundibulum from lightsheet microscope images. It is a bit difficult to position whole tissues using this technique, and the images that are shown are not from the same perspective, and thus measurement of the width is not accurate from these images. I suggest either removing this analysis or using more comparable images. Further, if this is a true phenotype, can you speculate on what the thickened infundibulum might mean?
      5. The authors then show mislocalization of Lrig1+ cells to the infundibulum in absence of FN. Are other stem markers localized to the infundibulum or outside of the bulge? Further, what might the mislocalization of Lrig1+ cells might mean?
      6. Please explain your conclusion after Figure 3i and at the end of the manuscript that states that FN is required for stem cell anchorage. I think that a very plausible explanation is that FN is required for stem cell function and identity, but anchorage of the SC lacks sufficient evidence. Further, your only evidence to support the anchoring theory only comes from expression of Lrig1 in FN knockout and no other markers. Are they also mislocalized? Please either tone down this conclusion on SC anchorage or provide stainings for more SC markers to show mislocalization in absence of FN.
      7. In Figure 3l-o, you examine proliferation on the control vs the conditional deletion of FN in D30 and D158. However, in D30, these tissues are not at all directly comparable since one is obviously in anagen and the knockout in telogen. You must compare the anagen knockout sample, although this occurs a bit later than the control. Further, how was the infundibulum distinguished from the bulb in these control images?
      8. In Figure 3P, you carry out RT-qPCR on whole skin to detect HFSC markers. This should have been carried out on sorted epithelial cells as isolation of whole back skin introduces bias to the system in that the number of stem cells may artificially look different in skin that is in anagen vs skin that is in telogen as the anagen skin has a different proportion of SC to progenitor cells to dermal cells. This concern is also similar to point 9 - the control and FN knockout at D30 are not comparable given that they are in different phases of the hair cycle.
      9. Figure 4a these images need to be of the whole mouse - it is not possible to determine what we are looking at or where - there is not even a scale bar.
      10. After Figure 4, you argue that because fibronectin expression resolves from healing dermis is the reason that hair follicles do not form, and site Dekonick and Blanpain (PMID: 30602767) - however this review makes no mention of the dynamics of fibronectin in wound healing. Further, evidence from Driskell et al (2013, PMID: 30602767) would suggest that it is the fibroblast population that responds to the wound that determines whether HF regenerate. And further, very large wounds do regenerate HF (Ito et al PMID: 30602767). In addition, this would all be fibroblast-derived FN, as opposed to the current study which examines keratinocyte-derived FN. Please reconsider this argument.
      11. The authors knockout SLC3A2, an integrin coreceptor that is localized to the plasma membrane. They show a very similar, yet more severe phenotype to the Lrig1- and K19- mediated knockout of FN. Given the bidirectional communication that SLC3A2 is responsible for, can you reconcile whether the defects in the HF cycle and the HFSC are a result of outside-in or inside-out signaling? Further, is it possible that integrin function regulated by SLC3A2 is necessary for more than FN assembly? This could be especially relevant given that your targeted screen also identified Col17A1, which is well known to be required for HFSC function (Matsumura et al., PMID: 26912707)
      12. It is intriguing that in the absence of HFSC-derived SLC3A2 that no FN network forms. Is FN expressed or is the assembly perturbed in the absence of properly functioning integrins? The authors conclude that the signaling cascade flows from fibronectin to integrin to SLC3A2, but do not test where the FN phenotype arises in the SLC3A2 knockout - is it due to aberrant assembly of the FN meshwork or a change in transcriptional or translational levels?
      13. In the grafting assay in Supplemental Figure 3, keratinocytes undergo a de novo hair follicle morphogenesis - is Lrig1 expression maintained in order to carry out cre-mediated deletion? Further, the fibroblasts in this assay may adopt a wound-like phenotype, expressing FN, which you earlier claim to be required for hair follicle production in wounds. Yet in the absence of epithelial FN, no HF form. Can the authors reconcile this?

      Minor comments:

      1. In Figure 1a, the two populations are Lgr5+ and basal; please define what the basal population is in this experiment.
      2. Significative is not a word.
      3. In Figure 4 figure legend, there is reference to a grafting experiment but no experiment shown.
      4. The authors delete FN in Lrig1+ or K19+ cells starting D19 and harvest at D30, and conclude that the hair follicles do not enter anagen after the second telogen, can you please include the data supporting the statement that mutant HF did not reenter the hair cycle after D65.

      Significance

      The authors show for the first time that fibronectin is expressed during cutaneous homeostasis and that it is required for normal function of the hair follicle stem cells. This is significant conceptual advance for the field of skin biology because fibronectin is thought to only be present in wounds: derived first from infiltrating serum and second from fibroblasts to act as provisional dermal ECM to support epithelialization during wound-response, which is ultimately resolved upon the conclusion of wound healing (reviewed in: Singer and Clark, PMID: 10471461). Further, FN has also been characterized as an EMT marker during cancerous progression (Lamouille et al, PMID: 24556840). Estrach and colleagues show that fibronectin is actually expressed by hair follicle stem cell keratinocytes and then is assembled into a meshwork that envelops the hair follicle and is in fact necessary for hair follicle stem cell homeostasis. This work would be broadly interesting to the field of stem cell biology as well as those working on extra cellular matrix signaling. My field is epithelial stem cells and more specifically hair follicle development and cycling.

      Referee Cross-commenting

      I have no disagreement with any of the points raised by the other reviewers. In fact, we seem to agree on the majority of the concerns. This includes the use of the tail wholemount model, the use of Lrig1-cre, selection of timepoint vs phase of the hair cycle, the appropriateness of the link between Fibronectin and SLC3A2, and further significant issues related to display of data and their reproducibility. Further, all of the major comments raised need to be addressed in order to properly evaluate the conclusions that the authors make. In my opinion, none of the comments raised here are unreasonable.

    1. Author Response:

      Reviewer #1:

      The paper uses a microfluidic-based method of cell volume measurement to examine single cell volume dynamics during cell spreading and osmotic shocks. The paper successfully shows that the cell volume is largely maintained during cell spreading, but small volume changes depend on the rate of cell deformation during spreading, and cell ionic homeostasis. Specifically, the major conclusion that there is a mechano-osmotic coupling between cell shape and cell osmotic regulation, I think, is correct. Moreover, the observation that fast deforming cell has a larger volume change is informative.

      The authors examined a large number of conditions and variables. It's a paper rich in data and general insights. The detailed mathematical model, and specific conclusions regarding the roles of ion channels and cytoskeleton, I believe, could be improved with further considerations.

      We thank the referee for the nice comment on our work and for the detailed suggestions for improving it.

      Major points of consideration are below.

      1) It would be very helpful if there is a discussion or validation of the FXm method accuracy. During spreading, the cell volume change is at most 10%. Is the method sufficiently accurate to consider 5-10% change? Some discussion about this would be useful for the reader.

      This is an important point and we are sorry if it was not made clear in our initial manuscript. We have now made it more clear in the text (p. 4 and Figure S1E and S1F).

      The important point is that the absolute accuracy of the volume measure is indeed in the 5 to 10% range, but the relative precision (repeated measures on the same cell) is much higher, rather in the 1% range, as detailed below based on experimental measures.

      1) Accuracy of absolute volume measurements. The accuracy of the absolute measure of the volume depends on several parameters which can vary from one experiment to the other: the exact height of the chamber, and the biological variability form one batch of cell to another (we found that the distribution of volumes in a population of cultured cells depends strongly on the details of the culture – seeding density, substrate, etc... - which we normalized as much as possible to reduce this variability, as described in previous articles, e.g. see2). To estimate this variability overall, the simplest is to compare the average volume of the cell population in different experiments, carried out in different chambers and on different days.

      Graph showing the initial average volume of cells +/- STD for 7 spreading experiments and 27 osmotic shock experiments, expressed as a % deviation from the average volume over all the experiments.

      The average deviation is of 10.9 +/- 8%

      2) Precision of relative volume measurements. When the same cell is imaged several times in a time-lapse experiment, as it is spreading on a substrate, or as it is swelling or shrinking during an osmotic shock, most of the variability occurring from one experiment to another does not apply. To experimentally assess the precision of the measure, we performed high time resolution (one image every 30 ms) volume measurements of 44 spread cells during 9 s. During this period of time, the volume of the cell should not change significantly, thus giving the precision of the measure.

      Graph showing the coefficient of variation of the volume (STD/mean) for each individual cell (n=44) across the almost 300 frames of the movie. This shows that on average the precision of volume measurements for the same cell is 0.97±0.21%. In addition, if more precision was needed, averaging several consecutive measures can further reduce the noise, a method which is very commonly used but that we did not have to apply to our dataset.

      We have included these results in the revised manuscript, since they might help the reader to estimate what can be obtained from this method of volume measurement. We also point the reviewer to previous research articles using this method and showing both population averages and time-lapse data2–8 . Another validation of our volume measurement method comes from the relative volume changes in response to osmotic shock (Ponder’s relation) measured with FXm, which gave results very similar to the numbers of previously published studies. We actually performed these experiments to validate our method, since the results are not novel.

      2) The role of cell active contraction (myosin dynamics) is completely neglected. The membrane tether tension results, LatA and Y-compound results all indicate that there is a large influence of myosin contraction during cell spreading. I think most would not be surprised by this. But the model has no contribution from cortical/cytoskeletal active stress. The authors are correct that the osmotic pressure is much larger than hydraulic pressure, which is related to active contraction. But near steady state volume, the osmotic pressure difference must be equal to hydraulic pressure difference, as demanded by thermodynamics. Therefore, near equilibrium they must be close to each other in magnitude. During cell spreading, water dynamics is near equilibrium (given the magnitude of volume change), and therefore is it conceptually correct to neglect myosin active contraction? BTW, 1 solute model does not imply equal osmolarity between cytoplasm and external media. 1 solute model with active contraction was considered before, e.g., ref. 17 and Tao, et al, Biophys. J. 2015, and the steady state solution gives hydraulic pressure difference equal to osmotic pressure difference.

      This is an excellent point raised by the referee. We have two types of answers for this. First an answer from an experimental point of view, which shows that acto-myosin contractility does not seem to play a direct role in the control of the cell volume, at least in the cells we used here. Based on these results we then propose a theoretical reason why this is the case. It contrasts with the view proposed in the articles mentioned by the referee for a reason which is not coming from the physical principles, with which we fully agree, but from the actual numbers, available in the literature, of the amount of the various types of osmolytes inside the cell. We give these points in more details below and we hope they will convince the referee. We also now mention them explicitly in the main text of the article (p. 6-7, Figure S3F) and in the Supplementary file with the model.

      A. Experimental results

      To test the effect of acto-myosin contraction on cell volume, we performed two experiments:

      1) We measured the volume of same cell before and after treatment with the Rho kinase ROCK inhibitor Y-27632, which decreases cortical contractility. The experiment was performed on cells plated on poly-L-Lysin (PLL), like osmotic shock experiments, a substrate on which cells adhere, allowing the change of solution, but do not spread and remain rounded. This allowed us to evaluate the effect of the drug. Cells were plated on PLL-coated glass. The change of medium itself (with control medium) induced a change of volume of less than 2%, similar to control osmotic shock experiments (maybe due to shear stress). When the cells were treated with Y-27, the change of volume was similar to the change with the control medium (now commented in the text p. 6-7, Figure S3F). To make the analysis more complete, we distinguished the cells that remained round throughout the experiment from the cells which slightly spread, since spreading could have an effect on volume. Indeed we observed that treatment with Y-27 induced more cells to spread (Figure S3F), probably because the cortex was less tensed, allowing the adhesive forces on PLL to induce more spreading9. Nevertheless, the spreading remained rather slow and the volume change of cells treated or not with Y-27 was not significantly different. This shows that, in the absence of fast spreading induced by Y-27, the reduction of contractility per se does not have any effect on the cell volume.

      Graphs showing proportion of cells that spread during the experiments (left); average relative volume of round (middle) and spread (right) control (N=3, n=77) and Y-27 treated cells (N=4, N=297).

      2) To evaluate the impact of a reduction of contractility in the total absence of adhesion, we measured the average volume of control cells versus cells which have been pretreated with Y-27, plated on a non-adhesive substrate (PLL-PEG treatment). This experiment showed that the volume of the cells evolved similarly in time for both conditions, proving that contractility per se has no effect on the cell volume or cell growth, in the absence of spreading.

      Graphs showing average relative volume of control (N=5, n=354) and Y-27 (N=3, n=292) treated cells plated on PLL-PEG (left); distributions of initial volume for control (middle) and Y-27 treated cells (right) represented on the left graph.

      Taken together these results show that inhibition of contractility per se does not significantly affect cell volume. It thus confirms our interpretation of our results on cell spreading that reduction of contractility has an effect on cell volume, specifically in the context of cell spreading, primarily because it affects the spreading speed.

      B. Theoretical interpretation

      In accordance with our experiments, in our model, the effect of contractility is implicitly included in the model because it modulates the spreading dynamics, which is an input to the model, i.e. through the parameters tau_a and A_0.

      We do not include the effect of contractility directly in the water transport equation because our quantitative estimates support that the contribution of the hydrostatic pressure to the volume (or the volume change) is negligible in comparison to the osmotic pressure, and this even for small variation near the steady-state volume. The main important point is that the concentration of ions inside the cell is actually much lower than outside of the cell10,11. The difference is about 100 mM and corresponds mostly to nonionic small trapped osmolytes, such as metabolites12. The osmotic pressure corresponding to this is about 10^5 Pa. Taking the cortical tension to be of order of 1 mN/m and cell size to be about ten microns we get a hydrostatic pressure difference of about 100 Pa due to cortical tension. A significant change in cell volume, of the order observed during cell spreading (let’s consider a ten percent decrease) will increase the osmotic pressure of the trapped nonionic osmolytes by 10^4 Pa (their number in the cell remaining identical). For this osmotic pressure to be balanced by an increase in the hydrostatic pressure, the cortical tension would need to increase by a factor of 100, which we consider to be unrealistic. Therefore, we find it reasonable to ignore the contribution of the hydrostatic pressure difference in the water flux equation. It is also consistent with the novel experiments presented above which show that inhibition of cortical contractility changes the cells volume below what can be detected by our measures (thus likely at maximum in the 1% range). This is now explained in the main text and Supplementary file.

      Regarding our minimal model required to define cell volume, the reason why we believe one solute model is not sufficient is fundamentally the same as above: the concentration of trapped osmolytes is comparable to the total osmolarity, which means that their contribution to the total osmotic pressure cannot be discarded. Secondly, within the simplest one solute model, the pump and leak dynamics fixes in inner osmolytes concentration but does not involve the actual cell size. The most natural term that depends on the size is the Laplace pressure (inversely proportional to the cell size in a spherical cell model). But as discussed above, this term may only permit osmotic pressure differences of the order of 100 Pa, corresponding to an osmolytes concentration difference of the order of 0.1 mM. That is only a tiny fraction of the external medium osmolarity, which is about 300 mM. Such a model could thus only work for extremely fine tuning of the pump and leak rates to values with less than about 1% variation. Furthermore, such a model could not explain finite volume changes upon osmotic shocks without involving huge (100-fold) cell surface tension variations, as discussed above. For these reasons, we believe that the one-solute model is not appropriate to describe our experiments, and we feel that a trapped population of nonionic osmolytes is needed to balance the osmolarity difference created by the solute pump and leak.

      In the revised version of the manuscript, we have now added a section in Supplementary file and in the main text, explaining in more detail this approximation.

      3) The authors considered the role of Na, K, and Cl in the model, and used pharmacological inhibitors of NHE exchanger. I think this part of the experiments and model are somewhat weak. I am not sure the conclusions drawn are robust. First there are many ion channels/pumps in regulating Na, K and Cl. The most important of which is NaK exchanger. NHE also involves H, and this is not in the model. The ion flux expressions in the model are also problematic. The authors correctly includes voltage and concentration dependences, but used a constant active term S_i in SM eq. 3 for active pumping. I am not sure this is correct. Ion pump fluxes have been studied and proposed expressions based on experimental data exist. A study of Na, K, Cl dynamics, and membrane voltage on cell volume dynamics was published in Yellen et al, Biophys. J. 2018. In that paper, they used different expressions based on previously proposed flux expressions. It might be correct that in small concentration differences, their expressions can be linearized or approximated to achieve similar expressions as here. But this point should be considered more carefully.

      We thank the reviewer for this comment. Indeed, we have not well justified our use of the NHE inhibitor EIPA. Our aim was not to directly affect the major ion pumps involved in volume regulation (which would indeed rather be the Na+/K+ exchanger), because that would likely strongly impact the initial volume of the cell and not only the volume response to spreading, making the interpretation more difficult. We based our choice on previous publication, e.g.13, showing that EIPA inhibited the main fast volume changes previously reported for cultured cells: it was shown to inhibit volume loss in spreading cells, as well as mitotic cell swelling14,15. Using EIPA, we also found that, while the initial volume was only slightly affected, the volume loss was completely abolished even in fast spreading cells (Y-27 and EIPA combined treatment, Figure S5H). This clearly proves that the volume loss behavior can be abolished, without changing the speed of spreading, which was our main aim with this experiment.

      The most direct effect of inhibiting NHE exchangers is to change the cell pH16,17, which, given the low number of H protons in the cell (negligible contribution to cells osmotic pressure), cannot affect the cell volume directly. A well-studied mechanism through which proton transport can have indirect effect on cell volume is through the effect of pH on ion transporters or due to the coupling between NHE and HCO3/Cl exchanger. The latter case is well studied in the literature18. In brief, the flux of proton out of the cell through the NHE due to Na gradient leads to an outflux of HC03 and an influx of Cl. The change in Cl concentration will have an effect on the osmolarity and cell volume.

      We thus performed hyperosmotic shocks with this drug and we found that, as expected, it had no effect on the immediate volume change (the Ponder’s relation), but affected the rate of volume recovery (combined with cell growth). Overall, the cells treated with EIPA showed a faster volume increase, which is what is expected if active pumping rate is reduced. This is in contrast with the above mentioned mechanism of volume regulation which will to lead to a reduced volume recovery of EIPA treated cells. This leads us to conclude that there is potentially another effect of NHE perturbation. Changing the pH will have a large impact on the functioning of many other processes, in particular, it can have an effect on ion transport16. Overall, the cells treated with EIPA showed a faster volume increase, which is what is expected if active pumping rate is reduced.

      On the model side, the referee correctly points out that there are many ion transporters that are known to play a role in volume regulation which are not included in Eq. 3. In the revised manuscript we now start with a more general ion transport equation. We show that the main equation (Eq.1 - or Supplementary file Eq.13) relating volume change to tension is not affected by this generalization. This is because we consider only the linear relation between the small changes in volume and tension. We note that the generic description of the PML (Supplementary file Eqs.1-6) can be seen as general and does not require the pump and channel rates to be constant; both \Lambda_i and S_i can be a function of potential and ion concentration along with membrane tension. It is only later in the analysis that we do make the assumption that these parameters only depend on tension. This point is now made clear in the Supplementary file.

      There is a huge body of work both theoretical and experimental in which the effect of different ion transporters on cell volume is analyzed. The aim of this work is not to provide an analysis of cell volume and the effect of various co-transporters but is rather limited to understanding the coupling between cell spreading, surface tension and cell volume.

      To analytically estimate the sign of the mechano-osmotic coupling parameter alpha we use a minimal model. For this we indeed take the pumps and channels to be constant. As it is again a perturbative expansion around the steady state concentration, electric potential, and volume, the expression of alpha can be easily computed for a model with more general ion transporters. This generalization will come at the cost of additional parameters in the alpha expression. We decided to keep the simpler transport model, the goal of this estimate is merely to show that the sign of alpha is not a given and depends on relative values of parameters. Even for the simple model we present, the sign of alpha could be changed by varying parameters within reasonable ranges.

      Given these points, and the clarification of the reasons to use EIPA in our experiments, a full mechanistic explanation of the effect of this drug is beyond the scope of this work. Because of this we are not analyzing the effect of EIPA on the model parameter alpha in detail. We now clarified our interpretation of these results in the main text of the article.

      Reviewer #2:

      The work by Venkova et al. addresses the role of plasma membrane tension in cell volume regulation. The authors study how different processes that exert mechanical stress on cells affect cell volume regulation, including cell spreading, cell confinement and osmotic shock experiments. They use live cell imaging, FXm (cell volume) and AFM measurements and perform a comparative approach using different cell lines. As a key result the authors find that volume regulation is associated with cell spreading rate rather than absolute spreading area. Pharmacological assays further identified Arp2/3 and NHE1 as molecular regulators of volume loss during cell spreading. The authors present a modified mechano-osmotic pump and leak model (PLM) based on the assumption of a mechanosensitive regulation of ion flux that controls cell volume.

      This work presents interesting data and theoretical modelling that contribute new insight into the mechanisms of cell volume regulation.

      We thank the referee for the nice comments on our work. We really appreciate the effort (s)he made to help us improve our article, including the careful inspection of the figures. We think our work is much improved thanks to his/her input.

      Reviewer #3:

      The study by Venkova and co-workers studies the coupling between cell volume and the osmotic balance of the cell. Of course, a lot of work as already been done on this subject, but the main specific contribution of this work is to study the fast dynamics of volume changes after several types of perturbations (osmotic shocks, cell spreading, and cell compression). The combination of volume dynamics at very high time resolution, and the robust fits obtained from an adapted Pump and Leak Model (PLM) makes the article a step-forward in our understanding of how cell volume is regulated during cell deformations. The authors clearly show that:

      -The rate at which cell deforms directly impacts the volume change

      -Below a certain deformation rate (either by cell spreading or external compression), the cells adapt fast enough not to change their volume. The plot dV/dt vs dA/dt shows a clear proportionality relation.

      -The theoretical description of volume change dynamics with the extended PLM makes the overall conclusions very solid.

      Overall the paper is very well written, contains an impressive amount of quantitative data, comparing several cell types and physiological and artificial conditions.

      We thank the referee for the positive comment on our work.

      My main concern about this study is related to the role of membrane tension. In the PLM model, the coupling of cell osmosis to cell deformation is made through the membrane-tension dependent activity of ion channels. While the role of ion channels is extensively tested, it brings some surprising results. Moreover, the tension is measured only at fixed time points, and the comparison to theoretical predictions is not always as convincing as expected: when comparing fig 6I and 6J, I see that predictions shows that EIPA (+ or - Y27), CK-666 (+ or - Y27) and Y27 alone should have lower tension than in the control conditions, and this is clearly not the case in fig 6J. But I would not like to emphasize too much on those discrepancies, as the drugs in the real case must have broad effects that may not be directly comparable to the theory.

      We apologize for the mislabeling of the Figure 6I (now Figure 5I). This plot shows the theoretical estimate for the difference in tension (in the units of homeostatic tension) between the case when the cell loses its volume upon spreading (as observed in experiments) compared to the hypothetical situation when the cell does not lose volume upon spreading (alpha = 0). The positive value of the tension difference predicts that the cell tension would have been higher if the cell were not losing volume upon spreading, which is the case for the treatments with EIPA and CK-666 (+ Y27) and corresponds to what we found experimentally.

      It thus matches our experimental observations for drug treatments which reduce or abolish the volume loss during spreading and correspond to higher tether force only at short time.

      We have corrected the figure and figure legend and explained it better in the text.

      But I wonder if the authors would have a better time showing that the dynamics of tension are as predicted by theory in the first place, as comparing theoretical predictions with experiments using drugs with pleiotropic effects may be hazardous.

      Actually, a recent publication (https://doi.org/10.1101/2021.01.22.427801) shows that tension follows volume changes during osmotic shocks, and overall find the same dynamics of volume changes than in this manuscript. I am thus wondering if the authors could use the same technique than describe in this paper (FLIM of flipper probe) in order to study the dynamics of tension in their system, or at least refer to this paper in order to support their claim that tension is the coupling factor between volume and deformation.

      As was suggested by the referee, we tried to use the FLIPPER probe. We first tried to reproduce osmotic shock experiments adding to the HeLa cells 4% of PEG400 (+~200 mOsm) or 50% of H20 (-~170 mOsm) and measuring the average probe lifetime before and after the shock. We found significantly lower probe lifetime for hyperosmotic condition compared with control, and non-significant, but slightly higher lifetime for hypoosmotic shock. The magnitude of lifetime changes was comparable with the study cited by the reviewer, but the quality of our measures did not allow us to have a better resolution. Next we measured average lifetime for control and CK-666+Y-27 treated cells 30 min and 3 h after plating, because we have highest tether force values for CK-666+Y-27 at 30 min. We did not see a change in lifetime in control cells between 30 min and 3 h (which also did not see with the tether pulling). Cells treated with CK-666+Y-27 showed a slightly lower lifetime values than control cells, but both 30 min and 3 h after plating, which means that it did not correspond to the transient effect of fast spreading but probably rather to the effect of the drugs on the measure.

      Graph showing FLIPPER lifetime before and after osmotic shock for HeLa cells plated on PLL- coated substrate. Left: control (N=3, n=119) and hyperosmotic shock (N=3, n=115); Right: control (N=3, n=101) and hypoosmotic shock (N=3, n=80). p-value are obtained by t-test.

      Graph showing FLIPPER lifetime for control just after the plating on PLL-coated glass (the same data for control shown at the previous graph), 30 min (control: N=3, n=88; Y-27+CK-666: N=3, n=130) and 3 h (control: N=3, n=78; Y-27+CK-666: N=3, n=142) after plating on fibronectin-coated glass. p-value are obtained by t-test.

      Because the cell to cell variability might mask the trend of single cell changes in lifetime during spreading, we also tried to follow the lifetime of individual cells every 5 min along the spreading. Most illuminated cells did not spread, while cells in non-illuminated fields of view spread well, suggesting that even with an image every 5 minutes and the lowest possible illumination, the imaging was too toxic to follow cell spreading in time. We could obtain measures for a few cells, which did not show any particular trend, but their spreading was not normal. So we cannot really conclude much from these experiments.

      Graph showing FLIPPER lifetime changes for 3 individual cells plated on fibronectin-coated glass (shown in blue, magenta and green) and average lifetime of cells from non-illuminated field (cyan, n=7)

      Our conclusions are the following:

      1) We are able to visualize some change in the lifetime of the probe for osmotic shock experiments, similar to the published results, but with a rather large cell to cell variability.

      2) The spreading experiments comparing 30 minutes and 3 hours, in control or drug treated cells did not reproduce the results we observed with tether pulling, with a global effect of the drugs on the measures at both 30 min and 3 hours.

      3) Following single cells in time led to too much toxicity and prevented normal spreading.

      We think that this technology, which is still in its early developments, especially in terms of the microscope setting that has to be used (and we do not have it in our Institute, so we had to go on a platform in another institute with limited time to experiment), cannot be implemented in the frame of the revision of this article to provide reliable results. We thus consider that these experiments are for further development of the work and are out of the scope of this study. It would be very interesting to study in details the comparison between the oldest and more established method of tether pulling and the novel method of the FLIPPER probe, during cell spreading and in other contexts. To our knowledge this has never been done so far, so it is not in the frame of this study that we can do it. It is not clear from the literature that the two methods would measure the same thing in all conditions even if they might match in some.

    1. Authors Response:

      Reviewer #2 (Public Review):

      The authors use representational similarity analysis on a combination of behavioral similarity ratings and EEG responses to investigate the representation of actions. They specifically explore the role of visual, action-related, and social-affective features in explaining the similarity ratings and brain responses. They find that social-affective features best explain the similarity ratings, and that visual, action-related, and social-affective features each explain some of the variance in the EEG responses in a temporal progression (from visual to action-related to social-affective).

      The stimulus set is nicely constructed, broadly sampled from a large set of naturalistic stimuli to minimize correlations between features of interest. I'd like to acknowledge and appreciate the work that went into this in particular.

      The analyses of the behavioral similarity judgments are well executed and interesting. The subject exclusion criteria and catch trials for online workers are smart choices, and the authors have tested a good range of models drawn from different categories. I find the case that the authors make for social features as determinants of behavioral similarity ratings to be compelling.

      I have a few questions and requests for additional detail about the EEG analyses. I appreciate that the authors have provided the code they used for all the analyses, and I'm sure that the answers to many if not all of my questions are there, but I don't have access to a Matlab license to run the code. Also, since the code requires familiarity with not just Matlab but with specific libraries to understand, I think that more description of the analysis in the paper would be appropriate.

      Some more detail is needed in the description of the multivariate classifier analysis. The authors write (line 597-599): "The two pseudotrials were used to train and test the classifier separately at each timepoint, and multivariate noise normalization was performed using the covariance matrix of the training data (Guggenmos et al., 2018). "

      I suspect I'm missing something here, because as written this sounds as if there was only one trial on which to train the classifier, which does not seem compatible with SVM classification. If only one trial was used to train the classifier, that sounds more like nearest-neighbor classification (or something else). Alternatively, if all different pseudo-trial averages - each incorporating a different subset of trials - were used for training, then that would seem to mean that some of the training pseudo-trials contained information from trials that were also averaged into the pseudo-trials used for testing. I don't know if this was done (probably not) but if it was it would constitute contamination of the test set. I think this part of the methods needs more detail so we can evaluate it. How many trials were used to train and to test for each iteration?

      Thank you for raising this issue; we agree that our Methods section was unclear on this point. We used split-half cross-validation. There was one pseudotrial for training per condition (which was obtained by averaging trials). There was no contamination between the training and test sets, because the data was first divided into separate training and test sets, and only afterwards averaged into pseudotrials for classification. This procedure was repeated 10 times with different data splits to obtain more reliable estimates of the classification performance. We rewrote the corresponding section to make this clearer:

      “Split-half cross-validation was used to classify each pair of videos in each participant’s data. To do this, the single-trial data was divided into two halves for training and testing, whilst ensuring that each condition was represented equally. To improve SNR, we combined multiple trials corresponding to the same video into pseudotrials via averaging. The creation of pseudotrials was performed separately within the training and test sets. As each video was shown 10 times, this resulted in a maximum of 5 trials being averaged to create a pseudotrial. Multivariate noise normalization was performed using the covariance matrix of the training data (Guggenmos et al., 2018). Classification between all pairs of videos was performed separately for each time-point. […] The entire procedure, from dataset splitting to classification, was repeated 10 times with different data splits.”

      We also performed the decoding procedure with a higher number of cross-validation folds and found very similar results.

      I think a bit more detail is also necessary to clarify the features used for the classification. My understanding is that each timepoint was classified as one action vs each other action on the basis of all the electrodes in the EEG for a given temporal window. Is this correct? (I'm guessing / inferring more than a little here.)

      This is correct, and we agree that further clarification was needed in text. We have added this:

      “Classification between all pairs of videos was performed separately for each time-point. Data were sampled at 500 Hz and so each time point corresponded to non-overlapping 2 ms of data. Voltage values from all EEG channels were entered as features to the classification model.

      The entire procedure, from dataset splitting to classification, was repeated 10 times with different data splits. The average decoding accuracies between all pairs of videos were then used to generate a neural RDM at each time point for each participant. To generate the RDM, the dissimilarity between each pair of videos was determined by their decoding accuracy (increased accuracy representing increased dissimilarity at that time point).”

      It would be useful to know how many features constituted each feature space. For example, was motion energy reduced to one summary feature (total optic flow for whole sequence?) For "pixel value", is that luminance? (I suspect so, since hue is quantified separately, but I don't think this was specified).

      For motion energy, we used the magnitude of the optic flow, and calculated Euclidean distances between the vectorized magnitude maps rather than reducing it to summary features. We have included the dimensionality of each feature in Supplementary File 1b and we now refer to it in text:

      “These features were vectorized prior to computing Euclidean distances between them (see Supplementary File 1b for the dimensionality of each feature).”

      Pixel value was indeed the luminance, and we have clarified this in text.

      More broadly, I would appreciate a bit more discussion of the role of time in these analyses. Each clip unfolds over half a second, so what should we make of the temporal progression of RDM correlations? Are the social and affective features correlated with later responses because they take more time to compute (neurally speaking), or because they depend on longer temporal integration of information? These two are not even exactly mutually exclusive, and I realize that it may be difficult to say with certainty based on this data, but I think some discussion of this issue would be appropriate.

      This is a great point, although it is difficult to speculate based on this data. One way to get at this would be to examine how much social-affective processing relies on previously extracted features. Future work could look at the causality between early and later-stage EEG features (unfortunately our post-hoc attempts to address this via Granger-causal analysis were unsuccessful, likely due to insufficient SNR with our specific experimental design). Alternatively, this could be investigated in a follow-up experiment that varies how social information unfolds over time (e.g., images vs. videos or varying video duration). We now discuss this possibility in the manuscript:

      “Given the short duration of our videos and the relatively long timescale of neural feature processing, it is possible that social-affective features are the result of ongoing processing relying on temporal integration of the previously extracted features. However, more research is needed to understand how these temporal dynamics change with continuous visual input (e.g. a natural movie), and whether social-affective features rely on previously extracted information.”

    2. Reviewer #2 (Public Review):

      The authors use representational similarity analysis on a combination of behavioral similarity ratings and EEG responses to investigate the representation of actions. They specifically explore the role of visual, action-related, and social-affective features in explaining the similarity ratings and brain responses. They find that social-affective features best explain the similarity ratings, and that visual, action-related, and social-affective features each explain some of the variance in the EEG responses in a temporal progression (from visual to action-related to social-affective).

      The stimulus set is nicely constructed, broadly sampled from a large set of naturalistic stimuli to minimize correlations between features of interest. I'd like to acknowledge and appreciate the work that went into this in particular.

      The analyses of the behavioral similarity judgments are well executed and interesting. The subject exclusion criteria and catch trials for online workers are smart choices, and the authors have tested a good range of models drawn from different categories. I find the case that the authors make for social features as determinants of behavioral similarity ratings to be compelling.

      I have a few questions and requests for additional detail about the EEG analyses. I appreciate that the authors have provided the code they used for all the analyses, and I'm sure that the answers to many if not all of my questions are there, but I don't have access to a Matlab license to run the code. Also, since the code requires familiarity with not just Matlab but with specific libraries to understand, I think that more description of the analysis in the paper would be appropriate.

      Some more detail is needed in the description of the multivariate classifier analysis. The authors write (line 597-599): "The two pseudotrials were used to train and test the classifier separately at each timepoint, and multivariate noise normalization was performed using the covariance matrix of the training data (Guggenmos et al., 2018). "

      I suspect I'm missing something here, because as written this sounds as if there was only one trial on which to train the classifier, which does not seem compatible with SVM classification. If only one trial was used to train the classifier, that sounds more like nearest-neighbor classification (or something else). Alternatively, if all different pseudo-trial averages - each incorporating a different subset of trials - were used for training, then that would seem to mean that some of the training pseudo-trials contained information from trials that were also averaged into the pseudo-trials used for testing. I don't know if this was done (probably not) but if it was it would constitute contamination of the test set. I think this part of the methods needs more detail so we can evaluate it. How many trials were used to train and to test for each iteration?

      I think a bit more detail is also necessary to clarify the features used for the classification. My understanding is that each timepoint was classified as one action vs each other action on the basis of all the electrodes in the EEG for a given temporal window. Is this correct? (I'm guessing / inferring more than a little here.)

      It would be useful to know how many features constituted each feature space. For example, was motion energy reduced to one summary feature (total optic flow for whole sequence?) For "pixel value", is that luminance? (I suspect so, since hue is quantified separately, but I don't think this was specified).

      More broadly, I would appreciate a bit more discussion of the role of time in these analyses. Each clip unfolds over half a second, so what should we make of the temporal progression of RDM correlations? Are the social and affective features correlated with later responses because they take more time to compute (neurally speaking), or because they depend on longer temporal integration of information? These two are not even exactly mutually exclusive, and I realize that it may be difficult to say with certainty based on this data, but I think some discussion of this issue would be appropriate.

    1. To treat the clot postpartum, the doctors wanted to prescribe an FDA Category X drug to treat the clot -- it's so dangerous for pregnancy that women often choose to be sterilized before they take it. They told me that my clotting disorder means I should not have any more children, because of the risk that pregnancy poses to my health. I didn't want them to think I was religious for fear of what they'd think of me, but when I hinted at the question of using Natural Family Planning (a method for spacing children that the Church deems morally acceptable), they laughed. Someone with my condition had to use contraception, they said. There was no choice. Fatigued by the constant pain, overwhelmed by medical bills that were piling up by the thousands, I began to slide back away from this religion, tumbling down a slope that ended back in atheism. I hadn't minded changing in the sense of not using the f-word so much, but this was a whole different ballgame. To stick with the Church now would be to lose my life as I knew it, and to set out down an unfamiliar, frightening path. Not knowing what else to do, I went back to the basics of the way I'd been taught to work through problems since childhood. My dad, my parent from whom I got my religious views (or lack thereof), had not raised me to be an atheist as much as he'd raised me to seek truth fearlessly. "Never believe something because it's convenient or it makes you feel good," he'd always say. "Ask yourself: 'Is this true?'" And so I set everything else aside, and clung to the simple question: What is true? I quickly realized then that that was not in question, and hadn't been for a while. For weeks now, I had known on an intellectual level that I believed what the Church taught. What stalled me had not been a hesitation of whether or not it was true; it had been a hesitation of not wanting to sacrifice too much. I had no idea how things would work out. I thought there was a fair chance that this step would lead us to financial ruin, and may even take a serious toll on my health. But I decided, for the first time in a long time, to choose what was true instead of what was comfortable. Joe and I signed up to begin the formation process at our parish church. And, in the first statement of faith I'd ever made, I told my doctors that I would not use contraception, because I was Catholic. ### After that moment, a bunch of fortuitous events occurred that smoothed the way for us to become Catholic. A series of windfalls gave us the money we needed to manage our medical bills. After they got over their initial shock at encountering someone who wouldn't contracept, my doctors came up with creative solutions to keep me healthy.

      What "creative solutions" did her doctors come up with?

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      1. General Statements [optional]

      This section is optional. Insert here any general statements you wish to make about the goal of the study or about the reviews.

      We are grateful to the reviewers for their honest opinion regarding this work and plan to address the majority of the comments in a revised version either through new analysis or revision to the text, as we believe these will improve the manuscript by making some of the details clearer. There were few suggestions that will lead to substantiative changes to the findings. Here, we address the most salient critiques, the primary one being related to novelty.

      We respectfully disagree, as our detailed analysis of the DNA methylome in Octopus bimaculoides represents a significant advance to understanding how the epigenome is patterned in non-model invertebrates in general, and cephalopods in particular. We acknowledge that the previous report that the octopus methylome resembles the few other invertebrates where low DNA methylation has been found, the finding was part of a multi-organism study last year (de Mendoza et al., 2021), which lacked any detailed investigation. Our study provides the first in depth analysis on methylation patterning, the relationship with transposons and gene expression, and reports the finding of other key epigenetic marks in O. bimaculoides, and in other cephalopods.

      In short, we believe our study to be highly novel and that it represent the first analysis of this kind in cephalopods and one of the few existing in non-model invertebrate organisms. In addition, we identify the conservation of the histone code in cephalopods. While this may be expected, this is the first experimental evidence in this class and represents an important step forward to understand the epigenetic regulation of genes and transposons in invertebrates. Finally, we plan to provide an updated transcriptome annotation for O. bimaculoides that will be available for the scientific community as a new valuable resource. We believe these features will make this study highly cited.

      We believe that findings like ours will complement several recent studies that extend the epigenetics field out of the current narrow focus on model organisms to understand how epigenetic mechanisms function in diverse animals. This provides new insights regarding the epigenetic mechanism of gene regulation in an emerging invertebrate model.

      2. Description of the planned revisions

      Insert here a point-by-point reply that explains what revisions, additional experimentations and analyses are planned to address the points raised by the referees.

      Reviewer 1 raised the following points that we are planning to address:

      *- It is unclear why the authors did not use the original gene models of O. bimaculoides or tried to improve them. By only relying on adult tissue (but the relatively late hatchling stage), they would have omitted most developmentally expressed genes, that are incidentally also the ones that are subjected to extensive spatiotemporal gene regulation (which is also a problem to assess the role of methylation). I think more comparisons with existing gene models and how the newly generated stringtie models should be provided. *

      We agree that using as many tissues and developmental stages as possible will expand the octopus transcriptome.

      We plan to:

      • Add RNA-seq data from stage 15 embryos to improve this.
      • Compare the gene model used in the original version of the manuscript (Stringtie model to use in Trinotate for improving the annotation of the genes) to the existing annotation model and report on which has superior performance for annotating the * bimaculoides* transcriptome.
      • Extend the annotation of the transcriptome which we undertook in a focused fashion in the first iteration of this manuscript. Reviewer 2 raised the following points that we are planning to address:

      *- It is not exactly clear to me why the authors look for expression clusters in the first part of the manuscript? This information, while interesting, does not seem to be used in the methylation analysis. It is also somewhat contradictory because the authors first claim that, based on their GO-term enrichment analysis, that different expression clusters are associated with "complex regulatory mechanisms, potentially based in the epigenome". Yet at the end they conclude that, due to the global and tissue-overarching nature of methylation, this "argues against this epigenetic modification as a player in the dynamic regulation of gene expression". *

      We thank the reviewer for pointing out this issue and we plan to clarify the point through changing the text and additional analysis. Since we found that the methylation pattern was stable across tissues, and that it corresponded to gene expression levels regardless of tissues, we concluded that the methylation pattern is not likely relevant for the tissue-specific gene expression pattern reported in Figure 1.

      We plan to:

      • Ask whether there is a correlation between the gene clusters generated in Figure 1 and the DNA methylation patterns identified in Figure 4. *- At least for the trees that are shown in the main figures it would be great to show support values. *

      We thank the reviewer for this request.

      We plan to:

      • Add full Supplementary information regarding the support values in Supplemental Files for all the trees present in the main Figures. Reviewer 3 raised the following points that we are planning to address:

      *- It would be great to see more data on cephalopod TET and MBD structure. For example, it would be interesting to know whether octopus TETs have a CxxC domain or whether MBD proteins harbor functional 5mC - binding domains. *

      We agree that it would be of interest to examine the conservation of TET genes to expand upon the initial analysis by Planques et al 2021 showing that O. bimaculoides have one TET homolog, one MBD4 homolog and one MBD1/2/3 homolog. Detailed analysis of MBD4 protein has been already performed in de Mendoza et al. 2021 by using the protein sequence of O. vulgaris, as the MBD4 gene in the O. bimaculoides genome appears truncated.

      We plan to:

      • add the PFAM domain analysis for TET proteins This will be added as a new figure panel.
      • Update the text to include the reference to the identification of MBD4/MECP2 as the invertebrate homologs of vertebrate MBD4. *- Even though RRBS provides limited insight into DNA methylation patterns, the authors could have done more to explore read-level 5mC information. For example, by studying single reads, the authors could deduce the numbers of fully methylated, unmethylated or partially methylated reads. Such analyses might provide valuable insight into potentially different modes of epigenetic inheritance in different tissues i.e are there tissues that favor fully methylated or unmethylated stretches of DNA vs tissues that favor partial methylation? *

      We think this is a really interesting point. This has been partially addressed in a previous work (de Mendoza et al., 2021) which found limited to no partially methylated reads in whole-genome bisulfite sequencing from O. bimaculoides brain.

      We plan to:

      3. Description of the revisions that have already been incorporated in the transferred manuscript

      Please insert a point-by-point reply describing the revisions that were already carried out and included in the transferred manuscript. If no revisions have been carried out yet, please leave this section empty.

      Reviewer 1 raised the following points that we have already addressed:

      We addressed all the comments raised by this Reviewer by revising the text, fixing references, typos and improving clarity.

      Reviewer 2 raised the following points that we have already addressed:

      We addressed all the minor Comments raised by this Reviewer regarding spelling errors and Supplementary Figures.

      - The finding that less than 10% of all possible sites are methylated is surprising. I could not (easily) find statistics of RRBS experiment read mapping to the genome.

      We have now provided this data and new Supplemental Table 1 (refereed in the text as Table S1).

      *- It is very exciting to see methylation of gene bodies and some correlation to their expression levels, but the authors may need to include a disclaimer that the methylation of TEs may go undetected due to the gapness of the genome. In fact, the authors may try to map their data onto a somewhat closely related Octopus sinensis genome sequenced with long reads available at NCBI to confirm overall pattern. It is likely though that due the evolutionary distance only gene bodies will have mapping. *

      The thank the reviewer for this suggestion and we included a sentence in the Result session indicating that methylation of TEs may go undetected due to the poor annotation of the octopus genome.

      *- The statistical reasoning (and methodology) behind how clusters in Figures 1 and 4 were defined is unclear. In particular, in Figure 4, it seems that the authors had asked the program to give four clusters in total - why was this number chosen? It seems that using the same generic clustering approach as in Figure 1 may benefit or confirm the results in Figure 4. *

      We clarified the rationale in the Material and Methods session to describe the bioinformatic analysis. We will put the full code used in the manuscript in our GitHub page (https://github.com/SadlerEdepli-NYUAD/) to have a more comprehensive understanding of the Method used.

      Reviewer 3 raised the following points that we have already addressed:

      We addressed all the minor comments in the text and figures raised by this reviewer regarding typos and clarity.

      *- There is little info on the generated 5mC data. To bolster its value as a resource, the manuscript should have a link to the table describing RRBS metrics. This should include: non-conversion rates, numbers of sequenced and mapped reads, read length and other info that the authors deem useful. *

      We have now provided this data in a new Supplemental Table 1 (refereed in the text as Table S1).

      4. Description of analyses that authors prefer not to carry out

      Please include a point-by-point response explaining why some of the requested data or additional analyses might not be necessary or cannot be provided within the scope of a revision. This can be due to time or resource limitations or in case of disagreement about the necessity of such additional data given the scope of the study. Please leave empty if not applicable.

      Reviewer 1 raised the following points that we are not planning to address:

      *- The newly sequence RNA-seq samples are using a ribodepletion protocol (RiboZero) while the other ones are using a polyA selection. This might be a slight problem to compare them quantitatively. Actually in the Figure 1, all 4 newly generated samples group together in the hierarchical clustering. *

      We acknowledge the reviewer’s point here and agree that heterogeneity in library prep and batch is a common issue when comparing public available with newly generated datasets. This could account for the clustering of the Ribosomal RNA depleted (i.e. RiboZero) from polyA selected RNA libraries. While this could potentially introduce bias, we do not believe that it substantially alters any of the main findings or the interpretations of this data. Our purpose for carrying out the cluster analysis of transcriptomic data from multiple tissues was to identify distinct gene patterns that defined different tissue types. This was accomplished regardless of the potential confounding variable introduced by different library preparations. In addition, we used TMP which seems to help in the comparison across different samples when used for qualitative analysis such as PCA and cluster analysis (Zhao et al. 2020; DOI: http://www.rnajournal.org/cgi/doi/10.1261/rna.074922.120). Therefore, even if not ideal we think that this approach is still valuable.

      *- I am not so sure about the way the authors used z-score normalized logTPMs and applied hierarchical clusters, this most likely would not fully alleviate the impact of expression level on the outcome compared to more advanced form of normalization and clustering. *

      We agree with the reviewer that applying z-score or a logTPMs normalization would not fully resolve the technical variance in the direct comparison of libraries generataed with different RNA selection methods. We did not apply z-score on logTPMs but these 2 methods were applied separately: z-score on TPMs in Figure 1B to define the gene clusters and log2(TPM+1) in Figure 4E. We have clarified the text to reflect this.

      *- I am not convinced that differences in western blot for histone modification could really provide a clear insight into their regulatory role. *

      We agree with the reviewer that Western blotting for histone modifications does not provide deep insight into their regulatory role. However, this is the first description of these marks in any cephalopod, and we believe that reporting a finding from experimental evidence is important, even if the result is aligned with the existing paradigm. Moreover, the marked difference in levels of distinct histone marks across tissues supports the hypothesis that they play a regulatory role. We observed this in mice where difference abundance in western blot correspond to different abundance and enrichment also by ChIP-seq (Zhang et al., 2021 DOI: https://doi.org/10.1038/s41467-021-24466-1). Considering the limited tools available in this species, we still consider this an important finding.

      Reviewer 2 raised the following points that we are not planning to address:

      *- The finding that less than 10% of all possible sites are methylated is surprising. I could not (easily) find statistics of RRBS experiment read mapping to the genome. I also wonder how much the gap-richness of the genome may affect the overall methylation estimate. If assembly permits, would it make sense to limit the sampled sites to areas where no flanking gaps are present (and sufficient scaffold length is available, maybe excluding very short scaffolds)? *

      We added all the statistical values regarding the RRBS in a NEW Supplemental Table 1. We used a single base pair analysis approach (not tiling windows), so the data we extracted is not biased by the length of the scaffolds. This is confirmed by the fact that the DNA methylation value obtained in our RRBS data matches the findings observed in Whole Genome Bisulfite Sequencing (WGBS). Moreover, global DNA methylation values assessed by Slot blot analysis as a technique independent from genome assembly confirmed what observed with RRBS.

  2. Feb 2022
    1. Well-paid and well-treated non-tenure track faculty are more probably to have the necessary time and be granted the backing required to lecture a first-rate class (Edmonds, 2015Edmonds, D. (2015). More than half of college faculty are adjuncts: Should you care? Retrieved from http://www.forbes.com/sites/noodleeducation/2015/05/28/more-than-half-of-college-faculty-are-adjuncts-should-you-care/#6ff634541d9b, May 28 [Google Scholar]).

      I think that the is possible and we could see it at more progressive schools. It is just like the banking industry, one bank got rid of overdraft fees and slowly we are seeing major banks do away with the fees as well. All it takes is for one brave institution and another to change the landscape of things.

    1. Fast Car
      • Fast Car is a song written and performed by American folk rock singer Tracy Chapman in 1986.
      • As quoted by Tracy Chapman in an interview in 1986 with Canadian radio station CIDR, “I think that it was a song about my parents…..a new life together and my mother was anxious to leave home. [They]tried to make a life for themselves and it was very difficult going….my mother didn’t have a high school diploma and my father was a few years older….hard for him to create the ideal life that he wanted…in a sense I think they came together thinking that they would have a better chance of making it”
      • The song became popular around the 1980s after Tracy Chapman performed it at the 70th birthday Nelson Mandela tribute.
      • The song is narrative, and tells the story from the point of view of a woman who wants to get out of her hometown to leave her problems behind, such as taking care of her alcoholic father, after her mother left the family. She finds the addressee, a lover with the titular fast car, and sees it as an opportunity to get out of her hometown at last and start life anew. However, the reality is far from that, as the song continues, and reveals how even after moving to the city, she is still working a dead-end job and living in a homeless shelter, while her lover is unemployed. It parallels her own parents’ relationship, as her lover becomes a father and an alcoholic, and their relationship grows rocky. Eventually she decides that she does not need him anymore, and tells him to either clean up his act or leave.
      • Hope and resilience are major themes in this song, as the persona continues to persevere despite the many challenges that are thrown at her in life, from her upbringing to her newfound problems.However, she remains optimistic and presses on, and we find hope in how she eventually achieves independence and supposedly leaves her problems behind.
      • One can interpret the title ‘Fast Car’ as a symbol of escapism. However, as the song goes on, it is revealed that getting out of a situation is far from easy and even if you start anew, there will always be problems and challenges in life to overcome. Additionally, sometimes, you will find that people who are close to you make irresponsible and selfish decisions which affect you .In order to carry on a meaningful and purposeful life, you may have to give up on them. To prevent this from happening, it is essential to become independent and not place all your faith in someone else, just as the persona has done in Fast Car.
    1. As a result, teachers who choose a variety of assistive technologies for their classrooms may want to ensure that students and parents are fully aware of any privacy or security issues that could arise.

      I think that this is so important. Being aware of the different privacy or security issues that may arise when using different technologies is something that everyone should be aware of in order to be safe. We have learned that not all tools are safe and there are many times where passwords and personal information can be stolen. Also, students may not actually fully research the privacy of a website so having a parent also research the tool/site can help ensure that all information will be safe.

    1. Even if you as an individual user may be okay with sharing your data for “free” tools, when you assign a tool to students you are asking them to share their data, whether they want to or not.

      It is so easy to click "accept" online which may be sharing your information and we often don't even think twice and should be paying closer attention to this.

    2. However, it is important to note, even when there is an accessibility statement or VPAT, these are often self-reported by the company and can be limited by the knowledge of accessibility of the person(s) creating it.

      I think this is a huge "however" when looking at how helpful an accessibility statement is. If there are huge gaps then the statement is not as helpful but also if it is quite extensive you have to remember that it is still the company writing it up, and they may be lacking in knowledge. I think one day regulations will be put in place for the VPAT but that is not where we are yet.

    1. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      Formation of tubes in a developing organism may arise from the closure of a pre-existing polarized epithelium or from de novo polarization and cavity formation in group of dividing cells. The concept of apical membrane initiation site (AMIS) refers to the fact that polarity proteins as PAR3 accumulate at a point where the apical membrane will be created. This accumulation occurs as early as the two cell stage. Previous reports have demonstrated the importance of the division process in defining this AMIS, however, in the present work the authors in vitro 3D cultures of mESC to report a mitosis independent mechanism that creates an AMIS, induces the polarization of groups of two or more cells, and permits the formation of a central cavity. The report shows that the mechanism is fully dependent on the polarized accumulation of E-cadherin at the cell membrane in contact with the other cells. Moreover, the mechanism does not require mitosis or interaction with the extracellular matrix.

      Major comments:

      The main objective of the work is to demonstrate that AMIS creation and cavity formation can be mitosis independent and that it is dependent on the accumulation of E-cadherin at the midline between two cells in contact. To demonstrate these objectives, the authors perform 3D cultures of mESC. To rule out the requirement of mitosis the authors perform cultures that are treated with mitomycin C and the purify single cells that are cultured again. The authors show time-laps experiments demonstrating that individual cells that do not dived create an AMIS when they contact one to each other. With this cultures they demonstrate that the process does not require an interaction with ECM (provided by the matrigel) but requires E-cadherin, to demonstrate, that they use E-cadherin KO cells (the same line where E-cadherin has been deleted). The work is well written and the objectives very clear. The technology used and the experiments done are adequate and sufficient to accomplish the proposed objectives and the results obtained clearly support the conclusions reached. The methods are well explained and transparent to be reproduced elsewhere and the number of replicas and the statistical methods applied seem corrects to me, although I am just a biologist, not a mathematician. Although the objectives of the work, that are: to demonstrate that AMIS formation can be independent of mitosis and that AMIS requires E-cadherin, there are parts of the results that could be farther studied or at least discussed more thoroughly. Firstly, the authors show that in non-dividing cells an AMIS is formed at the first contact site between the two cells, they also show that in the absence of E-cadherin the cell maintains the polarization of centrioles and Golgi apparatus, in spite that no AMIS is formed, this indicates that the deposition of E-cadherin at the midline membrane is part of a more global polarization event that most likely is initiated by the a directional activity of the Golgi apparatus that may direct the delivery of mature E-cadherin in that particular direction, initiating or maintaining the basis for an AMIS, since recent work (already cited in the manuscript) has demonstrated the importance of cadherin maturation for polarity establishment and maintenance (Herrera et al, 2021), the actual results should be farther discussed in this context. Secondly, it was previously shown that in different epithelia, upon cell-cell contact, the aPKC complex (that includes Par3 and Par6) is recruited early to the contact site where with the participation of Cdc42, aPKC is activated generating an initial spot-like adherent junction (AJs) (Suzuki et al., 2002). In that case it is thought to be mediated by a direct interaction between the first PDZ domain of PAR-3 and the C-terminal PDZdomain-binding sequences of immunoglobulin-like cell adhesion molecules: JAM-1 and nectin-1/3 (Fig. 3) (Ebnet et al., 2001; Itoh et al., 2001; Takekuni et al., 2003). Thus it wold be interesting to know if AMIS formation in absence of cell division depends on JAM-1 or nectin and whether JAM-/Nectin signalling is sufficient to initiate the Golgi and centriole polarization and which is the mechanism governing it.

      Minor comments:

      As I mentioned before, the paper is well presented and very clear, yes it is simple, but simple is always better, no complicated graphics or letterings, thank you. Although in my opinion the work is very well written, I have to admit that I am not qualified to evaluate the literary style of the work since English is not my mother tongue, also I have not reviewed typographical errors since I think that is the work of the editorial, not of scientific reviewers. Please include the full reference of all the antibodies used, including the company and not just the catalog number

      Quoted references:

      Ebnet, K., Suzuki, A., Horikoshi, Y., Hirose, T., Meyer Zu Brickwedde, M. K., Ohno, S. and Vestweber, D. (2001). The cell polarity protein ASIP/PAR-3 directly associates with junctional adhesion molecule (JAM). EMBO J. 20, 3738-3748.

      Itoh, M., Sasaki, H., Furuse, M., Ozaki, H., Kita, T. and Tsukita, S. (2001). Junctional adhesion molecule (JAM) binds to PAR-3: a possible mechanism for the recruitment of PAR-3 to tight junctions. J. Cell Biol. 154, 491-497.

      Takekuni, K., Ikeda, W., Fujito, T., Morimoto, K., Takeuchi, M., Monden, M. and Takai, Y. (2003). Direct binding of cell polarity protein PAR-3 to cell-cell adhesion molecule nectin at neuroepithelial cells of developing mouse. J. Biol. Chem. 278, 5497-5500

      Suzuki, A., Ishiyama, C., Hashiba, K., Shimizu, M., Ebnet, K. and Ohno, S. (2002). aPKC kinase activity is required for the asymmetric differentiation of the premature junctional complex during epithelial cell polarization. J. Cell Sci. 115, 3565-3573.

      Significance

      The paper describes for the first time that contrary to what was previously believed an AMIS can be generated without a cell division. This is very important because it opens the possibility that the mechanisms that originate the biologic cavities are in fact not really how we believed. The work is of interest of all cell biology scientists, specially working in developmental biology, cancer research.

      My particular field of expertise is cell biology and signaling, always applied to particular events as nervous system development or cancer, in particular I am interested in Wnt/b-catenin and Sonic Hedgehog pathways.

    1. I think that we both know the way that this story ends

      The speaker foreshadows about how their relationship would undoubtedly end. The word ‘story’ symbolises their relationship, which both may have a start and an end. The different parts of a story can resemble the life in a relationship with someone you love. The ‘introduction’ resembles how they meet, the ‘rising action’ resemble how they have taken things further and started to date, the ‘climax’ can resemble the fights or arguments that start to take place inferred from the previous line. And it carries on until the ‘conclusion’, where they separate. From this, I can reflect on the theme of relationship as how it may only last for a brief moment, and shatter, which can hurt so much so it is hard to let go. I feel proud for the speaker even though he fails repetitively to let go but tries his best to do so

    1. Reviewer #1 (Public Review):

      In this manuscript, Yang et al. trained monkeys to play the classic video game Pac-Man and fit their behavior with a hierarchical decision making model. Adapting a complex behavior paradigm, like Pac-Man, in the testing of NHP is novel. The task was well-designed to help the monkeys understand the task elements step-by-step, which was confirmed by the monkeys' behavior. The authors reported that the monkeys adopted different strategies in different situations, and their decisions can be described by the model. The model predicted their behavior with over 90% accuracy for both monkeys. Hence, the conclusions are mostly supported by the data. As the authors claimed, the model can help quantify the complex behavior paradigm, providing a new approach to understanding advanced cognition in non-human primates. However, several aspects deserve clarification or modification.

      1. The results showed that the monkeys adopted different strategies in different situations, which is also well described by the model. However, the authors haven't tested whether the strategy was optimal in a given situation.

      Our approach to analyze monkeys’ behavior is not based on optimality. Instead, we centered around the strategies and showed that they described the monkeys’ behavior well. The model and its fitting process does not assume the monkeys were optimizing for something. Nevertheless, the fitting results suggested that the strategies that the monkeys chose were rational, which suggests validity of our model. As we have pointed out above, optimality is hard to define in such a complex game. In particular, most of the game is about collecting pellets, strategies that are only used in a small portion of the game can be ignored when searching for optimal solutions. We feel that further analyses on the issue of optimality would dilute the center message of the paper and choose not to include them here.

      According to the results, the monkeys didn't always perform the task in an optimal way, as well. Most of the time, the monkeys didn't actively adopt strategies in a long-term view. They were "passively" foraging in the task: chasing benefit and avoiding harm when they were approached. This "benefit-tending, harm-avoiding" instinct belongs to most of the creatures in the world, even in single-cell organisms. When a Paramecium is placed in a complex environment with multiple attractants and repellents, it may also behave dynamically by adopting a linear combination of basic tending/avoiding strategies, although in a simpler way. In other words, the monkeys were responding to the change of environment but not actively optimizing their strategy to achieve larger benefits with fewer efforts. The only exception is the suicides. Monkeys were proactively taking short-term harms to achieve large benefits in the future.

      One possible reason is that the monkeys didn't have enough pressure to optimize their choices since they will eventually get all the rewards no matter how many attempts they make. The only variable is the ghosts. Most of the time, the monkeys didn't really choose between different targets/ strategies. They were making choices between the chasing order of the options, but not the options themselves. It is similar to asking a monkey to choose either to eat a piece of grape or cucumber first, but not to choose one and give up the other one. A possible way to avoid this is to stop the game once the ghost catches the Pac-Man or limit each game's time.

      The game is designed to force the players to make decisions quickly to clear the pellets, otherwise the ghosts would catch Pac-Man. Even in the monkey version of the game where the monkeys always get another chance, Pac-Man deaths lead to long delays with no rewards. They will not be able to complete the game if they do not actively plan their route, especially in the late stage when they must reach the scarcely placed dots while escaping from the ghosts. In addition, we provided additional rewards when a maze is cleared in fewer rounds (20 drops if in 1 to 3 rounds; 10 drops if in 4 to 5 rounds; and 5 drops if in more than 5 rounds), which added motivation for the monkeys to complete a game quickly.

      The monkeys’ behavior also suggested that they did not just adopt a passive strategy. Our analyses of the planned attack and suicide behavior clearly demonstrated that the monkeys actively made plans to change the game into more desirable states. Such behavior cannot be explained with a passive foraging strategy.

      2. It is well known that the value of an element is discounted by time and distance. However, in the model, the authors didn't consider it. A relevant problem will be the utility of the bonus elements, including the fruits and scared ghosts. Their utilities were affected not only by their value defined by the authors but also by effects, including their novelty and sense of achievement when they were captured, as the ghosts attracted relatively much more attention than the other elements (considering the number is 2 for them, see in figure 3E).

      These are good points, and our strategies could be built with more complexity to account for other potential factors. However, we focused our investigation on how to account for monkeys’ behavior with a set of strategies. A set of simple strategies with a small number of parameters would make a strong argument.

      Using a complex game such as Pac-Man allows us to investigate all of these interesting cognitive processes. We can certainly look at them in the future.

      3. The strategies are not independent. They are somehow correlated to each other. It may result in, in some conditions, false alarming of more strategies than the real, as shown in figure 2A.

      We have computed the Pearson correlations between the action sequences chosen with each basis strategy within each coarse-grained segment determined by the two-pass fitting procedure. As a control, we computed the correlation between each basis strategy and a random strategy, which generates action randomly, as a baseline. Most strategy pairs' correlations were lower than the random baseline. The results were now included in Supplementary (Appendix Figure 3).

      Sometimes two strategies may give exactly the same action sequence in a game segment. To deal with this problem, now we include an extra step when we fit the model to the behavior, which was described in Methods:

      “To ensure that the fitted weights are unique (Buja et al., 1989) in each time window, we combine utilities of any strategies that give exactly the same action sequence and reduce multiple strategy terms (e.g., local and energizer) to one hybrid strategy (e.g., local+energizer). After MLE fitting, we divide the fitted weight for this hybrid strategy equally among the strategies that give the same actions in the time segments.“

      Moreover, as the reviewer correctly reasoned, correlations between the strategies would yield possibly more strategies. However, our finding is that the monkeys were using a single strategy most of the time. This possible false alarm would go against our claim. Our conclusions stand despite the strategy correlations.

      It is hard to believe that a monkey can maintain several strategies simultaneously since it is out of our working memory/attention capacity.

      Exactly, and we are among the first to quantitatively demonstrate that the monkeys’ mostly relied on single strategies to play the game.

      Reviewer #2 (Public Review):

      In this intriguing paper, Yang et al. examine the behaviors of two rhesus monkeys playing a modified version of the well-known Pac-Man video game. The game poses an interesting challenge, since it requires flexible, context-dependent decisions in an environment with adversaries that change in real time. Using a modeling framework in which simple "basic" strategies are ensembled in a time-dependent fashion, the authors show that the animals' choices follow some sensible rules, including some counterintuitive strategies (running into ghosts for a teleport when most remaining pellets are far away).

      I like the motivation and findings of this study, which are likely to be interesting to many researchers in decision neuroscience and animal behavior. Many of the conclusions seem reasonable, and the results are detailed clearly. The key weakness of the paper is that it is primarily descriptive: it's hard to tell what new generalizable knowledge we take away from this model or these particular findings. In some ways, the paper reads as a promissory note for future studies (neural or behavioral or both) that might make use of this paradigm.

      I have two broad concerns, one mostly technical, one conceptual:

      First, the modeling framework, while adequate, is a bit ad hoc and seems to rely on many decisions that are specific to exactly this task. While I like the idea of modeling monkeys' choices using ensembling, the particular approach taken to segment time and the two-pass strategy for smoothing ensemble weights is only one of many possible approaches, and these decisions aren't particularly well-motivated. They appear to be reasonable and successful, but there is not much in the paper to connect them with better-known approaches in reinforcement learning (or, perhaps surprisingly, hierarchical reinforcement learning) that could link this work to other modeling approaches. In some ways, however, this is a question of taste, and nothing here is unreasonable.

      Thanks for the suggestion. In the new revision, we include a linear approximate reinforcement learning model (LARL) (Sutton, 1988; Tsitsiklis & Van Roy, 1997). The LARL model shared the same structure with a standard Q-learning algorithm but used the monkeys’ actual joystick movements as the fitting target. The model, although computationally more complex than the hierarchical mode, achieves a worse fitting performance.

      Second, there is an elision here of the distinction between how one models monkeys' behavior and what monkeys can be said to be "doing." That is, a model may be successful at making predictions while not being in any way a good description of the underlying cognitive or neuroscientific operations. More concretely: when we claim that a particular model of behavior is what agents "actually do," what we are usually saying is that (a) novel predictions from this model are born out by the data in ways that predictions from competing models are not (b) this model gives a better quantitative account of existing data than competitors. Since the present study is not designed as a test of the ensembling model (a), then it needs to demonstrate better quantitative predictions (b).

      We concede to the point that our model, while fitting to the behavior well, does not directly prove that the monkeys actually solved the task in this way. The eye movement and pupil dilation analyses partly addressed this issue, as their results were consistent with what one would expect from the model. We also hope future recording experiments will provide neural evidence to support the model.

      But the baselines used in this study are both limited and weak. A model crafted by the authors to use only a single, fixed ensemble strategy correctly predicts 80% of choices, while the model with time-varying ensembling predicts roughly 90%. This is a clear improvement and some evidence that *if* the animals are ensembling strategies, they are changing the ensemble weights in time. But there is little here in the way of non-ensemble competitors. What about a standard Q-learning model with an inferred reward function (that is, trained to replicate monkeys' data, not optimal performance). The perceptron baseline as detailed seems very poor as a control given how shallow it is. That is, I'm not convinced that the authors have successfully ruled out "flat" models as explanations of this behavior, only found that an ensembled model offers a reasonable explanation.

      We hope the new LARL model provides a better baseline control as a flat model. It performs better than the perceptron, yet much worse than our hierarchical model. Yet, we must point out that any hierarchical models can be matched in performance with a flat model in theory (Ribas-Fernandes et al., 2011). The advantage of hierarchical models mainly lies in their smaller computational cost for efficient planning. Even in a much simpler task such as a four-room navigation task, a hierarchical model can plan much faster than a flat model, especially under conditions with limited working memory (M. Botvinick & Weinstein, 2014). Our Pac-Man task contains an extensive feature space while requiring real-time decision-making. The result is that a reasonably performing flat model would go beyond the limits of the cognitive resources available in the brain. Even for a complex flat model such as Deep Q-Network (it can be considered to be similar a flat model since it does not explicitly plan with temporal extended strategies (Mnih et al., 2015)), the game performance is much worse than a hierarchical model (Van Seijen et al., 2017). The performance of the monkeys was unlikely to be achieved with a flat model. In addition, we trained the monkeys by introducing the game concepts gradually, with each training stage focusing on certain game aspects. The training procedure may have encouraged the monkeys to generalize the skills acquired in the early stages and use them as the basis strategies in the later training stages when the monkeys faced the complete version of the Pac-Man task.

      Reviewer #3 (Public Review):

      Yang and colleagues present a tour de force paper demonstrating non-human primates playing a full on pac-man video game. The authors reason that using a highly complex, yet semi controlled video game allows for the analysis of heuristic strategies in an animal model. The authors perform a set of well motivated computational modeling approaches to demonstrate the utility of the experimental model.

      First, I would like to congratulate the authors on training non-human primates to perform such a complex and demanding task and demonstrating that NHP perform this task well. From previous papers we know that even complex AI systems have difficulty with this task and extrapolating from my own failings in playing pac-man it is a difficult game to play.

      Overall the analysis approach used in the paper is extremely well reasoned and executed but what I am missing (and I must add is not needed for the paper to be impactful on its own) is a more exhaustive model search. The deduction the authors follow is logically sound but builds very much on assumptions of the basic strategy stratification performed first. This means that part of the hierarchical aspect of the behavioral strategies used can be attributed to the heuristic stratification nature of the approach. I am not trying to imply that I do not think that the behavior is hierarchically organized but I am implying that there is a missed opportunity to characterize that hierchical'ness (maybe in a graph theoretical way, think Dasgupta scores) further.

      All in all this paper is wonderful. Congratulations to the authors.

      We thank the reviewer for the encouraging comments. We have included a new flat model in the new revision for comparison against our hierarchical model and discussed other experimental evidence to support our claim.

    1. To create trails  When we are studying a text we need to take the time to understand more than just the storyline. During your second reading, any comments made during the first reading (marginal comments or summaries) will quickly give you the gist of your first reading, so that you can take advantage of your second.

      While multiple readings of a text in antiquity may have been rarer, due to the cheap proliferation of books, one can more easily "blaze a trail" through their reading to make it easier or quicker to rebuild context on subsequent readings.


      Look at history of reading to see which books would have been more likely re-read, particularly outside of one's primary "area" of expertise.

      Link to the trails mentioned by Vannevar Bush in As We May Think.

    1. Author Response:

      Reviewer #1:

      In this work Warneford-Thomson et al. developed an approach for surveillance screening for SARS-CoV-2, which involves the isothermic amplification of a region of the SARS-CoV-2 nucleocapsid gene using RT-LAMP, followed by detection with deep sequencing. High-throughput and cost effectiveness is achieved by two sets of barcodes that allow up to about 37,000 samples to be combined into one deep sequencing run. Moreover, the authors demonstrate they can do the detection from saliva collected on paper, which should make sample collection easier.

      The main strength of the work lies in the technical aspects, including setting up multiple controls such as a detection of a human gene, and multiplexing with detection of the influenza virus.

      The main weakness is that there are multiple other papers either published or archived that use RT-LAMP for SARS-CoV-2 detection, deep sequencing for SARS-CoV-2 detection, or both. These are cited in the current work, which is very well written and presented. Whether this method is better than the others which have the same aim of developing cost-effective and high-throughput detection is not conclusively demonstrated as only 8 clinical saliva samples are examined.

      We do not wish to claim that our method is better than the others. We think it has advantages and disadvantages and certainly it should be further optimized before scaling it up to population level. We have added these considerations to the text (lines 376–80).

      Furthermore, the requirement for deep sequencing and batching many samples for cost-effectiveness will, in most situations, greatly increase turn-around time. This will make surveillance much less effective, since by the time results are fed back, the asymptomatically infected individual would have had more opportunity to transmit the infection to others.

      We argue that time from sample to result is a mostly a function of logistics and not of the method. With proper set ups the time from sample collection to results could be < 16 hours, which would be compatible with population-level surveillance. We added these considerations to the text.

      However, the deep sequencing step may be very useful for surveillance of circulating SARS-CoV-2 spike sequences to detect emerging variants within a population, provided this method can be modified to do it.

      We agree and we mention this possibility in the discussion.

      Reviewer #2:

      In 'COV-ID: A LAMP sequencing approach for high-throughput co-detection of SARS-CoV-2 and influenza virus in human saliva', Warneford-Thomson et al. present a novel methodology to perform large numbers of COVID-19 tests in parallel. Their approach takes unprocessed saliva and requires only a small number of experimental steps before the results are sequenced overnight to generate many thousands of results. This straightforward experimental design should allow the protocol to be expanded to a number of settings where population-level monitoring is required in order to contain outbreaks and reduce transmission. In this paper, the authors demonstrate the efficacy of their approach and perform a large number of benchmarking experiments to quantify its sensitivity, specificity and limitations of detection. They are able to detect artificially created infections (spike-ins) with as low as 5 virions per µL and all clinically available samples agreed with the standard RT-qPCR test. This method can detect both SARS-CoV-2 and Influenza infection and can also be applied to saliva samples which have been collected on filter paper, a strategy which will further simplify the testing regime.

      The authors have spent much time testing this approach but these have largely been limited to analysing artificially created infections. The only results which were obtained were from eight clinically derived samples which are presented in Figure 2E. Although all results from this approach agreed with the standard clinical test this is a small number of tests compared to the total number of tests which are reported in this paper. It is also only a small proof-of-principle experiment to justify a quick rollout of this technology.

      We have now performed COV-ID on 120 additional patient samples (new Figure 2-figure supplement 2). These new results are described in the text.

      The potential for this technology to perform rapid, high-throughput SARS-CoV-2 testing alongside the potential for very low sequencing costs (Figure 4G) is impressive. It is noted in the manuscript that this will require 96 unique barcodes but only 32 are tested here. All but three of these 32 work for the SARS-CoV-2 N2 primers and required STATH control but how will the remaining 67 primers be derived (i.e. is it realistic that this can be made to work to deliver the promise of this approach)?

      The current COV-ID patient barcodes are 5 base pairs long. This allows for 4^5 = 1,024 combinations. Out of an abundance of caution, we excluded barcodes with homology to the reverse complement of the RT-LAMP primers used in any of the experiments (i.e. primers for SARS-CoV-2 N2, STATHERIN, ACTIN, and influenza virus) and then selected a set of 32 with Hamming distances of at least 2 from each other. This is now described more in detail in the methods.

      Regarding the numbers, out of 1,024 5-bp barcodes, 404 were removed due to homology, leaving 620. Of these, we could find at least 163 with Hamming distance ≥ 2 from each other. Even with a substantial failure rate, this should allow for 96 working barcodes. If we had only considered clashes with N2 and STATHERIN primers, the number of available barcodes would be substantially higher.

      Overall, this is an interesting paper which has very clear real-world application to helping to defeat the ongoing COVID-19 pandemic, but some extra validations are needed to fully demonstrate its performance in clinical and/or public health settings.

    1. Author Response:

      Reviewer #1:

      The experiments are well designed, generally well controlled, and carefully conducted, and are thoughtfully and appropriately discussed. The authors make conclusions that are well supported by their results.

      When describing the aptamer knockdown of the PPS, the authors explain that the western blot was too noisy for monitoring the knockdown, which is frustrating for the reader and must have been frustrating for the authors. The authors instead counter-intuitively use qRT-PCR to monitor the transcript abundance of the PPS transcript in the aptamer system - this aptamer system is thought to be a modifier of protein, not transcription or transcript abundance. The authors describe that this has been seen once before (using aptamer knockdown of PfFis1), and the authors of that study speculate that the TetR-DOZI aptamer might be degrading the target mRNA. This is a plausible explanation, but it isn't quite clear from the description how this experiment was performed. The authors explain that the knockdown parasites grew normally for three days, but the parasites may be becoming sicker over this period. It's therefore possible that the decrease in PPS mRNA abundance is a product, rather than a cause of the growth defect. Sick or dying parasites could plausibly impact the PPS differently to the two chosen controls, particularly since both control genes chosen have substantially longer half-lives than the PPS mRNA (according to the Shock and DeRisi datasets). I therefore I suggest that this experiment be performed in an IPP rescue scenario (where the parasites aren't dying) with biological replicates. There is no explanation of the replicates here, but the error bars in 6C are implausibly small for real biological replicates.

      To address these concerns, we have added western blot data showing down-regulation of PPS expression in -aTc +IPP conditions, relative to a loading control. We have also repeated the growth assay and RT-qPCR experiment (in biological triplicate) under IPP-rescue conditions. Parasites samples harvested on day 3 of the IPP-rescue assay were analyzed by RT-qPCR and show reduced PPS mRNA abundance that is similar to (and slightly lower than) that observed without IPP supplementation. This similarity is not surprising to us, since the day 3 harvest in the original growth assay (without IPP) was 3 days before observing a parasite growth defect in -aTc conditions. With respect to the mechanism of transcript loss in the aptamer/TetR-DOZI system, the fate of transcripts in this system has not been investigated in depth. However, DOZI is believed to target bound mRNA to P-bodies, which are a known site of mRNA degradation in cells. We have unpublished data with multiple parasite proteins tagged with the aptamer/TetR-DOZI system. In all cases, we see strong reductions in mRNA abundance in -aTc conditions, suggesting that such decreases are a general property of this knockdown system.

      Line 342 "These results directly suggest that apicoplast biogenesis specifically requires synthesis of linear polyprenols containing three or more prenyl groups." - I think that this might be overinterpreting those results - there could be a number of different reasons why polyprenols of different sizes do or don't rescue, including different solubility, diffusion, availability of transporters, predisposition to break down to useable subunits. Perhaps this needs a caveat.

      We have modified the text here to remove “directly” and to acknowledge uncertainty in beta-carotene uptake: “Although it is possible that β-carotene is not taken up efficiently into the apicoplast, rescue by decaprenol, which is similar in size and hydrophobicity to β-carotene, suggests that apicoplast biogenesis specifically requires synthesis of linear polyprenols containing three or more prenyl groups.” We have also added the statement that “this hypothesis is further supported by additional results described in the next two sections”, referring to our identification of an apicoplast-targeted polyprenyl synthase.

      Line 361 " the cytosolic enzyme, PF3D7_1128400" - I don't think we know the localisation of this protein based on the published data. The Gabriel et al study makes it clear the protein isn't apicoplast or mitochondrial, but it is punctate at stages in a pattern that doesn't look to me to be a straightforward cytosolic localisation (and the original authors don't describe it as cytosolic).

      We agree that the localization of PF3D7_1128400 requires further investigation. The Gabriel study, which (surprisingly) is the only study we found that has examined localization of this protein by microscopy, observed diffuse signal in trophozoites consistent with cytoplasmic localization, in additional to focal, punctate signals in schizonts that were distinct from the apicoplast or mitochondrion. The authors described their results as, “Analysis by fluorescence microscopy of live parasites confirms expression along the intra-erythrocytic cycle and shows FPPS/GGPPS localization throughout the cytoplasm and also forming spots, which increase in number as parasites mature from trophozoite to schizont stages.” For simplicity we referred to FPPS/GGPPS localization as cytoplasmic but agree that available data suggest more a complex localization that requires further studies to understand. We have modified the text to indicate that available data suggests a complex cellular distribution that includes both the cytoplasm and additional sub-cellular foci outside the apicoplast and mitochondrion.

      Line 423 "with strong prediction of an apicoplast-targeting transit peptide but uncertainty in the presence of a signal peptide". I don't think this describes well the bioinformatic analysis of the N-terminus. Although the experimental data are convincing that this is an apicoplast-targeted protein, bioinformatically this would not be predicted as an apicoplast protein. There is no obvious signal peptide, and "uncertainty" is too vague a descriptor. None of the versions of signalP, nor psort, predict this as possessing a signal peptide (which by definition means that PlasmoAP absolutely rejects it), and there is no obvious hydrophobic segment at the N-terminus that we would normally expect of a signal peptide. The toxoplasma hyperlopit doesn't suggest that the Toxoplasma orthologue is apicoplast, and the protein isn't found in the Boucher et al apicoplast proteome. This is somewhat of a mystery. It doesn't diminish the solid localisation data, with the excellent complementary data from IFA as well as the doxycycline+IPP experiment, but it should be pointed out clearly that this localisation isn't to be expected from the sequence analysis.

      We thank the reviewer for this perspective and agree that SignalP is unable to identify a signal peptide at the N-terminus of PPS. We have modified the text to remove our description of “uncertainty” and explicitly state that SignalP is unable to identify a canonical signal peptide at the N-terminus of PPS.

      We note that multiple proteins detected in the Boucher et al. apicoplast proteome also lack an identifiable signal peptide by SignalP yet are clearly imported into the apicoplast. These proteins include the key MEP pathway enzymes DXR (PF3D7_1467300) and IspD (PF3D7_0106900), holo ACP synthase (PF3D7_0420200), FabB/F (PF3D7_0626300), and the E1 subunit of pyruvate dehydrogenase (PF3D7_1446400). Thus, apicoplast import despite lack of identifiable signal peptide by SignalP is not unique to PPS but general to multiple (if not numerous) apicoplast-targeted proteins. These observations suggest to us that protein N-termini in Plasmodium can have sequence properties compatible with ER targeting that are broader and more heterogenous than other eukaryotic organisms that comprise the training sets upon which SignalP is currently based. It remains a future challenge to fully understand these properties.

      With respect to the lack of PPS detection in the Boucher et al. apicoplast proteome, PPS appears to have a very low expression level and unusual solubility properties that require overnight extraction of parasite pellets in 2% SDS (or LDS) for detection. In our experience, the RIPA extraction conditions (which contained 0.1% SDS) used in the Boucher et al. study are insufficient to solubilize PPS, which may explain lack of PPS detection in their study.

      To explicitly address these questions regarding PPS targeting to the apicoplast, we have added a new section to the Discussion to explore PPS targeting in the absence of a recognizable signal peptide, its unusual solubility properties and lack of detection in the Boucher et al. proteome, and planned future studies to further test, refine, and understand targeting determinants.

      With respect to Toxoplasma, T. gondii appears to also express two polyprenyl synthase homologs, TGME49_224490 and TGME49_269430, that are ~30% identical (in homologous regions) to PF3D7_1128400 (FPPS/GGPPS) and PF3D7_0202700 (PPS), respectively. TGME49_224490 appears to be targeted to the mitochondrion in T. gondii (based on MitoProt and HyperLOPIT analysis), in contrast to its P. falciparum homolog, PF3D7_1128400, which localizes to the cytoplasm and other cellular foci outside the mitochondrion. TGME49_269430 does not appear to target the apicoplast in T. gondii (based on HyperLOPIT data), which contrasts with our determination of apicoplast targeting for the P. falciparum homolog, PF3D7_0202700. These differing localizations may suggest distinct cellular roles for these homologs in T. gondii compared to P. falciparum. We are also aware of a recent study (Pubmed 34896149) showing that loss of MEP pathway activity in T. gondii (due to loss of apicoplast ferredoxin) does not impact apicoplast biogenesis, in contrast to our observations in P. falciparum based on FOS treatment, DXS deletion, and PPS knockdown. These distinct phenotypes further suggest differences in isoprenoid utilization and metabolism between T. gondii and P. falciparum that remain to be understood. We have added a new section to the Discussion to address these considerations.

      The section after line 344 "Iterative condensation of DMAPP with IP…", up until line 377 doesn't sit well within the section that has the heading "Apicoplast biogenesis requires polyprenyl isoprenoid synthesis". I suggest either creating a separate subheading for this material, or moving it into the start of the subsequent section "Localization of an annotated polyprenyl synthase to the apicoplast.".

      We thank the reviewer for this suggestion, which we have followed. We have moved the referenced text to the beginning of the subsequent section to better align the text with that section heading.

      Reviewer #2:

      Minor comments:

      The authors emphasize that this study reveals a previously unnoted interconnection between apicoplast maintenance and pathways that produce an output from the apicoplast to serve the cell. But is the prevailing view really that these two are separate? Isn't the interconnection already clear from many other studies and observations? E.g., the fatty acids produced inside the apicoplast provide membrane- and lipid- precursors for the rest of the cell as well as for the apicoplast itself (Botte et al., PNAS, 2013) (although not essential in Plasmodium blood stages). Other pathways that function inside the apicoplast such as the Fe-S cluster synthesis are critical to support enzymes that provide exported metabolites (e.g., IPP synthesis, IspG/H) and function in maintenance (e.g., MiaB) (Gisselberg et al., PLoSPath, 2013). Perhaps the authors could tone this conclusion down and acknowledge that maintenance and output are interconnected in other cases, which have been acknowledged in the literature.

      We thank the reviewer for this perspective and agree that in Toxoplasma as well as in mosquito- and liver-stage Plasmodium there are multiple apicoplast outputs (i.e., metabolic products exported from the apicoplast) that contribute to parasite fitness, including IPP, fatty acids, and coproporphyrinogen III. To clarify, we are specifically referring to blood-stage Plasmodium in our manuscript, when heme and fatty acid synthesis are dispensable and where the prior literature has intensely focused on IPP as the key essential output of the blood-stage apicoplast and consistently stated that IPP is not required for organelle maintenance.

      We agree that prior work has firmly established that apicoplast housekeeping functions (e.g., synthesis of proteins and Fe-S clusters) are required for organelle maintenance and to support IPP synthesis. However, our work is the first to demonstrate in blood-stage Plasmodium that the reverse is also true- that IPP as an essential apicoplast output is also required for organelle maintenance and that apicoplast maintenance and IPP synthesis are thus reciprocally dependent. We have modified the Discussion section to clarify these points and to explicitly acknowledge that apicoplast maintenance and other metabolic outputs may also be interdependent in Toxoplasma and other Plasmodium stages.

      Could the authors elaborate more on the leader sequence predicting apicoplast localization for the PPS characterized here and discuss why it might have been missed in previous detailed study of apicoplast localised proteins (Boucher et al., PlosBiol, 2018)?

      Please see our response above to Reviewer #1.

      Could the authors discuss conservation of the PPS gene(s) in other Apicomplexa with (e.g., T. gondii) and without (e.g., Cryptosporidium spp.) an apicoplast? This could be relevant for other people in the field and could give further insights into the enzyme's role in apicoplast maintenance.

      Please see our response above to Reviewer #1. Polyprenyl synthases are diverse enzymes that perform a variety of cellular functions, whose specific roles can differ between organisms. Although the two Plasmodium prenyl synthases show preferential homology with each of two different prenyl synthase homologs in Toxoplasma and Cryptosporidium (CPATCC_003578 and CPATCC_001801), the differing localizations of these homologs in each parasite suggest differing cellular roles. The differing dependence of apicoplast biogenesis on MEP pathway activity in T. gondii and P. falciparum and the absence of an apicoplast in Cryptosporidium further support differences in isoprenoid utilization and metabolism in these organisms. We have added a new section to the Discussion to address these considerations.

      Reviewer #3:

      The paper is very nicely written and was a true pleasure to read. The introduction is concise yet dense with all relevant background of our current understanding of functioning of the apicoplast in relation to IPP production and utilization. The rational of the experiments and the interpretation of the results are presented clearly and everything is discussed well in the context of the current understanding of the field. The main conclusion of the paper that isoprenoid is not solely essential for critical functions elsewhere in the cell, such as prenylation-dependent vesicular trafficking but also for apicoplast biogenesis via its processing by an essential polyprenyl synthase conserved with plants and bacteria is well substantiated and very exciting. The authors demonstrate an equally beautiful and clever use of available and newly generated genetic mutants in combination with complementary pharmacological interventions and metabolic supplementation. There are no true major weaknesses that could jeopardize the conclusions or change the interpretation of the results. However, the authors do consistently perform statistical analyses on data obtained from individual cells obtained in no more than two independent experiments, which in my humble opinion does not qualify for statistical analysis. That said, the results are so clear-cut that no statistics are required to convince me, or to quote Ernest Rutherford: '"If your experiment needs statistics, you ought to have done a better experiment."

      We thank the reviewer for these positive comments and suggestions. For growth assays, we have performed a third biological replicate and updated those figures and the indicated statistical analyses. For microscopy experiments, we have removed p values.

    2. Reviewer #1 (Public Review): 

      The experiments are well designed, generally well controlled, and carefully conducted, and are thoughtfully and appropriately discussed. The authors make conclusions that are well supported by their results. 

      When describing the aptamer knockdown of the PPS, the authors explain that the western blot was too noisy for monitoring the knockdown, which is frustrating for the reader and must have been frustrating for the authors. The authors instead counter-intuitively use qRT-PCR to monitor the transcript abundance of the PPS transcript in the aptamer system - this aptamer system is thought to be a modifier of protein, not transcription or transcript abundance. The authors describe that this has been seen once before (using aptamer knockdown of PfFis1), and the authors of that study speculate that the TetR-DOZI aptamer might be degrading the target mRNA. This is a plausible explanation, but it isn't quite clear from the description how this experiment was performed. The authors explain that the knockdown parasites grew normally for three days, but the parasites may be becoming sicker over this period. It's therefore possible that the decrease in PPS mRNA abundance is a product, rather than a cause of the growth defect. Sick or dying parasites could plausibly impact the PPS differently to the two chosen controls, particularly since both control genes chosen have substantially longer half-lives than the PPS mRNA (according to the Shock and DeRisi datasets). I therefore I suggest that this experiment be performed in an IPP rescue scenario (where the parasites aren't dying) with biological replicates. There is no explanation of the replicates here, but the error bars in 6C are implausibly small for real biological replicates. 

      Line 342 "These results directly suggest that apicoplast biogenesis specifically requires synthesis of linear polyprenols containing three or more prenyl groups." - I think that this might be overinterpreting those results - there could be a number of different reasons why polyprenols of different sizes do or don't rescue, including different solubility, diffusion, availability of transporters, predisposition to break down to useable subunits. Perhaps this needs a caveat. 

      Line 361 " the cytosolic enzyme, PF3D7_1128400" - I don't think we know the localisation of this protein based on the published data. The Gabriel et al study makes it clear the protein isn't apicoplast or mitochondrial, but it is punctate at stages in a pattern that doesn't look to me to be a straightforward cytosolic localisation (and the original authors don't describe it as cytosolic). 

      Line 423 "with strong prediction of an apicoplast-targeting transit peptide but uncertainty in the presence of a signal peptide". I don't think this describes well the bioinformatic analysis of the N-terminus. Although the experimental data are convincing that this is an apicoplast-targeted protein, bioinformatically this would not be predicted as an apicoplast protein. There is no obvious signal peptide, and "uncertainty" is too vague a descriptor. None of the versions of signalP, nor psort, predict this as possessing a signal peptide (which by definition means that PlasmoAP absolutely rejects it), and there is no obvious hydrophobic segment at the N-terminus that we would normally expect of a signal peptide. The toxoplasma hyperlopit doesn't suggest that the Toxoplasma orthologue is apicoplast, and the protein isn't found in the Boucher et al apicoplast proteome. This is somewhat of a mystery. It doesn't diminish the solid localisation data, with the excellent complementary data from IFA as well as the doxycycline+IPP experiment, but it should be pointed out clearly that this localisation isn't to be expected from the sequence analysis. 

      The section after line 344 "Iterative condensation of DMAPP with IP...", up until line 377 doesn't sit well within the section that has the heading "Apicoplast biogenesis requires polyprenyl isoprenoid synthesis". I suggest either creating a separate subheading for this material, or moving it into the start of the subsequent section "Localization of an annotated polyprenyl synthase to the apicoplast.".

    1. Author Response:

      Reviewer #1:

      Significance: A central puzzle in evolutionary biology (and philosophy of biology) is the evolution of new (collective) entities that can evolve on their own right (e.g. the evolution of multicellular organisms from single cells). These evolutionary transitions are often conceptualized in terms of fitness decoupling (a fitness increase of the collective even as the fitness of the component particles decreases). Using a life-history model, the authors show that fitness decoupling is not possible when the conditions for fitness are the same. Thus, this paper has the potential to change how we think about the evolution of new collective entities.

      Strengths: This paper is conceptually rich and the overall argument is clear. Re-analyzing previous data/models using their new framework highlights new patterns of fitness change in these transitions of individuality, and as such, it provides novel and exciting avenues of research.

      Weaknesses: While the overall argument is clear, some of the details can be hard to follow (even as someone familiar with the literature). The initial description of their model is fairly clear, but given its conceptual novelty, the paper does not spend enough time developing the different concepts of fitness at the particle level.

      Moreover, it is not entirely clear what is at stake: what is the role of fitness decoupling in our understanding of fitness transitions? And how does the proposed mechanistic ("trade-off breaking") model serve as a replacement? It seems to me like trade-off breaking is a characteristic of many evolutionary innovations, not only of major transitions. It seems even possible to envision groups that allow for an escape in a trade-off without leading to the evolution of a new "Darwinian" individual.

      For example, one could conceive of a trade-off in zebras between time spent foraging and protection against predators. Coming together temporarily as a group is likely to allow for values outside this trade-off space (similar to those in Fig. 6). One could even imagine a new mutation that makes zebras switch activities (foraging/watching) depending on their position within the group. This mutation is only available to zebras that form groups (the phenotype does not exist in the absence of a group). But I would still want to argue that there is more to the evolution of new levels of individuality. Trade-off breaking seems (potentially) a necessary, but not sufficient step in these transitions.

      And while the language of the authors is careful to not suggest sufficiency, it is not entirely clear how this approach helps us understand the particularity of these transitions.

      Reviewer #1 asks first to clarify the stakes: what is the role of fitness decoupling in the explanation and how does tradeoff-breaking replace or supplement it? Second, they requested us to make a statement about the necessary or sufficient nature of tradeoff-breaking.

      With respect to the second point, we argue that tradeoff breaking is not sufficient, but is probably necessary for an ETI to occur.

      Let us now clarify the role of fitness decoupling and tradeoff breaking in the explanation of ETIs. It must be stressed that tradeoff breaking does not “replace” fitness decoupling; rather, tradeoff breaking is an event that cannot be understood readily in the framework of fitness decoupling. Thus, we claim that ETIs are better understood when seen through the lenses of traits and the evolutionary constraints that link them (i.e., tradeoffs) than via the export-of-fitness model (i.e., fitness decoupling). To illustrate this, we use the zebra herd example proposed by the reviewer. Coming together temporarily as a collective does not, in itself, constitute a tradeoff-breaking event, but rather simply a collective-formation event (similar to the first ace2 mutation in snowflake yeast or the first WS mutation in the Pseudomonas system). From this starting point, a number of mutations (i.e., change in traits values) can be fixed in the population that improve the performance of zebras within this environment. This is the “fast” part of the evolutionary trajectory that occurs on the ancestral tradeoff, which we called “low hanging fruit mutations” in the manuscript. As a consequence, “optimal herds within the ancestral tradeoff” evolve. As stated in the manuscript, if we assume that the tradeoff on traits is identical for lone zebra and zebra herd and also assume that the ancestral lone zebra exhibit trait values that are optimal (within these constraints) for lone zebras, it follows that the low-hanging fruit mutations that improve the zebra herd will probably reduce counterfactual fitness. This lowering of counterfactual fitness is not due to a “transfer” between real and counterfactual fitness (because there is nothing to transfer between real and counterfactual worlds), but is a consequence of the differential contribution of the traits involved in the tradeoff to the two fitness quantities. However, this specificity of the tradeoff might be significant because it could lead to stabilisation of the new collectives through ratchetting.

      There is, indeed, “more to the evolution of new levels of individuality,” as pointed out by Reviewer #1. We claim that it involves rare mutations that would overcome the ancestral constraint and call them “tradeoff breaking mutations”. Tradeoff-breaking mutations are not bound by ancestral tradeoff; therefore, there is no a priori theoretical or biological reason to think they would have any positive or negative effect on counterfactual fitness. Here, we must stop using the zebra herd example because no tradeoff-breaking mutation occurred. However, the tradeoff-breaking lineages in the Pseudomonas example exhibit an improvement of both counterfactual and within-collective fitness. This observation does not fit within an export-of-fitness framework, but makes perfect sense in a traits-based view of ETIs—as a tradeoff-breaking mutation.

      Reviewer #2:

      This work reviews the influential "fitness decoupling" heuristic for understanding evolutionary transitions in individuality (ETIs), describes some of its limitations, and clarifies its interpretation. The review of the fitness decoupling account capably describes an interpretation of this framework that has frequently occurred in the literature, for example in Okasha 2006, Godfrey-Smith 2011, Hammerschmidt et al. 2014, Black et al. 2019, and Rose et al. 2020. However, it does not address the interpretation advanced by its authors, Richard Michod and colleagues, which they have clarified in several papers cited in the present work. Michod and colleagues have argued that the fitness decoupling account describes a changing relationship between the fitness of groups and the "counterfactual" fitness of their component cells, that is, the fitness the cells would have if they were removed from the group. This point is made explicitly in Shelton & Michod 2104 and Shelton & Michod 2020 and was present (though perhaps not as obvious) in Michod 2005 and later works, in contrast to the claim in the Glossary that this is a "relatively recent development of the fitness decoupling literature." The interpretation that Michod embraces is similar to what is here described as f2, the fitness of a "theoretical mono-particle collective", but that interpretation is not mentioned in the present work until Section 2.3. It is possible that an argument could be made that Michod and colleagues have not consistently interpreted fitness decoupling this way, or have made statements inconsistent with this interpretation, but no such argument is present in this work. Thus the impression conveyed is that Michod and colleagues consider decoupling of "commensurably computed fitnesses" possible, which is counter to their explicit statements on the topic.

      The description of the limitations of the fitness decoupling heuristic (Section 2) is useful and goes a considerable distance toward clarifying the ways in which fitness decoupling can rigorously be interpreted. However, the final assessment (Section 2.3) does not make a compelling case for its central argument, the lack of utility of the fitness decoupling concept. Elsewhere in the work, the ratcheting model of Libby and colleagues is referenced in comparison to the tradeoff-breaking approach, but Section 2.3 does not acknowledge the relationship between Libby and colleagues' model and the counterfactual interpretation of the fitness decoupling heuristic. For example, the argument in Libby and Ratcliff 2014 that "If any of the yeast that evolved high rates of apoptosis within clusters were to leave the group and revert to a unicellular lifestyle, they would find themselves at a competitive disadvantage relative to other, low-apoptosis unicellular strains." and in Libby et al. 2016 that "…if G cells were to revert to unicellular I cells, they would be quickly outcompeted" are counterfactual fitness arguments essentially similar to that of Shelton and Michod 2020 that "the fitness a cell would have on its own declines as the transition progresses." Section 2 makes a convincing case that commensurable fitnesses cannot be decoupled, but by fixating on commensurability, which is not relevant to the counterfactual interpretation of fitness decoupling, Section 2.4 fails to make a convincing case that "fitness-decoupling observations do little to clarify the process of an ETI." That is, "because they are not commensurable" does little to explain why the counterfactual interpretation of fitness decoupling "does little on its own to clarify the process of an ETI," since commensurability is not a claim that the the counterfactual interpretation of fitness decoupling makes.

      We agree with the reviewer on two essential points: (1) the decoupling of commensurably computed fitness is impossible when collectives have a finite size and (2) counterfactual fitness is not commensurable to particle or collective fitness.

      While we recognise that Michod and collaborators did clarify that fitness decoupling referred to counterfactual fitness (although, to us, this becomes clear from 2015 onward), we argue that the fitness transfer (or export-of-fitness) metaphor implies (by its wording) a commensurability of fitnesses that undermine this welcome clarification.

      Indeed, for a quantity to be transferred from one place—or component—to another, the source and destination must be commensurable. It is incorrect to talk about a transfer between counterfactual and actual quantities. A better choice of words to discuss the relative change of counterfactual and actual quantities would avoid the physical transfer metaphor and focus instead on the correlation of the two quantities. It must be noted that, despite the clarification of counterfactual fitness, the word “transfer” continues to be used in recent work (Davison & Michod, 2021).

      This may seem like nitpicking; however, there is a real advantage in being careful about this. We do agree that, under some assumptions, counterfactual fitness would decrease while whole–life cycle particle fitness (or collective fitness) increases. From there, one might ask: what needs explaining? If one assumes an export-of-fitness framework, the transfer of fitness explains why it cannot be otherwise. If fitness decreases on one side, it must increase on the other. In other words, the existence of a tradeoff is taken for granted based on the improper physical metaphor. While there are strong reasons to think that such tradeoffs exist, they should be assessed in their own right and on a case-by-case basis rather than being assumed to hold. Otherwise, there is no way to make sense of the tradeoff-breaking scenario described in Section 4.

      By the same token, the metaphor of “decoupling” often associated with the export-of-fitness model is misleading because it is used to describe a part of the evolutionary dynamics where counterfactual particle fitness and whole–life cycle particle fitness are strongly dependant on one another (even if their changes are anticorrelated), through the existence of the tradeoff.

      Nevertheless, we welcome the reviewer’s urge to clarify our position and how this relates to Michod and colleagues’ counterfactual fitness proposal.

      The model based on trade-offs and trade-off breaking is useful and likely to be of interest to theorists interested in ETIs. The observation that this model can reproduce the (counterfactual) fitness-decoupling observation is a useful in showing the how the two models relate. The result that counterfactual fitness decoupling is a consequence rather than a cause of the evolutionary dynamics is an important point (though perhaps obvious in retrospect, since counterfactuals, things to do not happen, can't be the causes of anything).

      The caution in Section 3.3 that "the same [counterfactual fitness decoupling] observation will be made in any situation in which short-term costs are compensated by long-term benefits, not solely during ETIs" is a good point, and it sets up the argument that trade-off breaking is a "genuine marker for an ETI". However, no convincing case is made that the same criticism, that the observed phenomenon is not unique to ETIs, is not equally true of trade-off breaking. Some nice examples of trade-off breaking in the context of ETIs are given, but these do not amount to an argument that trade-off breaking is only observed during ETIs. The life history literature includes examples of trade-off breaking that are not related to ETIs, so it is not clear that trade-off breaking is either a reliable indicator of ETIs or superior in this respect to counterfactual fitness decoupling.

      This point is in line with one of the points made by Reviewer #1. We have now clarified our position with respect to the generality of the tradeoff-breaking approach.

      In the Discussion, the "inconveniences" associated with the fitness decoupling are cogent limitations of this heuristic. The "impossibility of decoupling between commensurable measures of fitness" is an important result, but it is not new and should thus probably not be presented as "[o]ur first main finding". Shelton and Michod 2014 includes a mathematical proof in the appendix that, given the model assumptions, "consideration of the births and deaths of colonies gives us exactly the same bottom line (fitness) as consideration of the births and deaths of lone cells." The second main finding, that "fitness decoupling observations cannot be reliably used as a marker for ETIs," is valid, but as described above, a convincing case is not made that trade-off breaking can be reliably used in this manner, either. Trade-off breaking may, however, be a useful way to think about ETIs in the other ways that are suggested, for example as key events and as stepping stones to new hypotheses.

      We have now clarified our position.

    1. A node may have several applications running on it (several sensors) each of which is an application. These application instances on a node are said to be endpoints, where messages can originate and terminate.

      I think here we are calling sensors/acuators/transducers as "applications". But this may be saying that the endpoint is where the data comes from, and not the hardware device. The question is, can application endpoints be at coordinator and router devices?

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      The paper tackles an important problem regarding the effect of demographic dependent vaccination protocols on the reduction in the number of deaths with respect to the situation of no vaccination (say J). A compartmental SIRD model with reinfection Y is proposed, stratified in two (age dependent) groups, based on a binary reduction of a given contact map, and given infection fatality risk (IFR). Several countries are then analyzed.

      As far as I understand we have a control variable v, parameters of the stratified model (i=1,2) tuned to match IFRi, and a control objective, i.e. minimization of J over one year.

      The paper is well written. The final message and some theoretical passages are not completely clear, at least to me. I have the following observations that the authors may want to consider.

      We thank the referee for the revision and are very glad that the overall evaluation is positive. Comments and suggestions have been thoroughly addressed, as we discuss in the following.

      1) The study of stability of infection free and endemic equilibria should be better developed. The 5 equations can be reduced to 4 (neglecting D) and the characteristic of the reduced Jacobian used to characterize the local asymptotic stability of equilibria, instability, bifurcation points etc... Alternatively, one can use a co-positive Lyapunov function (LF). For instance, if we take the LF V=S+I+Y+R, we get $\dot V=-\mu_I I-\mu_Y Y \le 0$. If $\mu_I$ and $\mu_y$ are strictly positive all equilibria are characterized by (S*,0 0,R*) and D=1-S*-R*. So, I don't understand the phrase after (7,8), notice that Y cannot be zero in finite time. For $\mu_y=0$ then Y* can be nonzero. I guess that closed-form computation of S* and R* is possible as function of the parameters at least in the case v=0. The stability result should be cast in function of the current reproduction number (not explicitated) wrt to S and R.

      The authors are invited to have a look at

      1.1) Pagliara et al, "Bistability and Resurgent Epidemics in Reinfection Models", IEEE CSLetters, 2018,

      for a theoretical analysis of stability on a similar (just a little bit simpler) model.

      We appreciate the suggestions of the referee for improvement of this material. We have carried out an in-depth revision of the stability analysis and significantly extended it. The major addition has been, as suggested, a section relating the current reproductive number at equilibrium (we call it the asymptotic reproductive number in the text) to the fixed points of the dynamics for three different scenarios: general model, no vaccination, and zero mortality of reinfected individuals. As Pagliara et al. show in their paper, the connection between the fixed points and the reproductive number is not trivial, but it is possible to derive it through the next-generation matrix technique, as we now do. Additional references regarding this technique have been added. We have included a Table summarizing the stability analysis (page 2 in SI 3) at the end of this new section.

      Other modifications include the reduction of 5 equations to 4 for the stability analysis and a clarification of possible equilibria (page 1 of SI 3), rephrasing and correcting our sentence after eqs. (7) and (8). We also attempted to obtain a closed-form computation of S* and R* but, to the best of our knowledge, concluded that it is not possible. We would be happy to pursue any insight in this respect the referee may have.

      What said before should be also extended to the stratified model, where a "network" Rt could be defined, see for instance

      1.2) L. Stella et al, "The Role of Asymptomatic Infections in the COVID-19 Epidemic via Complex Networks and Stability Analysis", SIAM J Cont. Opt., 2021, (arxiv.org/pdf/2009.03649.pdf)

      We thank the referee for pointing out this reference. Following the analysis in Stella et al., we have carried out a stability analysis for the stratified model as well. The results are included in a new section (pages 7-10 in the SI 3).

      2) It is not clear whether the free contagion parameters of the model have been fitted on real data (identification from infection and reinfection data). Notice that the interplay between vaccination strategies and NPI is important, see e.g.

      *2.1) Giordano et al, Modeling vaccination rollouts, SARS-CoV-2 variants and the requirement for non-pharmaceutical interventions in Italy", Nature Medicine 2021, *

      where progressive vaccination in reverse age order is considered together with different enforced NPI countermeasures.

      In the first part of our study, parameters are intendedly left free because we aim at describing the generic behavior of the model. Still, we derive several inequalities and relationships between parameter ratios that seem to be sensible attending to what the different classes in the model stand for. This is as described in sections regarding model parameters when the two generic models (SIYRD and S2IYRD) are introduced. The aim is to represent both the generic dependence with some variables and a broad class of contagious diseases, so parameters are mostly free. In agreement with this approach, parameters can be also freely varied in the companion webpage.

      In the second part of our study, the model is applied to COVID-19. In that case, we have used parameter values in agreement with observations, as (admittedly poorly) explained in pages 9-10 of the main text. Indeed, not enough information on parameter estimation was provided in the main text, and the SI 2 also needed some additional information. This has been amended. Let us explicitly mention that we have not fitted the dynamics of the model to any actual data set to fix specific values, as Giordano et al. do. In our case, we have first used different demographic data sets to evaluate contact rates and IFRs of the two population groups (these are parameters Mij and Ni in eqs. (7-10)). Secondly, recovery and death rates are estimated through the IFRi values for each age group i and the infectious period of COVID-19, that we fix at dI=13 days. Third, infection rate βSI=R0/dI has been estimated fixing R0=1, since the reproductive number of COVID-19 all over the world fluctuates around this value (Arroyo-Marioli et al. (2020) Tracking R of COVID-19: A new real-time estimation using the Kalman filter, PLoS ONE 16(1):e0244474). The reinfection rate is defined through its relationship with the infection rate, βRI= α1 βSI, where α1 was in the range 0-0.011 at early COVID-19 stages (Murchu et al. (2022), Quantifying the risk of SARS‐CoV‐2 reinfection over time, Rev Med Virol 32:e2260) and seems to be about 3-4 fold larger for the omicron variant (Pulliam et al., Increased risk of SARS-CoV-2 reinfection associated with emergence of the Omicron variant in South Africa, www.medrxiv.org/content/10.1101/2021.11.11.21266068v2). Given the relationships derived among parameters, our only free parameter was α2RY= α2 βRI, and we fixed it to α2=0.5 (i.e., reinfected individuals recover twice as fast as individuals infected for the first time).

      Once more, it was not our goal to precisely recover specific trajectories of COVID-19 or to point at possible future scenarios, but to illustrate the dependence of major trends with model parameters. Also, the appearance of new variants requires the reevaluation of parameters. For example, omicron has different IFR (therefore different mortality and recovery rates), a different infectious period, and higher infection and reinfection rates. In this context, the interactive webpage (where we will update demographic profiles and IFR data as they become available) is a useful resource to simulate any situation different from current or past ones.

      3) In the model the immunity waning is not explicitly considered (flux from R to S or better from a vaccinated compartment to S). It is clear that this complicates the model. Please discuss why the indirect way the waning is considered here is justified.

      3.1) Batistela et al, "SIRSi compartmental model for COVID-19 pandemic with immunity loss", Chaos Soliton and fractals, 2021.

      3.2) McMahon et al, "Reinfection with SARS-CoV-2: Discrete SIR (Susceptible, Infected,Recovered) Modeling Using Empirical Infection Data", JMIR Public health and surveillance, 2020.

      Though the model does not consider an incoming flux of individuals to compartment S, the existence of a "backward" flux from R to Y yields a transient phenomenology analogous to models with increases in the S class. Indeed, it is these fluxes that cause persistent endemic states; otherwise, the S class is monotonously depleted until infection extinction.

      In Batistela's et al. work, the possibility that individuals become reinfected is effectively implemented through a flux between the R and S classes, since only one class of infected individuals is considered and recovered individuals cannot be infected again. In our case, feeding back to S would mean that previous immunity is completely lost or that vaccines are not effective at all for some individuals. This is neither what McMahon et al. conclude when evaluating real data nor what more recent surveys indicate (see for instance the Science Brief published in October 2021 by the CDC, SARS-CoV-2 Infection-induced and Vaccine-induced Immunity, https://www.cdc.gov/coronavirus/2019-ncov/science/science-briefs/vaccine-induced-immunity.html).

      This nonetheless, complete immunity waning (feedback to the S class) and reinfections (feedback to a partly immune class experiencing overall lower severity of the disease) are equivalent to a large extent: the trend of COVID-19 seems to indicate that our Y class will be the "new S", and that fully naive individuals would arrive mostly due to demographic dynamics (birth and death processes, as also implemented by Batistela et al.). Summarizing, complete immunity waning is rare in the time scales considered in our simulations, while partial immunity that decreases the severity of the disease (after infection or vaccination) is the rule, in agreement with our choices.

      4) Reduction of deaths wrt no vaccination is of course important, but also reduction of stress in hospitals. This is particularly important now with the advent in Europe of the omicron variant. Please discuss on the real message you want to convey to policy makers in the actual scenario of the pandemic.

      The model in this work is deliberately simple. Our main goal was to explore the qualitative effects of demographic structure and disease parameters in protocols for vaccine administration. This was the reason to consider a mean-field model in a population structured into two groups. The main conclusion is that optimal vaccination protocols are demography- and disease-dependent. If this is so in our streamlined model, the more it will be in more realistic models, where one should include a finer stratification and, in all likelihood, heterogeneity in contagions. Our main message, therefore, is that there is no unique protocol for vaccine roll-out, valid for all populations and diseases. The abstract has been modified to highlight this conclusion.

      Some qualitative considerations also allow us to draw preliminary conclusions on the reduction of stress in hospitals. Since the number of hospital admissions is proportional to the incidence of the disease, the number H of hospitalized individuals can be represented as H=a I + b Y, with a>>b due to the partial immunity of vaccinated or recovered individuals (which belong to class Y upon (secondary) contagion). Therefore, minimizing the burden on the healthcare system amounts to minimizing the number of individuals in the I class. Beyond non-pharmaceutical measures, I is minimized when individuals are transferred as fast as possible to the Y class, that is, maximizing vaccine supply and acceptance. In terms of our model parameters, this entails maximizing v and also θ (the maximum fraction of individuals eventually vaccinated), for instace through devoted awareness campaigns. These ideas have been included in the Discussion section.

      Reviewer #1 (Significance (Required)):

      The final message and some theoretical passages are not completely clear, at least to me.

      Please discuss on the real message you want to convey to policy makers in the actual scenario of the pandemic.

      As discussed above, we have modified the manuscript following the advice given by the Reviewer. We think that both the presentation and the theory are clearer now.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      In this paper, a compartmental model of the propagation of an infection with vaccination and reinfection is studied. The impact that changes in the rates of these two processes have on disease progression and on the number of deaths is analyzed. In order to highlight the overall effect of the demographic structure of populations and the propagation of a given disease among different groups, the population is divided into two subpopulations and the model is extended to the two-dimensional case. In addition to the study of equilibria and their relative stability, the model is then applied in the case of COVID-19. Different vaccination strategies are studied using real demographic data and with a population split between under 80 and over 80 individuals. It is observed that for low vaccination rates, the advisable strategy is to vaccinate the most vulnerable group first, in contrast to the case of sufficiently high rates, where it is appropriate to vaccinate the most connected group first. The simulations show also that with a low fatality ratio, the strategy that yields the greatest reduction in deaths is vaccination of the group with the most contacts, while the situation is reversed for higher fatality ratio.

      The model and simulations presented are interesting and valuable. The comparison of the behavior of the model in the 4 different countries is very interesting, as well as the webpage created by the authors.

      We thank the referee for the very positive evaluation and are very glad that the study is found interesting and valuable.

      As minor comment, I think that the introduction of the model needs a more extensive literature review. For example, there is no mention of the classic SIR model of Kermack and McKendrick (1927) and other works on the introduction to epidemic models, which form the basis of the model presented by the authors.

      The referee is right. There is a long history of extensions and applications since Kermack & McKendrick introduced the SIR model that we obviated. This has been amended by adding an introductory paragraph with several new references at the beginning of the Models section, page 3 in the main text.

      Reviewer #2 (Significance (Required)):

      The model presented by the authors is quite original and simple enough to be suitable to different contexts and scenarios.

      Compared to previous work, this paper makes a twofold contribution, as explained by the authors. First, the introduction of reinfections shows the existence of long transients (or quasi-endemic states) that may precede the transition to a truly endemic state predicted for COVID-19. Second, the simplicity of model allows the characterization of systematic effects due to, at least, group size, demographic composition, and IFRs.

      I am involved in the study and analysis of epidemic models accompanied by network effects. I think this paper is a good contribution, although preliminary, in the analysis of the vaccination process and in the search for the optimal strategy.

      We thank the Reviewer and are glad that our goal, offering a model as simple as possible to obtain meaningful conclusions, is appreciated.

    1. Author Response:

      Reviewer #1 (Public Review):

      In this paper, Qin et al. investigated the molecular mechanism of phospholamban (PLN) linked dilated cardiomyopathy (DCM), using structural approaches combined with biophysical measurements. Structures of the catalytic domain of protein kinase A (PKAc) in complex with PLN peptides (both wild-type and the R9C and A11E DCM mutants) provide insights into the mechanism of substrate recruitment and how it is perturbed in the disease state. Qin et al. show convincingly that the mutant peptides all have lower affinity for PKA than the wild-type peptide, suggesting models in which heterozygous DCM mutations act via sequestering PKA and thereby preventing phosphorylation of the wild-type peptide may be incorrect.

      The authors highlight significant differences between their structure of the WT-PLN:PKAc complex, which has a 1:1 stoichiometry, and a previous structure of the complex (PDB 3O7L), which has 1 PLN bound between two PKAc monomers (a 1:2 complex). The authors posit that the stoichiometry observed in 3O7L is an artifact of the crystal lattice, and does not occur in solution, supporting this with analysis of the elution volumes of the peptide complexes on size exclusion chromatography compared to PKAc alone. They further suggest that the AMP-PNP ligand included in the 3O7L structure is not bound, based on analysis of Fo-Fc maps calculated from the deposited coordinates. Inspecting 3O7L I am not convinced of this last point - it seems more likely that a technical error was made in assigning or refining the B-factor of the ligand in 3O7L, because there is clearly density present in SA-omit maps for the nucleotide.

      Taking these results together, the authors suggest a mechanism for DCM, whereby mutations in PLN result in lower affinity for PKA, and consequently reduced phosphorylation. This seems plausible and well supported by the data, although in the ADP-Glo assay used here, the reductions in phosphorylation observed for some of the mutant peptides are rather modest. However, as the authors state, it is plausible that even relatively subtle changes in PLN phosphorylation could have substantial effects on Ca2+ homeostasis via increasing SERCA inhibition.

      We thank the reviewer for the appreciation of our work.

      Reviewer #2 (Public Review):

      Strengths:

      The authors presented new high-resolution 3D crystal structures of the PKA catalytic domain (PKAc) in complex with PLN WT or mutant peptides (residues 8-22) containing the DCM-associated PLN mutations (R9C or A11E). These are novel and important data given that the present structures are dramatically different from those reported previously. The authors made convincing argument that the 3D model reported previously may result from a crystallization artifact.

      By characterizing the interactions between the PKAc domain and PLN WT or DCM-associated mutant peptides using surface plasmon resonance (SPR) analysis, the authors convincingly showed that the DCM-associated PLN mutations at positions 9, 14, and 18 alter the conformation of the PLN peptide and reduce the binding affinity of the PLN peptide with PKAc. These data provide an explanation how some DCM-associated PLN mutations at these positions reduce the level of PKA-dependent phosphorylation of PLN.

      The authors also performed nuclear magnetic resonance (NMR) to determine the structural dynamics of PLN WT, R9C, P-Ser16, and P-Thr-17 peptides. These NMR structures combined with the SPR analysis also support their conclusion that PLN phosphorylation and DCM-associated PLN mutations have an impact on its conformation.

      We thank the reviewer for the comments.

      Weakness:

      The present study used PLN-derived peptides (aa 8-22). Although technically challenging, it is important to consider if the full-length WT or mutant PLN will behave the same as those observed with the peptides. This is especially crucial in light of the prior work showing substantially different structures using a different segment of PLN.

      We are fully aware of the potential risk to draw conclusion from an isolated peptide instead of the full-length PLN as a transmembrane protein. In the previous study, people showed that the PLN peptide could be used as a good model substrate that gets phosphorylated as efficiently as the full-length PLN protein (L. R. Masterson et al., Dynamics connect substrate recognition to catalysis in protein kinase A. Nat Chem Biol 6, 821-828 (2010); D. K. Ceholski, C. A. Trieber, C. F. Holmes, H. S. Young, Lethal, hereditary mutants of phospholamban elude phosphorylation by protein kinase A. The Journal of biological chemistry 287, 26596-26605 (2012)). These results together with our biochemistry results suggest the tail peptides are indeed active substrates of PKA. Due to the technical difficulty, we were not able to crystallize PKAc in complex with the full-length PLN. To explain the potential difference between the peptides and the full-length PLNs, we added more text in the discussion section “Additionally, the trend of the reduced phosphorylation by DCM mutations can be significantly affected by the oligomerization state of PLN. Ceholski et al. showed that R9C severely inhibits PKA phosphorylation in the context of full-length pentameric PLN, but has a much milder effect in the context of full-length monomeric PLN or an isolated tail peptide [41].”

      Although it is convincing that DCM-associated PLN mutations likely reduce the interaction between PKAc and PLN (assuming that the peptides behave the same as the full-length PLN with respect to interaction with PKA) and, as a result, the PKA dependent phosphorylation of the mutant PLN, it is unclear how this impaired interaction between PKA and PLN mutant could explain the effects of the DCM-associated PLN mutations on SERCA function (either reduced or enhanced PLN-dependent inhibition of SERCA, as proposed previously). In this regard, can the authors predict if the DCM-associated PLN R9C mutation reduces or increases SERCA inhibition based on the results of their present study?

      It is indeed controversial how PLN mutations cause DCM. Previous studies have shown that the DCM mutations in PLN might change this regulation in either a phosphorylation-dependent or phosphorylation-independent manner. Our results show that the mutations may act through both manners: 1) the mutations reduce the phosphorylation level of PLN, which has been shown to enhance the inhibition of SERCA and inhibit the uptake of Ca2+; 2) the mutations change the conformation of PLN before binding to PKA or SERCA, which could have additional consequences, such as altered assembly state of PLN, phosphorylation of PLN by CaMKII, or changes in interactions of PLN with the lipid membrane. This could impact in either directions, reducing or increasing SERCA inhibition, which is difficult to predict based on our data. We added the explanation in the discussion “While decreased PLN phosphorylation is likely an important contributor to the physiological dysfunction associated with familial DCM, disease-causing mutations in PLN may have additional consequences, such as altered assembly state of PLN, phosphorylation of PLN by CaMKII, or changes in interactions of PLN with the lipid membrane. The influence of such factors on SERCA inhibition are unclear. In principle, they might further increase inhibition of SERCA and act in conjunction with lower PKA-mediated phosphorylation to manifest the disease symptoms. Conversely, it is possible that these factors could decrease the inhibition of SERCA, partially compensating for the decreased phosphorylation level, and mitigating the symptoms.”

      It is also unclear how reduced PKA phosphorylation of mutant PLN could lead to DCM. PLN is unlikely to be significantly phosphorylated by PKA at rest (in other words, PLN is likely to be phosphorylated by PKA during stress, i.e. during the adrenergic fight-or-flight response). Therefore, it is puzzling how such reduced PKA-dependent phosphorylation of PLN would significantly affect the PLN function during the absence of flight-or-flight response.

      As explained above, we think that this regulation could be through both phosphorylation-dependent and phosphorylation-independent manner. Even only considering the phosphorylation-dependent manner, the DCM phenotype could be due to an accumulation of the Ca2+ imbalance in the cell over repeated cycles of cardiac muscle contraction upon chronic accumulation of the sporadic phosphorylation events. It is also possible that the mutations affect the CaMKII-dependent regulation of PLN, which leads to DCM.

      Given that the DCM-associated PLN mutations have significant effects on the conformation of PLN itself, at least in the form of short-peptides, it is possible that these mutations could affect the folding, oligomerization, trafficking, degradation, etc., in addition to PKA-dependent phosphorylation. The relevance and contribution of reduced PKA-dependent PLN phosphorylation to DCM remain unresolved.

      We agree with the reviewers that both phosphorylation-dependent and phosphorylation-independent manners could contribute to the DCM disease phenotype. It remains unresolved which factor is the major contributor. We have added a statement in the discussion (see point above).

      Reviewer #3 (Public Review):

      This manuscript describes an elegant study utilizing the crystal structures for the elucidation of the disease mechanism of familial dilated cardiomyopathy. It has been known for decades that the mutations in PLN are associated with DCM, but the underlying mechanism remains controversial. In my opinion, Prof Yuchi and co-authors did excellent job on revealing the high-resolution crystal structures of PKA-phospholamban complexes, representing both the native and diseased states. Combined with various of biophysical and biochemical methods, including SPR, ADP-glo, thermal melts, NMR, etc, the authors systematically investigated the correlations between the PLN conformation, the binding affinity, and the phosphorylation level. The mechanism of PKA phosphorylation on another related substrate, ALN, was also convincingly revealed. The results are very helpful for understanding the pathological mechanism of PLN-related DCM. More importantly, the atomic structures of PKA-phospholamban complexes lay a solid foundation for the structure-based rational design of therapeutic molecules that can reverse the effects of the DCM-causing mutations in the future, e.g. by stabilizing the interactions between PLN and PKA.

      We thank the reviewer for the appreciation of our work.

    1. Author response:

      Reviewer #2 (Public Review):

      This work by Castledine et al. addresses the important question of whether results from in vitro (laboratory-based) evolution studies may be useful for predicting evolution during phage therapy in a clinical setting. In order to explore this question, the authors cultured a set of bacterial isolates from a patient pre- and during phage therapy, as well as phages from several time points during therapy. They then experimentally evolved (in vitro) a mixture of the bacterial isolates from the patient in the absence of phage, or in the presence of phage using two different treatments (phage added once or added repeatedly). Overall, they observed similarities between the evolutionary outcomes (genomic and phenotypic) in vitro and in the patient. Resistance evolved rapidly in the patient and in vitro under phage selection, and similar genomic changes were observed in both environments. The approach of using bacterial isolates directly from the patient (as well as the phages used for therapy) in vitro is clever, and the observed similarities are compelling.

      We thank the reviewer for appreciating the novelty in our results and methodology.

      However, I think there are some limitations with the study that should be addressed in the text.

      In particular, (1) While the similarities in vitro and in the patient are quite interesting, there are some differences that were dismissed as being minor without justification. Calling the results "highly parallel" is a bit subjective - in vitro in the repeated phage treatment (which is suggested to be most similar to the clinical context), there did appear to be phage coevolution that was not observed in vivo. The tradeoffs/relationships between traits (as shown in Fig. 3) also differed to some extent.

      We agree this could have been more objectively phrased at the start of the discussion – this has been edited to reflect this. We have highlighted the differences between in vivo and in vitro treatments with respect to phage evolution. Moreover, we have also highlighted that the observed trade-offs had different underlying mechanisms which may not always result in parallel evolutionary changes between in vivo and in vitro environments.

      Additionally, for the genomic results only a subset of variants were plotted (those in genes of known function), but there were far more significant variants in genes of unknown function that were not included. It is difficult to assess whether the genomic findings are truly similar across environments if only a fraction of those results were presented in the manuscript.

      We chose to concentrate on genes of only known function so that we could better understand their potential significance, and also because the figures and analyses (Figures 4 and 5) would become extremely complex and large and uninterpretable with genes of unknown function included. This is especially true for Figure 5, which would have required us to show 284 rows if all genes would have been included. Ultimately, whichever way we do this exploratory analysis, it is going to be difficult to see if findings are truly similar across environments because we only have a single patient who had phage therapy.

      However, we have redone the analysis with all of the significant genetic changes (SNPs and indels from both known and unknown genes) included.

      Figure 5 has been recreated and is now included as "Figure 5 - Figure supplement 1". All of the statistical analysis on (a) the number of SNP/indels seen (b) genetic distance from ancestor and (c) alpha diversity give quantitatively similar results. That is, although all the estimates are generally much higher after including many more genetic variants, all of the significant results from both the overall model fit and post-hoc multiple comparisons remain the same. One interesting result that came out of looking at all the genetic changes was that for genetic variants occurring in a gene of known function, 56% (28 out of 50) were de novo mutations, whereas this value was only 42% (98 out of 234) for variants in genes of unknown function.

      We then looked at the proportion of genetic variants (both in known and unknown genes) found in vitro that were also found in vivo. For genes of known function, 62% of genetic variants were found in vivo (31 of 50) and this was comparable to the 65% of genetic variants in genes of unknown function (153 of 234). Of the 26 genes of known function with differences identified in the in vitro analysis, 16 (61%) were also found to have genetic changes in vivo. The equivalent metric for genes of unknown function was 86% (85 of 99). Similar to in vitro, variants occurring in a gene of known function were more likely to be de novo mutations (77%) compared to variants occurring in a gene of unknown function (46%).

      While these patterns and exploratory analyses are interesting, they have extremely limited statistical power and therefore do not alter the conclusions or results of the work presented. For these reasons, we have chosen not to include these results in the already long manuscript. We have added a line to say we have done it both way:

      “We performed all downstream statistical analyses on (a) only genetic variants in genes of known function and (b) all genetic variants.”

      And we also added a line at the beginning of the genomic analysis results section:

      “Results were not affected whether we included only genetic variants occurring in genes of known function or all genetic variants (Figure 5-Figure supplement 1). As we were interested in attributing potential functions to the variants identified, we only present the results for genetic variants occurring in genes of known function.”

      (2) Much of the text is framed around whether in vitro outcomes are predictive of those in vivo, but this study only included results from a single patient. Thus, it is impossible to know whether these findings are by chance or representative of a more general relationship between in vitro and in vivo evolution.

      We agree that having a single patient for our in vivo comparison limits the generalisability of our results. We have highlighted this in the revised manuscript. However, that our replicated in vitro experiments agreed broadly with our in vivo results and that of other studies (finding resistance-virulence trade-offs) suggests that at least in some circumstance in vitro dynamics are predictive of in vivo dynamics. Further studies are clearly needed (and hopefully will arise as a consequence of this work) to determine the generalisability of this finding and the circumstances where this parallelism might break down.

      (3) Although the evolutionary outcomes appear to be similar, the pathogen was successfully cleared from the patient but persisted throughout experimental evolution. Whether the pathogen is successfully eliminated or not is presumably the most important clinical outcome, and while this difference is not surprising, it is an important one to point out to the reader. Essentially, evolution was similar to some extent but the consequences of evolution for bacterial persistence in each environment were quite different.

      We have now highlighted this difference to the reader in the revised manuscript.

    1. Author Response:

      Reviewer #1 (Public Review):

      • Line 141: It would be beneficial to better understand how the sequenced sample of the population corresponds to the PCR confirmed sample of the population, in order to understand possible selection biases in the sequence data. Could you elaborate on how the composition of sequence PCR confirmed cases matches the composition of PCR confirmed cases, by the demographic characteristics listed in Table 1.

      Early in the pandemic (March-April), we tried to sequence every SARS-CoV-2 positive case diagnosed in our KWTRP laboratory from Coastal Kenya. However, with the sharp increase in the number of identified cases from the month of May 2020 onwards, and a limited in-house sequencing capacity, we changed strategy to sequence only a sub-sample of the identified positives. The criteria for sub-sampling included having a cycle threshold of < 30.0, spatial representation (at county level) and temporal representation (at month level). The consequent number and proportion of samples sequenced across the study period months and across the counties is summarized in Fig. 2C-E with the sample flow provided in Figure 2-figure supplement 1.

      In the revised manuscript we have provided a comparison of the demographic characteristics of the sequenced cases versus non-sequenced cases (shown as Table 2). The participants providing the sequenced and non-sequenced positive samples had a similar gender distribution and similar probabilities of being from either from Wave one or Wave two. However, the distribution of sequenced vs non-sequenced cases differed significantly in age distribution, nationality and travel history. Specifically in the sequenced sample, there were more participants in 30–39 years age bracket compared to the non-sequenced samples, a disproportionately representation of non-Kenyan nationals and persons with a recent international travel history in the sequenced sample.

      • Line 283: I am particularly interested in the observed inter county flows, but it is hard to interpret the numbers. Considering population sizes in each county, what are the phylogenetically observed import rates per 100,000? What are the rate ratios? Based on the observed data, is there any evidence that imports into coastal Kenya occurred statistically significantly through Mombasa?

      We thank the reviewer for these comments.

      In the revised manuscript we have added two new tables (1 & 4) which detail the population size in each of the six Coastal Kenya counties, population density and estimated import/export rates (per 100,000) for the counties.

      The alluvial plots are descriptive regarding genome flows. The underlying data on the pattern of virus movement is inferred using the ancestral state reconstruction which an established phylogenetic approach that has been applied elsewhere to infer SARS-CoV-2 local and global movement (Wilkinson et al, Science 2021, Tegally et al, Nature, 2021).

      The results we obtained from ancestral state reconstruction of Mombasa being a major gateway for variants entering the coastal region of Kenya is consistent with (a) the county showing the highest number circulating of lineages (n=28) compared to the other five remaining counties of Coastal Kenya, (b) approximately half (n=21, 49%) of the detected lineages in coastal Kenya had their first case identified in Mombasa and (c) Mombasa had an early wave of infections compared to the other Coastal counties.

      We are not aware of an approach to consider statistical significance on these plots. The graphical display is based on the observed number events, and we would argue this is more appropriate than presenting absolute rates which would be susceptible to sampling bias.

      Is it possible to account for potential bias in sequence sampling in these calculations, perhaps as done in Bezemer et al AIDS 2021? It should be possible to adjust for the proportion of sequenced individuals in PCR confirmed individuals, and it might also be possible to back calculate infected cases from cumulative reported deaths and to adjust for the proportion of sequenced individuals in infected individuals?

      The reviewer suggests helpful methods to examine sampling bias, but we found this beyond scope here. Our method was based on ancestral location state reconstruction of the dated phylogeny. The approach has been used elsewhere to answer similar questions (Wilkinson et al, Science 2021, Tegally et al, Nature, 2021). The Bezemer paper uses maximum parsimony ancestral state reconstruction algorithm implemented in phyloscanner, and the Bayesian method applied to impute incomplete sampling is applicable to chains of transmission which we have not tried to reconstruct in our analysis.

      Considering my earlier recommendation to document sequence sampling representativeness in Table 1, if Mombasa is found to be oversampled relative to infections, then it might also be helpful to perform sensitivity analyses in which sequences from over-represented locations are down-sampled. Another option might be to consider the approaches considered in de Maio PLOS Comp Bio 2015, or Lemey Nat Comms 2020. Thank you for investigating potential caveats and substantiating your findings in more detail.

      In the revised manuscript we have clarified that our sequenced sample was proportional the number of positive cases reported in the respective Coastal Kenya counties (see-Fig.2E and Table 1).

      The De Maio method uses BASTA (BAyesian STructured coalescent Approximation) into BEAST for purposes of phylogeographic analysis to compare ability to discriminate a zoonotic reservoir vs the implausible alternative cryptic human transmission. Analyses developed from these methods would be valid and interesting to apply to our dataset but would be a major new analysis and beyond the scope of the present paper. We have therefore taken the approach of: a) more clearly acknowledging sampling bias (see below) and b) undertaking sensitivity analyses (Supplementary File 5, see below). Using the larger global background sequence sets selected in a different way (more geographically balanced relative to the first round that was random), we still find that most of the virus introductions into coastal Kenya occurred via Mombasa consistent with our previous analysis.

      The results are consistent with the case numbers in that (i) Mombasa experienced an earlier peak during wave one relative to other counties and (ii) had in total more cases than all the other five counties, and (iii) was commonly the first county of detection for many of the identified lineages in the region. However relative to its population, the border county of Taita Taveta had a higher import rate (13.5. per 100,000 people) compared to that of Mombasa (11.6 per 100,000 people), Table 4

      Observations from our sensitivity analyses (Supplementary File 5) are included in the revised manuscript. We found that the absolute number of estimated viral imports/exports and intercounty transmission events fluctuated depending on the number of Coastal Kenya sequences and size of global comparison dataset but with a clear pattern of (a) counted events increasing with sample size (b) with Mombasa County consistently leading in the number of events; imports or exports.

      • Line 292: The results are of course subject to differences in sequencing rates in each of the countries listed, and differences in reporting of these data.

      This is a valid concern; to mitigate the bias that arises with these differences, unlike in the previous comparison dataset where we randomly selected a specified number of samples per month for each continent, in the revised analysis we have done the selection at country level. We limited the comparison data to maximum of 30 genomes per country per month per year. In this way, countries with high sequencing rates do not become overrepresented in our comparison dataset.

      Some of these biases could be elicited through comparison to international travel data. For example, are the US and England also the top two countries from which most travellers arrive into Kenya? If such additional analyses are out of scope, it seems warranted to either strongly point to the substantial limitations of this analysis, or remove it altogether.

      We concur with the reviewer on the potential bias that could exist in conclusions that arise from inferring sources of importations based on genomic data alone, available from only a few countries. However, vital quality and curated international travel data into Kenya during the study period was not available to us at the time of this analysis. We have therefore agreed to remove the previous analysis on potential origins and destinations of observed Kenya lineages from the revised manuscript.

      What is perhaps striking is that Tanzania is entirely missing from this list, given extensive spread there. Another analysis that could be useful is a comparison of country specific lineage compositions, which might bypass some of the difficulties associated with substantial differences in sequence sampling/reporting rates.

      SARS-CoV-2 genomic data from Tanzania has not been publicly shared to date, and hence is not included. And as indicated above, we have removed the analysis that was trying to infer sources of SARS-CoV-2 importations into Kenya.

      To hypothesize on the potential lineages circulating in Tanzania, we have added a sentence detailing that 5 Pango lineages were identified among the 34 Tanzanian nationals who provided samples that were sequenced: B.1 (n=10), B.1.1 (n=10), B.1.351 (n=8), A (n=5) and A.23.1 (n=1)

      • Line 536: it seems problematic that the data used in the import/export analysis did not contain all available African sequences. Can these be included in the corresponding analysis please.

      In the revised manuscript we have included all accessible, good quality and contemporaneous Africa genomes in the revised manuscript (n=21,150). However due to the huge computational processing power need to process the phylogenetics for such large sequence data sets, we split the analysis into two parts, each with approximately 10,000 genomes (see Figure 3-figure supplement 1).

      Notably with the increased sample size (including the analysis of 390 more genomes from coastal Kenya), we detected far more imports of SARS-CoV-2 into Coastal Kenya compared to our previous analysis (n=280 vs n=69) but only a modest change in exports (n=95 vs n=105) and inter-county virus movement events (239 vs 190).

      Reviewer #2 (Public Review):

      Agoti et al. analyzed SARS-CoV-2 samples collected from infected patients in coastal Kenya, collected between March 2020 and February 2021. This period spans the first two waves of COVID-19 in Kenya, and the authors aimed to understand the lineages circulating throughout the region, in comparison to the virus circulating elsewhere in Kenya and in the world. The manuscript is clearly written, and the figures and results are thorough and well described throughout. These data add to our understanding of COVID-19 in Kenya and in East Africa, and the discussion of how different lineages spread in Kenya (single clusters versus dispersed over several regions) is both interesting and potentially useful for informing public health measures.

      The analyses are well done and excellently presented, but this paper is significantly lacking in a discussion of how sampling bias may affect the stated conclusions. Additionally, the paper focuses almost exclusively on genomic data and fails to closely examine epidemiological factors that may better contextualize the results presented.

      We thank the reviewer for bringing this to our attention, we have added the paragraph below to the revised manuscript.

      “Sampling bias is a potential limitation of this study arising from the fact that (a) demographic characteristics (age distribution, travel history and nationality) of the sequenced versus non-sequenced sub-sample differed significantly, (b) <10% of confirmed SARS-CoV-2 infections in Coastal Kenya were sequenced, prioritizing samples with a Ct value of <30.0 (Table 1); (c) the Ministry of Health case identification protocols were repeatedly altered as the pandemic progressed (Githinji et al., 2021) and (d) sampling intensity across the six Coastal counties differed, probably in part due to varied accessibility of our testing center that is located in Kilifi County (Figure 1A and Table 1). This may have skewed the observed lineage and phylogenetic patterns. To better contextualize the genomic analysis results, close examination of the case metadata is important, but unfortunately there was a lot of the metadata was missing (e.g., travel history, nationality, Table 2) which made it hard to integrate genomic and epidemiological data in an analysis. Although all analyzed genomes had > 80% coverage, very few were complete or near complete (>97.5%, n=344) due to amplicon drop-off or low sample quality and this may have reduced the overall phylogenetic signal.”

      Specifically:

      1) The authors do not discuss the potential effects of sampling on their import/export analyses. For example, they find that the USA and England are in the top six country sources of SARS-CoV-2 importation into coastal Kenya, as well as in the top six country destinations of viral export from the region. These two countries have generated huge numbers of sequences compared to the rest of the world, which may clearly bias these findings. While the authors do evaluate the sensitivity of their analyses by repeating them with different global subsamples, it is unclear if these subsamples corrected for large discrepancies in available data from different parts of the world.

      We concur and appreciate that sampling bias is indeed a common limitation in the type of analysis we have undertaken given the variation in data collection across geographies. Some of the approaches we took to correct for this have been highlighted in our responses to reviewer #1.

      In the revised manuscript, we have undertaken a reanalysis with a larger and more representative dataset at all scales of observation (Figure 3-figure supplement 1). Specifically, for the global dataset, we have revised our sub-sampling script to pick up the comparison dataset uniformly across months and countries for non-African countries. All the available African genomes have been included in our analysis including 605 collected in Kenya outside the coastal regional.

      Similarly, the authors find that new variant introductions were mainly through Mombasa city, but most of the Kenyan sequences were from this region, so it is perhaps unsurprising that more lineages were found there. The authors should repeat their analyses with a more representative global subsample, or at the very least discuss these caveats in the discussion and discuss what other evidence there may be to support their findings.

      Our sequencing rate by county is approximately proportional to the total number of cases seen in the county (Table 1 and Figure 2E). For Coastal Kenya, the revised manuscript included 389 additional genomes from coastal Kenya that became available while the manuscript was under review.

      Thus, in the revised manuscript, we have addressed the valid sampling bias concerns of the reviewers and editor by: (i) increasing the number of analyzed genomes in our dataset for previously under-represented periods and regions, (ii) including contemporaneous Kenyan genomes from outside the coastal counties in our import/export analysis, (iii) including all available Africa genomes into the analysis and selecting a balanced global sub-sample for inclusion into the analysis. In addition, were have also provided a paragraph in the discussion section highlighting sampling bias as a caveat to interpretation of the findings of the current study:

      “The accuracy of the inferred patterns of virus importations to and exportations from coastal Kenya are in part dependent on both the representativeness of our sequenced samples for Coastal Kenya and the comprehensiveness of the comparison data from outside Coastal Kenya. Our sequenced sample was proportional the number of positive cases reported in the respective Coastal Kenya counties (Figure 2E and Table 1). Also, we carefully selected comparison data to optimize chances of observing introductions occurring into the coastal region (e.g. by using all Africa data). But still there remained some important gaps e.g. non-coastal Kenya genomic data was limited (n=605). Despite this, we think the results from ancestral state reconstruction indicating that Mombasa is a major gateway for variants entering coastal Kenya is consistent with (a) the county showing the highest number lineages circulating (n=28) during the study period compared to the other five remaining Coastal counties Kenya, (b) approximately half (n=21, 49%) of the detected lineages in coastal Kenya had their first case identified in Mombasa and (c) Mombasa had an early wave of infections compared to the other Coastal counties and (d) is the most well connected county in the region to the rest of the world (large international seaport and airport and major railway terminus and several bus terminus).”

      2) Restriction measures enforced by the Kenyan government are briefly introduced at the very beginning of the manuscript and then mentioned at the very end as a possible explanation for observed transmission patterns. However, there is very limited discussion of the potential effect of restriction measures throughout, and no formal analyses are presented using this kind of epidemiological information. Adding formal analyses to back up the hypothesis that relaxation of interventions may have driven the second wave of infections would make this paper much stronger and potentially more interesting.

      In the revised manuscript, we have detailed the restriction measures the government of Kenya put in place in the introduction, methods, and results sections and discussed where appropriate on how we think they impacted the observed transmission patterns. We have added Supplementary Table 1 that provides the dates the various measures took effect or were relaxed.

      In a separate piece of work (Brand et al, 2021 published in Science journal, 10.1126/science.abk0414), we investigated the potential drivers of the first three waves of infection observed in Kenya and we have appropriately referenced this in the revised manuscript.

      We feel that additional analyses on the impact of the restriction measures on SARS-CoV-2 epidemiology and the lineage patterns observed are beyond the scope of this work whose focus was primarily genomic epidemiology.

      3) Generally, the text of the manuscript focused on waves of SARS-CoV-2 transmission, while the analyses presented data aggregated by month. A clearer connection between month and wave (particularly visually, on the figures themselves) would aid in interpretation of the data presented.

      This is a valid concern and a good suggestion. In the revised manuscript, for all temporal plots, we have added a line to demarcate when we switched from wave one to wave two period. Similarly, for several analyses, we have provided aggregations by wave period rather than by month.

      4) One of the strengths of this manuscript is the depth to which the authors discuss the detection of specific lineages in coastal Kenya. However, there is limited discussion of these results in the context of when various lineages appeared or disappeared globally, though these details are presented in a table. Discussing the appearance of the various lineages (was it surprising to see a particular lineage at a certain time or in a certain place?) would also improve this manuscript.

      In the revised manuscript, we have compared the patterns of lineage detection locally compared to all Kenya and to all continents in the newly added Figure 3. We have also discussed this aspect for the most frequent 4 lineages in both Wave one and Wave two.

    1. Author Response:

      Reviewer #1:

      Hauser et al, analyze two large datasets of GPCR-G protein interactions/couplings ("Inoue" and "Bouvier"), comparing and combining them with the widely-used literature-based Guide to Pharmacology (GtP) database. As the Inoue and Bouvier datasets were based on different experimental setups, this enables the identification of which couplings are supported by more than one method. The authors also establish a normalization protocol that enables to move from qualitative to quantitative comparisons and identify couplings that might be either below are above a rigid threshold. Overall, the paper describes a new resource and the methodologies used to build this resource. The resulting coupling map is available through the GPCRdb website, a widely used resource in the field.

      The authors have thus improved the ability of researchers to assess prior results and compare them to their own new data. This resource clearly and significantly upgrades options currently available and will likely be of interest and prove quite useful to scientists both in academia and in industry.

      We thank the reviewer for so nicely describing the study and its prospective application.

      Weaknesses include:

      • The data is described mostly by broad numbers, such as the number of receptors or coupling in a subset, or percentages. While this is helpful to understand the data, this reviewer found it hard to follow the mountain of numbers. A suggestion would be to add a section where the authors pick selected examples of particular experimental data and show how their combine database can resolve previously unanswered (or wrongly answered) questions of GPCR/G protein coupling.

      We have removed numbers in several places throughout Results where we had included multiple measures e.g., absolute numbers and percentages. Furthermore, where an overall number has been broken down into distributions, e.g., across different G proteins of families thereof, we moved other numbers to parentheses.

      The different sections of Results that answer questions of GPCR-G protein coupling have now been presented more clearly by updating their headings and grouping them all in a subsection of part of Results called “Research Advances – Insights on GPCR-G protein selectivity”. These sections are all based on our “combined database”/coupling map. In each such section, we start at the overall level – covering all GPCRs and/or G proteins – but then give selected examples thereof that are weaved into and exemplifies the text. This approach has also been used in the new Results section “Differential tissue expression gives G proteins in the same family large spatial selectivity”, which gives selected examples of G proteins with specific tissue expression profiles.

      Given that the paper has already exceeded the maximum of 5,000 words by quite a bit, we think that this approach of weaving selected examples into each selectivity insight section is the most appropriate, and that it brings most clarity. Furthermore, we hope that readers will be inspired to use our coupling map to generate additional questions for future experiments.

      • The paper does not reveal new biological findings. For example, while some emphasis is placed on new data on G15, it would be helpful to take the extra step and use this to suggest new biological insights.

      eLife’s author guidelines (https://reviewer.elifesciences.org/author-guide/types) state that “Tools and Resources articles do not have to report major new biological insights or mechanisms, but it must be clear that they will enable such advances to take place, for example, through exploratory or proof-of-concept experiments.” In case this manuscript is published as a Tools and Resources paper, it may therefore be sufficient to provide the foundation for future studies to reveal new biological findings.

      Nevertheless, the coupling map led to biological findings relating to patterns and mechanisms of GPCR-G protein selectivity that were not described in the original studies. I.e., while this study did not generate new data, it arrived at new insights based on published data. This seems to be in line with eLife’s publication format “Research Advances” (https://reviewer.elifesciences.org/author-guide/types), and the Analysis format of several other journals. Some insights described herein have not been presented before while others have been updated in scope and precision. Furthermore, we have added a new section of Results with insights on G protein expression profiles and co-expression.

      We have clarified this by updating the headings of the sections that present these insights, and grouped them under a common subheading of Results termed “Research Advances – Insights on GPCR-G protein selectivity”. However, in case we have overlooked very recent studies describing some of the same biological insights, we would please like to ask for their references and would be more than willing to revise the manuscript again to incorporate them. Furthermore, if the Reviewer is missing a particular analysis that is critical to understand GPCR-G protein coupling, please let us know.

      • The authors cautiously label couplings supported by only one dataset as "unsupported". It would seem more helpful to grade couplings by a reliability scale, providing users with a wider set of data. Perhaps only couplings that are directly conflicted by negative data should be labeled as unsupported?

      We understand that the term “unsupported” has been used in a confusing way. We have now replaced this term with “unique” and explained all terms in Table 1 of the revised manuscript.

      To address the need for a means to grade or filter couplings by reliability, we have added the following paragraph to the manuscript:

      “To enable any researcher to use the coupling map, we have availed a “G protein couplings” browser (https://gproteindb.org/signprot/couplings) in GproteinDb (2). By default, this browser only shows “supported” couplings with evidence from two datasets, but there is an option (first blue button) to changes the level of support to only one (for most complete coverage of GPCRs) or to three (for the highest confidence) sources. We propose a standardized terminology to describe couplings based on their level of experimental support from independent groups (Table 1). The criterion of supporting independent data, and the terms “proposed” and “supported”, are already used by the Nomenclature Committee of the International Union of Basic and Clinical Pharmacology (NC-IUPHAR) for GPCR deorphanization. Furthermore, the online coupling browser allows any researcher to use only a subset of datasets, or to apply filters to the Log(Emax/EC50), Emax, and EC50 values. Finally, users can filter datapoints based on a statistical reliability score in the form of the number of SDs from basal response."

      Furthermore, we have added references to the online G protein coupling browser in the:

      (1) Introduction ending: “On this basis, we develop a unified map of GPCR-G protein couplings that can be filtered or intersected in GproteinDb …”, (2) Fig. 2 legend ending: “Note: Researchers wishing to use this coupling map, optionally after applying own reliability criteria or cut-offs, can do so for any set of couplings in GproteinDb (1).” (3) Fig. S2 ending: “Unique couplings are hidden by default in the online G protein couplings browser in GproteinDb, as they await the independent support by a second group.”

      To many scientists the most reliable option is to involve NC-IUPHAR. Gloriam is a corresponding member of NC-IUPHAR, which has mentioned the possibility of involving its many worldwide pharmacological experts to update GtP on a case-by-case basis for receptors. For example, many of the “novel” couplings jointly supported by Bouvier and Inoue may be added. This option is advantageous as it involves experts in each receptor system (often with knowledge of other relevant studies) and is backed by the authoritative organization.

      • Given that this manuscript includes authors from both the Inoue and Bouvier studies, I can understand why they are not directly assessing which of the two datasets (in relation to the GtP) might be more accurate. Nevertheless, I believe this assessment should be done and that the advantages and disadvantages of the two experimental systems discussed clearly.

      We believe that the three-way intersection of couplings is the most informative and therefore preferred over individual comparison of each of the Inoue and Bouvier datasets to GtP. GtP is unfortunately not suitable as a stand-alone resource – neither to contradict nor support couplings (on the G protein subtype level). This is because GtP is incomplete (especially for G12/13) and does not provide any information on the level of G protein subtypes, only families. The three-way interactions will always use GtP but adds a second dataset on top of this when validating a third dataset. Our manuscript already included a three-way intersection of datasets, allowing readers to conclude which dataset might be more accurate (then Fig. 3 and Spreadsheet 3) on a per-G protein basis.

      In the revised manuscript, we have rewritten this section, which now has the heading “Bouvier’s and Inoue’s biosensors appear more sensitive for G15 and, Gs and G12, respectively. We have also made a completely new figure, Fig. 7, which more clearly illustrates for which G proteins that Bouvier and Inoue may have overrepresented or underrepresented couplings. This section specifically investigates the question of whether differential sensitivity can explain “unique” couplings. However, such unique couplings can either be due to overrepresentation or instead be true positives that are missing in GtP because of incompleteness and in the other biosensor due to lower sensitivity. Unfortunately, we will not be able to distinguish these possibilities until the research community has gained additional datasets from independent biosensors with as high sensitivity.

      Whereas our study compares datasets rather than experimental systems, we have added a paragraph in the Discussion describing which aspects should be considered when choosing a biosensor. There, we reference a review from last year dedicated to biosensors and describing their pros and cons (3), and the accompanying paper by Bouvier et al. (4), comparing several aspects of the experimental system used by Inoue et al. It is also important to note that the most advantageous biosensor may be one of the two for which data is analyzed in our paper. For many studies, researchers may instead be better off with another biosensor, for example those from Lambert/Mamyrbekov (5), Roth (2) (Gαβγ sensors first described in (6-11)) or Inoue (unpublished dissociation assays using wt G proteins fused with LgBit and HiBit). These are all referenced in the Discussion.

      References:

      1. Pandy-Szekeres G, Esguerra M, Hauser AS, Caroli J, Munk C, Pilger S, et al. The G protein database, GproteinDb. Nucleic Acids Res. 2022;50(D1):D518-D25. 10.1093/nar/gkab852
      2. Olsen RHJ, DiBerto JF, English JG, Glaudin AM, Krumm BE, Slocum ST, et al. TRUPATH, an open-source biosensor platform for interrogating the GPCR transducerome. Nat Chem Biol. 2020;16(8):841-9. 10.1038/s41589-020-0535-8
      3. Wright SC, Bouvier M. Illuminating the complexity of GPCR pathway selectivity – advances in biosensor development. Curr Opin Struct Biol. 2021;69:142-9. https://doi.org/10.1016/j.sbi.2021.04.006
      4. Avet C, Mancini A, Breton B, Gouill CL, Hauser AS, Normand C, et al. Effector membrane translocation biosensors reveal G protein and B-arrestin profiles of 100 therapeutically relevant GPCRs. bioRxiv. 2021:2020.04.20.052027. 10.1101/2020.04.20.052027
      5. Masuho I, Martemyanov KA, Lambert NA. Monitoring G Protein Activation in Cells with BRET. Methods Mol Biol. 2015;1335:107-13. 10.1007/978-1-4939-2914-6_8
      6. Gales C, Rebois RV, Hogue M, Trieu P, Breit A, Hebert TE, et al. Real-time monitoring of receptor and G-protein interactions in living cells. Nat Methods. 2005;2(3):177-84. 10.1038/nmeth743
      7. Gales C, Van Durm JJ, Schaak S, Pontier S, Percherancier Y, Audet M, et al. Probing the activation-promoted structural rearrangements in preassembled receptor-G protein complexes. Nat Struct Mol Biol. 2006;13(9):778-86. 10.1038/nsmb1134
      8. Schrage R, Schmitz AL, Gaffal E, Annala S, Kehraus S, Wenzel D, et al. The experimental power of FR900359 to study Gq-regulated biological processes. Nat Commun. 2015;6:10156. 10.1038/ncomms10156
      9. Breton B, Sauvageau E, Zhou J, Bonin H, Le Gouill C, Bouvier M. Multiplexing of multicolor bioluminescence resonance energy transfer. Biophys J. 2010;99(12):4037-46. 10.1016/j.bpj.2010.10.025
      10. Bunemann M, Frank M, Lohse MJ. Gi protein activation in intact cells involves subunit rearrangement rather than dissociation. Proceedings of the National Academy of Sciences of the United States of America. 2003;100(26):16077-82. 10.1073/pnas.2536719100
      11. Janetopoulos C, Jin T, Devreotes P. Receptor-mediated activation of heterotrimeric G-proteins in living cells. Science. 2001;291(5512):2408-11. 10.1126/science.1055835

      Reviewer #2:

      This study is a meta-analysis of previously reported studies on G protein-coupled receptor (GPCR) coupling to G proteins. The data sets are from three distinct sources: a compendium compiled by the International Union of Basic & Clinical Pharmacology (IUPHAR), and two data sets compiled by two separate laboratories. Each of these data sets describes the coupling of members of the superfamily of non-sensory GPCRs (~200 genes) to the large family of G protein alpha subunits (~20 genes). The authors try to arrive at a consensus for receptor-G protein coupling from the three data sets, as well as identify and highlight differences or incongruencies. Compiling these vast data sets into a unified format will be extremely useful for investigators to understand receptor and effector relationships. The meta-analysis will help to deconvolute the complex physiology and pharmacology underlying hormone or drug actions acting on receptor superfamilies. A better understanding of receptor-G protein selectivity and/or promiscuity will ultimately help in identifying safer therapeutics.

      We appreciate the summary and the explanation of the usefulness of our meta-analysis and its potential impact.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers


      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      This article focuses on one possible outcome of protein sequence evolution after duplication, in which the residue distribution at specific positions of a multiple sequence alignment becomes uncoupled from the distribution expected from the phylogeny of the protein family. The authors call these events "residue inversions" and interpret them as the result of functional pressures on family members with diverging cellular roles. Based on a theoretical model of residue evolution after duplication of the coding gene, the authors describe the criteria for categorizing a particular position in a protein as a "residue inversion" and develop an algorithm to identify such events in a multiple alignment. They then apply their approach to the family of Epidermal Growth Factor Receptors in Teleost fishes and identify 19 EGFR positions in a dataset of 88 fish genomes, which satisfy the criteria of "residues inversions". They provide support to the scoring scheme used in their approach through a simulated evolution run and conclude from a comparison of their positions to the ones predicted by SPEER to represent Specificity Determining Sites that the two are largely orthogonal and may therefore complement each other in sequence-based function prediction.

      Major comments: 1. Throughout the paper, the functional involvement of positions subject to "residue inversions" is indirect, inferred from the literature, and in parts sparse and tenuous. It therefore remains unclear to what extent the interpretation that "residue inversions" represent functional adaptations is correct. The authors acknowledge this uncertainty in several places, including the Conclusions.

      We agree with the reviewer that without experimental validation an uncertainty about the data interpretation remains, however testing protein function on a large scale and in non-model organisms is extremely challenging. Since we were aware of this obstacle, we validate our conclusions in different ways: 1. the theoretical model and the simulated MSA both show a lower chance of observing residue inversions than what we detected in the teleost fish EGFR example. 2. previous literature highlighted an identified inverted residue as the possible cause of sub-functionalization of teleost fish EGFR. 3 We generated the alpha fold models of teleost fish EGFR and performed molecular dynamic simulation of the two copies, in complex with the ligand. In our simulations, we see the same trend that we observe with the inter-paralog inversions at the functional level. The new results have been integrated in line 692-706.

      "Residue inversion" is a very unintuitive term, which took me several readings to penetrate and made reading the article difficult. The authors may wish to reconsider this term. Naively, a residue inversion would be the swapping of residues between two positions, such that a residue expected in position A is found in position B, while the residue expected in B is found in A. That is what I suspect most readers will think.

      We acknowledged that the terminology might be confusing. We therefore decided to define it as inter-paralog inversion of amino acids throughout all the text.

      Is the phenomenon described here just a curiosity, or an important aspect of divergent evolution after duplication? The authors seem to be of two minds about it, calling the phenomenon "rare" in the Abstract, but an "important and understudied outcome of gene duplication" in the Introduction, then hedging again that it "might be rare" in the Conclusions. The benefits of recognizing such positions are also formulated with great caution, for example in lines 309-311: "In summary, the identification of residue inversion event has the potential to improve functional residue predictions".

      We agree with the reviewer that we did not yet test the recurrence of this event on a large scale, however this does not exclude that this event is frequent. This work is focused on the observation, characterization, and implications of this event. Considering this comment and the one below we decided to perform a further analysis (see below for more details).

      Additionally, the analysis of the frequency of this event at the whole-organism scale on multiple organisms, while interesting, would be out of the scope of this paper, if not just because it requires a totally different (large-scale) approach compared to the one used in here. This type of analysis is also limited by the absence of a database collecting intermediate knowledge that would speed up the initial part of ortholog classification at a broad range.

      Finally, by rarity we mean the statistical chance of the event, not considering the effective chance of observing it from the real data. In fact, we rectified in the text using the reviewer’s observation.

      OLD VERSION (ppXX):

      Our work uncovers a rare event of protein divergence that has direct implications in protein functional annotation and sequence evolution as a whole.

      NEW VERSION:

      Our analysis shows a new way to investigate an important and understudied outcome of gene duplication.

      It would probably strengthen the article substantially if the authors would (I) use their program to scan a large number of multiple alignments in order to establish more reliably how frequent this phenomenon actually is, and whether it is universal or a specifc aspect of eukaryotic, maybe even only vertebrate evolution; and then (II) mapped the positions identified on structural models for the proteins, obtained by homology modeling or AlfaFold prediction, in order to substantiate their potential origin as functional adaptations.

      We thank the reviewer for the thoughtful suggestions. (I) we tested the inter-paralog inversion score at the proteome level using a reduced dataset (70) of reference teleost fish proteomes from Uniprot. We obtained 54 proteins that duplicated in the teleost specific whole genome duplication, then we run our pipeline on it. We found that the overall distribution of scores is more similar to the simulated evolution experiment rather than to the EGFR test case. We integrated the new results and discussion in a new paragraph and new figure in line 708-716.

      (II) We considered also the analysis requested in the second point. Unfortunately, we could not extract any meaningful data from the AlphaFold models.

      Reviewer #1 (Significance (Required)):

      A method to improve the functional annotation of proteins in a paralogous family would be very useful, given the abundance of sequence data.

      We thank the reviewer for acknowledging the importance of the question that we have addressed.

      I am knowledgeable in varios aspects of molecular evolution and functional annotation. I am neither a mathematician, nor a developer of phylogenetic methods, so I cannot judge these aspects of the paper.


      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Review of Pascarelli and Laurino titled “Identification of residue inversions in large phylogenies of duplicated proteins”

      I find the topic of the paper very exciting and long overdue. Indeed, I was under the impression that the question of parallel evolution in paralogous copies must have been addressed long ago: to my surprise, having looked in depth at the literature, that is only partially so. The manuscript, therefore, addresses a relatively novel and fundamental question of broad interest.

      We thank the reviewer for his positive comment.

      Having said this, I also found the manuscript to suffer from an identity problem, which in many places encroaches on the underlying quality of the science. I will structure my review into three concerns: the identity issues, the novelty issue and the emergent quality issues from the two.

      Identity issues:

      The manuscript is primarily dealing with an evolutionary issue – or I am biased to see it this way as an evolutionary researcher myself. Nevertheless, much of the language and terminology of the paper either misuses evolutionary terms or invents new ones in its place with a bias towards a protein chemistry perspective. Specifically, what the authors call “residue inversions” is called “parallel evolution” or “convergent evolution” in the literature. Also, "residues" are typically used for physical amino acids in a structure. If we are talking about sequence level “amon acid” would be a better term. The issue is further confounded by the meaning of “inversion” in genetics as a single mutation that inverts the position of nucleotides (i.e. an “AT” becomes “TA”).

      I strongly recommend for the authors to become familiarized with the common usage of existing and widely used terms in evolutionary biology that describe the phylogenetic patterns they see: parallel evolution, convergent evolution, homoplasy, etc, and to use them consistently throughout the manuscript.

      The same goes for "mutation", which the authors confuse on two levels: evolutionary and biochemical. Sometimes the authors refer to “mutation” of amino acids (which can be entertained at some level, but from a genetic perspective only nucleotides mutate – in the protein biochemistry field this term is frequently applied to amino acid residues, which is the basis of the identity issue). However, since the authors also use “mutation” to refer to a “substitution” (which is what we call a mutation that has become fixed in evolution) this creates another level of confusion. I urge the authors to change this aspect of the language of the manuscript to better reflect evolutionary concepts.

      As part of the language issues I am not sure how meta-functionalization in the author’s view differs either from neofunctionalization or specialization of duplicated genes.

      We thank the reviewer to point out the terminology issue, this will also help reaching a broader audience. We clarify the confusion surrounding the terms “mutation” and “residue inversion” by changing the former to “substitution”, while the latter to “inter-paralog inversions” (see also other reviewer comments).

      We understand the importance of the usage of the correct term to talk about this event of protein sequences evolution. Therefore, we used convergent and parallel evolution accordingly when we discussed the nuances between Metafunctionalization and parallel evolution in the text, in lines 188 and 399.

      Novelty issues:

      As I mentioned, the issue of parallel evolution of gene duplications is an extremely interesting topic. I was sure that the people who studied parallel evolution, or those interested in gene duplications, must have published extensively on this. However, my search of the literature revealed only a modest pre-existing effort. Nevertheless, previous efforts are not entirely non-existent and should be cited and discussed in this paper too. The most pertinent example is

      https://bmcecolevol.biomedcentral.com/articles/10.1186/s12862-020-01660-1

      which has an identical setup from what I can tell (compare Figure 1 in each paper).

      This paper was not hard to find using "parallel evolution", thus my focus on the language issues in the previous section.

      We thank the reviewer for his suggestion, we included the relevant papers in the text in lines 520-523. Interestingly, the cited paper shows that a comprehensive analysis of the fate of duplicated genes at the sequence level was done. However, in this paper, the ‘fate’ of a paralog is determined by counting the number of sites that support one or the other fate, independently of the orthologous relationship. In our study, we start from the orthologous relationship to pre-determine the fate of the paralogous protein, then we identify the sites that break this assumption. Our type of analysis is deemed to work only where the orthologous relationship is unequivocal. That is the reason why we chose an example with relatively short branch lengths after duplication (the teleost specific duplication). Our rationale is that with a higher genome coverage across organisms, resolving the orthologous relationship will get easier in time. However, our study focuses on a distinct case (asymmetric divergence) where the diverging paralogs converge to the same phenotype. In such a case, neutral substitutions related to the ancestral relationship of a protein can be filtered out to better search for functional adaptations.

      Content issues:

      The lack of attention to evolutionary concepts, in my opinion, provided some missed opportunities for the authors to attack the problem in a more convincing fashion. Specifically, in the setup to distinguish between parallel evolution of paralogues versus orthologues ("inversion" versus "species-specific adaptation" in the author's text) one must be able to distinguish between the two copies and assign true evolutionary relationship. In practice, that is not always possible based on tree lengths or topologies alone because of confounding factors such as independent duplications or gene conversion events.

      I would feel better about the results of this study if the following two things were integrated.

      The use of synteny to better determine homologous relationships (declare copies to be true paralogues if they occupy the same syntenic region). To compare the frequency or parallel evolution of paralogues versus orthologues as a null model of the expected number of parallel events in paralogous copies.

      We agree that a synteny analysis has to be included. We tested it for the EGFR proteins in fish and the results support the orthologous relationship of EGFRa and EGFRb in the two groups compared (Cypriniformes versus other teleosts). The results were included in the text and in the Supplementary figure in lines 303-305.

      The second point targets the way the model derives the expectations: at the author's own admission the model makes a number of unrealistic assumptions, ") equal branch length between the two paralogs; 2) only zero to one mutation can occur in each of the six branches; 3) after a mutation, each residue is equiprobable; 4) no selective pressure; 5) the probability of a mutation on a branch solely depends on the branch length (mutation rate). The authors do not really test the resulting tree on deviation from these assumptions (I am sure that it does not conform) but essentially comparing the occurrence of parallel events in paralogues versus orthologues may solve the problem with a less restrictive set of assumptions (that one expects an equal number of parallel events in paralogues and orthologues unless there is some paralogue-specific selection pressure, which is what the authors are looking for.

      We compared the occurrence of the two outcomes in both the simulation and in the real data. In all cases, the two score distributions have a very similar shape, with a 99th percentile score of respectively 0.062 and 0.113. Most sites in an alignment (>99%) are not expected to be inverted and will have scores very close to 0, making the identification of inversions a quest for outliers. Furthermore, in case of the real data, each distribution can be independently affected by different selective pressures that might bias the background distribution. While the inversion in paralogs is expectedly involving few, functional, residues, the inversion in orthologs is expected to have a broad effect. For example, a temperature adaptation might shift the number of polar residues on the protein surface (see for example: https://academic.oup.com/peds/article/13/3/179/1466666). Also, a different protein chosen for analysis might generate a different background distribution of the two events. In the larger dataset, the similarity of the two distributions is even more (99th percentile of 0.07 and 0.08). Because of the shown similarity of the two event distributions, and the possible issues with different selective pressures, we leave the analysis suggested by the reviewer as a post-processing possibly performed by the user. We report a summary of this result born from the reviewer’s observation in line 478.

      In summary, I believe that the topic is very interesting, the authors potentially found a new aspect of evolution of a specific gene family. However, in my opinion a major revision is needed to unite this text with the terms in the field, the previous publication and to integrate the two additional analyses I suggested.

      Minor Comments:

      I started adding these specific comments before generalizing the broader deviation from the common evolutionary language. There are more further along in the manuscript, but in the interest of time I will not articulate them here hoping that the authors will first try a major revision targeting these issues.

      Line 64: While neutral mutations help to determine the phylogenetic position of a protein, mutations of functional residues are a signal of functional shifts that might occur independently of the phylogeny. - this is quite misleading. All substitutions (neutral or beneficial) have a phylogenetic signal. In any case, this is discussed here in phylogenetic terms: https://pubmed.ncbi.nlm.nih.gov/10742039/

      We corrected the sentence to refer to divergence time instead of phylogenetic signal.

      OLD VERSION:

      While neutral mutations help to determine the phylogenetic position of a protein, mutations of functional residues are a signal of functional shifts that might occur independently of the phylogeny.

      NEW VERSION:

      While neutral substitutions are directly proportional to the time of divergence, a change in functional residues could be a signal of a functional shift that might occur independently of the divergence time.

      Line 107: "under high evolutionary pressure" - I do not know what evolutionary pressure is nor why it can be high or low.

      We corrected the term to “selective pressure”.

      OLD VERSION:

      Lorin et al. showed that both copies of EGFR might have been retained because they are involved in the complex process of skin pigmentation (40), which is under high evolutionary pressure in most fish.

      NEW VERSION:

      Lorin et al. showed that both copies of EGFR might have been retained because they are involved in the complex process of skin pigmentation (40), a trait that is under selective pressure in most fish

      Line 112 "linearly inherited across orthologs" - linear is a poor choice of a word here. The first thing that comes to my mind is quadratic inheritance as an alternative. Perhaps the authors are looking for "vertical" versus "horizontal" - these are established terms in phylogenetics (think "horizontal gene transfer").

      We corrected the term to “vertically inherited”.

      OLD VERSION

      Therefore, the power to predict functional residues is limited by our ability to track protein function on the phylogenetic tree when it is not linearly inherited by orthologs.

      NEW VERSION

      Therefore, the power to predict functional residues is limited by our ability to track protein function on the phylogenetic tree when it is not vertically inherited by orthologs.

      It is my invariant practice to reveal my identity to the authors,

      Fyodor Kondrashov

      Reviewer #2 (Significance (Required)):

      Addressed in the above

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Review of Pascarelli and Laurino titled "Identification of residue inversions in large phylogenies of duplicated proteins"

      I find the topic of the paper very exciting and long overdue. Indeed, I was under the impression that the question of parallel evolution in paralogous copies must have been addressed long ago: to my surprise, having looked in depth at the literature, that is only partially so. The manuscript, therefore, addresses a relatively novel and fundamental question of broad interest.

      Having said this, I also found the manuscript to suffer from an identity problem, which in many places encroaches on the underlying quality of the science. I will structure my review into three concerns: the identity issues, the novelty issue and the emergent quality issues from the two.

      Identity issues:

      The manuscript is primarily dealing with an evolutionary issue - or I am biased to see it this way as an evolutionary researcher myself. Nevertheless, much of the language and terminology of the paper either misuses evolutionary terms or invents new ones in its place with a bias towards a protein chemistry perspective. Specifically, what the authors call "residue inversions" is called "parallel evolution" or "convergent evolution" in the literature. Also, "residues" are typically used for physical amino acids in a structure. If we are talking about sequence level "amon acid" would be a better term. The issue is further confounded by the meaning of "inversion" in genetics as a single mutation that inverts the position of nucleotides (i.e. an "AT" becomes "TA").

      I strongly recommend for the authors to become familiarized with the common usage of existing and widely used terms in evolutionary biology that describe the phylogenetic patterns they see: parallel evolution, convergent evolution, homoplasy, etc, and to use them consistently throughout the manuscript.

      The same goes for "mutation", which the authors confuse on two levels: evolutionary and biochemical. Sometimes the authors refer to "mutation" of amino acids (which can be entertained at some level, but from a genetic perspective only nucleotides mutate - in the protein biochemistry field this term is frequently applied to amino acid residues, which is the basis of the identity issue). However, since the authors also use "mutation" to refer to a "substitution" (which is what we call a mutation that has become fixed in evolution) this creates another level of confusion. I urge the authors to change this aspect of the language of the manuscript to better reflect evolutionary concepts.

      As part of the language issues I am not sure how meta-functionalization in the author's view differs either from neofunctionalization or specialization of duplicated genes.

      Novelty issues:

      As I mentioned, the issue of parallel evolution of gene duplications is an extremely interesting topic. I was sure that the people who studied parallel evolution, or those interested in gene duplications, must have published extensively on this. However, my search of the literature revealed only a modest pre-existing effort. Nevertheless, previous efforts are not entirely non-existent and should be cited and discussed in this paper too. The most pertinent example is

      https://bmcecolevol.biomedcentral.com/articles/10.1186/s12862-020-01660-1

      which has an identical setup from what I can tell (compare Figure 1 in each paper).

      This paper was not hard to find using "parallel evolution", thus my focus on the language issues in the previous section.

      Content issues:

      The lack of attention to evolutionary concepts, in my opinion, provided some missed opportunities for the authors to attack the problem in a more convincing fashion. Specifically, in the setup to distinguish between parallel evolution of paralogues versus orthologues ("inversion" versus "species-specific adaptation" in the author's text) one must be able to distinguish between the two copies and assign true evolutionary relationship. In practice, that is not always possible based on tree lengths or topologies alone because of confounding factors such as independent duplications or gene conversion events.

      I would feel better about the results of this study if the following two things were integrated.

      The use of synteny to better determine homologous relationships (declare copies to be true paralogues if they occupy the same syntenic region). To compare the frequency or parallel evolution of paralogues versus orthologues as a null model of the expected number of parallel events in paralogous copies.

      The second point targets the way the model derives the expectations: at the author's own admission the model makes a number of unrealistic assumptions, ") equal branch length between the two paralogs; 2) only zero to one mutation can occur in each of the six branches; 3) after a mutation, each residue is equiprobable; 4) no selective pressure; 5) the probability of a mutation on a branch solely depends on the branch length (mutation rate). The authors do not really test the resulting tree on deviation from these assumptions (I am sure that it does not conform) but essentially comparing the occurrence of parallel events in paralogues versus orthologues may solve the problem with a less restrictive set of assumptions (that one expects an equal number of parallel events in paralogues and orthologues unless there is some paralogue-specific selection pressure, which is what the authors are looking for.

      In summary, I believe that the topic is very interesting, the authors potentially found a new aspect of evolution of a specific gene family. However, in my opinion a major revision is needed to unite this text with the terms in the field, the previous publication and to integrate the two additional analyses I suggested.

      Minor Comments:

      I started adding these specific comments before generalizing the broader deviation from the common evolutionary language. There are more further along in the manuscript, but in the interest of time I will not articulate them here hoping that the authors will first try a major revision targeting these issues.

      Line 64: While neutral mutations help to determine the phylogenetic position of a protein, mutations of functional residues are a signal of functional shifts that might occur independently of the phylogeny. - this is quite misleading. All substitutions (neutral or beneficial) have a phylogenetic signal. In any case, this is discussed here in phylogenetic terms: https://pubmed.ncbi.nlm.nih.gov/10742039/

      Line 107: "under high evolutionary pressure" - I do not know what evolutionary pressure is nor why it can be high or low.

      Line 112 "linearly inherited across orthologs" - linear is a poor choice of a word here. The first thing that comes to my mind is quadratic inheritance as an alternative. Perhaps the authors are looking for "vertical" versus "horizontal" - these are established terms in phylogenetics (think "horizontal gene transfer").

      It is my invariant practice to reveal my identity to the authors,

      Fyodor Kondrashov

      Significance

      Addressed in the above

    1. Is this all right, or are there other people that, in this case, you would rather be paired with for whatever reason—even if that reason is only for breaking up the appearance of possible racism; since the appearance of possible racism can be just as much a factor in reproducing and promoting racism as anything else: Racism is as much about accustoming people to becoming used to certain racial configurations so that they are specifically not used to others, as it is about anything else. Indeed, we have to remember that what we are combatting is called prejudice: prejudice is pre-judgment—in this case, the prejudgment that the way things just happen to fall out are “all right,” when there well may be reasons for setting them up otherwise.

      i guess in a SF world the thought of having a diverse appearance is important in order to combat prejudice but in the world we live in today i don't know if thats always the case. How many ads do you see today with people of color for companies who in the past haven't always been the most diverse (the film industry). Its like white people have found away to hide their red hands behind our colored faces for the perception of diversity. who is diversity really helping if white people are the ones cashing in. Hers is a list of companies that appear to promote diversity and inclusion but actually don't. Amazon, Apple, Bank of america, Cisco, Facebook, KFC, Lego, L'Oreal, Loius Vuitton. all of these companies have spent a lot of money to appear one way but when you look at who is really profiting on their board of directors, they are all white and mostly men. To be clear i think Delany's point is valid and i agree with it but a lot has happened since and we cant continued to be fooled at face value. Forget having a seat at the table, if it's only for perception.

    1. Reviewer #1 (Public Review):

      The premise of this paper is that a significant amount of microbial diversity might be maintained not purely through resource partitioning, as has been the thrust of multiple recent papers over the last few years, but perhaps also through "physical" differences between organisms---here manifested by the detachment rate of heterotrophic bacteria from resources in the form of particulate matter. I completely agree with that premise, and agree that this is an underexplored niche axis that is important to account for when seeking to understand coexistence and diversity.

      As with any mathematical model, the assumptions made are critical to get right, and different assumptions about the details of resource uptake, dispersal, and competition may lead to different conclusions. So my comments primarily relate to some of these mathematical choices, as well as to their explanation in the text.

      -- In framing the paper, I think the authors are right to focus on dispersal and detachment as under-explored mechanisms. But readers will benefit from reference to other work (even on particle-associated microbes) related to resource diversity, succession, and crossfeeding. That can only help put the current study in context with other mechanisms for the maintenance of microbial diversity.

      -- There is a population growth process when a cell settles on a new particle. This is assumed to be logistic growth, though in the end, it seems likely that the precise dynamics of the growth process don't matter so much as the final abundance (carrying capacity). However, this seemed subtle to me for three reasons:

      (i) Will detachment rate directly affect carrying capacity?

      (ii) Is carrying capacity occurring when microbes fill out the surface of a particle, or when they have eaten the entire volume of a particle?

      (iii) If the former, will particles continue to be shed from the particle as growth continues approximately linearly?

      It's possible that none of this matters too much if all that's important is a final population size. However, it might help to clarify the process for readers if we have a conceptual picture of what this final population size represents (surface of particle being filled? or volume of particle entirely eaten up) and if there is a truer picture of the dynamics than logistic growth.

      -- The relationship between the trade-off (between different detachment rates) derived in Eq 2 versus the optimal detachment rate (derived in the methods) is framed a little confusingly. If I understand correctly, the "trade-off" actually comes from the condition that a population will have net non-negative growth rate in the absence of other populations with different strategies. So it may be reasonable to frame this as a threshold---a necessary condition rather than a sufficient condition for a given population to persist. The reason I say this is that it is a bit confusing to have a trade-off that suggests a range of detachment rates can coexist so long as they differ in their carrying capacities, since it is then stated that the optimal detachment rate outcompetes all the others. Maybe I misunderstood something important being assumed about the carrying capacity for the optimal case, but a trade-off that also has an optimum is an odd outcome.

      -- In the end, it seems critical that for multiple strategies to be maintained in the population that there is not only whole-particle mortality (which in effect is highly correlated catastrophic dynamics for an individual microbial population), but that the inflow of resources itself fluctuates. Did I interpret that correctly? Readers may appreciate a slightly clearer description of how this environmental stochasticity differs from the previous possibility of whole-cell mortality, and this also left me wondering how to quantity the kind of environmental stochasticity that will generally lead to multiple strategies coexisting.

      -- In summary, I think this is a terrific idea and promising analysis that will bear fruit. But I also wanted to understand how robust is the outcome of coexistence to the various assumptions in the model.

    1. As we research, we may find ourselves returning to and changing our question, or we may near the end of a project and think we’re done but discover we need to go back to find more or better sources. The messiness of research requires us to be flexible,

      This passage shows how it is acceptable and even accepted to have to change your research as you go along. This makes research feel more free and creative rather than strict and boring.

    2. Like a daisy’s petals, research is described as cyclical and fluid. As we research, we may find ourselves returning to and changing our question, or we may near the end of a project and think we’re done but discover we need to go back to find more or better sources. The messiness of research requires us to be flexible, often modifying our approaches along the way.

      I can attest to this as many times before I have changed my original question or approach after uncovering some bits of research.

    1. Author Response

      Reviewer #1 (Public Review):

      This is a very solid and exciting study.

      We thank the reviewer for finding our study to be very solid and exciting.

      I have several suggestions, comments and questions:

      1. The authors focused on examining the role of C129 as a regulator of PTPN22 redox sensitivity based on a published crystal structure of the catalytic domain. It would be great if they could demonstrate the existence of the disulfide bond between C129 and C227 also experimentally (in T cells).

      As we understand it, it is requested that the disulfide bond between C227 and C129, as previously suggested by Tsai et al. (2009) (1) with pure protein, should be documented to actually occur in the activated T cells. We fully agree that this would improve the study and we have therefore made several attempts to demonstrate this oxidation, or the oxidation state of the active site Cys residue in PTPN22 in situ. However, as we had also expected, it has proven to be technically very challenging. Nevertheless, as the functional consequence of the PTPN22 oxidation and the effect of the C129S mutation is clearly documented in the mouse, using in vivo experiments, we still think it is valid to conclude that the reversible oxidation state of PTPN22 as well as the involvement of the Cys129 residue regulates the function of PTPN22 in vivo, which is the main conclusion of our study.

      1. To this end, there are other cysteine residues in the vicinity of C227 such as the C231 that might be involved in the redox regulation PTPN22. The authors should at least discuss the their possible involvement.

      It is correct that Tsai et al. (2009) (1) found that mutating C231 to serine dramatically reduced phosphatase activity, thus suggesting its importance in catalysis. Reactivation assays showed higher reactivation rates for C231S mutants, and they suggested that C231 suppresses reactivation in a reducing environment by competing with C227 for reduction in the catalytic pocket. Therefore, C231 could also be a target for negative regulation of PTPN22. However, our project was from the start limited to the intention of studying whether PTPN22 could be shown to be redox regulated in vivo through modification of key cysteine residues, and the aim has not been to give the full picture of how the molecule is regulated. We have now extended this point in the discussion in the paper.

      1. How is mutation of C227 affecting T cell function? Are the effects similar with those of C129S?

      This would be interesting but to analyze if also the cysteine at 227 is regulating the T cell activation by creating another transgenic C227S mouse is outside the scope of the study. As said above and clearly described in the study, we have focused on the redox-mediated effects through C129 and hope that the reviewer can agree with us that this rather focused study is solid and fully sufficient for publication on its own merits.

      1. Although the in vitro evaluation of the PTPN22 activity is of highest quality, it would be good to demonstrate that C227 redox status is modified under physiological conditions. 25-100 µM H2O2 is a high concentration that might not be reached within a cell and might be lethal for T cells.

      See response to point 1.

      1. C129 seems not to be mutated in patients with autoimmunity but is an excellent tool to test the importance of C227 redox regulation and the findings of this study suggest that its over-oxidation will support autoimmune responses. When considering the clinical relevance of the study, a drug that will protect the oxidation of the catalytic cysteine and/or stabilize the disulfide bond would have beneficial effects. The authors could test such pharmacological modulators in isolated T cells.

      Indeed, such modulators would be very interesting to test; however, developing such drugs can hardly be demanded to be within the scope of this study. We have however included a statement on this topic in the Discussion of the manuscript.

      1. The authors discuss that NOX2-derived ROS most likely originate from antigen presenting cells. I fully agree with this discussion. However, some studies have proposed that NOX2 plays an important role also in T cells, a finding which was not confirmed by other following studies. It would be great if the authors could address this controversial issue in regards to their findings.

      The finding that the ROS that modify PTPN22 in fact come from the interacting APC rather than from the T cell itself we believe is very important. However, we have not made a major point of this as we have shown that aspect before in other studies, and we wanted in the current paper to focus on the take home message that PTPN22 could hereby be shown to be redox regulated in vivo. However, the last word about the source of ROS has not been said. The controversy whether the Ncf1 containing NOX2 complex is functionally expressed in T cells stems from the paper by Jackson et al. in Nat Immunol 2004 (2). We have not been able to reproduce those findings and in addition we have never detected a NOX2 dependent response in pure T cells, which has also been shown in several of our papers. There are certainly many pitfalls, contaminating NOX2 expressing cells, NOX2 containing exosomes and peroxides, and even NOX2 complexes picked up by interactions with antigen presenting cells. However, it is dangerous to completely exclude that Ncf1 could be expressed at minimal levels or to exclude that functional NOX2 complex can indeed be formed in T cells, and we all know that minute levels of any peroxide as produced by cells could have an impact on cellular functions. But, based on the present knowledge we conclude that T cells do not functionally express Ncf1-containing NOX2 complexes. We have now added two references to enlighten this point, (3, 4; refs. 38 & 39 in the manuscript).

      1. Fig. 1: Is the addition of bicarbonate affecting the pH and thus the activity of PTPN22?

      No, we believe that addition of bicarbonate is not acting by an altered pH but is instead required for formation of peroxymonocarbonate when reacting with H2O2, which is subsequently the molecular species that bypasses the cellular antioxidant systems in order to oxidize the active site Cys residues of target PTPs. This was shown by us in an earlier publication (Dagnell et al, ref. 11 in the manuscript) (5) and a sentence has now been added in the Discussion to further emphasize this point.

      1. The H2O2 concentration dependence of PTPN22_C129S should also be shown as for WT (see Fig. 1B)

      We agree with the reviewer that titration of the mutant with additional H2O2 concentrations could potentially have been done, but we thought that the comparison of WT and C129S enzyme side-by-side using either 0 µM, 25 µM or 50 µM as in Fig. 1D was a sufficient comparison in H2O2 sensitivity. Unfortunately, we do not have the possibility to analyze more purified C129S mutant protein at the moment and it would require a major effort to run those additional experiments. We thereby hope that the reviewer would agree with having the data presented as they currently are to be sufficient.

      1. Quantification of the slope based on only 3 measuring points is not accurate (Fig. 1D).

      Each data point in those curves represents the mean ± S.D. derived from duplicate samples ran three different times, with clearly very low standard deviations. Thus, we believe that the data are reliable and that the statistically significant difference when comparing the slopes between WT and the C129S mutant as shown in the figure, should be trustworthy.

      1. The pinna thickness measurements shown in Fig. 3B and C suggest that in NCF1 mice C129S has no effect. However, the thickness in NCF1 mice is already much higher than in WT mice (compare B and C). Does this mean that NOX2-derived ROS are the only factor that affects C227 redox properties?

      The effects of the decreased ROS due to the Ncf1 mutation is likely to have consequences for the functions of many proteins, in different pathways, and not only of PTPN22. The sum effect is that the Ncf1 mutated mice responds stronger than the wild type, which explains the difference. However, the main message here is that if there is no ROS from the NOX2 complex, the effect of the PTPN22 mutation is lost.

      1. The results shown in Fig. 5D could be moved to a supplementary figure.

      We prefer to keep it within Fig 5 as it is more logical in the context or the other parts of this figure. Of course, if there is a space layout problem, we can consider moving it.

      1. The calcium measurements are not convincing and the differences are rather small. The y axis labels show 50K, 100K etc. Are this ratio values? If yes the imaging settings need to be optimized. Why is the mutant labeled as Pep? How is the C129S affecting calcium signaling? These observations need be examined in more detail or maybe calcium is not playing an important role.

      We agree that the differences in calcium measurements are not very large but have nevertheless been repeated several times, and there is a significant difference as shown. The calculation is done on the slope of the curve, which is independent of the absolute values given on the y-axis. We agree that the figure was not properly labeled and have now changed this.

      1. I would suggest a more extensive evaluation of the proteomic data presented in Fig. 6D. The results might be very exciting and can further increase the impact of this study.

      We fully agree with this. We have chosen not to go into details of the results of the proteomic analysis. The data shown confirms our conclusion and we did not plan to identify the downstream targets of the PTPN22 oxidative regulation. Highlighting some of these targets will require biological confirmation, which can be done but must await future work. The full dataset has however been deposited in PRIDE for any reader interested to analyze the results further.

      1. Is 24h BSO treatment not toxic for the T cells (ferroptosis)?

      We have not seen any evidence for toxicity upon the BSO treatment of T cells in vitro, which however has been more thoroughly checked by others. Gringhuis et al (JI, 2000) (6) have shown immunofluorescence staining on T cells 72 hours post BSO treatment with intact cell membranes. Additionally, Carilho et al. (Chem. Cent. J., 2013, 7:150) (7) noted no changes in Jurkat T cell viability after 24 hours at a maximum dose of 100 µM BSO.

      Reviewer #3 (Public Review):

      The manuscript by James, Chen Hernandez et al. reveals a novel function for PTPN22 oxidation in T-Cell activation. The authors used a broad array of methods to demonstrate that PTPN22 is catalytically impaired in addition to being more sensitive to reversible oxidation in vitro. In the characterization process, the authors found that PTPN22 could be directly reduced by Thioredoxin Reductase and that oxidation of PTPN22 oxidation could be easily monitored by the appearance of a faster migrating band in non-reducing gels. Supporting the hypothesis that the catalytic Cysteine forms a disulfide with a backdoor Cysteine (Cys129), the authors found that this C129S mutant is prone to oxidation and cannot be reduced back to its active form by Thioredoxin Reductase. Using a new mouse model in which this key Cysteine of PTPN22 is mutated to a Serine residue (PTPN22C129S mutant) and can presumably not form a stabilizing redox intermediate between the catalytic Cys residue and this backdoor Cys (C227-C129), the authors study how the oxidation prone mutant affects T-Cell activation. The authors find that the C129S mutant mouse showed an increased T-Cell dependent inflammatory response that was dependent on activation of the reactive oxygen species-producing enzyme NOX2. This data adds an interesting redox twist to the function of PTPN22 in T-Cells that contributes to conversation on the protective effects of reactive oxygen species against inflammatory diseases in vivo.

      Strengths:

      The in vitro characterization of the WT and C129S mutant form of PTPN22 is very thorough. Determination of the Km and Kcat highlights the differences between the two enzymes that go beyond redox regulation of the phosphatase. The reduction studies are masterfully done and highlight a novel reduction mechanism that merits to be further studied in cells. Demonstrating that PTPN22C129S is prone to oxidation in vitro is a key and technically challenging result that may be applicable to other members of the PTP family that also form disulfides with a backdoor cysteine. Showing that PTPN22C129S mice (backcrossed to B6Q mice making them susceptible to autoimmune arthritis) displayed higher T cell activation in two models (DTH and GPI), in addition to studies in T cells stimulated with collagen, increased this reviewer's confidence that the PTPN22C129S mouse exhibited T-cell-dependent inflammatory response phenotype similar to the PTPN22 knockout phenotype. Validation of T-cell signaling events in PTPn22C129S T cells were in line with the in vitro characterization of the phosphatase.

      We thank the reviewer very much for the detailed summary of our findings and the appreciative words.

      Weaknesses: Although the paper has many strengths, some important weaknesses need to be addressed by the authors. In particular, the authors need to characterize better their mouse model and determine if PTPN22 is reversibly oxidized following TCR activation. If PTPN22 is oxidized, does it form an intramolecular disulfide between C227 and C129? The proposed model, that PTPN22C129S is more prone to oxidation, also has to be validated in vivo. Although this could be technically challenging in theory, the authors have shown that the migration pattern of the oxidized enzyme is different that of the reduced enzyme. Another major issue is that PTPN22 does not appear to be expressed in CD4+ T cells unless these cells are activated in vitro with anti-CD3/CD28 for 24 hours. This makes acute CD3-stimulation of CD4+ T cells studies - such as the measurement of acute calcium influx in Fig. 5E - very difficult to interpret. Perhaps the authors should explain why acute signal transduction studies in Figure 6 were performed in lymph node cells. If the reason is that PTPN22 (WT and C129S mutant) expression is higher, the authors should provide immunoblots for PTPN22 in these cells. Since the PTPN22C129S mouse model has not been sufficiently validated, the claims of the authors are unfortunately weakened and the underlying molecular mechanisms do not completely support their conclusions. However, given the clear in vitro work provided in figures 1 and 2, it is this Reviewer's opinion that the authors can address the issues related to the oxidation status of PTPN22 and of PTPN22C129S in vivo, support their claims, and make a significant contribution to the field.

      We again thank the reviewer for the detailed summary of our findings and for the suggestions. With regards to the in vivo oxidation status of PTPN22, please see the discussion above.

      1. Tsai SJ, Sen U, Zhao L, Greenleaf WB, Dasgupta J, Fiorillo E, et al. Crystal structure of the human lymphoid tyrosine phosphatase catalytic domain: insights into redox regulation. Biochemistry. 2009;48(22):4838-45.
      2. Jackson SH, Devadas S, Kwon J, Pinto LA, Williams MS. T cells express a phagocyte-type NADPH oxidase that is activated after T cell receptor stimulation. Nat Immunol. 2004;5(8):818-27.
      3. Gelderman KA, Hultqvist M, Holmberg J, Olofsson P, Holmdahl R. T cell surface redox levels determine T cell reactivity and arthritis susceptibility. Proc Natl Acad Sci U S A. 2006;103(34):12831-6.
      4. Gelderman KA, Hultqvist M, Pizzolla A, Zhao M, Nandakumar KS, Mattsson R, et al. Macrophages suppress T cell responses and arthritis development in mice by producing reactive oxygen species. J Clin Invest. 2007;117(10):3020-8.
      5. Dagnell M, Cheng Q, Rizvi SHM, Pace PE, Boivin B, Winterbourn CC, et al. Bicarbonate is essential for protein-tyrosine phosphatase 1B (PTP1B) oxidation and cellular signaling through EGF-triggered phosphorylation cascades. J Biol Chem. 2019;294(33):12330-8.
      6. Gringhuis SI, Leow A, Papendrecht-Van Der Voort EA, Remans PH, Breedveld FC, Verweij CL. Displacement of linker for activation of T cells from the plasma membrane due to redox balance alterations results in hyporesponsiveness of synovial fluid T lymphocytes in rheumatoid arthritis. J Immunol. 2000;164(4):2170-9.
      7. Carilho Torrao RB, Dias IH, Bennett SJ, Dunston CR, Griffiths HR. Healthy ageing and depletion of intracellular glutathione influences T cell membrane thioredoxin-1 levels and cytokine secretion. Chem Cent J. 2013;7(1):150.
    1. Author Response

      Reviewer #1 (Public Review):

      Dias et al proposed a new method for genotype imputation and evaluated its performance using a variety of metrics. Their method consistently produces better imputation accuracies across different allele frequency spectrums and ancestries. Surprisingly, this is achieved with superior computational speed, which is very impressive since competing imputation softwares had decades of experience in optimizing software performance.

      The main weakness in my opinion is the lack of software/pipeline descriptions, as detailed in my main points 36 below.

      We have made the source code and detailed instructions available publicly at Github. The computational pipeline for autoencoder training and validation is available at https://github.com/TorkamaniLab/Imputation_Autoencoder/tree/master/autoencoder_tuning_pipeline.

      1. In the neural network training workflow, I am worried it will be difficult to compute the n by n correlation matrix if n is large. If n=10^5, the matrix would be ~80GB in double precision, and if n=10^6, the matrix is ~2TB. I wonder what is n for HRC chromosome 1? Would this change for TOPMed (Taliun 2021 Nature) panel which has ~10x more variants? I hope the authors can either state that typical n is manageable even for dense sequencing data, or discuss a strategy for dealing with large n. Also, Figure 1 is a bit confusing, since steps E1-E2 supposedly precede A-D.

      We included more details in the methods section to address this question. It is true that computing the entirety of this matrix is computationally intensive, thus, in order to avoid this complexity, we calculated the correlations in a sliding box of 500 x 500 common variants (minor allele frequency (MAF) >=0.5%). In other words, no matter how dense the genomic data is, the n x n size will always be fixed to 500 x 500. Larger datasets will not influence this as the additional variants fall below the MAF>=0.5% threshold. Thus, memory utilization will be the same regardless of chromosome length or database size. Please note that this correlation calculation process is not necessary for the end-user to perform imputation, since we already provide the information on what genomic coordinates belong to the local minima or “cutting points” of the genome. This computational burden remains on the developer side. The reviewer is right to point out that Figure 1 is misleading in its ordering, we have corrected this in the revision.

      1. I have a number of questions/comments regarding equations 2-4. (a) There seems to be no discussion on how the main autoencoder weight parameters were optimized? Intuitively, I would think optimizing the autoencoder weights are conceptually much more important than tuning hyper-parameters, for which there are plenty of discussions.

      These parameters are optimized through the training process described in “Hyperparameter Initialization and Grid Search / Hyperparameter Tuning” - where both the hyperparameters and edge weights are determined for each autoencoder for each genomic segment. There are 256 genomic segments in chromosome 22, and each segment has a different number of input variables, sparsity, and correlation structure. Thus, there is a unique autoencoder model that best fits each genomic tile (e.g.: each autoencoder has different weights, architecture, loss function, regularizes, and optimization algorithms). Therefore, while there are some commonalities across genomic tiles, there is not a single answer for the number of dimensions of the weight matrix, or for how the weights were optimized. Instructions on how to access the unique information on the parameters and hyperparameters of each one of the 256 autoencoders is now shared through our source code repository at https://github.com/TorkamaniLab/imputator_inference.

      We included an additional explanation clarifying this point in the Hyperparameter Tuning subsection of the Methods.

      (b) I suppose t must index over each allele in a segment, but this was not explicit.

      That is correct, t represents the index of each allele in a genomic segment. We included this statement in the description of equation 2.

      (c) Please use standard notations for L1 and L2 norms (e.g. ||Z||_1 for L1 norm of Z). I also wonder if the authors meant ||Z||_1 or ||vec(Z)||_1 (vectorized Z)?

      We included a clarification in the description of equation 3. ‖𝑾‖𝟏 and ‖𝑾‖𝟐 are the standard L1 and L2 norms of the autoencoder weight matrix (W).

      (d) It would be great if the authors can more explicitly describe the auto-encoder matrices (e.g. their dimensions, sparsity patterns if any...etc).

      As we answered in comment 2.a, each one of the 256 autoencoders for each genomic segment is unique, so it would be unfeasible to describe the architecture, parameters, optimizers, loss function, regularizes, of each one of them. We realized it would be more suitable to share this information in a software repository and have now done so.

      1. It is not obvious if the authors intend to provide a downloadable software package that is user-friendly and scalable to large data (e.g. HRC). For the present paper to be useful to others, I imagine either (a) the authors provide software or example scripts so users can train their own neural network, or (b) the authors provide pretrained networks that are downloaded and can be easily combined with target genotype data for imputation. From the discussion, it seems like (b) would be the ultimate goal, but is only part dream and part reality. It would be helpful if the authors can clarify how current users can benefit from their work.

      We have now shared the pre-trained autoencoders (including model weights and inference source code) and instructions on how to use them for imputation. These resources are publicly available at https://github.com/TorkamaniLab/imputator_inference. We have added this information to the Data Availability subsection of the Methods.

      1. Along the same lines, I also found the description of the software/pipeline to be lacking (unless these information are available on the online GitHub page, which is currently inaccessible). For instance, I would like to know which of the major data imputation formats (VCF/BGEN..etc) are supported? Which operating systems (window/linux/mac) are supported? I also would like to know if it is possible to train the network or run imputation given pre-trained networks, if I don't have a GPU?

      We have now made the github repository publicly available. The description of the requirements and steps performed in the hyperparameter tuning pipeline is available at https://github.com/TorkamaniLab/Imputation_Autoencoder/tree/master/autoencoder_tuning_pipeline.

      1. Typically, imputation software supplies a per-SNP imputation quality score for use in downstream analysis. This is important for interpretability as it helps users decide which variants are confidently imputed and which ones are not. For example, such a quality score can be estimated from the posterior distribution of an HMM process (e.g. Browning 2009 AJHG). Would the proposed method be able to supply something similar? Alternatively, how would the users know which imputed variants to trust?

      We included further clarification in the data availability session of methods: Imputation data format. The imputation results are exported in variant calling format (VCF) containing the imputed genotypes and imputation quality scores in the form of class probabilities for each one of the three possible genotypes (homozygous reference, heterozygous, and homozygous alternate allele). The probabilities can be used for quality control of the imputation results.

      We included this clarification in the manuscript and in the readme file of the inference software repository https://github.com/TorkamaniLab/imputator_inference.

      1. I think the authors should clarify whether input genotypes must be prephased. That is, given a trained neural network and a genotype data that one wishes to impute, does the genotype data have to be phased? The discussion reads "our current encoding approach lacks phasing information..." which can be understood both ways. On a related note, I hope the authors can also clarify if the validation and testing data (page 7 lines 1423) were phased data, or if they were originally unphased but computationally phased via softwares like Eagle 2 or Beagle 5.

      The input genotypes are not phased, nor pre-phased, and no pre-phasing was performed before imputation. We included further clarification on the method section, stating “All input genotypes from all datasets utilized in this work are unphased, and no pre-phasing was performed.”. We also included further clarification in the Discussion session.

      1. It is unclear if the reported run times (Figure 6) includes model training time, or if they are simply imputing the missing genotypes given a pre-trained autoencoder? For the later, I think the comparison may still be fair if users never have to train models themselves. However, if users currently have to train their own network, I feel it is imperative to also report the model training time, even if in another figure/table.

      The end-users do not have to train the models, the computational burden of training the models remains on the developer side, so the runtimes refer to the task of imputing the missing genotypes given a pre-trained autoencoder set. This allows for distribution without reference datasets. We included further clarification on the Performance Testing and Comparisons subsection of Methods.

      Reviewer #2 (Public Review):

      In this manuscript the authors introduce a segment based autoencoder (AE) to perform genotype imputation. The authors compare performance of their AE to more traditional HMM-based methods (e.g. IMPUTE) and show that there is a slight but significant improvement on these methods using the AE strategy.

      In general the paper is clearly presently and the work in timely, but I have some concerns with respect to the framing of the advances presented here along with the performance comparisons.

      Specific Points:

      1. The authors aren't doing a good enough job presenting the work of others in using deep neural networks for imputation or using autoencoders for closely related tasks in population genetics. For instance, the authors say that the RNN method of Kojima et al 2020. is not applicable to real world scenarios, however they seem to have missed that in that paper the authors are imputing based on omni 2.5 at 97% masking, right in line with what is presented here. It strikes me that the RNNIMP method is a crucial comparison here, and the authors should expand their scholarship in the paper to cover work that has already been done on autoencoders for popgen.

      This is an important comparison that we erroneously misrepresented. We have now separated out this particular application of the RNN-IMP in the introduction of the manuscript. The major difference is that RNN-IMP needs to be retrained on different input genetic variants, much like a standard HMM-based method. The computational burden of RNN-IMP remains on the end-user side. It appears that computational complexity is tremendous in this model, given that the only example the authors provided with their software consists of 100 genomes from 1000 Genomes Project to perform the imputation on Omni by de novo training of the data. Given their approach does not achieve the benefits of distributing a generalizable pre-trained neural network, and the computational burden associated with training these models on the 60K+ genomes we use in our manuscript, we have opted for stating the benefits and downsides of their approach in the introduction.

      1. With respect to additional comparisons-Kenneth Lange's group recently released a new method for imputation which is not based on HMM but is extremely fast. The authors would be well served to extend their comparisons to include this method (MendelImpute)-it should be favorable for the authors as ModelImpute is less accurate than HMMs but much faster.

      We appreciate the reviewer pointing out this additional method, however their parent manuscript clearly shows substantially inferior imputation performance relative to BEAGLE/Minimac etc. which we already compare against. There is not much to gain by performing this comparison. Our autoencoder-based approach is already generating results that are competitive with the best and most cited imputation tools, which are all HMM-based and outperforming MendelImpute. The outcome of this comparison is forecasted based upon the parent manuscript.

      1. The description of HMM based methods in lines 19-21 isn't quite correct. Moreover-what is an "HMM parameter function?"

      Thank you for catching this. We were referring to parameter *estimation and have corrected this in the manuscript.

      1. Using tiled AEs across the genome makes sense given the limitations of AEs generally, but this means that tiling choices may affect downstream accuracy. In particular-how does the choice of the LD threshold determine accuracy of the method? e.g. if the snp correlation threshold were 0.3 rather than 0.45, how would performance be changed?

      This choice is driven by the limitations of cutting-edge GPUs. 0.45 is the threshold that returns the minimum number of tiles spanning chromosome 22 with an average size per tile that fits into the video memory of GPUs. While developing the tiling algorithm, we tested lower thresholds, which made the tiles smaller and more abundant, and thus made the GPU memory workload less efficient (e.g. many tiles resulted in many autoencoders per GPU, which thus caused a CPU-GPU communication overhead). Due to the obstacles related to computational inefficiency, CPUGPU communication overhangs, and GPU memory limits, we did not proceed with model training on tiles generated with other correlation thresholds. We’ve added a paragraph explaining this choice in the manuscript.

      1. How large is the set of trained AEs for chromosome 22? In particular, how much disk space does the complete description of all AEs (model + weights) take up? How does this compare to a reference panel for chr22? The authors claim that one advance is that this is a "reference-free" method - it's not - and that as such there are savings in that a reference panel doesn't have to be used along with the genome to be imputed. While the later claim is true, instead a reference panel is swapped out for a set of trained AEs, which might take up a lot of disk space themselves. This comparison should be given and perhaps extrapolated to the whole genome.

      This is an interesting point. For comparison, the total combined uncompressed size of all pre-trained autoencoders together is 120GB, or 469MB per autoencoder. The size of the reference data, HRC chromosome 22 across ~27,000 samples is 1GB after compression – or nearly 10X the autoencoder size. Moreover, unlike in HMM-based imputation, the size of the pre-trained autoencoders does not increase as a function of the reference panel sample size. The size of the autoencoders remains fixed since the number of model weights and parameters remains the same regardless of sample size – though it will expand somewhat with the addition of new genetic variants. Another point to consider is that privacy concerns associated with distribution of reference data are mitigated with these pretrained autoencoders.

      1. The results around runtime performance (Figure 6) are misleading. Specifically HMM training and decoding is being performed here, whereas for the AE only prediction (equivalent to decoding) is being done. To their credit, the authors do mention a bit of this in the discussion, however a real comparison should be done in Figure 6. There are two ways to proceed in my estimation - 1) separate training and decoding for the HMM methods (Beagle doesn't allow this, I'm not sure of the other software packages) 2) report the training times for the AE method. I would certainly like to see what the training times look like given that the results as present require 1) a separate AE for each genomic chunk, 2) a course grid search, 3) training XGBoost on the results from the course grid search, and 4) retraining of the individual AEs given the XGBoost predictions, and 5) finally prediction. This is a HUGE training effort. Showing prediction runtimes and comparing those to the HMMs is inappropriate.

      We consider the prediction only during the runtime comparisons because only the prediction side is done by the enduser, whereas the computational burden remains on the developer side. For the HMMs, we included only the prediction time as well (excluded the time for data loading/writing, computing model parameters and HMM iterations). The pre-trained autoencoders, when distributed, can take as input any set of genetic variants to produce the output without any additional training or fine-tuning required.

      1. One well known problem for DNN based methods including AEs is out-of-sample prediction. While Figure 5 (missing a label by the way) sort of gets to this, I would have the authors compare prediction in genotypes from populations which are absent from the training set and compare that performance to HMMs. Both methods should suffer, but I'm curious as to whether the AEs are more robust than the HMMs to this sort of pathology.

      Our test datasets in Figures 4 and 5 are independent of the reference dataset. MESA, Wellderly, and HGDP are all independent datasets, never used for training, nor model selection. Only HRC was used as reference panel or for training, and ARIC was used for model selection during tuning. We included a statement in the methods clarifying this point.

      Reviewer #3 (Public Review):

      Over the last 15 years or so genotype imputation has been an important and widely-used tool in genetic studies, with methods based on Hidden Markov Models (HMMs) and reference panels emerging as the dominant approach. This paper suggests a new approach to genotype imputation based on denoising autoencoders (DAE), a type of neural network. This approach has two nice advantages over existing methods based on Hidden Markov Models (HMMs): i) once the DAE is trained on a reference panel the reference panel can be discarded, and users do not need access to the reference panel to use the DAE; ii) imputation using a DAE is very fast (training is slow, but this step is done upfront so users do not need to worry about it). The paper also presents data showing that the tuned DAE is competitive in accuracy with HMM methods.

      I have two main concerns.

      First, it is unclear to me whether the accuracy presented for the tuned DAE (eg Figure 3, Table 4) is a reliable reflection of expected future accuracy. This is because the tuning process was quite extensive and complex, and involved at least some of the datasets used in these assessments. While the paper correctly attempts to guard against overfitting and related issues by using separate Training, Validation and Testing data (p7), it seems that the Testing data were used in at least some of the development of the methods and tuning (eg p14, "A preliminary comparison of the best performing autoencoder..."; Figure 2 and Table 2, all involve the Testing data). Because of the complexity of the process by which the final DAE was arrived at it is unclear to me whether there is a genuine concern here, but it would seem safest and most convincing at this point to do an entirely independent test of the methods on genotype data sets that were not used at all up to this point.

      MESA, Wellderly, and HGDP were not used for training, nor for tuning, they are completely independent. So all the results showing these datasets are completely independent. Only HRC and ARIC were used for training and validation/tuning, respectively. We included a statement in the methods session clarifying this point.

      Moreover, HGDP in particular includes 828 samples from 54 different populations representing all continental populations and including remote populations like Siberia, Oceania, etc. This reference panel is described in more detail in the reference below and likely represents the most diverse human genome dataset available. Thus, we have externally validated generalizability on a dataset with much greater diversity than our training dataset:

      Bergström A, et al. Insights into human genetic variation and population history from 929 diverse genomes. Science. 2020 Mar 20;367(6484):eaay5012.

      Second, there is a potentially tricky issue of to what extent distributing a black box DAE trained on a reference sample is consistent with data sharing policies. Standards of data sharing have evolved over the last decade. Generally there currently seems to be little hesitation to publicly share "single-SNP summary data" such as allele frequency information from large reference panels, whereas sharing of individual-level genotype data is usually explicitly forbidden. It is not quite clear to me where sharing the fit of a DAE falls here, or how much information on individual genotypes the trained DAE contains. The current manuscript does not adequately address this issue.

      Currently there are no official data sharing restrictions on deep learning data. We are aware that future policies may rise, and we have started a collaboration with Oak Ridge National Laboratory to explore differential privacy techniques and privacy concerns for these autoencoders. Another point to consider is that the autoencoders segment the genome, making reconstruction of an individual genome impossible even if reference data were somehow recoverable from the neural networks. Regardless, this is an interesting and important point that should be addressed in the manuscript and we have added a paragraph discussing this point.

      Reviewer #4 (Public Review):

      In this manuscript, Dias et al proposed a novel genotype imputation method using autoencoders (AE), which achieves comparable or superior accuracy relative to the state-of-the-art HMM-based imputation methods after tuning. The idea is innovative and provides an alternative solution to the important task of genotype imputation. The authors also conducted some experiments using three different datasets as targets to showcase the value of their approach. The overall framework of the method is clearly presented but more technical details are needed. The results presented showed slight advantage of AE imputation after tuning but more comprehensive evaluations are needed. In particular, the authors didn't consider post-imputation quality control. The reported overall performance (R2 in the range of 0.2-0.6) seems low and inconsistent with the imputation literature.

      Overall, the method has potential but is not sufficiently compelling in its current form.

      We show average accuracy of 0.2-0.6 in Table 4, but that is the average R2 per variant across all variants (no MAF filtering or binning applied). The reviewer points that the accuracy should be R2>0.8, but this R2>0.8 refers to common variants only (allele frequency >1%), and we have shown r2>0.8 for these variants (Figure 4). The aggregate accuracy displayed in Table 4 is lower because the vast majority of variants fall below 1% allele frequency threshold.

      The references bellow demonstrate this issue and agree with our results:

      References:

      Rubinacci S, Delaneau O, Marchini J. Genotype imputation using the positional burrows wheeler transform. PLoS genetics. 2020 Nov 16;16(11):e1009049.

      McCarthy S, Das S, Kretzschmar W, Delaneau O, Wood AR, Teumer A, Kang HM, Fuchsberger C, Danecek P, Sharp K, Luo Y. A reference panel of 64,976 haplotypes for genotype imputation. Nature genetics. 2016 Oct;48(10):1279.

      Vergara C, Parker MM, Franco L, Cho MH, Valencia-Duarte AV, Beaty TH, Duggal P. Genotype imputation performance of three reference panels using African ancestry individuals. Human genetics. 2018 Apr;137(4):281-92.

    1. Author Response

      Reviewer #1 (Public Review):

      The manuscript by Rekler and Kalcheim examines the role of neural tube-derived retinoic acid (RA) in neural crest development. They observe that the onset of expression of the RA-synthesizing enzyme RALDH2 in the dorsal neural tube coincides with the end of neural crest production. The authors propose that this local source of RA is essential to activate the transcription of Bambi other BMP inhibitors, leading to the disruption of BMP signaling. Loss of BMP activity at the dorsal neural tube would halt neural crest production, leading to the establishment of the definite roof plate. Thus, precise temporal regulation of RALDH2 in the dorsal neural tube would dictate the timing of neural crest production and the segregation of PNS and CNS progenitors.

      Previous studies have already identified a role for RA in the control of the timing of neural crest production. MartinezMorales et al (JCB 2011) have shown that during early trunk development, mesoderm-derived RA works with FGF signaling to jumpstart the BMP/Wnt cascade that drives neural crest migration in the trunk. Rekler and Kalcheim choose to focused on a distinct function of RA at a later timepoint. The main contribution of the present study is the demonstration that - at later stages - RA produced by the neural tube has the opposite effect, acting to inhibit the BMP/Wnt cascade and halt neural crest production. Thus, RA would be a major regulator of the timing of neural crest production, acting to both trigger and repress neural crest migration.

      The study's strengths lie in an experimental strategy that allows the authors to manipulate RA function in a stagespecific manner and therefore uncover a later role for the signaling system in neural crest production. The authors also show that RA inhibition results in an incomplete fate switch and results in the generation of cells that share regulatory features of neural crest and roof plate cells. A significant limitation of the study is that the molecular mechanisms that endow RA signaling with stage-specific functions remain unknown. This is of particularly important since the early vs. late RA seem to have opposing effects, acting to either promote or terminate neural crest production.

      We thank this referee for her/his positive comments on our manuscript. We agree with the referee that a key question is understanding how RA signaling is differentially interpreted over time given its multistage activity in dorsal NT development.

      This is based on the following findings: Years ago, we uncovered that the balance between activities of BMP/Wnt and noggin in the dorsal NT trigger the onset of NC EMT. Martinez-Morales et al. strengthened our findings by reporting that a balance between somitic RA and FGF works on the reported BMP/Wnt modules to initiate the process. This group found that at gastrulation stages, RA is required for NC specification, as revealed by analysis of VAD quail embryos. Next, during somite formation, somitic RA is necessary for the onset of emigration of specified NC progenitors but at advanced somite stages it is dispensable for the subsequent maintenance of cell emigration. Presently, we find that RP-derived RA ends NC production. Together, this highlights a dynamic behavior of RA at 4 sequential stages of NC ontogeny. Clearly enough, the two first effects are mediated by an influence of RA dorsoventral patterning of the early NT, as distribution of ventral NT markers was strongly affected. In our case, RA from the nascent RP has no such effects suggesting that RP-derived RA acts at a post-patterning phase to specifically affect the dorsal NT.

      All things considered, we think that the problem is not simply a binary question of “opposing functions of RA signaling in starting or terminating NC production”. Instead, it is the understanding of a differential interpretation to the same morphogen by progenitor cells with changing states and at sequential stages.

      To the referee’s request, we begun addressing the question of how does RA inhibit BMP signaling close to the RP stage. To this end, we decided first to examine the temporal regulation of Raldh2 expression that is restricted to the RP stage, and is therefore a prerequisite for the late activity of RA. Whereas repressing RA activity extends the NC phase including the continuous transcription of Foxd3, Sox9 and Snail2 (Fig.3), we now found that extending the activity of each of these transcription factors close to the RP stage represses the onset of Raldh2 transcription in the nascent RP (new Fig. 9). We interpret these results to mean that as long as NC genes are active in the dorsal NT (NC stage), local Raldh2 and consequent RA synthesis in the NT does not take place, so Raldh2 in RP is repressed by NC-specific traits. The significance of these data is twofold: first, they explain the late onset of Raldh2 production at the RP stage. Second, since we also report the reciprocal result, that RA represses NC genes (Fig.3), we conclude that a cross repressive interaction exists between NC and RP-specific genes downstream of RA, being an emerging temporal property of the network. These data further indicate that the changing roles of RA throughout development of the dorsal neural primordium, largely depend on a different interpretation of the signal mediated by changing and mutually repressive codes.

      We have now presented these data in Fig.9. To clarify our thoughts further, we now provide a working model summarizing the effects of RA in NC to RP transition (Fig.10B).

      Our article uncovers for the first time and thoroughly documents, a role of local RA activity on the end of NC production and ensuing RP architecture. We believe that a comprehensive elucidation of the molecular mechanism responsible for inhibition of BMP signaling by local RA is the next obligatory step. We show in this study the selective activation of BMP inhibitors by endogenous RA and previously found that one of them, Hes/hairy, indeed inhibits BMP signaling and NC EMT (Nitzan et al, 2016). Therefore we propose that upregulation of BMP inhibitors by RA is a possible mechanism. However, we also predict that this is not the only one, and a deeper understanding of this problem is beyond the scope of the present study.

      Additional possibilities that fit with our data were now discussed: RA expression in somites vs. RP can be regulated by different enhancers and thus have distinct functions. For example, a specific enhancer driving expression of Raldh2 was found to be activated only at the definitive RP stage (Castillo et al., 2010). This enhancer contains Tcf binding sites and thus may be activated by Wnt signaling. In turn, as we show, RP-derived Raldh2 and resulting RA could negatively feed-back on Wnt signaling in the formed RP either directly or through BMP acting upstream of Wnt (now presented in Fig. 10B).

      Another possible scenario is that RA represses BMP signaling by inactivating Smad proteins via ubiquitination, as shown to be the case in selected cell lines (Sheng et al., 2010). These possibilities were discussed and await to be systematically explored.

      Comments:

      Previous studies have demonstrated that early RA production (presumably from the mesoderm) is necessary for the expression of early dorsal neural tube / neural crest genes like Pax7, Msx, Wnt1, and even BMP ligands. This is in contrast to the local source of RA, which presumably would be silencing these genes. Thus, mesoderm-derived RA would have the opposite effect in these progenitors than the RA synthesized in the neural tube. The study does not provide a mechanism that explains these stage-specific effects of the morphogen.

      As elaborated in our reply above to the general comment, we believe that RA whether emanating from somites or nascent RP, provides an initial signal that is later relayed upon target factors unique to each stage. It is possible that the precise source of factor plays a role; along this line we showed that somitic RA is dispensable for late events, reciprocally, there is no RA synthesis in the early NT that could affect NC cells. Having said that, there is RA activity in the NT at both stages and the output is still different. Hence, there should be more to this: In the revised version, we report that NC and RP-specific genes stand in a mutually repressive interaction downstream of RA, and this may contribute to the stage-specific effects of the morphogen.

      The effects of RA manipulation are often examined with non-quantitative techniques, like in situ hybridization (Fig. 2, 3). The incorporation of quantitative approaches (e.g., qPCR) would allow for the precise characterization of phenotypes (and better estimation of penetrance, etc.). Furthermore, the study lacks molecular/biochemical strategies to define the regulatory linkages between genes and pathways. This is a considerable limitation of the study since it prevents the establishment of a regulatory axis that would directly connect RA signaling to the BMP pathway.

      As the referee may notice, most genes examined are not restricted solely to the dorsal NT/RP domains. Since it is technically not accurate to isolate only the regions of interest for qPCR analysis, collecting entire NTs following unilateral or bilateral electroporations for qPCR would be highly inaccurate. In situ hybridization and immunohistochemistry provide a precise tool to assess the spatial localization of the transcripts/proteins of interest. To note is that in all cases examined, development of the color reactions was for the same length of time for control and experimental cases and photography was performed under identical conditions. Furthermore, in most cases, effects between treatments were dramatic, readily apparent at a qualitative level and easily quantifiable from ISH or fluorescent images.

      As to regulatory linkages between genes and pathways, the referee is correct; we do not demonstrate direct molecular interactions between the different players at the biochemical level. The present study provides a wealth of novel data connecting morphogens such as RA with BMP and Wnt activities, and those with a variety of downstream genes specific for either NC or RP stages. The next step will be to ask about the precise nature of the linkages between specific molecules/pathways.

      The function (and the regulation) of RALDH2 at the dorsal neural tube has been studied thoroughly, and RA is a known player in the dorsal-ventral patterning of the CNS. It is not clear to what extent the phenotypes observed by the authors are due to the disruption of a neural crest-intrinsic mechanism or if they are secondary to the overall changes in the cellular organization of the neural tube caused by loss of RA.

      This is a good point as RA is known to have multiple effects on NT development whose nature changes with stage. Available data emanating from young caudal neural plate explants and from VAD embryos that lack RA, showed that early RA signaling from developing somites is required for ventral patterning of the neural tube (motoneurons and V1, V2 interneurons) and for neuronal differentiation (Diez del Corral et al, 2003, Sockanathan and Jessel, 1998, Liu et al, 2001, Maden et al, 1996). These effects were shown to depend, at least partly, on antagonistic activities of RA and FGF in mesoderm which affect ventral, but not dorsal NT patterning (Diez del Corral et al, 2003). Our study focuses on a later stage when D-V neural tube patterning is already established.

      To address the referee’s comment, we now examined the effects of RA attenuation on expression of Pax7, a dorsal factor, and Hb9, a motoneuron-specific protein. We found that RARa403 does not affect the localization and/or extent of expression of Hb9, and causes only a mild 12% increase in the area of expression of Pax7. Consistent with these results, we also show in several figures that in the absence of RA signaling pSmad and Wnt activities, Foxd3, Snai2 and Sox9 expression patterns are prolonged in time but not in D-V extent.

      These data corroborate that the effects documented are directed to the dorsal NT and do not result from overall changes in D-V patterning. The data were now added as Fig.7 Supplementary 1.

      The authors rely solely upon overexpression constructs to manipulate the activity of the RA signaling pathway, which may be prone to artifacts. Furthermore, both overexpression constructs aim at inhibiting RA activity. This limits the impact of the work since there is no demonstration that RA is sufficient to activate BMP inhibitors and halt neural crest production.

      The tools we used to repress RA signaling consist of RARa403 that acts as a pan-dominant negative construct to abrogate receptor activity, and Cyp26A1, an enzyme that degrades RA. To activate RA signaling in a ligand- independent manner, we now implemented VP16-RAR-alpha in the revised version of this manuscript. All these tools are extensively and routinely employed in the literature in a variety of animal species and were shown to act in vivo as expected both by others and further confirmed by us in the present study. Having said that, we are currently optimizing the CRISPR-Cas9 method for gene editing of RA-specific genes and hope to succeed in the near future.

      We have now performed experiments to address the sufficiency of RA. Data were now added as Fig.5 Supplem 2 and 3 and Fig.6 Supp.2 .

      As we expected, gain of RA function at NC stages is not sufficient to prematurely activate BMP inhibitors like BAMBI, to end prematurely BMP signaling (pSMAD) or NC EMT, to alter the dynamics of expression of NC-specific genes, or to cause an earlier appearance of RP-specific traits. This is fully consistent with RA being active at NC stages when BMP/Wnt signaling, NC EMT, etc are operational. The fact that RA is necessary but not sufficient for these processes further suggest that the key is how NC cells at various stages of their ontogeny and then RP cells, differentially interpret the signal given the profound changes in cellular and molecular landscapes apparent between these stages.

      Reviewer #2 (Public Review):

      The manuscript presents a novel role for RA signaling during development as the mediator of the switch that occurs in the dorsal neural tube after the neural crest cells have migrated and the roof plate forms. The finding is interesting and novel as the events that take place at the end of neural crest stage are poorly understood. The strengths of the manuscript are that the study is well planned and executed to show the interesting phenotype of delayed/disturbed roof plate formation accompanied with prolonged neural crest stage caused by inhibition of RA signaling in the dorsal neural tube. The results also show that RA signaling marks the RP territory and inhibits the DI1 interneurons from invading the region. The results bring novel information to the field. The original finding of the involvement of RA in the process was revealed in a RNAseq screen comparison between the neural crest and the roof plate (which was recently published by the same lab). However, the current study doesn't use any new technology such as high throughput screens or high resolution or live imaging etc., but rather relies mainly on "old fashioned" techniques: electroporation to induce transient inhibition of RA signaling in the dorsal neural tube followed by analysis of the phenotype by using chromogenic in situ hybridization. The chosen techniques are sufficient to convincingly show the point the authors want to make and the study serves as a reminder that fancy new techniques are not necessarily a requirement for creating a solid story. The manuscript is also well written and easy to follow.

      We thank this referee for a very positive feedback on our study. Although we are always motivated by the implementation of new techniques, we agree that the primary goal is to answer a biologically meaningful question with suitable means.

      Finally, the manuscript links the activation of RA signaling to the decline of BMP signaling and specifically the upregulation of BMP inhibitors in the dorsal neural tube at the end of the NC stage, but in its current form the proof of this proposed link remains weak.

      Our article uncovers for the first time and thoroughly documents, a role of local RA activity on the end of NC production and ensuing RP architecture. We believe that a comprehensive elucidation of the molecular mechanism responsible for inhibition of BMP signaling by local RA is the next obligatory step. We show in this study the selective activation of BMP inhibitors by endogenous RA and previously found that one of them, Hes/hairy, indeed inhibits BMP signaling and NC EMT (Nitzan et al, 2016). Therefore we propose that upregulation of BMP inhibitors by RA is a possible mechanism. However, we also predict that this is not the only one, and a deeper understanding of this problem is beyond the scope of the present study.

      Additional possibilities that fit with our data were now discussed: RA expression in somites vs. RP can be regulated by different enhancers and thus have distinct functions. For example, a specific enhancer driving expression of Raldh2 was found to be activated only at the definitive RP stage (Castillo et al., 2010). This enhancer contains Tcf binding sites and thus may be activated by Wnt signaling. In turn, as we show, RP-derived Raldh2 and resulting RA could negatively feed-back on Wnt signaling in the formed RP either directly or through BMP acting upstream of Wnt (this was now presented in a working model in Fig. 10B).

      Another possible scenario is that RA represses BMP signaling by inactivating Smad proteins via ubiquitination, as shown to be the case in selected cell lines (Sheng et al., 2010). These possibilities were discussed and await to be explored systematically.

      Similarly, the manuscript does not address the consequences of exposure of RA to the dorsal neural tube during NC stage and it thus remains unknown whether RA signaling is sufficient to end the NC stage and activate roof plate formation prematurely. Additional experiments of this kind would help clarify the role of RA in the dorsal neural tube and the reciprocal roles of the two signaling pathways (RA and BMP).

      We have now performed experiments to address the sufficiency of RA. Data were now added as Fig.5 Supp.2 and Supp.3, and Fig.6 Supp.2, and discussed.

      As we expected, gain of RA function at NC stages is not sufficient to prematurely activate BMP inhibitors, to end prematurely BMP signaling (pSMAD) or NC EMT, to alter the dynamics of expression of NC-specific genes, or to cause an earlier appearance of RP-specific traits.

      This result is totally understandable in light of RA being anyway active (but not produced) in NT at NC stages (original Fig.1) when BMP/Wnt signaling, a NC-specific gene network, and NC EMT are operational.

      The fact that RA is necessary but not sufficient for these processes further suggests that the key is in the following, perhaps complementary mechanisms: 1) a different interpretation of the same signal by NC progenitors at sequential stages of their ontogeny and then by RP cells, accounted for by the profound changes in cellular and molecular landscapes apparent between these stages. 2) the possibility that somite-derived versus RP-derived RA are differentially interpreted by the dorsal NT cells owing, for example, to a distinctive mode of ligand presentation (e.g; by CRABP1 expressed in RP but not NC, etc).

    1. Author Response

      Reviewer #1 (Public Review):

      In Figure 1A, the authors should show TEM images of control mock treated samples to show the difference between infected and healthy tissue. Based on the data shown in Figure 1B-E that the overexpression of GFP-P in N. benthamiana leads to formation of liquid-like granules. Does this occur during virus infection? Since authors have infectious clones, can it be used to show that the virally encoded P protein in infected cells does indeed exist as liquid-like granules? If the fusion of GFP to P protein affects its function, the authors could fuse just the spGFP11 and co-infiltrate with p35S-spGFP1-10. These experiments will show that the P protein when delivered from virus does indeed form liquid-like granules in plants cells. Authors should include controls in Figure 1H to show that the interaction between P protein and ER is specific.

      We agree with the reviewer and appreciate the helpful suggestion. As suggested, we added TEM images of control mock treated barley leaves. We also carried out immune-electron microscope to show the presence of BYSMV P protein in the viroplasms. Please see Figure 1–Figure supplement 1.

      BYSMV is a negative-stranded RNA virus, and is strictly dependent on insect vector transmission for infecting barley plants. We have tried to fuse GFP to BYSMV P in the full-length infectious clones. Unfortunately, we could not rescue BYSMV-GFP-P into barley plants through insect transmission.

      In Figure 1H, we used a PM localized membrane protein LRR84A as a negative control to show LRR84A-GS and BYSMV P could not form granules although they might associate at molecular distances. Therefore, the P granules were formed and tethered to the ER tubules. Please see Figure 1–Figure supplement 4

      Data shown in Figure 2 do demonstrate that the purified P protein could undergo phase separation. Furthermore, it can recruit viral N protein and part of viral genomic RNA to P protein induced granules in vitro.

      Because the full-length BYSMV RNA has 12,706 nt and is difficult to be transcribed in vitro, we cannot show whether the BYSMV genome is recruited into the droplets. We have softened the claim and state that the P-N droplets can recruit 5′ trailer of BYSMV genome as shown in Figure 3B. Please see line 22, 177 and 190.

      Based on the data shown in Figure 4 using phospho-null and phospho-mimetic mutants of P protein, the authors conclude that phosphorylation inhibits P protein phase separation. It is unclear based on the experiments, why endogenous NbCK1 fails to phosphorylate GFP-P-WT and inhibit formation of liquid-like granules similar to that of GFP-P-S5D mutant? Is this due to overexpression of GFP-P-WT? To overcome this, the authors should perform these experiments as suggested above using infectious clones and these P protein mutants.

      As we known, phosphorylation and dephosphorylation are reversible processes in eukaryotic cells. Therefore, as shown in Figure 5B and 6B, the GFP-PWT protein have two bands, corresponding to P74 and P72, which represent hyperphosphorylation and hypophosphorylated forms, respectively. Only overexpression of NbCK1 induced high ratio of P74 to P72 in vivo, and then abolished phase separation of BYSMV.

      In Figure 5, the authors overexpress NbCK1 in N. benthamiana or use an in vitro co-purification scheme to show that NbCK1 inhibits phase separation properties of P protein. These results show that overexpression of both GFP-P and NbCK1 proteins is required to induce liquid-like granules. Does this occur during normal virus infection? During normal virus infection, P protein is produced in the plant cells and the endogenous NbCK1 will regulate the phosphorylation state of P protein. These are reasons for authors to perform some of the experiments using infectious clones. Furthermore, the authors have antibodies to P protein and this could be used to show the level of P protein that is produced during the normal infection process.

      We detected the P protein existed as two phosphorylation forms in BYSMV-infected barley leaves, and λPPase treatment decreased the P44 phosphorylation form. Therefore, these results indicate that endogenous CK1 cannot phosphorylate BYSMV P completely.

      Based on the data shown in Figure 6, the authors conclude that phase separated P protein state promotes replication but inhibits transcription by overexpressing P-S5A and P-S5D mutants. To directly show that the NbCK1 controlled phosphorylation state of P regulates this process, authors should knockdown/knockout NbCK1 and see if it increases P protein condensates and promote recruitment of viral proteins and genomic RNA to increase viral replication.

      In our previous studies, BLAST searches showed that the N. benthamiana and barley genomes encode 14 CK1 orthologs, most of which can phosphorylated the SR region of BYSMV P. Therefore, it is difficult to make knockdown/knockout lines of all the CK1 orthologues. Accordingly, we generated a point mutant (K38R and D128N) in HvCK1.2, in which the kinase activity was abolished. Overexpression of HvCK1.2DN inhibit endogenous CK1-mediated phosphorylation of BYSMV P, indicating that HvCK1.2DN is a dominant-negative mutant.

      It is important to note that both replication and transcription are required for efficient infection of negative-stranded RNA viruses. Therefore, our previous studies have revealed that both PS5A and PS5D are required for BYSMV infection. Therefore, expression of HvCK1.2DN in BYSMV vector inhibit virus infection by impairing the balance of endogenous CK1-mediated phosphorylation in BYSMV P.

      Reviewer #2 (Public Review):

      The manuscript by Fang et al. details the ability of the P protein from Barley yellow striate mosaic virus (BYSMV) to form phase-separated droplets both in vitro and in vivo. The authors demonstrate P droplet formation using recombinant proteins and confocal microscopy, FRAP to demonstrate fluidity, and observed droplet fusion. The authors also used an elaborate split-GFP system to demonstrate that P droplets associate with the tubulur ER network. Next, the authors demonstrate that the N protein and a short fragment of viral RNA can also partition into P droplets. Since Rhabdovirus P proteins have been shown to phase separate and form "virus factories" (see https://doi.org/10.1038/s41467-017-00102-9), the novelty from this work is the rigorous and conclusive demonstration that the P droplets only exist in the unphosphorylated form. The authors identify 5 critical serine residues in IDR2 of P protein that when hyper-phosphorylated /cannot form droplets. Next, the authors conclusively demonstrate that the host kinase CK1 is responsible for P phosphorylation using both transient assays in N. benthamiana and a co-expression assay in E. coli. These findings will likely lead to future studies identifying cellular kinases that affect phase separation of viral and cellular proteins and increases our understanding of regulation of condensate formation. Next, the authors investigated whether P droplets regulated virus replication and transcription using a minireplicon system. The minireplicon system needs to be better described as the results were seemingly conflicting. The authors also used a full-length GFP-reporter virus to test whether phase separation was critical for virus fitness in both barley and the insect vector. The authors used 1, 6-hexanediol which broadly suppresses liquid-liquid phase separation and concluded that phase separation is required for virus fitness (based on reduced virus accumulation with 1,6 HD). However, this conclusion is flawed since 1,6-hexanediol is known to cause cell toxicity and likely created a less favorable environment for virus replication, independent of P protein phase separation. These with other issues are detailed below:

      1. In Figure 3B, the authors display three types of P-N droplets including uniform, N hollow, and P-N hollow droplets. The authors do not state the proportion of droplets observed or any potential significance of the three types. Finally, as "hollow" droplets are not typically observed, is there a possibility that a contaminating protein (not fluorescent) from E. coli is a resident client protein in these droplets? The protein purity was not >95% based on the SDS-PAGE gels presented in the supplementary figures. Do these abnormalities arise from the droplets being imaged in different focal planes? Unless some explanation is given for these observations, this reviewer does not see any significance in the findings pertaining to "hollow" droplets.

      Thanks for your constructive suggestions. We removed the "hollow" droplets as suggested. We think that the hollow droplets might be an intermediate form of LLPS. Please see PAGE 7 and 8 of revised manuscript.

      1. Pertaining to the sorting of "genomic" RNA into the P-N droplets, it is unlikely that RNA sorting is specific for BYSMV RNA. In other words, if you incubate a non-viral RNA with P-N droplets, is it sorted? The authors conclusion that genomic RNA is incorporated into droplets is misleading in a sense that a very small fragment of RNA was used. Cy5 can be incorporated into full-length genomic RNAs during in vitro transcription and would be a more suitable approach for the conclusions reached.

      Thanks for your constructive suggestions. Unfortunately, we could not obtain the in vitro transcripts of the full-length genomic RNAs (12706 nucleotides). We have softened the claim and state that the P-N droplets can recruit the 5′ trailer of BYSMV genome as shown in Figure 3B. Please see line 22, 177 and 190.

      According to previous studies (Ivanov, et al., 2011), the Rhabdovirus P protein can bind to nascent N moleculaes, forming a soluble N/P complex, to prevent from encapsidating cellular RNAs. Therefore, we suppose that the P-N droplets can incorporate viral genomic RNA specifically.

      Reference: Ivanov I, Yabukarski F, Ruigrok RW, Jamin M. 2011. Structural insights into the rhabdovirus transcription/ replication complex. Virus Research 162:126–137. DOI: https://doi.org/10.1016/j.virusres.2011.09.025

      1. In Figure 4C, it is unclear how the "views" were selected for granule counting. The methods should be better described as this reviewer would find it difficult to select fields of view in an unbiased manner. This is especially true as expression via agroinfiltration can vary between cells in agroinfiltrated regions. The methods described for granule counting and granule sizes are not suitable for publication. These should be expanded (i.e. what ImageJ tools were used?).

      We agree with the reviewer that it is important to select fields of view in an unbiased manner. We selected the representative views and provided large views in the new Supplement Figures. In addition, we added new detail methods in revision. Please see Figure 4–Figure supplement 1, Figure 5–Figure supplement 1, and method (line 489-498).

      1. In Figure 4F, the authors state that they expected P-S5A to only be present in the pellet fraction since it existed in the condensed state. However, WT P also forms condensates and was not found in the pellet, but rather exclusively in the supernatant. Therefore, the assumption of condensed droplets only being found in the pellet appears to be incorrect.

      Many thanks for pointing this out. This method is based on a previous study (Hubstenberger et al., 2017). The centrifugation method might efficiently precipitate large granules more than small granules. As shown in Figure 4B, GFP-PS5A formed large granules, therefore GFP-PS5A mainly existed in the pellet. In contrast, GFP-PWT only existed in small granule and fusion state, thus most of GFP-PWT protein was existed in supernatant, and only little GFP-PWT protein in the pellet. These results also indicate the increased phase separation activity of GFP-PS5A compared with GFP-PWT. Please see the new Figure 4F.

      Reference: Hubstenberger A, Courel M, Benard M, Souquere S, Ernoult-Lange M, Chouaib R, Yi Z, Morlot JB, Munier A, Fradet M, et al. 2017. P-Body Purification Reveals the Condensation of Repressed mRNA Regulons. Molecular Cell 68(1): 144-157 e145.

      1. The authors conclude that P-S5A has enhanced phase separation based on confocal microscopy data (Fig S6A). The data presented is not convincing. Microscopy alone is difficult for comparing phase separation between two proteins. Quantitative data should be collected in the form of turbidity assays (a common assay for phase separation). If P-S5A has enhanced phase separation compared to WT, then S5A should have increased turbidity (OD600) under identical phase separation conditions. The microscopy data presented was not quantified in any way and the authors could have picked fields of view in a biased manner.

      Thanks for your constructive suggestions. As suggested, turbidity assays were performed to show both GFP-PWT and GFP-PS5A had increased turbidity (OD600) compared with GFP. Please see Figure 4–Figure supplement 3.

      1. The authors constructed minireplicons to determine whether mutant P proteins influence RNA replication using trans N and L proteins. However, this reviewer finds the minireplicon design confusing. How is DsRFP translated from the replicon? If a frameshift mutation was introduced into RsGFP, wouldn't this block DsRFP translation as well? Or is start/stop transcription used? Second, the use of the 2x35S promoter makes it difficult to differentiate between 35S-driven transcription and replication by L. How do you know the increased DsRFP observed with P5A is not due to increased transcription from the 35S promoter? The RT-qPCR data is also very confusing. It is not clear that panel D is only examining the transcription of RFP (I assume via start/stop transcription) whereas panel C is targeting the minireplicon.

      Thank you for your questions and we are sorry for the lack of clarity regarding to the mini-replicon vectors. Here, we updated the Figure supplement 14 to show replication and transcription of BYSMV minireplicon, a negative-stranded RNA virus derivative. In addition, we insert an A after the start codon to abolish the translation of GFP mRNA, which allow us to observe phase separation of GFP-PWT, GFP-PS5A, and GFP-PS5D during virus replication. Use this system, we wanted to show the localization and phase separation of GFP-PWT, GFP-PS5A, and GFP-PS5D during replication and transcription of BYS-agMR. Please see Figure 6–Figure supplement 1.

      1. Pertaining to the replication assay in Fig. 6, transcription of RFP mRNA was reduced by S5A and increased by S5D. However, the RFP translation (via Panel A microscopy) is reversed. How do you explain increased RFP mRNA transcription by S5D but very low RFP fluorescence? The data between Panels A, C, and D do not support one another.

      Many thanks for pointing this out! We also noticed the interesting results that have been repeated independently. As shown the illustration of BYSMV-agMR system in Figure 6–Figure supplement 1, the relative transcriptional activities of different GFP-P mutants were calculated from the normalized RFP transcript levels relative to the gMR replicate template (RFP mRNA/gMR), because replicating minigenomes are templates for viral transcription.

      Since GFP-PS5D supported decreased replication, the ratio of RFP mRNA/gMR increased although the RFP mRNA of GFP-PS5D is not increased. In addition, the foci number of GFP-PS5D is much less than GFP-PWT and GFP-PS5A, indicating mRNAs in GFP-PS5D samples may contain aberrant transcripts those cannot be translated the RFP protein. In contrast, mRNAs in GFP-PS5A samples are translated efficiently. These results were in consistent with our previous studies using the free PWT, PS5A, and PS5D.

      Reference: Gao Q, et al. 2020. Casein kinase 1 regulates cytorhabdovirus replication and transcription by phosphorylating a phosphoprotein serine-rich motif. The Plant Cell 32(9): 2878-2897.

      1. The authors relied on 1,6-hexanediol to suppress phase separation in both insect vectors and barley. However, the authors disregarded several publications demonstrating cellular toxicity by 1,6-hexanediol and a report that 1,6-HD impairs kinase and phosphatase activities (see below). doi: 10.1016/j.jbc.2021.100260,

      We agree with the reviewer that 1, 6-hexanediol induced cellular toxicity. Therefore, we removed these results, which does not affect the main conclusion of our results.

      1. The authors state that reduced accumulation of BYSMV-GFP in insects and barley under HEX treatment "indicate that phase separation is important for cross-kingdom infection of BYSMV in insect vectors and host plants." The above statement is confounded by many factors, the most obvious being that HEX treatment is most likely toxic to cells and as a result cannot support efficient virus accumulation. Also, since HEX treatment interferes with phosphorylation (see REF above) its use here should be avoided since P phase separation is regulated by phosphorylation.

      We agree with the reviewer that 1, 6-hexanediol induced cellular toxicity and hereby affected infections of BYSMV and other viruses. In addition, 1, 6-hexanediol would inhibit LLPS of cellular membraneless organelles, such as P-bodies, stress granules, cajal bodies, and the nucleolus, which also affect different virus infections directly or indirectly. Therefore, we removed these results, which does not affect the main conclusion of our results.

      Reviewer #3 (Public Review):

      Membrane-less organelles formed through liquid-liquid phase separation (LLPS) provide spatiotemporal control of host immunity responses and other cellular processes. Viruses are obligate pathogens proliferating in host cells which lead their RNAs and proteins are more likely to be targeted by immune-related membrane-less organelles. To successfully infect and proliferate in host cells, virus need to efficiently suppressing the immune function of those immune-related membrane-less organelles. Moreover, viruses also generate exogenous membrane-less organelles/RNA granules to facilitate their proliferation. Accordingly, host cells also need to target and suppress the functions of exogenous membrane-less organelles/RNA granules generated by viruses, the underlying mechanisms of which are still mysterious.

      In this study, Fang et al. investigated how plant kinase confers resistance against viruses via modulating the phosphorylation and phase separation of BYSMV P protein. They firstly characterized the phase separation feature of BYSMV P protein. They also discovered that droplets formed by P protein recruit viral RNA and other viral protein in vivo. The phase separation activity of P protein is inhibited by the phosphorylation on its intrinsically disordered region. Combined with their previous study, this study demonstrated that host casein kinase (CK1) decreases the phase separation of P protein via increasing the phosphorylation of P protein. Finally, the author claimed that the phase separation of P protein facilitates BYSMV replication but decreases its transcription. Taking together, this study uncovered the molecular mechanism of plant regulating viral proliferation via decreasing the formation of exogenous RNA granules/membraneless organelles. Overall, this paper tells an interesting story about the host immunity targeting viruses via modulating the dynamics of exogenous membraneless organelles, and uncovers the modulation of viral protein phase separation by host protein, which is a hotspot in plant immunity, and the writing is logical.

      Thanks for your positive comment on our studies.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Note: Figures we made to respond to the referee comments appear to be not supported by the ReviewCommons system.

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      **Summary:**

      The authors characterized a new lncRNA locus named FLAIL that controls flowering time in Arabidopsis thaliana. The functional validation of this locus is strongly supported by the use of several different tools (CRISPR-Cas9 deletions, T-DNA insertion, amiRNA gene silencing, and transgene complementation of KO lines). It is also suggested that FLAIL lncRNA works in trans but not in cis. There are strong observations supporting that FLAIL works in trans.

      Moreover, it is suggested that FLAIL regulates gene expression by interacting with distant chromatin loci. This was assessed using RNA-Seq and ChIRP-Seq. Yet, the overlap between DEGs in the flail mutant and FLAIL binding sites at the chromatin is very small, with only 12 genes. From those, only 2 flowering genes' expression was rescued by FLAIL transgene complementation. The final conclusion that FLAIL lncRNA represses flowering by direct inhibition of the 2 flowering genes expression is correlative, and lacks genetic validation.

      #1.1 We plan to support the conclusions in the manuscript genetically as the reviewer suggests. We started these experiments yet they will require the timeframe of the full revision.

      In addition inspection of the supplementary file shows that the ChIRP analysis was done without filtering for the FDR so that some of the positive hits have an FDR of 0,232.

      #1.2 We strengthened the manuscript by implementing and FDR filter of ChIRP-seq results. The distribution of FLAIL binding sites in Fig. S7B and Table S4, and overlapping numbers between DEGs and FLAIL-ChIRP in Fig. S8A were correspondingly updated.

      In addition, many of the peaks land in intergenic regions with is not mentioned in the text a graph with the position of the peaks in respect to nearby genes would help.

      #1.3 Thank you for the suggestion, we strengthened the manuscript with the requested analysis. We implemented the FDR filter, then we used "tssRegion" in ChIPseeker to set distance to the nearest TSS as (-1000, 1000), then most peaks were located in promoter regions (67.24%) and in intergenic regions with 16.38%. Since many papers present the position of the peaks by ChIPseeker (PMID: 32338596, PMID: 28221134, PMID: 31081251, PMID: 32012197, PMID: 31649032, PMID: 32633672) we also applied a similar method to display a distribution of FLAIL binding loci relative to distance from the nearest TSS in Fig. S7C.

      In one sentence, the authors used the right model system and methodology, including advanced techniques, to characterize a new trans-acting lncRNA important for controlling the flowering time in Arabidopsis but lack evidence supporting a mechanism of action that goes beyond the interaction with several chromatin loci.

      **minor points:**

      line#63-64 the authors say the COLDAIR and ASL work on FLC in cis in my view the original papers suggested/showed they work in trans.

      #1.4 We increased precision by changing this sentence to ‘Vernalization-induced flowering associates with several lncRNAs such as ____COOLAIR____, COLDAIR____, ANTISENSE LONG (ASL), and COLDWRAP____ that in cis or in trans locally repress gene expression of FLOWERING LOCUS C (FLC), a key flowering repressor at different stages of vernalization’____.

      Fig 1B please add some more protein-coding RNAs for the bio-info analysis for comparison

      #1.5 ____done.

      Order of Supplementary Fig citation is mixed with S2 coming before S1B

      #1.6 Thank you, we ordered all figures by appearance in the text. __

      __

      It would help the reader to have a schematic of the crisper deletions, T-DNA insertion, and position of primers used for the RT-qPCR.

      #1.7 We enhanced our presentation of Fig. 1A. It shows a schematic of them as well as positions of primers.

      In the supplementary PDF file, some of the text is missing on page 3 beginning and end of lines.

      #1.8 we ensured all text in new submission.

      Reviewer #1 (Significance (Required)):

      The use of several different tools to validate the biological function of FLAIL locus is a major strength of this work.

      The authors propose that flowering time and its gene regulation are controlled by sense FLAIL lncRNAs. However, the sense transcription of FLAIL locus is not detected in wild-type plants by TSS-Seq, TIF-Seq, or plaNET-Seq.

      #1.9.1 There appears to be some confusions. Transcription of sense FLAIL can be observed in chr-DRS, TSS-seq, TIF-seq in wild type and even in plaNET-seq in NRPB2-FLAG nrpb2-1 plant. We enhanced presentation of Fig. 1 and provided a more clear description in Line 81-99.

      If the authors would have explored further the expression of FLAIL transcripts in different stages of development (vegetative and non-vegetative) and in response to different conditions, it would make their claims on the function of FLAIL lncRNAs more convincing. Additionally, flail mutants could have been obtained in the hen-2 background, since it's there where we can observe FLAIL transcription.

      #1.9.2 Thank you for the suggestion. We included additional analyses in ____Fig. S2 for FLAIL transcription level in different tissues and different abiotic stress conditions base on 20,000 publicly available RNA-seq libraries (PMID: 32768600). Although many libraries are non-stranded, this analysis determined that sense FLAIL or total FLAIL (including sense and antisense) is broadly expressed over many tissues and induced in response to many abiotic stresses (Fig. S2A-B), therefore suggesting that FLAIL may be needed broadly in Arabidopsis.

      FLAIL locus lays on the proximal promoter region of PORCUPINE (PCP), an important regulator of plant development. As flail mutants, pcp mutants display an early flowering phenotype. The authors show no link between FLAIL and PCP from the overlap between re-analysis of published RNA-Seq data for pcp and RNA-Seq and ChIRP-Seq from the authors. This analysis is not enough to exclude the involvement of PCP from the FLAIL function. PCP expression using RT-qPCR should be performed in flail mutants to further support that FLAIL works independently from PCP.

      #1.10 We strengthened this conclusion by adding the requested experiment. PCP transcription level in flail3 mutant was provided by RT-qPCR and RNA-seq in Fig. S11A-B.

      This work does not hypothesize any molecular mechanism besides the interaction of FLAIL lncRNAs with several chromatin loci. It was recently proposed in Arabidopsis that a trans-acting lncRNA interacts with distant loci via the formation of R-loops. The authors do not comment on that. This work would benefit in correlating FLAIL binding sites with R-loop-forming regions mapped in Arabidopsis, regardless of the results from this analysis. Additionally, the authors could attempt to look for a motif responsible for FLAIL binding.

      Check R-loop forming data R-loops (Santos-Pereira and Aguilera, 2015) in Arabidopsis, determined by DRIP-seq (Xu et al., 2017).

      #1.11 Thanks very much for this excellent suggestions.

      First, we searched for a consensus DNA motif on FLAIL binding regions by Homer. We determined four commonly enriched DNA sequence motifs among FLAIL target genes (Fig. 4G). Notably, the target genes CIR1 and LAC8 contained consensus sequences that matched to all FLAIL binding motifs (Fig. 4G). These data are consistent with a model where FLAIL binds DNA targets through a sequence complementary mechanism. Functionally important sequences are frequently conserved among evolutionarily distant species, we observed three motifs that appeared to cross-species conserved (Fig. S9), suggesting a potential evolutionarily constrained role.

      Second, we indeed identified R-loops peaks on several of FLAIL binding sites by DRIP-seq (Xu et al., 2017). For example, we observed R-loop formation over three FLAIL binding motifs at CIR1 locus and one at LAC8 (Fig. R1), indicating that R-loop formation may also be a factor determining FLAIL binding. Even though R-loop peaks are present at several FLAIL targets, full elucidation if R-loop formation determines FLAIL targeting requires further experimental evidence is beyond the scope of the current manuscript.

      Fig. R1 Representative tracks at LAC8 and CIR1 showing R-loop formation by DRIP-seq on Watson strand (w-R loops), Crick strand (c-R loops). Undetectable R-loops after RNAse-H treatment was shown as negative control. Four conserved sequence regions of FLAIL binding motifs were indicated by red arrows at LAC8 and CIR1 loci. Gene annotation was shown at the bottom.

      Most of the key conclusions are convincing, except for the flowering time control directly through CIR1 and LAC8, which should be mentioned as speculative

      ____#1.12____ Thank you for finding most key conclusions convincing. We plan strengthen the manuscript with additional genetic evidence to as part of the full revision.

      The words locus and loci are latin and they should be written in italic. The word Brassicaceae, referring to the family should be in italic, and should not be "Brassicaceaes". The word analysis has the wrong spelling.

      #1.13 We follow conventions given in Scientific Style and Format: The CBE Manual for Authors, Editors and Publishers (1994) Cambridge University Press, Cambridge, UK, 6th edn. The words locus and loci are common Latin terms and should not be italicized. However, should the format of the final prefer these words in italics we will change it later. We improved consistency of using italics. “Brassicaceaes” was changed to “Brassicaceae”.

      "How much time do you estimate the authors will need to complete the suggested revisions: this is difficult to answer as it depends to which level the author would like to take their work. In my view, if all new experiments would have to be started from scratch it is too far away to be estimated.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      In this ms, the authors identified the FLAIL lncRNA that represses flowering in Arabidopsis from a locus producing sense and antisense transcripts. They use an allelic series involving T-DNA insertions, CRISPR/Cas9 and artificial miRNAs to study the role of FLAIL in flowering. A complementation series of constructs of the flail3 allele allowed them to show that the sense FLAIL lncRNA can act in trans. RNAseq revealed a small group of genes linked to the regulation of flowering whose expression is affected in the mutant and restored in the complementation line. To gain further insight into FLAIL function, the authors used a ChIRPeq approach to test whether the lncRNA can recognize potential target genes along the genome and they could show that FLAIL binds specific genomic regions. Clearly, this paper shows very nice evidence that the FLAIL lncRNA can act in trans to regulate gene expression. Nevertheless, there are certain points that need to be clarified to further support the action of the sense FLAIL transcript.

      1.According to Fig. 1 A, the antisense FLAIL is "internal" to the DNA genomic area spanning the sense FLAIL. Hence, with direct RT-qPCR is very difficult to distinguish between these molecules as a minor "RT" activity of the Taq polymerase may lead to detection of low levels of antisense, idem if RDRs may generate low antisense levels. Although I think that the plaNET seq brings strong evidence about the start and ends of these molecules, to measure them by RT-qPCR is not trivial and requires the use of strand-specific RT-PCR using a 5' extension of the oligo and amplification with one oligo of the FLAIL sequence (sense or antisense) and the added oligo.

      #2.1.1 Thanks for this good suggestion. We tested both sense and antisense FLAIL transcription using oligo linked gene specific reverse primers for RT and a pair of the linked oligo and gene specific forward primer for qPCR. Primer locations were shown in Fig. 1A and new data were in Fig. 1C-D, Fig. 2B-C, and Fig. S4B-C.

      It is not clear how they could distinguish precisely sense and antisense particularly when both RNAs correlate as it is the case here in all alleles (Fig. 1 C and 1D). This should be more explicitly mentioned in the materials and methods section.

      #2.1.2 We gave a description of strand specific RT-qPCR method in detail in Line 397-402.

      2.In Fig. 2, what are the levels of antisense in the complementing lines with the sense transcript? And reciprocally sense levels in antisense constructs?

      #2.2 We added this data in Fig. 2B-C and described in Line 136-143. We indeed observed that sense FLAIL transcripts in the transformed asFLAIL construct or asFLAIL transcripts in the transformed sense FLAIL construct was similar to the control 35S:GUS (Fig. 2B-C), validating that NOS terminator inhibits antisense transcripts. We also noted that the transformed 35S:GUS and sense FLAIL construct expressed higher asFLAIL compared to the flail3 mutant (Fig. 2C). This may be caused by a T-DNA insertion of the resulting transgenic plants.

      This will definitively demonstrate the assumption that the T-NOS termination will not allow any expression on the other strand. At present, only one of the lncRNAs is measured in each experiment?

      #2.3 We appreciate the next-level reflection of this reviewer, with so many regions initiating cryptic antisense transcription it is an interesting challenge to identify a 3´- terminator that initiates no or poor antisense transcription.

      First, previous published data argue that the NOS terminator is largely abolishing initiation of antisense transcription (PMID: 33985972, PMID: 30385760, PMID: 27856735). All these studies address roles of antisense transcription by generating mutations abolishing antisense lncRNA transcription using the NOS terminator sequences.

      Second, to satisfy the curiosity of this reviewer, we provide data below that from another manuscript of the lab in preparation. It’s a screenshot of plaNET-seq in fas2-4 NRPB2-FLAG nrpb2-1 mutant carrying a pROK2 construct. The pROK2 T-DNA coincidentally carries a NOS terminator. We mapped plaNET-seq reads to the pROK2 scaffold to display the reads. In pROK2, a NOS promoter activates NPTII expression (red) with NOS terminator as a terminator sequence. No antisense transcription (blue) is detectable by this sensitive method to detect nascent transcripts. Taken together, the selection of the NOS terminator as a region suppressing initiation of antisense transcription represents a valid choice.

      Fig. R2 Genome browser screenshot of plaNET-seq at NPTII locus of pROK2 T-DNA vector in fas2-4 NRBP2-FLAG nrpb2-1 mutant. This mutant carries a pROK2 construct, in which a NOS promoter activates NPTII expression with NOS terminator a terminator sequence. Sense strand was shown in red and antisense strand in blue. pROK2 annotation was shown at the bottom.

      3.In Fig. 3, it will be important to also show the FLAIL locus in the flail3 mutants (in comparison to the wt) as well as the transgene locus. Here the reads will be strand specific and furthermore this will allow to show that the transgene is not generating antisense transcripts (through RDRs for gene silencing?) and confirm that the sense FLAIL is required for the complementation.

      #2.4 Thank you very much for this suggestion. NGS reads for endogenous FLAIL and transgenic FLAIL both map to the FLAIL locus, so we show the FLAIL locus in Fig 3B. This representation shows that sense FLAIL transcripts were significantly reduced in flail3 and rescued in complementation line comparing to wild type. These data argue against the idea of gene silencing and linked antisense production from the transgene. However, RNA-seq suggests that an isoform of asFLAIL appears to accumulate in flail3. Since we fail to identify this accumulation by strand specific RT-qPCR result in flail3 and in CRISPR-deletion lines, this may be an asFLAIL isoform resulting from the T-DNA insertion.

      4.In Fig. S5, the expression of FLAIL is shown in the artificial miRNA lines. Is the antisense FLAIL affected "indirectly" by the cleavage of the amiRNA or remains constant? This is likely the case but should be shown.

      #2.5 We added this result in Fig. S4C and expression level of asFLAIL remains constant compared to the transformed empty vector control.

      5.The ChIRPseq data adds major novelty to the ms and brings new ideas about the way of action of FLAIL. However, are there any common epigenetic states between ChIRP targets (e.g. histone modifications, antisense RNA production, homologies "detected" in the conserved regions between Camelina and Arabidopsis and the target loci? Or others) that may highlight potential mechanisms leading to repression mediated by FLAIL of these loci? There are many databases that could be explored (even during flowering) to search for potential relationships. Although precise description of the mechanism is out of the scope of this ms, this can be discussed in more detail to further expand on the nice data obtained.

      #2.6 We searched for a consensus DNA motif on FLAIL binding regions by Homer. We determined four commonly enriched DNA sequence motifs in target genes. Notably, the target genes CIR1 and LAC8 contained consensus sequences that matched to all FLAIL binding motifs (Fig. 4G). These data are consistent with a model where FLAIL binds DNA targets through a sequence complementarity mechanism. Functionally important sequences are frequently conserved among evolutionarily distant species, we observed three motifs that appeared to cross-species conserved (Fig. S9), suggesting a potential evolutionarily constrained role.

      **Minor comments:**

      6.In Fig. S3, a global alignment between FLAIL and two loci in Arabidopsis and Camelina is sown. What is the extent of homology? How conserved is this sequence at nucleotide level (small or very long?) to support the conservation of this lncRNA. Are there potential structures conserved among these lncRNAs?

      #2.7 T____wo consensus regions of ____FLAIL____ sequences among eleven disparate Brassicaceae genomes were shown in Fig. S9. ____Camelina sativa_ shared 98-nucleotide_ conserved sequences with Arabidopsis thaliana. In the future, it will be interesting to explore evolutional conserved structures among Brassicaceae genomes. However, these analyses are beyond the scope of the current manuscript.

      7.In Fig. S4B, arrows may help to understand which seeds were selected.

      __#2.8 Thanks. Arrows were included.____

      __

      Reviewer #2 (Significance (Required)):

      This paper is a very nice piece of work and demonstrate the action of a long non-coding RNA (lncRNA) in trans on specific targets involved in the regulation of a developmental process, flowering. There is growing evidences that the non-coding genome hides large number of lncRNAs and there is little detailed genetic support for the action of lncRNAs globally. In contrast to many descriptive papers in the field, this ms demonstrates genetically, through an allelic series and complementation experiments, that this lncRNA locus is involved in flowering regulation and that its sense lncRNA recognizes target loci genome-wide, bringing interesting perspectives on potential new mechanisms of transcriptional regulation mediated by non-coding RNAs.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      In the manuscript by Jin et al authors characterize the FLAIL DNA locus in Arabidopsis (using a wide array of publicly available datasets), which produces a set of sense and anti-sense lncRNAs.

      While our work on the FLAIL manuscript was ongoing we published the manuscripts where we presented these novel genomics methods and related data to capture nascent transcription and cryptic isoforms. We shared most data with TAIR, so we are happy to hear that these data are considered publically available.

      Authors determined that the sense FLAIL lncRNA (or a set of sense lncRNAs, which isn't fully clear from the way the data are presented) is involved in flowering time in Arabidopsis based on the fact that the several flail mutants lead to the early flowering phenotype and this flowering defect is complemented by transgenic FLAIL DNA, meaning that FLAIL lncRNA acts in trans.

      A series of experiments lead us to conclude that the sense isoform of FLAIL is responsible for the effect. We improved the data representation and writing of the manuscript to enhance accessibility.

      The T DNA flail3 - mutant results in expression changes (up or down) of 1221 genes, including twenty genes linked to flowering in various ways. Expression of a group of these flowering-related genes could be either fully (for eight genes) or partially (for five genes) rescued by transgenic FLAIL. Authors also conducted the ChIRP-seq to determine which genes are physically bound by FLAIL lncRNA genome-wide. It was found that 210 genes in the genome are bound by FLAIL lncRNA. Comparison of the dataset of differentially expressed genes in the T-DNA flail3 mutant with the ChIRP-seq dataset of genes that are bound by FLAIL lncRNA revealed the 12 overlapping genes.

      Among these twelve overlapping genes, four were found to be functionally connected to flowering with expression of these four genes being down in flail3 T-DNA mutant. Two out of these four genes were ruled out from being involved in the regulation of flowering by FLAIL. Authors conclude that the two other genes (Cir1 and Lac8) are responsible for the late flowering phenotype of flail mutants based on the three lines of evidence: (i) these genes expression is reduced in the flail mutant, (ii) FLAIL lncRNA directly interacts with these genes chromatin, (iii) the mutants of these genes were previously reported by others to display early flowering phenotypes too. While I find many of the findings reported in the manuscript very interesting, building a good foundation on which to expand the study and providing a very good leads for follow up experiments, I also have serious concerns about the manuscript in its current form.

      Most importantly, this reviewer doesn't think that the mechanism of FLAIL lncRNA action was convincingly demonstrated. The main question would be how FLAIL lncRNA works and this question wasn't fully answered. It is great that FLAIL lncRNA binds directly to the two flowering-related genes, but what does it mean? Does it change any chromatin context of these genes quantitatively or qualitatively to affect the transcription? Or does it bind any components of transcriptional machinery and thus controls the transcriptional output?

      #3.1 This manuscript addresses an important question in the field question: what is the evidence for functional elements in non-coding regions of genomes? Despite many efforts, convincing genetic support for these functions often remained limited. In addition to our strong genetic data, we provided new evidence that FLAIL recognizes targets with evolutionally conserved sequence motifs as part of the revision in Fig 4F and Fig. S9. Additionally, we plan to do ChIP-qPCR to identify histone modifications on FLAIL targets.

      Additionally, flail3 T-DNA mutant affects the expression of 1221 genes and FLAIL lncRNA physically interact with 210 genes, so how can authors be fully sure that FLAIL lncRNA has only direct effect on these two genes and doesn't also contribute to the regulation of the upstream to Cir1 and Lac8 genes or even components of the transcriptional machinery that regulate these genes?

      #3.2 We agree with this opinion. It is the reason why we felt stating this exact conclusion in our previous manuscript was justified. We improved accessibility of our manuscript in the revision, these clarify our model, that the trans-acting lncRNA sense FLAIL can interact with the chromatin regions of its target genes to directly or indirectly regulate gene expression changes involving flowering (Line 274).

      Theoretically, doing RNA-seq in the amiR-FLAIL sense lncRNA mutant might have a chance of reducing the number of affected DEGs, making it easier to analyze the FLAIL targets, even if the allele can't be used for complementation experiments.

      #3.3 Thanks for this suggestion. We plan to confirm key gene expression changes using amiRNA-FLAIL in full revision.

      Also, auhors totally neglect putting the Cir1 and Lac8 genes into the context of flowering regulation, but it is something that needs to be done.

      #3.4 ____We discussed roles of CIR1 and LAC8 in flowering regulation in Line 260-272. Flowering is fine-tuned to maximize reproductive success and seed production and by endogenous genetic cues and external environmental stimuli such as photoperiod. Nevertheless, many details of the flowering pathways and their integration remain to be investigated____. CIR1 is a circadian clock gene, induced by light and involved in a regulatory feedback loop that controls a subset of the circadian outputs and thus determines flowering time. Our GO analysis supports that a subset of DEGs are connected to the response to red or far red light that contains among other key flowering genes such as ____phytochrome interacting factor____ 4____ (PIF4) and CONSTANS (CO)____. FLAIL also binds the chromatin region of LAC8. LAC8 is a laccase family member that mainly modulates phenylpropanoid pathway for lignin biosynthesis____. Similar to flail, lac8 mutants flower early. While intermediates in this pathway or dysregulation of lignin-related genes could promote flowering in plants, the molecular connections of reduced LAC8 expression to effects on flowering time will require further investigation.

      Lastly, the paper needs to be totally rewritten to be even properly evaluated. In its current state it reads like a very short draft.

      #3.5 We reorganized the structure of manuscript, improved clarity and provided new mechanistic evidence in Fig. 4G and Fig. S9 to present a more complete manuscript.

      The Abstract is weak, the Introduction is written in a such telegraphic style that it is barely readable, in many places there is no connections between sentences leading to the information appear to be presented as random, even if it isn't.

      #3.6- We strengthened the Abstract by providing new evidence and improved for the Introduction.

      The Results section is written rather rudimentary with information not being sufficiently provided to describe the results but rather scattered between the Results and Figure legends.

      #3.7 Thanks for your suggestions, we described each FLAIL length and all constructs in detail in Results, put a schematic of T-DNA and CRISPR mutants in Fig. 1A, moved comparative genomics data to the end of Results and ensured all figures in order.

      The Discussion is the best written part of the manuscript.

      Thanks for your appreciation of the Discussion.

      The Conclusion section carries no specific information and reads more like a little summary suitable for a review article rather than experimental paper.

      #3.8 We agree this opinion, this paragraph fits Discussion better and Conclusion was removed.

      Therefore, this reviewer thinks that regardless of how authors will choose to proceed with the current experimental version of the manuscript, it'd be in the authors' best interests to at least fully revise the paper before resubmitting anywhere. I'd also advise authors to seek professional editorial help specifically using an editor with the background in the plant sciences.

      Authors might also want to consider moving Fig.3 into the Suppl. as it doesn't carry much weight or significance and perhaps make existing figures more meaningful and comprehensive and by including a better diagram of the locus (e.g., Fig. S1), etc.

      #3.9 We thank this helpful suggestion. Fig.3 represents the RNA-seq data. In combination with supporting data in the supplementary material, it gives an easy visual readout of the reproducibility of the findings in replicates of stranded RNA-seq. In a new submission, we moved it to Fig. S5B and highlighted 13 differentially expressed flowering genes as well as sense FLAIL in flail3 that were rescued in complementation line in Fig. 3A. Moreover, we gave screenshots of FLAIL itself and four flowering related FLAIL targets in RNA-seq with a clear schematic representation of each locus. We believe these revisions improve Fig. 3.

      It's not practical to list all issues with the writing as the paper requires total re-writing, so I can just make a few suggestions without any specific order to help authors improve the paper:

      We are happy to improve our manuscript with the help of the reviewers. We addressed all comments including from reviewer #3 with a constructive spirit. However, since colleagues and reviewers #1 and #2 found the manuscript comprehensible to the point where they could make expert-level comments that illustrate understanding of the manuscript, a total re-writing did not feel like the most constructive suggestion to improve the manuscript.

      --There is no statement anywhere that states the goal of the study.

      #3.10____ We stated the goal of the study in line 50-69 and we think this is a misunderstanding. We summarized three issues currently exist in characterization of functional lncRNA in the last sentence of the first three paragraphs in Background: 1 in Line 50, the broad range of candidate hypotheses by which lncRNA loci may play functional roles call for multiple approaches to distinguish alternative molecular mechanisms. 2 in Line 59, functional characterization of trans-acting lncRNAs remains a key knowledge gap to understand the regulatory contributions of the non-coding genome. 3 in Line 69, the contribution of trans-acting lncRNAs to the regulation of distant flowering genes is currently unclear. So in the last paragraph of the background, we claimed that our goals are to address these questions through characterization of functional FLAIL lncRNA in flowering repression using multiple genetic approaches and various genomic data.

      --No rational is provided on why authors decided to examine this specific genomic locus.

      #3.11 For several years, our lab studies the rules and roles of non-coding transcription. We characterized and are characterizing several loci with evidence of non-coding transcription in a range of species. Early experiments suggested that FLAIL functioned in flowering, this manuscript clarifies that the function is executed as trans-acting lncRNA of the sense FLAIL isoform.

      --Typically, the significance is in studying the function of lncRNA or a group of lncRNAs produced from a genomic locus, I don't think I ever encountered the instances when it was exciting to study just a specific genomic locus. If the locus does indeed have any significance for initiating the study, it needs to be explained.

      #3.12 This study is remarkable in many aspects. We fully discuss key strengths in the discussion. First, we ____exhibit a trans-acting lncRNA FLAIL that represses flowering by promoting the expression of floral repressor genes as discussed in Line 247-281_; Second, in Line 284-306, we informed that this study provide a compelling model about how to apply _series of convincing genetic data____ to functionally characterize lncRNA loci. Third, in Line 307-312, evolutionary conserved FLAIL sequences across species is key to characterize the functional _microhomology in other _Brassicaceae.

      --The locus can produce lncRNAs, but it can't harbor them.

      #3.13 We clarified this confusion by enhancing ____presentation of Fig. 1 and providing a more clear description of each sequencing method and results in Line 81-99. Although we provided evidence that transcription of both sense and antisense FLAIL are more stable in hen2-2, they were clearly observed in chr-DRS in wild type and plaNET-seq in NRBP2-FLAG nrpb2-1 and sense FLAIL was even detected in TSS-seq and TIF-seq in wild type.

      --No length of FLAIL lncRNAs or their range is provided in the first section of Results.

      #3.14 We gave the length of sense FLAIL in Line 82 and antisense FLAIL in Line 86.

      --On many occasions authors don't state rational for doing experiments, which leads to information often flowing as random.

      #3.15 we enhanced clarity of the rational for each experiment and made some connections between sentences to make more fluent. For example, in sentences in Line 99, Line 113, Line 126, Line 159, Line 183, Line 214, and Line 219.

      --What do authors mean by the subtitle "FLAIL characterizes a trans-acting lncRNA repressing flowering"? How can lncRNA FLAIL or FLAIL locus characterize lncRNA?

      #3.16 We changed it to “FLAIL represses flowering as trans-acting lncRNA” in Line 112.

      --Check all figures. E.g., Fig. 3B-E mentions only accession numbers for the genes.

      #3.17 The systematic gene IDs are a valid way to represent data, in particular for genomics data since it facilitates cross-comparisons. To make it more accessible we also show systematic names of each gene in Fig. 3A-F, Fig. S6 and Table S3.

      --It is not clear where exactly the T-DNA insertion is located relative to sense FLAIL in flail3 mutant (Fig. S4).

      #3.18 We moved the schematic to clarify this to revised Fig. 1A and the exact T-DNA insertion site is mentioned in the legend.

      --- What is the length of the complementing sense FLAIL lncRNA?

      #3.19 We now include the length of the complementing sense and antisense FLAILs in Line 351-352.

      --Check the description of each and every construct used and provide explanation for each in the Results. E.g., the pFLAIL:gFLAIL18/88 and pasFLAIL:gasFLAIL18/39 constructs aren't explained in Results, and can only be found in Fig. 2 legends.

      #3.20 We described each construct including pFLAIL:gFLAIL18/88 and pasFLAIL:gasFLAIL18/39 constructs in Line 133, amiR-FLAIL-11 and amiR-FLAIL-11 in Line 149.

      Reviewer #3 (Significance (Required)):

      Tens of thousands of lncRNAs have been identified in various eukaryotes, but their biological roles have been shown only for a small fraction of them, and the mechanisms of their action are delineated for only a very few of them. Most of the advances on the field of lncRNAs are reported in metazoan, while the field of lncRNAs in plants is lagging far behind in terms of knowledge about lncRNAs with assigned biological functions or lncRNAs with delineated mechanisms of action. From this point of view, this reviewer is always excited to see any new functional plant lncRNAs for which either biological or mechanistic functions have been determined, and deems the information on this subject significant. The manuscript's findings are potentially very interesting and present a decent body of work that lays a very solid groundwork for future experiments. My main concern about the manuscript's significance in its current form is the fact that no real solid mechanism of action for the described lncRNA or a set of lncRNAs (?) has been demonstrated. The best mechanistically studied lncRNAs in Arabidopsis are involved in the regulation of flowering time, particularly those that function in the vernalization flowering pathway and to lesser extent in autonomous pathway. The new FLAIL lncRNA or lncRNAs (?) described in this manuscript also appear to regulate the flowering time in Arabidopsis, however more experiments would be needed to provide a definite conclusion about how direct FLAIL's effect is and how exactly it functions. That unfortunately obviously diminishes the significance of the manuscript and makes it potentially interesting only to researches studying flowering in Arabidopsis and even then the manuscript results would be incomplete to make solid conclusion.

      Lots of functional phenotype have

      Additionally, the manuscript requires complete re-writing.

      We thank this reviewer for the appreciation of __a decent body of work and a very solid groundwork for future experiments. We are confident that our revisions make the manuscript more comprehensible to highlight the qualities of our manuscript more accessibly.____

      __

    1. We need to be open to what takes placeand able to change our plans and go with whatmight grow at that very moment both inside thechild and inside ourselves.

      Isn't this the way children are though ? The way things are born inside them, and it may because of something we may take for granted as ordinary, something small or something big can cause a tremble or ripple...and it can grow into something exciting. I like to think it is that way for us too.

    1. Author Response

      Reviewer #1 (Public Review):

      Yang, Bhoo-Pathy, Brand et al detail their investigation of a large Swedish cohort compared with age matched controls to estimate the risk of short- and long-term cardiotoxicities of breast cancer therapies in a general breast cancer patient population. They find that breast cancer patients are at significantly increased risk of developing arrhythmia and heart failure both within the first year of cancer diagnosis as well as at least 10 years after. Interestingly, they find that there is an increased risk of ischemic heart disease within the first year after diagnosis, but no increased risk of ischemic heart disease in the long term.

      The authors should be commended for this large cohort study that achieves its goal of identifying the incidence and hazard ratio of cardiotoxicity associated with breast cancer treatment within a general breast cancer population. Their findings of increased risk of heart failure in patients treated with anthracyclines and trastuzumab is consistent with multiple prior studies in the field of cardio-oncology and adds to the validity of the data.

      The finding that there is only a slightly increased (and statistically insignificant) risk of ischemic heart disease after left sided radiotherapy is quite interesting, and as noted by the authors, differs from prior understandings about risk of ischemic heart disease associated with breast radiation therapy. Without data on mean heart dose or total radiation administered the results are hypothesis generating, but should not be utilized to guide medical decision making.

      One of the major limitations of this study is that the authors' goal is to identify the incidence and risk of cardiotoxicity associated with the various breast cancer treatment regimens and determine these risks over time, and as noted by the authors, the registry utilized only includes planned treatment not whether patients did receive this therapy (and what dose of therapy). This is a key point that should be emphasized when interpreting the results.

      As noted by the reviewer, the Stockholm-Gotland Breast Cancer Register only included the intended treatment without a detailed dosage of the therapy. However, the agreement between intended and administrated treatment was about 95% in Sweden (Löfgren,L et, al BMC Public Health. 2019). We have now further explained this in the discussion section.

      In Discussion: “Overall, our results indicate only small risk of heart disease due to radiotherapy in women treated in Sweden after year 2000. Further studies with detailed information on the mean heart dose of radiation or total cumulative radiation dose administered are therefore needed to confirm and provide more context to this finding.”

      In Discussion: “Besides, the Stockholm-Gotland Breast Cancer Register only records intended treatment, not whether patients actually received these therapies. However, the agreement between the intended and administered breast cancer treatment in Sweden has been previously reported to be about 95% (Löfgren et al., 2019).”

      There are several conclusions included in the discussion section that are not supported by the data from the results section and the authors should be careful to suggest mechanisms of cardiotoxicity from an observational population-based study. Examples include suggesting anthracyclines cause cardiotoxicity of the myocardium but not the cardiac vessels; attributing early increased risk of ischemic heart disease to emotional distress alone; and that inhibition of HER2 receptors in myocytes may explain cardiotoxicity caused by trastuzumab. These are interesting hypotheses that would be better supported by references to lab/animal model studies.

      We thank the reviewer for the suggestions and have now added the reference for the suggested mechanisms of cardiotoxicity with lab/animal model studies in the discussion section.

      In Discussion: “As the long-term risk was observed for heart failure but not ischemic heart disease, the cardiotoxic effect of chemotherapy might be mainly on the myocardium mediated by the effect of DNA double-strand breaks through topoisomerase (Top) 2β, but not the cardiac vessels. (Lyu et al., 2007)”

      In Discussion: “The finding that risk of ischemic heart disease in breast cancer patients was only transiently elevated after diagnosis is not unexpected, considering the emotional distress of dealing with a new cancer diagnosis in the patients, which may lead to higher short-term rates of ischemic heart disease (Fang et al., 2012; Schoormans, Pedersen, Dalton, Rottmann, & van de Poll-Franse, 2016). In addition, surgery after breast cancer diagnosis might increase the risk of arterial thromboembolism (Gervaso, Dave, & Khorana, 2021), which includes myocardial infarction, and the effect appears to attenuate one year after diagnosis. (Navi et al., 2017; Navi et al., 2019).”

      In Discussion: “The cardiotoxic effect of trastuzumab meanwhile may be explained by inhibition of the HER2 receptors in myocytes, that activates the mitochondrial apoptosis pathway through modulation of Bcl-xL and -xS, which regulates cell development and growth (Grazette et al., 2004; Yeh & Bickford, 2009)”

      The authors succeed in highlighting the increased risk of cardiotoxicity associated with breast cancer treatment in the observed patient population. Rather than exploring the mechanism of cardiotoxicity for the treatment regimens observed, the data presented may be more useful to propose a longitudinal cardiac monitoring schedule for patients who have been treated for breast cancer, and who the current data suggest, are at long term risk for heart failure and arrhythmia.

      As we found increased long-term risk of heart failure in breast cancer patients, especially for those treated with Anthracyclines +Taxanes and Trastuzumab, we therefore suggest for a prolonged longitudinal cardiac monitoring schedule for ten or more years in these treated patients. We have added the suggestion in the discussion section.

      In Discussion: “Analysis by time since diagnosis revealed long-term increased risks of arrhythmia and heart failure following breast cancer diagnosis, suggesting that a longitudinal cardiac monitoring schedule might be helpful to improve cardiac health in breast cancer patients.”

      Reviewer #2 (Public Review):

      This is a registry based study in which patients diagnosed with locoregional breast cancer ( stage 1-111) from 2001-2008, between the ages of 25-75 were compared to a randomly sampled cohort of 10 women matched by the year of birth and for three specific cardiac conditions as outlined in the key objective. Data was gathered by cross referencing Subject's unique identification numbers in Swedish Cancer Register, Patient Register, Cause of Death, and Migration Register. Prescribed Drug Register was reviewed to gather information about prescribed medication to perhaps infer the medical comorbid conditions for which medication was prescribed. Breast cancer treatment specific information was missing in cases and presumption of use of Anti Her2 therapy was made based on HER2 neu status in some cases. While the primary objective of the study to show increased evidence primarily Heart failure and arrythmias seem to have been met in this patient registry based study, there is some question of the specificity of the data since it was gathered from the various registers and is subject to operator dependent biases.

      Strengths: Study is a long term follow up of patients treated with potential cardiotoxic drugs, confirming the previously known association of specific heart disease to the use of these drugs. Longest follow up seems to be for 16 yrs for the earliest cohort of 2001 and minimum approximately 10 yrs for the cohort of 2008. This study does confirm that long term risk that remains even after the treatment is completed and potentially suggests that more robust cardiac function monitoring guidelines for survivors may be warranted.

      Weaknesses: This is a patient register based study. As outlined above, data was extracted by cross referencing various patient registers. Since the data was dependent on the ICD codes entered in the patient register, there seems to be potential for missed information.

      The Swedish Patient Register has quite high validity for the heart diseases analyzed in this study, with a positive predictive value between 88%-98%, by using the main diagnosis in the register. However, it is still possible that we have missed some information for heart disease and we have emphasized this limitation in the discussion section.

      In discussion: “The Swedish Patient Register has high validity for heart failure, arrhythmia and ischemic heart disease (with positive predictive value between 88%-98%) (Hammar et al., 2001; Ludvigsson et al., 2011), by analysing main diagnoses only. However, misclassification of heart diseases may still have occurred.”

      Preexisting comorbidities were also extracted through Patient Registers hence may be subject to same potential for missed information.

      The Swedish Patient Register has relatively high validity for the majority of comorbid diseases. However, patients without severe symptoms of the diseases might be treated in the primary health care centers, which were not included in the patient register. We have therefore pointed out this limitation in the discussion section.

      In discussion: “In addition, preexisting comorbidities extracted from the patient registers may not include those patients with slight symptoms.”

      In addition, information for use of Trastuzumab was extrapolated from the Her2neu status of the patient when such information may not have been accessible through Prescribed Drug Registers.

      As the majority of HER-2 positive patients were treated in the clinics, the Swedish Prescribed Drug Register does not register their information. Because ~90% of HER-2 positive cancers were treated with trastuzumab between 2005 and 2008 in the Stockholm-Gotland region, we therefore used HER-2 positivity as a proxy for trastuzumab treatment. We have now further explained this in the methods section.

      In Materials and Methods: “As ~90% of HER-2 positive cancers were treated with trastuzumab between 2005 and 2008 in the Stockholm-Gotland region and the Swedish Prescribed Drug Register does not cover data on treatment with trastuzumab, HER-2 positivity was used as a proxy when no registry data on trastuzumab was available during this time period (30% of the HER-2 positive patients had missing information on trastuzumab).

      It is also unclear if there was any protocol in place for cardiac monitoring for patients receiving cardiotoxic chemotherapy or Anti Her2neu agents.

      In Sweden, there is no cardiac monitoring for chemotherapy in routine clinical practice. For HER2-therapy, cardiac monitoring with a thorough cardiac assessment prior to treatment, including history, physical examination, and determination of left ventricular ejection fraction before, during and right after treatment has been mandatory since introduction in clinical routine. We have now added this information to the discussion.

      In discussion: “As there is no cardiac monitoring for chemotherapy in routine clinical practice and cardiac assessment is only performed prior to and during the treatment period for HER-2 positive patients in Sweden, a longer-term cardiac monitoring program might be helpful for these patients.”

      Reviewer #3 (Public Review):

      This matched analysis uses data from patients newly diagnosed with breast cancer the Stockholm-Gotland Breast Cancer Register and data from patients in the general female population in Sweden to ask the question of whether breast cancer diagnosis (and subsequent treatments of breast cancer) is associated with an increased rate of heart disease after treatment. It is impossible to answer this question in a randomized controlled setting and would be unethical to randomize patients to not be treated for their cancer, thus a matched approach in theory would seem to make sense at face value. However, I have some concerns about the analysis that I believe impede their answering the research aims.

      1. With regard to the matched analysis of time to heart disease diagnosis, I have several critiques/questions. First, for the breast cancer cohort, were patients with a diagnosis of heart disease prior to cancer diagnosis included in the analysis? If so, how was the event (which precedes time = 0) incorporated into the analysis? If not, please make sure to make note of this important restriction. I think the latter approach is the better / correct.

      As suggested by Referee 3, we have now excluded those patients with a diagnosis of heart disease prior to cancer diagnosis. We have updated the results and the methods section accordingly.

      In Materials and Methods:

      “We included all patients diagnosed with non-metastatic breast cancer (stages I-III) and without prior diagnosis of heart disease at age 25 to 75 years (N = 8015).”

      Second, for the matched cohort, what is time = 0 for these persons? i.e. how does one interpret "Time since diagnosis" on Figure 1 for a patient who has not been diagnosed with breast cancer?

      We apologize for this misunderstanding and have revised it to “Time since index date (= date of diagnosis, which is the same date for corresponding matched individual from the general population) ” in Figure 1.

      Third, how was the matching incorporated into the FPM? Presumably there should be a frailty term of some sort to indicate the matched groups, within which there is expected to be correlation.

      In the flexible parametric survival model for matched cohort data, a shared frailty term was incorporated into the model to indicate the matched cluster. The maximum (penalized) marginal likelihood method is used to estimate the regression coefficients and the variance for the frailty. We have added this explanation in the methods part.

      In Materials and Methods: “Considering the correlation within the matched clusters, a shared frailty term (as random effects) was incorporated into the model and the maximum (penalized) marginal likelihood method was used to estimate the regression coefficients and the variance for the frailty.”

      1. It is noted that Kaplan Meier curves were used to estimate the cumulative incidence of heart disease. How was death of the patient prior to diagnosis of heart disease handled? I do not think that Kaplan Meier is the correct approach here but rather a Aaalen-Johansen-type estimator that treats death as a competing event. See e.g. https://pubmed.ncbi.nlm.nih.gov/10204198/ A Kaplan Meier will tend to overestimate the event rate when competing events are counted as censoring.

      As suggested by the reviewer, we have now used the Aalen-Johansen method to estimate the cumulative incidence of heart disease and revised the text in the Methods, as well as the tables and figures in the supplement.

      In Materials and Methods,: “Aalen-Johansen estimation was used to assess the cumulative incidences of heart diseases in breast cancer patients and matched reference individuals, while other causes of death were considered as competing events.”

      1. The sentence "Missing indicators were included for the analysis of these covariates in the model" and the results in Table 3 suggest that some missing values were analyzed 'as is', meaning that missingness was used as a category itself. This of course is not desirable and there exists methodology+software for more appropriately handling these data, e.g. multiple imputation with chained equations. For example, how does one interpret that 'unknown chemotherapy' status is positively associated with heart failure but less so than anthracycline based chemo.

      Missingness of the type of adjuvant treatment was considered as a category in the previous version of our manuscript. To address potential biases resulting from missing data, we have now used multiple imputation with chained equations and revised the methods and Table 3 accordingly.

      In Materials and Methods: “Multiple imputation with chained equations was used to deal with the treatment categories with missing information. We replaced the missing data with 10 rounds of imputations and all the covariates were included in the imputation model.”

      1. The reported HRs at the top of page 10 seem incongruous with the FPM model demonstrated in Figure 1, since there is clearly a non-linear relationship between the hazard and the outcome. In other words, there is little sense in which the hazards are proportional at all time points.

      As shown in the FPM model in Fig. 1, HRs were not constant according to time since index date. Therefore, in the revised version, we only showed the HRs separately in <1, 1-2, 2-5, 5-10 and 10-17 years after diagnosis. We have revised the abstract, methods, and Table 2.

      In Abstract: “Time-dependent analyses revealed long-term increased risks of arrhythmia and heart failure following breast cancer diagnosis. Hazard ratios (HRs) within the first year of diagnosis were 2.14 (95% CI = 1.63-2.81) for arrhythmia and 2.71 (95% CI = 1.70-4.33) for heart failure. HR more than 10 years following diagnosis was 1.42 (95% CI = 1.21-1.67) for arrhythmia and 1.28 (95% CI = 1.03-1.59) for heart failure. The risk for ischemic heart disease was significantly increased only during the first year after diagnosis (HR=1.45, 95% CI = 1.03-2.04).”

      In Materials and Methods: “We compared the risk of heart diseases in breast cancer patients with that observed in the matched cohort, using flexible parametric model (FPM) with time since index date as underlying time scale.”

      In Results: “A short-term increase in risks of arrhythmia and heart failure was found in breast cancer patients (Table 2, Figure 1, HR at first year for arrhythmia= 2.14; 95% CI = 1.63-2.81, for heart failure =2.71; 95% CI = 1.70-4.33, respectively).”

      1. It seems unlikely that breast cancer diagnosis could ever be 'protective' for ischemic heart disease. A more constrained model that does not allow for the possibility of HR < 1 could provide a more sensible estimate of this time-dependent HR.

      To the best of our knowledge, the inverse association between breast cancer and the long-term risk of ischemic heart disease is possible considering that some of the reproductive risk factors for breast cancer have protective effect on the risk of ischemic heart disease. We have now discussed about this in Discussion.

      In Discussion: “The long term lower risk of ischemic heart disease in breast cancer patients compared to age-matched women might be explained by the opposite role of reproductive factors in breast cancer and ischemic heart disease. Women with younger age at menarche and older age at menopause were associated with increased risk of breast cancer, while decreased risk of ischemic heart disease were found among these women (Collaborative Group on Hormonal Factors in Breast, 2012; Okoth et al., 2020).”

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-2021-01118

      Corresponding author(s): Jun, Nakayama and Kentaro, Semba

      1. General Statements

      We are grateful to all of the reviewers for their critical comments and insightful suggestions that have helped us considerably improve our paper. As indicated in the responses that follow, we have taken all of these comments and suggestions into account in the revised version of our paper, including the supplementary information.

      In the revised manuscript, we focus on the existence of two cancer stem cell-like populations in TNBC xenograft model and patients. The response to each reviewer is described below.

      Sincerely,

      Jun Nakayama

      Kentaro Semba

      Department of Life Science and Medical Bioscience

      School of Advanced Science and Engineering

      Waseda University

      E-mail: junakaya@ncc.go.jp or jnakayama.re@gmail.com to JN

      ksemba@waseda.jp to KS

      2. Point-by-point description of the revisions

      Reviewer #1 (Evidence, reproducibility and clarity (Required)): * **Summary:** Nakayama and colleagues use their previously developed automated tissue microdissection punching platform to perform spatial transcriptomics on a breast cancer xenograft model. Using transcriptomics on multiple clumps of 10-30 cells from different regions in a tumor and a lymph node metastasis they identified different cell-type clusters. Two of these clusters expressed different cancer stem cell markers. This led the authors to suggest that two distinct cancer stem cell(-like) populations may exist within one (breast) tumor, which could potentially make tumors more drug-resilient.

      **Major comments:** While the quality of the presented sequencing data is good and the manuscript is mostly written in a clear and accessible style, there are some concerns that limit the impact of this story. Most importantly, the manuscript in its present form does not convince me that the MDA-MB-231 xenografts indeed contain two distinct populations of cancer stem(-like) cells.

      1.The data obtained are not single cell data, which makes it difficult -if not impossible- to draw conclusions about presence of cancer stem cells. Each data point is the average of 10-30 cells, and the interpretation of the data is severely limited by this. How can the quantification of expression of CD44/MYC/HMGA1 in clumps of 10-30 cells teach us something about the stemness of tumor cells? *

      Answer: We would thank the comment. The reviewer’s suggestion is an important point; however, this is technical limitation of spatial transcriptomics technology. Most advanced spatial transcriptomics technologies, e.g. Visium (10x Genomics), also have the same problem. It means that our technology and the advanced technologies are technics to analyze gene expression and characteristics of tissues from 10-30 cells in each spot. Although high resolution spatial transcriptomics has been developed in 2021 [1], it is not generally used yet as described in the comment (Significance) from reviewer1.

      From our spatial analysis, we identified that CD44, MYC, and HMGA1 were expressed from human cancer cell. Their expression profiles were distinct among specific parts of the tumor section. To validate the existence of two types of cancer stem-like cells in TNBC tumors, we performed the additional analysis with the public scRNA-seq datasets of high-metastatic MDA-MB-23-LM2 xenograft model (GSE163210) [2]. This study performed scRNA-seq analysis of primary tumor and circulating tumor cells in MDA-MB-231-LM2 xenograft model. We analyzed it with Seurat/R (Figure A-1). As a result of reanalysis, HMGA1 and CD44 expression were confirmed at single-cell resolution (Figure A-2,3). These results verified the existence of two cancer stem cell-like populations (HMGA1-high, CD44-high) in MDA-MB-231 xenograft. Hence, the study of MDA-MB-231 xenograft supported our findings from spatial transcriptomics.

      Additionally, we performed the immuno-staining of sections using anti-CD44 antibody and anti-HMGA1 antibody as described in reviewer’s comment 5. As a result, CD44 and HMGA1 were detected in primary tumor sections. There were cells that express either CD44 or HMGA1 and cells that co-express both CD44 and HMGA1 (Figure B). We believe that our findings are solid results because the findings were also validated by other methods.

      In the revised manuscript, Figure A are incorporated as Figure 3B-E. Figure B is incorporated as Figure 3A. Hope our new results will be now accepted by the learned Reviewer and Editor.

      Figure A-1. Reanalysis of scRNA-seq of metastatic MDA-MB-231 xenograft

      Flowchart of the public single-cell RNA-seq (scRNA-seq) reanalysis using GSE163210 datasets.

      Figure A-2. UMAP plots of xenograft and CD44/HMGA1 expression

      UMAP plot of MDA-MB-231-LM2 xenograft tumors and circulating tumor cells (Left). Expression of CD44 and HMGA1 in the UMAP plot (Right).

      Figure A-3. Pie chart of CD44/HMGA1 positive cancer cells in MDA-MB-231 xenograft

      Pie chart of cancer stem cell-like population ratio in MDA-MB-231-LM2 xenografts.

      Figure B. Fluorescent immuno-staining of MDA-MB-231 primary tumor

      Representative images immunostained with CD44 and HMGA1 in primary tumor sections of the MDA-MB-231 xenograft model. Red: HMGA1, Green: CD44, and Blue: Nucleus. Scale bars, 20 μm (left), 10 μm (right). White arrows represent cancer cells that independently expressed or co-expressed.

      * 2.Furthermore, the authors should better explain their data analysis strategy with identification of gene expression profiles. It is unclear how they found CD44, MYC, and HMGA1 other than by cherry-picking from the list of cluster markers. *Answer: In this research, to identify the characteristics of clusters, we analyzed differentially expressed genes (DEGs) by ‘FindAllMarkers’ function of Seurat. As a result, ‘Cluster 0’ significantly expressed HMGA1 gene, and ‘cluster 1’ significantly expressed CD44. HMGA1 and CD44 are popular cancer stem cell markers in triple-negative breast cancer [3, 4]. In this study, we focus on metastasis-related genes and cancer stem cell markers (described in introduction section). Therefore, we focus on cancer-stem cell markers in the presented study. Cancer stemness is an important concept in cancer metastasis [5-7]. These results suggested that the existence of two cancer stem cell-like populations could potentially make tumors more drug-resilient in xenograft models and clinical patients.

      To improve the manuscript, we revised the description in the revised manuscript (Pages 5-6, Lines 97-105).

      * 3.Following up on the above point: I looked in the supplementary tables, but couldn't find MYC. How did the authors conclude that MYC is involved in cluster 1? In fact, when I ran a quick analysis in EnrichR, I saw that putative MYC target genes were strongly enriched among the markers in the HMGA1 cluster, but not the CD44/MYC. That's opposite to what I would expect. *__Answer: __We apologize for our confusing data and description. First, we found the expression of CD44 and HMGA1 in each cluster. Therefore, we performed the up-stream enrichment analysis using gene signatures of FindAllMakers by Metascape. From the result of enrichment analysis, we found the MYC activation in CD44 high-cluster; therefore, we named the cluster “CD44/MYC-high” cluster.

      To improve the manuscript, we revised the Figure2, Supplementary Table S3, and manuscript (Pages 5-6, Lines 103-106).

      * 4.All data were produced from 1 primary tumor and 1 metastasis. Thus, reproducibility and robustness of the methodology cannot be evaluated. The interpretation of the data could be strengthened when xenografts from at least 3 different mice are shown. *__Answer: __We would thank the suggestion. As the reviewer’s comment, we performed 1 primary tumor and 1 metastasis lesion from a transplanted mouse. Since this experiment take a long time, we tried to validate the findings by other methods (Figure A: scRNA-seq analysis of MDA-MB-231 xenografts, Figure B: Immuno-staining of MDA-MB-231 primary tumor, Figure C: scRNA-seq analysis of TNBC patients).

      First, we reanalyzed the public dataset which performed single-cell RNA-seq analysis of MDA-MB-231 xenografted tumor and circulating tumor cells in immunodeficient mice as shown in the answer to comment 1 (Figure A). Next, we performed the immuno-staining of sections using anti-CD44 antibody and anti-HMGA1 antibody as described in reviewer’s comment 5. As results, CD44 and HMGA1 were detected in primary tumor sections. There were cells that express either CD44 or HMGA1 and cells that co-express both CD44 and HMGA1 (Figure B). Next, we performed the reanalysis of 19 scRNA-seq samples from integrated 3 TNBC cohorts (Figure C-1). In a UMAP plot, differences between CD44-positive cancer cell and HMGA1-positive cancer cell were observed; however, these cells did not visually form the specific clusters (Figure C-2). CD44 and HMGA1 expressed globally in the UMAP plot, but CD44 makes some specific clusters (cluster at right side). Additionally, following the comment, we performed the population analysis in each patient (Figure C-3 and C-4). Detection of double-positive population in TNBC patients suggested that the population may be more undifferentiated cancer stem cells diving into both CD44-positive cells and HMGA1-positive cells.

      In addition, we reanalyzed primary tumors and metastasis lesions from other mice as a test trial sample (Figure D-1). The microspots including test trial samples showed 3 human clusters which were classified into CD44/MYC, HMGA1, and Marker-low clusters. We believe that our findings are solid results because the findings were also validated by other methods.

      In the revised manuscript, Figure A are incorporated as Figure 3B-E. Figure B is incorporated as Figure 3A. Figure C is incorporated as Figure 5. We only showed Figure D in the response to the reviewer’s comment. Hope our new results will be now accepted by the learned Reviewer and Editor.

      Figure C-1. Reanalysis of integrated TNBC patients scRNA-seq

      A flowchart of the reanalysis of a public scRNA-seq dataset. We downloaded GSE161529, GSE176078, and GSE180286 (scRNA-seq data of 19 TNBC patients). Integrated datasets were analyzed with Seurat. Log normalization, scaling, PCA and UMAP visualization were performed following the basic protocol in Seurat. To extract the cancer cells, cells expressing EPCAM/KRT8 (epithelial marker) were filtered. A UMAP plot of cancer cell from 19 TNBC patients (right).

      Figure C-2. CD44/HMGA1 expression in TNBC patients

      Expression analysis of CD44 (Expression level > 2) and HMGA1 (Expression level > 2) with UMAP plots.

      Figure C-3. CD44/HMGA1-positive cancer cell with UMAP plot

      UMAP plots of CD44-high, HMGA1-high, HMGA1/CD44-high, and Negative cancer cells.

      Figure C-4. Ratio of CD44/HMGA1-positive cancer cell in each patient

      The bar plot showed the ratio of cancer cells that expressed CD44 and HMGA1.

      Figure D-1. Analysis of microspots of MDA-MB-231 xenografts including test trial samples

      UMAP plots of CD44-high, HMGA1-high, and Marker-low clusters with test trial samples (2 primary tumors and 1 lung metastasis). ‘Primary tumor 1’ has 20 microspots, ‘Primary tumor 2’ has 24 microspots, and ‘lung metastasis’ has 7 microspots. Most microspots of lung metastasis failed extraction of RNA; therefore, these spots classified into Marker-low cluster.

      Figure D-2. Expression analysis of CD44, HMGA1, and MYC

      Feature plot of CD44-high, HMGA1-high, and Marker-low clusters with test trial samples.

      * 5.The only methodology is single cell RNA-sequencing. Immuno-staining on relevant markers such as CD44, MYC, HMGA1 plus human epithelium and cell cycle markers would provide strong additional support for the claims made by the authors, because it's a complementary technique and it allows quantification at single cell resolution. *__Answer: __We would thank the comment. As described in the responses to the reviewer’s comment 1 and 4, we performed the immuno-staining of sections using anti-CD44 antibody and anti-HMGA1 antibody as described in reviewer’s comment 5. As a result, CD44 and HMGA1 were detected in primary tumor sections. There were cells that express either CD44 or HMGA1 and cells that co-express both CD44 and HMGA1 (Figure B).

      In the revised manuscript, Figure B is incorporated as Figure 3A.

      * 6.Line 173-175. The marker-low cluster look to me simply like spots containing a relatively high amount of dead/dying (tumor) cells. The identity/state of cells in the marker-low cluster should be characterized and discussed more extensively. *__Answer: __We would thank the comment. This suggestion is important. In fact, total count of RNA in the Marker-low cluster decreased as compared to HMGA1-high and CD44/MYC-high (Supplementary Figure S1B). Additionally, Ttr-high mouse cluster also has low total count of RNA (Supplementary Figure S1C).

      Following the comment, we described that the Marker-low cluster and Ttr-high cluster have the possibility to include dead/dying cells (Page 13, Lines 268-279).

      * 7.Figure 5 and accompanying text in line 182-194; the authors try to infer cell-to-cell interactions using a previously published tool. However, any biological interpretation is lacking. What can be concluded from this analysis? *__Answer: __Initially, algorithms of cell-to-cell interaction were reported with previously published tool [8, 9]; however, in this manuscript, we originally conducted the code for cell-to-cell interaction with the interaction database of the Bader laboratory from Toronto University (https://baderlab.org/CellCellInteractions#Download_Data) as previously described [10, 11]. We aimed to estimate the cell-to-cell interaction in each spot (including 10-30 cells). We think that this analysis will be helpful for discovering the cancer stem cell niche and metastatic niche [6].

      However, in the revised manuscript, we focused on the existence of two cancer stem cell-like populations in TNBC xenograft and patients. Therefore, CCI analysis in previous Figure 5 moved to Supplementary Figure S7. Previous Figure 6 is removed from revised manuscript.

      * 8.Figure 6. Can the authors please explain more clearly what they mean by "PT" and "Mix" groups? I had a very hard time to understand what the data in figure mean. Again, an overall interpretation at the end (line 211) is lacking. *__Answer: __We apologize for the confusing result. We examined the combinations of human cancer cell cluster and mouse stromal cell cluster. To summarize, there are 10 combinations in the MDA-MB-231 xenograft. The combination groups in only primary tumor were named “PT”; on the other hand, the combination groups in both primary tumor and lymph-node metastasis were named “Mix”. These CCI analysis focused on cluster types of cancer cell and stromal cell. However, according to this revision, our presented study mainly focuses on the existence of two types of cancer stem cell-like population in TNBC xenograft and patients. Therefore, CCI analysis with cluster types was deleted from revised manuscript.

      In the revised manuscript, we focused on the existence of two cancer stem cell-like populations in TNBC xenograft and patients. Previous Figure 6 was removed from the revised manuscript.

      * 9.Figure 7. I like the effort to align the results with public scRNA-seq data. But although the expression of the cluster-signatures is heterogeneous, there is no evidence for distinct (CSC-like) cell populations. Why don't these HMGA1 vs CD44 signature cells cluster away from each other in the UMAPs? Perhaps the patient-to-patient heterogeneity overwhelms differences within tumors, but in that case the authors could re-run their analysis for each patient separately, to make 6 patient-specific UMAPs. In its present form, this analysis does not convince me that two distinct CSC(-like) populations within one TNBC exist. *Answer: We would thank the comment. To improve the quality of reanalysis of clinical cohorts, we performed the reanalysis of 19 scRNA-seq samples from integrated 3 TNBC cohorts (Figure C-1). In a UMAP plot, there are differences between CD44-positive cancer cells and HMGA1-positive cancer cells; however, these cells did not visually form the specific clusters (Figure C-2). CD44 and HMGA1 were expressed globally in the UMAP plot, but CD44 made some specific clusters (cluster at right side). Additionally, following the comment, we performed the population analysis in each patient (Figure C-3 and C-4). There is double-positive population in TNBC patients suggesting that this population may be more undifferentiated cancer stem cells, dividing into both CD44-positive cells and HMGA1-positive cells.

      In the revised manuscript, Figure C is incorporated as Figure 5.

      * **Minor comments:** 10.In the Supplemental table 2 noticed that many of the marker genes have adjusted P values well above 0.05 (and even above 0.1). That makes the statistical analysis rather weak. This could especially be problematic since the authors entirely base their main claims on this marker analysis, and I recommend that the authors use more stringent P-value cut-offs in the cluster analysis. *Answer: We would thank the comment. We reshaped the list of differentially expressed genes (DEGs). Significantly expressed genes (adjusted p-value In mouse clusters, the enrichment analysis using significantly DEGs showed that only Tcell-like clusters had a lot of enriched terms. Citric acid (TCA) cycle, chemical stress response, and fatty acid oxidation were enriched in Tcell-like populations (Page 7, Lines 141-144).

      In the revised manuscript, enrichment analyses are showed as Supplementary Figure S2 and S3B. We revised the sentence of enrichment analyses (Page 6, Lines 114-121), (Page 7, Lines 141-144). The network visualization of enrichment analysis was removed from the revised manuscript because this result did not support conclusions of the presented study.

      * 11.Line 129/130. If I look at figure 3A, I don't see this tendency that the authors describe. Can the authors provide statistical support or visual aid to make their claim more apparent to the reader? *__Answer: __We would thank the suggestion. Following the comment, we performed the statistical analysis of spot position. The spots were categorized outer side (tumor edge) and Inner site (Center of tumor) in the primary tumor section (Figure E-1 upside). We counted the spot numbers of the clusters (Figure E-1 table) and performed statistical test by chi-test. As a result, CD44/MYC clusters significantly resided at outer side of primary tumor (Figure E-1 barplot). On the other hand, the spots in lymph-node metastasis are not readily defined the outer or inner. In addition, cell cycle analysis in the primary tumor and lymph node metastasis was performed with statistical test. As a result, HMGA1-high cluster and CD44/MYC-high cluster significantly proliferated in the lymph node metastasis section (Figure E-2).

      Therefore, in the revised manuscript, we revised the sentence of spot position in lymph-node metastasis (Pages 8-9, Lines 159-172). Figure E-1 is incorporated as Figure 4D. Figure E-2 is incorporated as Figure 4F. Hope our new results will be now accepted by the Reviewer and Editor.

      Figure E-1. Statistical analysis of spot position

      Chi-test was performed by R. *p Figure E-2. Statistical analysis of cell cycle index

      Fisher’s exact test was performed by R. *p * 12.Line 217; shouldn't this be 6 patients? I see six clusters and in the original paper six patients are mentioned. *Answer: We would thank the comment. ‘6 patients’ is correct, we revised it. However, in the revised manuscript, we added integrated analysis of TNBC as shown in the answer to comment 9.

      Previous reanalysis of clinical scRNA-seq (previous Figure 7) was removed from the revised manuscript. The reanalysis using 3 integrated TNBC cohorts (Figure C) is incorporated as Figure 5.

      Reviewer #1 (Significance (Required)): * Conceptual/biological impact: Showing the existence of distinct populations of CSCs within one (breast-)tumor potentially has a high impact on the field of fundamental and translational cancer research. As the authors state, it could be one key reason underlying drug resistance. However, the technology used by the authors does in my view not allow to make such a claim. First and foremost because the technology does not allow analysis at single cell resolution.

      Technical impact: The platform used by the authors can be of interest for some applications, but they already published this in Scientic Reports a few years ago. I'm afraid that with the rapid recent developments in the field of spatial single cell transcriptomics (See for example Srivatsan et al Science 2021; 373: 111-117), the technical impact on the field is relatively low.

      Audience: Researchers in the field of cancer biology with an interest to perform low-cost molecular analysis at low-resolution spatial-resolved tissue specimens (transcriptomics, but perhaps expanded with bisulfite sequencing, or ATAC sequencing) could be interested in the technology presented in this manuscript.

      My expertise: single cell transcriptomics, (cancer) cell cycle, cancer drug resistance, cell plasticity, mouse models. *

      **Referee Cross-commenting** I have read the comments and align mostly with reviewer #2. The authors need to improve this manuscript a lot before it's suitable for publication in any of the Review Commons journals. Answer: We are grateful to the reviewers. As indicated in the responses that follow, we have taken all of these comments and suggestions into account in the revised version of our paper, including the supplementary information.

      *

      *

      Reviewer #2 (Evidence, reproducibility and clarity (Required)): * This manuscript uses spatial transcriptomics to perform single cell-like expression analysis between a breast cancer cell line and tumor microenvironment in mice xenografted with these cells. Unfortunately, from the title, abstract, and introduction, it is difficult to understand exactly what the authors are focusing and discussing. It is also unclear the advantage of their technique for evaluating the populations observed within this manuscript. Furthermore, there is very little explanation of the results, and it does not appear to be a scientific logical structure. Hence, this manuscript is not suitable for acceptance in the journal. In order to improve the scientific quality of this study, the following concerns are presented.

      **Major concerns:** 1.Is cell-cell interaction (CCI) analysis novel method? If so, please specify detail in the manuscript. If the basic concept and the principle of CCI analysis have not been published, please mention in the discussion section as a limitation that a manuscript on CCI analysis is under submission to the preprint. In addition, please revise the abstract and related text. *__Answer: __Initially, algorithms of cell-to-cell interaction were reported with previously published tool [8, 9]; however, in this manuscript, we originally conducted the code for cell-to-cell interaction with the interaction database of the Bader laboratory from Toronto University (https://baderlab.org/CellCellInteractions#Download_Data) as previously described [10, 11]. We aimed to estimate the cell-to-cell interaction in each spot (including 10-30 cells). We think that this analysis will be helpful for discovering the cancer stem cell niche and metastatic niche [6].

      However, in the revised manuscript, we focused on the existence of two cancer stem cell-like populations in TNBC xenograft and patients. Therefore, CCI analysis in previous Figure 5 is moved to Supplementary Figure S7. Previous Figure 6 are removed from the revised manuscript. We revised the description in the manuscript (Page 18, Lines 385-387).

      * 2.The reviewer thinks that spatial transcriptomics plays an important role in your manuscript. Please describe the technique in the introduction. *__Answer: __We would thank the comments. Following the comments, we described the spatial technics in Introduction section. We revised the manuscript (Page 4, Lines 63-65) (Page 12, Lines 250-253).

      * 3.The classification by expression profile (HMGA1, CD44/MYC and marker-low) lacks an explanation. Authors should mention in detail how these populations were extracted from breast cancer cell lines. *Answer: In this research, to identify the characteristics of clusters, we analyzed differentially expressed genes (DEGs) by FindAllmarkers function of Seurat. As a result, ‘Cluster 0’ significantly expressed HMGA1 gene, and ‘cluster 1’ significantly expressed CD44. Next, we performed the up-stream enrichment analysis using gene signatures of FindAllMakers by Metascape. From result of enrichment analysis, we found the MYC activation in CD44 high-cluster; therefore, we named the cluster “CD44/MYC-high” cluster.

      HMGA1 and CD44 are popular cancer stem cell markers in triple-negative breast cancer [3, 4]; therefore, we focus on cancer-stem cell marker in presented study. Cancer stemness is an important concept in cancer metastasis [5-7].These results suggested that the existence of two cancer stem cell-like populations could potentially make tumors more drug-resilient in xenograft model and clinical patient.

      To improve the manuscript, we revised the Figure2, Supplementary Table S2 and S4, and manuscript (Pages 5-6, Lines 97-106).

      * 4.The description of the results is back and forth and confusing. Please reconsider the flow of the analysis. *__Answer: __We would thank the comment. We reconsidered the description and structure of manuscript. In revised manuscript, we focused on the existence of two cancer stem cell-like populations in TNBC xenograft and patients.

      To improve the manuscript, we revised the Figure2 for examination of cluster characteristics by clustering and gene expression profiling. Figure 3 was revised for the validation of two cancer stem cell-like populations in TNBC xenograft model. Figure 4 was revised for the elucidation of spatial characteristics of each cluster. Figure 5 was revised for the validation of two cancer stem cell-like populations in TNBC patients.

      * 5.How did you evaluate the outsides of the samples with very different spot positions in Figure 3A? Please mention your evaluation method in a scientific manner. In particular, authors should clearly indicate the outer evaluation for the metastatic case. *

      Answer: We would thank the suggestion. Following the comment, we performed the statistical analysis of spot position. The spots were categorized outer side (tumor edge) and Inner site (Center of tumor) in primary tumor section (Figure E-1 upside). We counted the spot numbers of the clusters (Figure E-1 table) and performed statistical test by chi-test. As a result, CD44/MYC clusters significantly resided at outer side of primary tumor (Figure E-1 bar plot). On the other hand, the spots in lymph-node metastasis are not readily defined the outer or inner. In addition, cell cycle analysis in the primary tumor and lymph node metastasis was performed with statistical test. As a result, HMGA1-high cluster and CD44/MYC-high cluster significantly proliferated in the lymph node metastasis section (Figure E-2).

      Therefore, in the revised manuscript, we revised the sentence of spot position in lymph-node metastasis (Pages 8-9, Lines 153-172). Figure E-1 are incorporated as Figure 4D. Figure E-2 are incorporated as Figure 4F. Hope our new results will be now accepted by the Reviewer and Editor.

      Figure E-1. Statistical analysis of spot position

      Chi-test was performed by R. *p Figure E-2. Statistical analysis of cell cycle index

      Fisher’s exact test was performed by R. *p * 6.The spots in primary tumor have few counts derived from mouse stromal/immune cells, as shown in Figure S1A. Nevertheless, Figure 3C shows that mouse stromal/immune cells are evaluated in the same way in primary and metastatic sites. The reviewer thinks that the regions identified as Tcell-like in the metastatic site, where there are many mouse-derived counts, and in the primary, where there are few mouse-derived counts, do not have the same characteristics. If many mouse-derived counts were detected in a spot using the spatial transcriptomics, then there must be many mouse-derived cells in the spot. Please discuss how this expression is evaluated on this technique, which is not a single cell analysis. *__Answer: __We would thank the comment. The reviewer’s suggestion is an important point; however, this suggestion is technical limitation of spatial transcriptomics technology. Most advanced spatial transcriptomics technologies, e.g. Visium (10x Genomics), also have the same problem. It means that our technology and the advanced technologies are technics to analyze gene expression and characteristics of tissues from 10-30 cells in each spot.

      In this spatial transcriptome analysis of mouse genes, we first performed the log normalization and scaling. Since Seurat used variable features among the samples for single-cell or spot clustering, we extracted the variable features for detection of clusters using the ‘FindVariableFeatures’ function. PCA and clustering using only mouse genes was performed for detecting the neighboring samples. After the clustering of mouse spots, we identified the character of clusters by finding the gene signatures. As the indication by the reviewer, the detected RNA counts and features are different, so it is difficult to define the exact character and cell type of stromal cells. Theoretically, spatial transcriptomics could only detect some kinds of stromal cells expressing the T-cell marker gene in the spot. Therefore, we named the cluster as “Tcell-like”. Not all of the Tcell-like cluster have the same characteristics or cell types, but they certainly express T-cell marker genes. This is also a technical limitation of spatial transcriptomics. Spatial transcriptomics with higher resolution probably is able to detect the stromal cells as a single-cell resolution, such as the one developed in previous research [1].

      In the revised manuscript, we focused on the two types of cancer stem cell-like populations that were validated by other methods (scRNA-seq and Immuno-staining). As the method is not able to define the exact cluster characters, we moved CCI analyses to supplementary figures or removed partly.

      We also revised the discussion in the revised manuscript (Pages 13-14, Lines 279-283).

      * 7.Please explain how the gene symbols listed in Figure 4A were selected. Also, please indicate the characteristics of the gene groups that are not listed. *__Answer: __We selected the gene signature list from results of ‘FindAllMarker’ function in Seurat. ‘FindAllMarker’ function enables to extract the significantly expressed genes in each cluster. Heatmap in previous Figure 4A was drawn using these marker genes (Adjusted p-value 0.1). Highlighted genes in the heatmap have been reported as cancer-related genes or cell cycle-related genes.

      The genes used for drawing heatmap are shown in Supplementary Table S2 and S4.

      * 8.Please describe the details of the division and cycle index in lines 141-142. *__Answer: __Cell cycle index is a basic function of Seurat [12] (https://satijalab.org/seurat/archive/v3.1/cell_cycle_vignette.html). A list of cell cycle markers is loaded with Seurat. We can segregate this list into markers of G2/M phase and markers of S phase. We subjected this function into our spatial transcriptomics to estimate the cell cycle in each spot.

      We revised the description manuscript (Page 16, Lines 331-332).

      * 9.In Line 148-151, the expression and prognosis of TMSB10, CTSD, and LGALS1 is mentioned based on the previous reports. Aren't these findings the result of bulk? Is the HMGA1 cluster that the authors found involved in the prognosis of mice? Please clarify, as it is unclear what you want to discuss. *

      Answer: We apologize for our confusing data and description. These highlighted genes (TMSB10, CTSD, LGALS1, CENPK, and CENPN) were extracted as DEGs of human cancer clusters (Supplementary Table S2). Previously, these genes have been reported as cancer-related genes or cell cycle-related genes, described in the manuscript (Page 6, Lines 107-110). To show the other expressed genes in each human cluster, we focused on these genes in the manuscript.

      We extracted the gene signatures from DEGs and showed the gene signatures from HMGA1-high cluster correlated to poor prognosis in TNBC patients. Our data suggested that the HMGA1 signatures from the microspot resolution has the potential to be a novel biomarker for diagnosis, and HMGA1-high cancer stem cells may contribute to poor prognosis.

      In this revision, since we reperformed DEGs analysis with significant threshold; therefore, survival analysis was reperformed with novel gene signatures with METABRIC TNBC cohorts (Figure F).

      To improve the manuscript, we revised the description of DEGs extraction and heatmap (Page 6, Lines 106-112). Hope our Reviewer will approve this revised sentence.

      Figure F. Survival analysis with gene signatures of HMGA1-high and CD44/MYC-high

      Survival analysis of TNBC patients (claudin-low subtype and basal-like subtype) in METABRIC cohorts by the Kaplan-Meier method. (Left) Survival analysis with the expression of the HMGA1 signatures (High = 151, Low = 247). Shading along the curve indicates 95% confidential interval. Log-rank test, p = 0.012. (Right) Survival analysis with the expression of the CD44/MYC signatures (High = 333, Low = 65). Log-rank test, p = 0.079.

      * 10.Please provide details of all statistical tests used in this manuscript and describe significance levels used in the p-values and FDR. *__Answer: __We performed the extraction of differentially expressed genes (DEGs) by ‘FindAllMarkers’ function with MAST method. MAST method identifies differentially expressed genes between two groups of cells using a hurdle model tailored to scRNA-seq data [13]. Adjusted p-value is calculated based on Bonferroni correction using all features in the dataset. In spatial spot analysis, statistical analyses were performed by Chi-test and Fisher’s exact test.

      We revised materials and methods section in the manuscript (Page 19, Lines 391-394).

      * 11.Please mention CCI score (line 198). *Answer: As described in answer to comment 1, the algorithms of CCI score calculation were performed using previously published tool [8, 9]; however, we originally conducted the code for cell-to-cell interaction with the interaction database of the Bader laboratory from Toronto University (https://baderlab.org/CellCellInteractions#Download_Data). We extracted the genes whose expression value was greater than 2. We selected the combinations representing ligand__-__receptor interactions, in which both ligand genes and receptor genes were expressed in the same spot.

      We revised materials and methods section in the manuscript and Supplementary Legends (Page 18, Lines 385-387).

      * 12.Lines 204-206 and Figure 6G show specific interaction of ITGB1 and CST3, but it is unclear why only these molecules were extracted. What about the other molecules? At least ITGB1 is not scored in mix5. *Answer: We selected genes that have been reported as cancer-related ones in breast cancer to discuss the interactions in primary tumor and lymph-node metastasis. However, according to this revision, our presented study mainly focused on the existence of two types of cancer stem cell-like population in TNBC xenografts and patients. Therefore, CCI analysis with cluster types moved to supplementary Figure or some were not shown now.

      In the revised manuscript, previous Figure 6 is removed.

      * 13.HMGA1 signature appears in Line 214, please explain in detail. *__Answer: __As described in answer to comment 7, we selected the gene signature list from results of ‘FindAllMarker’ function. ‘FindAllMarker’ function enables to extract the significantly expressed genes in each cluster. HMGA1 signature genes were selected from significantly differentially expressed genes of HMGA1-high clusters.

      We revised the description in the revised manuscript (Pages 9-10, Lines 190-193).

      * 14.Authors should discuss how the previously reported bulk expression data used in Figure 7E can be linked to the single-cell-like analysis in this study. *__Answer: __Previous research reported that gene signatures extracted from specific clusters in scRNA-seq study have the potential to be a prognosis marker [14]. We showed the gene signatures from HMGA1-high cluster correlated to poor prognosis in TNBC patients. Our results suggested that the gene signatures from the resolution of microspot (10-30 cells) could have the potential to be prognosis markers. This punching microdissection system enables to extract only the parts of a section that are necessary for diagnosis of cancer and to analyze at low-cost. It could be applied to diagnostics instead of the laser-capture microdissection methods.

      We performed additional survival analysis with METABRIC cohorts. As described in this revision, since we reperformed DEGs analysis with significant threshold, survival analysis was reperformed with novel gene signatures with METABRIC TNBC cohorts (Figure F).

      In revised manuscript, Figure F were incorporated as Figure 6. The usefulness of gene signatures from microspot resolution was additionally discussed (Page 12, Lines 242-245, 250-253).

      * **Minor concerns:** 15.Please describe how the normalized centrality was calculated in UMAP algorithm and explain what this means in the results. __Answer: __The data showed that the expressional diversity in each cluster based on the network centrality of a correlational network with graph theory. The differences in the centrality among the clusters suggested expressional diversity in each (Supplementary Figure 4). Higher centrality represented lower expressional diversity and vice versa*. The detailed method for the calculation of centrality was previously shown to reveal the difference between smokers and never-smokers [10, 11].

      We added the description in the Legend (Pages 7-8, Lines 145-150).

      * 16.Please mention an explanation for the red X in Figure 1B to the legend. *__Answer: __The red X means failure spot for RNA extraction. We added the description in Figure 1B.

      * 17.Please spell out the abbreviations in all figure legends. *__Answer: __We added the abbreviations in the legends of all figures.

      * 18.Please explain what is meant by the color of the lines and the size of the circles in Figure 4D. *__Answer: __The network analysis was performed by Metascape (https://metascape.org/gp/index.html#/main/step1) [15]. The node size is proportional to the number of genes belonging to the term, and the node color represents the identity of the cluster. However, as described in the answer to reviewer’s comment 9, we reperformed enrichment analysis with significant DEGs. As a result, only CD44/MYC cluster had a lot of enrichment terms.

      Therefore, network visualizations were removed from the revised manuscript.

      * 19.Please mention an explanation for the color of the spots in Figure 5D and 5F to the legend. *__Answer: __The color showed the spots categorized into the selected group.

      In the revised manuscript, previous Figure 5 was incorporated as Supplementary Figure S7. We added the description in Supplementary Figure S7 and S8 with the legends.

      * 20.Is "S51" in Line 148 a typo for "S5A"? *Answer: Thank you. We revised “S5A”.

      * 21.Please mention an explanation for the bars in Figure 6D and 6F to the legend. *__Answer: __The bars showed relative CCI scores. As described below, we removed the results of CCI analysis with cluster group (previous Figure 6) in the revised manuscript.

      * 22.Please mention an explanation for the colors in Figure 7E to the legend. *__Answer: __The color showed patients’ group based on expression levels of gene signatures. We added the description in the Legend of Figure 6.

      *

      *

      Reviewer #2 (Significance (Required)): * The approach in Figure 5 is interesting, but the rest of the results do not take full advantage of the technology developed by the authors. The structure of the manuscript should be re-examined and new perspectives added. I look forward to the future of the authors' research.

      *

      Reviewer #3 (Evidence, reproducibility and clarity (Required)): Microtissue transcriptome analysis of triple-negative breast cancer cell line MDA-MB-231 xenograft model using automated tissue microdissection punching techonology revealed that the existence of three cell-type clusters in the primary tumor and axillary lymph node metastasis. The CD44/MYC-high cluster showed aggressive proliferation with MYC expression, the HMGA1-high cluster exhibited HIF1A activation and upregulation of ribosomal processes. The cell-cell-interaction analysis revealed the interaction dynamics generated by the combination of cancer cells and stromal cells in primary tumors and metastases. The gene signature of the HMGA1-high cancer stem cell-like cluster has the potential to serve as a novel biomarker for diagnosis. The key conclusions are convincing. The data and methods are presented in a reproducible way. The experiments are adequately replicated and statistical analysis is adequate. Prior studies are appropriately referenced. The text and figures are clear and accurate. __Answer: __We would thank the valuable comments. As the reviewer mentioned, our findings showed that the existence of two cancer stem cell-like populations has the potential to make tumors more drug-resilient. Our results suggested that the gene signatures from the resolution of microspot (10-30 cells) could have the potential to be prognosis markers. This punching microdissection system enables to extract only the parts of a section that are necessary for diagnosis of cancer and to analyze at low-cost. It could be applied to diagnostics instead of the laser-capture microdissection methods.

      In this revision, we focused on the existence of two cancer stem cell-like populations in TNBC xenografts and patients. Following the other reviewer’s comments, we performed the extraction of DEGs with significant threshold; therefore, we revised the results of enrichment analysis but it did not influence our main findings.

      To validate the existence of two types of cancer stem-like cells in TNBC tumors, we performed the additional analyses (reanalysis of public scRNA-seq datasets and immuno-staining of MDA-MB-231 primary tumor). These results verified two cancer stem cell-like populations (HMGA1-high, CD44-high) in MDA-MB-231 xenograft and TNBC patients. We believe that our findings are solid results because the findings were also validated by other methods.

      Again, we would thank kind reviewing our manuscript.

      Reviewer #3 (Significance (Required)): * In the past several studies showed the heterogeneity of cell-cell interactions between cancer cells and stromal cells in situ (Andersson et al, 2021; Wu et al, 2021) and tumor microheterogeneity (Jiang et al, 2016; Liu et al, 2016; Zhang et al, 2020). Spatial transcriptomics methods are important to reveal microheterogeneity of cancer. As a physician working in gynecology and obstetrics in my opinion the results of the study and spatial transcriptomic methods could be relevant to detect new biomarkers for diagnosis and prognosis of breast cancer in future and to find novel therapeutic targets to overcome drug resistance and facilitate curative treatment of breast cancer.

      *

      References in response letter

      1. Srivatsan SR, Regier MC, Barkan E, Franks JM, Packer JS, Grosjean P, et al. Embryo-scale, single-cell spatial transcriptomics. Science. 2021;373(6550):111-7. Epub 2021/07/03. doi: 10.1126/science.abb9536. PubMed PMID: 34210887.
      2. Moravec JC, Lanfear R, Spector DL, Diermeier SD, Gavryushkin A. Cancer phylogenetics using single-cell RNA-seq data. bioRxiv. 2021:2021.01.07.425804. doi: 10.1101/2021.01.07.425804.
      3. Liu H, Patel MR, Prescher JA, Patsialou A, Qian D, Lin J, et al. Cancer stem cells from human breast tumors are involved in spontaneous metastases in orthotopic mouse models. Proc Natl Acad Sci U S A. 2010;107(42):18115-20. Epub 2010/10/06. doi: 10.1073/pnas.1006732107. PubMed PMID: 20921380; PubMed Central PMCID: PMC2964232.
      4. Pegoraro S, Ros G, Piazza S, Sommaggio R, Ciani Y, Rosato A, et al. HMGA1 promotes metastatic processes in basal-like breast cancer regulating EMT and stemness. Oncotarget. 2013;4(8):1293-308. Epub 2013/08/16. doi: 10.18632/oncotarget.1136. PubMed PMID: 23945276; PubMed Central PMCID: PMC3787158.
      5. Weiss F, Lauffenburger D, Friedl P. Towards targeting of shared mechanisms of cancer metastasis and therapy resistance. Nat Rev Cancer. 2022. Epub 2022/01/12. doi: 10.1038/s41568-021-00427-0. PubMed PMID: 35013601.
      6. Oskarsson T, Batlle E, Massagué J. Metastatic Stem Cells: Sources, Niches, and Vital Pathways. Cell Stem Cell. 2014;14(3):306-21. doi: https://doi.org/10.1016/j.stem.2014.02.002.
      7. Turdo A, Veschi V, Gaggianesi M, Chinnici A, Bianca P, Todaro M, et al. Meeting the Challenge of Targeting Cancer Stem Cells. Front Cell Dev Biol. 2019;7:16. Epub 2019/03/06. doi: 10.3389/fcell.2019.00016. PubMed PMID: 30834247; PubMed Central PMCID: PMC6387961.
      8. Armingol E, Officer A, Harismendy O, Lewis NE. Deciphering cell-cell interactions and communication from gene expression. Nat Rev Genet. 2021;22(2):71-88. Epub 2020/11/11. doi: 10.1038/s41576-020-00292-x. PubMed PMID: 33168968; PubMed Central PMCID: PMC7649713.
      9. Kumar MP, Du J, Lagoudas G, Jiao Y, Sawyer A, Drummond DC, et al. Analysis of Single-Cell RNA-Seq Identifies Cell-Cell Communication Associated with Tumor Characteristics. Cell Rep. 2018;25(6):1458-68.e4. Epub 2018/11/08. doi: 10.1016/j.celrep.2018.10.047. PubMed PMID: 30404002; PubMed Central PMCID: PMCPMC7009724.
      10. Watanabe N, Nakayama J, Fujita Y, Mori Y, Kadota T, Shimomura I, et al. Single-cell Transcriptome Analysis Reveals an Anomalous Epithelial Variation and Ectopic Inflammatory Response in Chronic Obstructive Pulmonary Disease. medRxiv. 2020:2020.12.03.20242412. doi: 10.1101/2020.12.03.20242412.
      11. Nakayama J, Yamamoto Y. Single-cell meta-analysis of cigarette smoking lung atlas. bioRxiv. 2021:2021.12.09.472029. doi: 10.1101/2021.12.09.472029.
      12. Stuart T, Butler A, Hoffman P, Hafemeister C, Papalexi E, Mauck WM, 3rd, et al. Comprehensive Integration of Single-Cell Data. Cell. 2019;177(7):1888-902.e21. Epub 2019/06/11. doi: 10.1016/j.cell.2019.05.031. PubMed PMID: 31178118; PubMed Central PMCID: PMC6687398.
      13. Finak G, McDavid A, Yajima M, Deng J, Gersuk V, Shalek AK, et al. MAST: a flexible statistical framework for assessing transcriptional changes and characterizing heterogeneity in single-cell RNA sequencing data. Genome Biol. 2015;16:278. Epub 2015/12/15. doi: 10.1186/s13059-015-0844-5. PubMed PMID: 26653891; PubMed Central PMCID: PMCPMC4676162.
      14. Cheng S, Li Z, Gao R, Xing B, Gao Y, Yang Y, et al. A pan-cancer single-cell transcriptional atlas of tumor infiltrating myeloid cells. Cell. 2021;184(3):792-809.e23. Epub 2021/02/06. doi: 10.1016/j.cell.2021.01.010. PubMed PMID: 33545035.
      15. Zhou Y, Zhou B, Pache L, Chang M, Khodabakhshi AH, Tanaseichuk O, et al. Metascape provides a biologist-oriented resource for the analysis of systems-level datasets. Nat Commun. 2019;10(1):1523. Epub 2019/04/05. doi: 10.1038/s41467-019-09234-6. PubMed PMID: 30944313; PubMed Central PMCID: PMC6447622.
    1. Author Response

      Reviewer #1 (Public Review):

      The authors use ribosome profiling (RiboSeq) and RNA sequencing (RNASeq) to characterise the transcriptome and translatome of two PRRSV species as well as the host in response to infection. One particularly exciting feature of the study is that the analysis is carried out at different times of infection, which shows how both the virus and the host regulate their gene expression. The authors identify several new regulatory mechanisms of virus gene expression. Unexpectedly, they also find that the frameshifting efficiency at the ORF1ab frameshifting site changes with time. This contradicts the dogma in the field, which states that frameshifting is constant and has evolved to be constant to produce the a particular ratio of the two protein isoforms. The strength of the paper is in its comprehensible analysis. The paper is extremely rich in data, with 12 main and 23 Supplemental Figs and 11 Supplemental Tables, all of them rather complex. The main weakness is that it is written in a technical language that will be hardly readable by a non-specialist readership. Unfortunately, the authors do not make a good job in guiding the reader through their findings and hardly identify the the most important findings, while leaving the details to the specialists. This is particularly exemplified in Fig. 12, which should present the summary of the findings and would be extremely helpful, but hardly provides any text at all. This is potentially a very interesting paper, but the impact on the field could be increased considerably by better presentation of the work.

      We would like to thank this reviewer for the positive comments about the scientific findings, and for their suggestions for improving the presentation of the work. This outside perspective was very useful in helping us see which parts of the paper required clearer explanation or less detail, which can be hard to discern when very close to the work. We have incorporated all of this reviewer’s suggestions and we think this has improved the manuscript and made it easier to follow.

      Reviewer #2 (Public Review):

      The authors used the ribosome profiling technique to study gene expression at transcriptional and translational levels in the cells infected with porcine reproductive and respiratory syndrome virus (PRRSV-1 and PRRSV-2) using ribosome profiling. The ribosome profiling was carried out on the cells at different time points within the first 12 hours of infection, thus providing information on gene expression changes during the time of infection.

      The analysis of ribosome profiling data is exceptionally detailed and includes scrupulous characterization of footprint read lengths, de novo prediction of translated ORFs, characterisation of local pauses and differential gene expression of host and viral genes. The RNA-seq analysis is on par with that, the authors did a superb job at characterising the composition of the viral transcriptome that included identification of heteroclite RNAs and defective interfering RNAs. This provided the authors with reliable information for the interpretation of translational mechanisms responsible for the translation of ORFs discovered with ribosome profiling data.

      A specific focus of the manuscript was placed on the characterisation of two instances of ribosomal frameshifting occurring in PRRSVs. In addition to "canonical" -1 frameshifting at a slippery sequence stimulated by downstream RNA secondary structure (common to many viruses), PRRSVs genome contains an additional frameshifting site whose efficiency is stimulated by a viral protein. The authors demonstrated that the efficiency of this frameshifting is increasing over time which is expected since the concentration of stimulating protein is increasing. Furthermore, the authors found that the efficiency of "canonical" frameshifting is also changed. The authors describe this as surprising since it directly contradicts the common description of its function as "setting the fixed ratio" between the synthesized products upstream and downstream of the frameshift site. Perhaps it is not so surprising in the hindsight, given that the frameshifting is dependent on so many different factors, folding states of RNA pseudoknots which are dynamic, ribosome density upstream, etc. it would be more surprising if the efficiency of frameshifting were indeed fixed. I think the "fixed ratio" was proposed mainly to draw a difference to ribosomal frameshifting occurring in cellular genes (like antizyme or bacterial release factor 2) where there seems to be only one functional product, but its synthesis level depends on the efficiency of frameshifting sensing certain conditions. It is great though that the authors observed such changes and I agree with the authors' speculations that this is unlikely to be unique to PRRSVs.

      While I found the work to be largely descriptive, the authors did not shy away from speculating about potential mechanisms responsible for observed regulation. The manuscript is hard to get through simply due to its large length and a lot of data, but reading it is rewarding.

      Again, we would like to thank this reviewer for their positive comments about the work, and to reiterate that hopefully the revised version of the manuscript will be easier to read.

      Reviewer #3 (Public Review):

      The manuscript by Cook et al. describes the first comprehensive gene expression analysis of two species of PRRSV, an important agricultural pathogen. Using ribosome profiling and RNA-sequencing, the authors systematically analyze the transcriptome of the virus and its translation, and their temporal kinetics. The analysis revealed non-canonical RNA species that are suggested to contribute to translation of parts of ORF1ab, changing the stoichiometry between the NSPs. In addition, the authors use the ribosome profiling data to identify novel overlapping ORFs, including a conserved uORF in the 5' leader, and to analyze the efficiency of frame-shift in two sites in the viral genome, one of which is trans-regulated by the viral nsp1β. The frame-shift efficiency in both sites is presented to be increasing late in infection. The authors also present conservation analysis from hundreds of available genomes. Finally, analysis of host gene expression uncovers a pattern suggesting translation inhibition of induced transcripts, and by comparing a WT virus to a mutant virus lacking the nsp2 site frame-shift, the authors identify a gene (TXNIP) whose expression is affected by nsp2TF.

      In this rigorous work, the authors uncover new insights on an important pathogen, which can be of value to the wider field of virology. However, due to technical issues a few of the authors claims may require reconsideration.

      We are grateful to this reviewer for their comments on the rigour and the impact of the work, as well as the suggestions for improvement which they included in their more detailed review. Within the detailed review, this reviewer expressed some concerns that ribosome run-off (seen in Figure 1—figure supplement 1 [formerly Supplementary Figure 1]) might confound the comparison of ribosome densities in different regions of the viral genome (particularly ORF1ab). However, this run-off only noticeably affects the first ~100 nt of host CDSs, which is very small compared to the ~12,000 nt total length of ORF1ab. The regions of ORF1ab in which we compare ribosome density in our study are almost all > 1,000 nt downstream of this ~100 nt run-off region and will therefore not be significantly affected by run-off. The exception to this is our assessment of heteroclite sgRNA translation, where the “heteroclite” region does include the first ~100 nt of ORF1a. As such, run-off may have a slight effect on this analysis, but we expect this to be minor, as the ~100 nt run-off region represents only a small proportion of the 1,550-nt “heteroclite” region. Further, any such effect would actually lead to under-estimation of heteroclite sgRNA translation, by artefactually reducing the relative RPF density in the heteroclite region. This would therefore strengthen our conclusion that our data provide evidence for heteroclite sgRNA translation.

    1. Author Response

      Reviewer #2 (Public Review):

      Romand et al investigates the role of hyperphosphorylated guanosine nucleotides (ppGpp) in acclimation of plant chloroplasts to nitrogen limitation. The signaling role of ppGpp as alarmone is well established in the stringent response of bacteria. The stringent response allows bacteria to adapt to amino acid or carbon starvation and other acute abiotic stress conditions by downregulation of resource-consuming cell processes. A series of studies, including the current one, have demonstrated the retention of the bacterial-type ppGpp-mediated signaling response in plant and algal chloroplasts. The current study convincingly demonstrates the involvement of ppGpp in remodeling of photosynthetic machinery under nitrogen limitation. Using three Arabidopsis RSH lines (two underaccumulators and one overaccumulator of ppGpp), the authors show that the ppGpp is required for preventing excess ROS accumulation, oxidative stress and death of cotyledons under nitrogen limiting condition. The authors show a transient accumulation in ppGpp upon nitrogen limitation, which is followed by a sustained increase in the ratio of ppGpp to GTP. There is a prompt decline in maximum photochemical efficiency of photosystem II (PSII) and linear electron transport under nitrogen deficiency in wild type and ppGpp overaccumulator plants. However, mutants with low amount of ppGpp have a delayed decrease in these photosynthetic parameters. PpGpp is further shown to decrease (or degrade) photosynthetic proteins, and a remodeling of PSII that involves uncoupling of LHC II from the reaction center core has been suggested to occur under nitrogen starvation. The authors also show a ppGpp-mediated downregulation of chloroplast gene transcription and a coordinated plastid-nuclear gene expression under nitrogen deficiency.

      Strengths 1. The conclusions of this paper are mostly well supported by data. With three different RSH lines, there is a convincing demonstration of the specific involvement of ppGpp in nutrient acclimation. The line carrying conditional overexpression of Drosophila ppGpp hydrolase (MESH) nicely complements the RSH lines and strengthens many of the conclusions. This is a detailed analysis of ppGpp function in a plant species. The data supplement accompanying each main figure is extensive and helpful. 2. The genomic analysis in nitrogen replete and deplete wild type uncovers an interesting regulation of RSH enzymes at the transcriptional level. This is likely to be part of a signaling response that works in conjunction with allosteric modulation of RSH activity under nitrogen limitation. 3. The large-scale analysis of plastid and nuclear gene transcripts supports the involvement of ppGpp in coordinated repression of plastid and nuclear gene transcription. 4. By the inclusion of mitochondrial genes and proteins in their analysis, the authors clearly show that the ppGpp action is limited to plastids and does not extend to mitochondria, which like chloroplasts, have a bacterial ancestry. 5. The thorough demonstration of the involvement of ppGpp in low nitrogen acclimation of photosynthetic metabolism adds greatly to the understanding of plant abiotic stress tolerance mechanisms and ppGpp function in both plants and bacteria.

      We thank the reviewer for these observations on our work.

      Weaknesses: 1. With two earlier reports from a different laboratory (Maekawa et al 2015 and Honoki et al 2018) showing the involvement of ppGpp in acclimation to nitrogen deficiency, the novelty of the current study is diminished. The authors mention that the double mutant (rsh2 rsh3) used by Honoki et al does not show a clear phenotype other than a delay in Rubisco degradation. It is not clear to me why the lack of two major RSH isoforms, involved in synthesis of ppGpp under light, would not produce any phenotype. This discrepancy should be discussed further in the manuscript.

      The work of Maekawa et al., 2015 and Honoki et al., 2018 was indeed important for highlighting the potential involvement of ppGpp in the acclimation to nitrogen deficiency. However, these studies were based on the constitutive overaccumulation of ppGpp. Here, we demonstrate a physiological requirement for ppGpp signalling by the plant to allow acclimation to an abiotic stress- we consider this to be a major step forwards in understanding the role of ppGpp in plants, and one of the few examples of a physiological requirement for ppGpp in plants.

      We mention the use of an RSH2 RSH3 mutant by Honoki et al. 2018 while putting our results into the context of previous findings in the discussion. We bring the attention of the reviewer to our analysis of an RSH2 RSH3 mutant in this study, and that in our hands the mutant phenotype was indistinguishable from the RSH quadruple mutant (rshQM) (Figure 2- figure supplement 1 panel B). Therefore, we do indeed consider that RSH2 and RSH3 are the main RSH isoforms involved in ppGpp-mediated acclimation to nitrogen deficiency, and we state this ( see p7 l161-164 in original manuscript). As we explain in the discussion there are probably technical reasons for the discrepancy with the results reported by Honoki et al. 2018. We also note here that the RSH2 RSH3 mutants used in our study and by Honoki et al. 2018 are not identical: the same SAIL insertion SAIL_305_B12 was used for rsh2, while the rsh3 allele used by Honoki was the GABIkat insertion GABI129D02 and here SAIL_99_G05). We now add this difference in the genetic identity of the mutants as an additional potential explanation for the different findings in the two studies.

      1. The authors at times show a tendency to overinterpret their results. A ppGpp-mediated repression of chloroplast transcription and translation is sufficient to explain most of the observations in this study. However, the authors seem to go beyond this simple explanatory framework by invoking specific roles for ppGpp in remodeling of PSII antenna-core interaction and in blocking of PSII reaction center repair. There is no data in the manuscript in support of these two propositions. A coordinated decrease in synthesis of most chloroplast proteins, including the D1 reaction center protein of PSII, is sufficient to explain the decrease in Fv/Fm. There is no evidence in the manuscript for "photoinactivation gaining an upper hand via ppGpp-mediated signaling"

      The circuit breaker analogy of PSII photoinhibition that the authors discuss in support is just an interpretation. The remodeling of PSII antenna-core interaction, likewise, could be a simple consequence of the ppGpp-mediated decrease in D1 protein synthesis. The high antenna-core ratio under nitrogen starvation likely reflects the lag in the decrease of LHCB1 (which eventually decreases significantly by day 16).

      Since ppGpp-signaling primarily affects plastid transcription and translation, there is a rapid decrease in plastid psbA gene product (D1) relative to the nuclear-encoded LHCB1. The unconnected LHCII might simply be a result of the mismatch in antenna-core stoichiometry rather than an active regulation of PSII functional assembly by ppGpp.

      We have re-worked the discussion to make these points more clearly, and also to tone down certain points where we may have over-stretched our interpretation.

      We think that our interpretation is essentially the same as the reviewer’s- the ppGpp mediated inhibition of chloroplast translation and transcription is sufficient to explain the majority of our results. In the discussion we also discuss the possibility that ppGpp stimulates the active degradation of some chloroplast proteins, and put this in context of studies showing that N-starvation activates the specific proteolysis of certain photosynthetic proteins in Chlamydomonas and has an effect on the half lives of different chloroplast proteins in plants. We do not propose or present data suggesting that ppGpp has any other specific targets/effectors- for example within the PSII repair cycle or in remodelling PSII stoichiometry- although we also cannot exclude the possibility of targets in these processes.

      We think that the ppGpp dependent change in PSII stoichiometry during N-starvation is not just a side effect of a general downregulation or a temporary mismatch as suggested- but due to its size, persistence and effect on photosynthesis is likely to be part of the acclimation process. For example, the ppGpp-dependent drop in Fv/Fm is maintained at day 16 and even beyond (Fig 2D). We also see that photosynthetic proteins are still degraded in low ppGpp mutants (Fig. 3A), but that the high Fv/Fm is maintained throughout. These points and the fact that the alteration of PSII stoichiometry is not caused by the direct action of ppGpp on PSII (but via transcription/translation) does not mean that it is not important or does not play a role in acclimation. Other studies report that PSII RC inactivation can protect PSI (e.g. Tikkanen et al. 2014) and ppGpp may be working in a similar fashion here by reducing the flow of energy into the photosynthetic electron transport chain. This interpretation is consistent with our results showing that wild-type plants and high ppGpp plants (rsh1-1) accumulate less ROS and ROS-related damage than plants defective in ppGpp biosynthesis (Fig. 1).

      1. The work is mostly descriptive of the involvement of ppGpp in low nitrogen tolerance without any data on how the nitrogen deficiency is sensed by the RSH enzymes and how ppGpp orchestrates the multi-faceted acclimatory response. Perhaps, these aspects are beyond scope of the current manuscript, but they could be discussed more.

      We agree that these are very important questions, and also that they are out of the scope of the current work. We think that our work goes beyond the descriptive by demonstrating the physiological functions of ppGpp-signalling during nitrogen deficiency and a framework for how it occurs (i.e downregulation of chloroplast function and avoidance of excess oxidative stress).

      Reviewer #3 (Public Review):

      The manuscript by Romand et al. explores the role of guanosine penta- and tetraphosphate, ppGpp, in the acclimation of plants to nitrogen limitation. It shows that an early and transient ppGpp accumulation - and a controlled ppGpp/GTP ratio - is necessary for a proper acclimation of plants to such stress. The pathway is shown to act on remodeling the photosynthetic machinery and downregulating photosynthesis during stress, thus limiting ROS damage to the plants. This regulation most likely takes place by affecting chloroplast transcription, maintaining the balance between nucleus- and chloroplast-encoded proteins.

      The manuscript proposes a thorough analysis of the ppGpp-induced response including extensive wild type and mutant analyses at the gene and protein expression level as well as at the physiological level under nitrogen limitation together with heterologous expression of ppGpp hydrolase from Drosophila. The conclusions are carefully backed by the data (but for the lack of gene expression analysis in the high ppGpp line, rsh1-1), the figures and text clear, well-written and easy to follow. Altogether it represents a solid new step in improving the comprehension of plant response to nitrogen limitation, as well as on the role of ppGpp in plants and possibly throughout the green lineage. An alternative hypothesis to ppGpp photoprotective role could be discussed in that photoprotection may be an indirect effect due to photosynthetic protein degradation enabled by ppGpp, possibly through modulation of ppGpp/GTP ratio affecting chloroplast protease activity.

      On this last point we agree with the reviewer- our data indicates that the photoprotective role of ppGpp is via the ppGpp-dependent control of the abundance of photosynthetic proteins. This is indirect in the sense that we have no evidence that ppGpp itself interacts with components of the photosynthetic machinery. However, as discussed below we do not think that photoprotection is just a side-effect of ppGpp’s action- we show that the capacity to synthetise ppGpp is required for avoiding the generation of ROS and tissue death.

    1. Author Response

      Reviewer #1 (Public Review):

      In this paper, the authors examine the role of feedback from primary visual cortex (V1) to the dorsolateral geniculate nucleus of the thalamus (dLGN) under a variety of visual stimulus conditions. This is a well-defined circuit originating from a specific population of Layer 6 cells in the cortex, and the authors test the role of this projection by recording in dLGN during silencing of V1 via ChR2 expression in PV inhibitory cells. This is a well-established technique for strong silencing of cortex. However, because there are other disynaptic pathways from V1 to thalamus, they also perform a similar set of experiments using more targeted optogenetic inhibition of a genetically-defined class of Layer 6 (NTSR1) cells that make up most of the L6 corticothalamic projections. The fact that these experiments elicit similar results supports their interpretation that these direct projections are largely responsible for the observed results. While previous studies have manipulated corticothalamic projections pharmacologically, via V1 lesions, or via optogenetics, the authors rightly point out that most previous studies have focused on simple parametric stimuli and/or have been performed in anesthetized animals. The results of this study suggest feedback during natural visual stimuli and locomotion reveal effects that are distinct from these previous studies.

      Overall, these are important and carefully-performed experiments that significantly advance our understanding of the role of corticothalamic feedback to the dLGN.

      We thank the reviewer for the appreciation of our methods and results.

      The authors suggestion that the different effects observed during simple and complex stimuli may be due to increased surround suppression during the full-field gratings seems reasonable, but I didn’t understand how the analysis of blank periods during these two conditions supported this argument. It wasn’t clear to me what mechanisms would be expected to support the alternative outcome, where suppressing feedback during the blank periods interleaved with the two different stimuli would have different effects - unless they are testing whether natural movies elicit some longer-lasting state change that would change the results observed during blank periods. This seems somewhat implausible, and unless the authors wish to expand the study to include different stimulus sizes, I think the interpretation regarding surround suppression is best left to the discussion, where it is already treated well.

      We thank the reviewer for the recommendation. We fully agree that explaining the difference in CT feedback across blanks, gratings, and movies will require more experiments. We have followed the recommendation of the reviewer and removed the interpretation related to differences in surround suppression from the results section and treat it now in the discussion only.

      The paper would benefit from more clearly highlighting results that agree or disagree with previous studies, with a brief mention of how the authors interpret these similarities or differences. For example the results of Olsen et al 2012 seem to be consistent with what the authors observe here with gratings but not with natural movies, and although Olsen et al performed some awake recordings, I think the LGN recordings were all under anesthesia. Specifically highlighting these differences (and suggesting an interpretation for them) would help emphasize the novelty of the study.

      We thank the reviewer for the recommendation and now highlight throughout the results and discussion where our results agree or disagree with previous studies. As mentioned by the reviewer, we have similar results for gratings to the results obtained by Olsen et al. (2012), although in our study we have not explicitly centered the full field gratings on the RFs and we have not measured surround suppression. The results for the blank stimuli and the movies, however, are different, at least in terms of how CT feedback affects ring rate. A key insight of our study, at least in our view, is that CT feedback effects might well differ for different stimuli, and understanding the underlying mechanism (e.g., differential engagement of the excitatory and indirect inhibitory CT feedback pathway) will be an important avenue of research in the future.

      The authors should comment more on the spatial extent of V1 silencing and potential effects of the variability observed across mice, especially given that they appear to have made only a single injection of ChR2 to label PV cells. While silencing with this method extends beyond the injection site, it probably doesn’t cover all of V1. Was any analysis done of variability across mice based on the size or location of the ChR2 expression measured post-hoc?

      Unfortunately, we did not preserve enough slices to precisely quantify the extent of expression across animals. However, visual inspection of the slices revealed that even a single injection typically resulted in a widespread pattern of expression. In fact, we think that activation of PV neurons was determined in its spatial extent not so much by the virus expression but rather by the photoactivation light. With a distance of 0.5 0.1 mm of the optical fibre from the cortical surface, most of V1 was covered by light. A previous study performing a quantitative characterization of the lateral spread of optogenetic suppression by PV activation demonstrates that pyramidal neuron ring can be suppressed 2 3 mm from the laser center Li et al. (2019). Hence, we think that variability in opsin expression across mice is unlikely to have a substantial impact on our results.

      The decrease in reliability and sparseness during running is attributed partially to increased eye movements. In cortex this has been studied in awake animals with natural movies in a variety of studies where the opposite effects are observed including Froudarakis et al 2014 where there was a small increase in both metrics during running, and Reimer et al 2014 where reliability strongly increased during pupil dilation. If there is enough data to condition on running periods where eye movements are stable or dilation outside of running to measure the effects of feedback suppression during these periods, this would be useful information.

      We thank the reviewer for bringing up this interesting issue. We fully agree that our results recorded in dLGN are different from those measured by Froudarakis et al. (2014) and Reimer et al. (2014) in V1.

      As suggested by the reviewer, we have repeated the analysis proposed by Reimer et al. (2014) to identify periods in the movie with the most rapid pupil dilation / constriction in face of continuous changes in overall luminance. Besides the effects of pupil dilation / constriction on ring rate, we have computed reliability both according to what we had used throughout our manuscript and in the way proposed in Reimer et al. (2014), which resembles our measure of SNR. We find that both measures of reliability are unaffected by pupil dilation.

      Interestingly, in the meantime other studies have also reported that reliability might be differently affected by behavioral state in V1 compared to dLGN. For instance, Nestvogel and McCormick (2022) found that consistent with our results variability of membrane potential in visual thalamic neurons was not significantly altered by locomotion or whisker movement.

      Reviewer #2 (Public Review):

      Spacek et al. study the corticothalamic feedback of different visual stimuli on visual thalamus. With optogenetic suppression of visual cortex feedback and simultaneous multi-channel recordings in visual thalamus, the authors succeeded to acquire important data about this essential feedback loop in awake, behaving animals. The authors show in detail that the cortical feedback acts as a gain factor in thalamus for the transmission of signals from retina to cortex. They also show that naturalistic scenes result in robust feedback from cortex. As expected from anatomy, the authors find that modulatory feedback from cortex and modulatory input from brain stem act rather independently on thalamus. The paper is technically very impressive and the results are important for a wide range of readers.

      We thank the reviewer for the positive feedback.

      It is advisable to revise the Introduction and Discussion to better integrate the new findings into the existing literature.

      We thank the reviewer for this advice, and have revised the title, abstract, introduction and discussion to better integrate our new findings into the existing literature, and highlight our advances in relation to previous findings.

      The authors distinguish between awake, resting state and running state. However, the awake, resting state in mice comprises a wide range of alertness levels. This range of alertness will most likely affect the bursting probability of thalamocortical neurons.

      We thank the reviewer for this comment. So far, our manuscript had only taken locomotion as a proxy for behavioral state, as locomotion typically goes along with increased pupil size (Erisken et al., 2014; McGinley et al., 2015) and increased levels of arousal (McGinley et al., 2015; Vinck et al., 2015). To also study the effects of locomotion-independent arousal, we have now applied the analysis mentioned by the reviewer: following methods originally suggested by Reimer et al. (2014), we identified periods of the movie presentation without locomotion that corresponded to the upper or the lower quartile of pupil size change. Similar to the results that Reimer et al. (2014) found for primary visual cortex, we observed that ring rate in dLGN is enhanced during times when the pupil was dilating faster than usual vs. when it was constricting faster than usual. Like the effects of running, the modulations by pupil-indexed arousal persisted even with V1 suppression. We present these new results in Figure 5 - Supplement 2.

    1. Author Response

      Reviewer #1 (Public Review):

      In this manuscript, the authors find CpGs within 500Kb of a gene that associate with transcript abundance (cis-eQTMs) in children from the HELIX study. There is much to admire about this work. With two notable exceptions, their work is solid and builds/improves on the work that came before it. Their catalogue of eQTMs could be useful to many other researchers that utilize methylation data from whole blood samples in children. Their annotation of eQTMs is well thought out and exhaustive. As this portion of the work is descriptive, most of their methods are appropriate.

      Unfortunately, their use of results from a model that does not account for cell-type proportions across samples diminishes the utility and impact of their findings. I believe that their catalog of eQTMs contains a great deal of spurious results that primarily represent the differences in cell-type proportions across samples.

      Lastly, the authors postulate that the eQTM gene associations found uniquely in their unadjusted model (in comparison to results from a model that does account for cell type proportion) represent cell-specific associations that are lost when a fully-adjusted model is assumed. To test this hypothesis, the authors appear to repurpose methods that were not intended for the purposes used in this manuscript. The manuscript lacks adequate statistical validation to support their repurposing of the method, as well as the methodological detail needed to peer review it. This section is a distraction from an otherwise worthy manuscript. But provide evidences that enriched for cell sp CpGs.

      Major points

      1. Line 414-475: In this section, the authors are suggesting that CpGs that are significant without adjusting for cell type are due to methylation-expression associations that are found only in one cell type, while association found in the fully adjusted model are associations that are shared across the cell types. I do not agree with this hypothesis, as I do not agree that the confounding that occurs when cell-type proportions are not accounted for would behave in this way. Although restricting their search for eQTMs to only those CpGs proximal to a gene will reduce the number of spurious associations, a great deal of the findings in the authors' unadjusted model likely reflect differences in cell-type proportions across samples alone. The Reinius manuscript, cited in this paper, indicates that geneproximal CpGs can have methylation patterns that vary across cell types.

      Following reviewers’ recommendations, we have reconsidered our initial hypothesis about the role of cellular composition in the association between methylation and gene expression. Although we still think that some of the eQTMs only found in the model unadjusted for cellular composition could represent cell specific effects, we acknowledge that the majority might be confounded by the extensive gene expression and DNA methylation differences between cell types. Also, we recognize that more sophisticated statistical tests should be applied to prove our hypothesis. Because of this, we have decided to report the eQTMs of the model adjusted for cellular composition in the main manuscript and keep the results of the model unadjusted for cellular composition only in the online catalogue.

      1. Line 476-488: Their evidence due to F-statistics is tenuous. The authors do not give enough methodological detail to explain how they're assessing their hypothesis in the results or methods (lines 932-946) sections. The methods they give are difficult to follow. The results in figure S19A are not compelling. The citation in the methods (by Reinius) do not make sense, because Reinius et al did not use F-statistics as a proxy for cell type specificity. The citation that the authors give for this method in the results does not appear to be appropriate for this analysis, either. Jaffe and Irizarry state that a CpG with a high Fstatistic indicates that the methylation at that CpG varies across cell type. They suggest removing these CpGs from significant results, or estimating and correcting for cell type proportions, as their presence would be evidence of statistical confounding. The authors of this manuscript indicate that they find higher F-statistics among the eQTMs uniquely found in the unadjusted model, which seems to only strengthen the idea that the unadjusted model is suffering from statistical confounding.

      We recognize the miss-interpretation of the F-statistic in relation to cellular composition. We have deleted all this part from the updated version of the manuscript.

      1. The methods used to generate adjusted p-values in this manuscript are not appropriate as they are written. Further, they are nothing like the methods used in the paper cited by the authors. The Bonder paper used permutations to estimate an empirical FDR and cites a publication by Westra et al for their method (below). The Westra paper is a better one to cite, because the methods are more clear. Neither the Bonder nor the Westra paper uses the BH procedure for FDR.

      Westra, H.-J. et al. Systematic identification of trans eQTLs as putative drivers of known disease associations. Nat. Genet. 45, 1238-1243 (2013).

      We apologize for this misleading citation. Although Bonder et al applied a permutation approach to adjust for multiple testing, our approach was inspired by the method applied in the GTEx project (GTEx consortium, 2020), using CpGs instead of SNPs. The citation has been corrected in the manuscript. Moreover, we have explained in more detail the whole multiple-testing processes in the Material and Methods section (page 14, line 316):

      “To ensure that CpGs paired to a higher number of Genes do not have higher chances of being part of an eQTM, multiple-testing was controlled at the CpG level, following a procedure previously applied in the Genotype-Tissue Expression (GTEx) project (Gamazon et al., 2018). Briefly, our statistic used to test the hypothesis that a pair CpGGene is significantly associated is based on considering the lowest p-value observed for a given CpG and all its pairs Gene (e.g. those in the 1 Mb window centered at the TSS). As we do not know the distribution of this statistic under the null, we used a permutation test. We generated 100 permuted gene expression datasets and ran our previous linear regression models obtaining 100 permuted p-values for each CpG-Gene pair. Then, for each CpG, we selected among all CpG-Gene pairs the minimum p-value in each permutation and fitted a beta distribution that is the distribution we obtain when dealing with extreme values (e.g. minimum) (Dudbridge and Gusnanto, 2008). Next, for each CpG, we took the minimum p-value observed in the real data and used the beta distribution to compute the probability of observing a lower p-value. We defined this probability as the empirical p-value of the CpG. Then, we considered as significant those CpGs with empirical p-values to be significant at 5% false discovery rate using BenjaminiHochberg method. Finally, we applied a last step to identify all significant CpG-Gene pairs for all eCpGs. To do so, we defined a genome-wide empirical p-value threshold as the empirical p-value of the eCpG closest to the 5% false discovery rate threshold. We used this empirical p-value to calculate a nominal p-value threshold for each eCpG, based on the beta distribution obtained from the minimum permuted p-values. This nominal p-value threshold was defined as the value for which the inverse cumulative distribution of the beta distribution was equal to the empirical p-value. Then, for each eCpG, we considered as significant all eCpG-Gene variants with a p-value smaller than nominal p-value.”

      References:<br /> GTEx consortium, The GTEx Consortium atlas of genetic regulatory effects across human tissues, Science (2020) Sep 11;369(6509):1318-1330. doi: 10.1126/science.aaz1776.

      Reviewer #2 (Public Review):

      Strength:

      Comprehensive analysis Considering genetic factors such as meQTL and comparing results with adult data are interesting.

      We thank the reviewer for his/her positive feedback on the manuscript. We agree that the analysis of genetic data and the comparison with eQTMs described in adults are two important points of the study.

      Weakness:

      • Manuscript is not summarized well. Please send less important findings to supplementary materials. The manuscript is not well written, which includes every little detail in the text, resulting in 86 pages of the manuscript.

      Following reviewers’ comments, we have simplified the manuscript. Now only the eQTMs identified in the model adjusted for cellular composition are reported. In addition, functional enrichment analyses have been simplified without reporting all odds ratios (OR) and p-values, which can be seen in the Figures.

      • Any possible reason that the eQTM methylation probes are enriched in weak transcription regions? This is surprising.

      Bonder et al also found that blood eQTMs were slightly enriched for weak transcription regions (TxWk). Weak transcription regions are highly constitutive and found across many different cell types (Roadmap Epigenetics Consortium, 2015). However, hematopoietic stem cells and immune cells have lower representation of TxWk and other active states, which may be related to their capacity to generate sub-lineages and enter quiescence.

      Given that we analyzed whole blood and that ROADMAP chromatin states are only available for blood specific cell types, each CpG in the array was annotated to one or several chromatin states by taking a state as present in that locus if it was described in at least 1 of the 27 bloodrelated cell types. By applying this strategy we may be “over-representing” TxWk chromatin states, in the case TxWk are cell-type specific. As a result, even if each blood cell type might have few TxWk, many positions can be TxWk in at least one cell type, inflating the CpGs considered as TxWk. This might have affected some of the enrichments.

      On the other hand, CpG probe reliability depends on methylation levels and variance. TxWk regions show high methylation levels, which tend to be measured with more error. This also might have impacted the results, however the analysis considering only reliable probes (ICC >0.4) showed similar enrichment for TxWk.

      Besides these, we do not have a clear answer for the question raised by the reviewer.

      References:

      Bonder MJ, Luijk R, Zhernakova D V, Moed M, Deelen P, Vermaat M, et al. Disease variants alter transcription factor levels and methylation of their binding sites. Nat Genet [Internet]. 2017 [cited 2017 Nov 2];49:131–8. Available from: http://www.ncbi.nlm.nih.gov/pubmed/27918535

      Roadmap Epigenomics Consortium, Kundaje A, Meuleman W, Ernst J, Bilenky M, Yen A, Heravi-Moussavi A, Kheradpour P, Zhang Z, Wang J, Ziller MJ, Amin V, Whitaker JW, Schultz MD, Ward LD, Sarkar A, Quon G, Sandstrom RS, Eaton ML, Wu YC, Pfenning AR, Wang X, Claussnitzer M, Liu Y, Coarfa C, Harris RA, Shoresh N, Epstein CB, Gjoneska E, Leung D, Xie W, Hawkins RD, Lister R, Hong C, Gascard P, Mungall AJ, Moore R, Chuah E, Tam A, Canfield TK, Hansen RS, Kaul R, Sabo PJ, Bansal MS, Carles A, Dixon JR, Farh KH, Feizi S, Karlic R, Kim AR, Kulkarni A, Li D, Lowdon R, Elliott G, Mercer TR, Neph SJ, Onuchic V, Polak P, Rajagopal N, Ray P, Sallari RC, Siebenthall KT, Sinnott-Armstrong NA, Stevens M, Thurman RE, Wu J, Zhang B, Zhou X, Beaudet AE, Boyer LA, De Jager PL, Farnham PJ, Fisher SJ, Haussler D, Jones SJ, Li W, Marra MA, McManus MT, Sunyaev S, Thomson JA, Tlsty TD, Tsai LH, Wang W, Waterland RA, Zhang MQ, Chadwick LH, Bernstein BE, Costello JF, Ecker JR, Hirst M, Meissner A, Milosavljevic A, Ren B, Stamatoyannopoulos JA, Wang T, Kellis M. Integrative analysis of 111 reference human epigenomes. Nature. 2015 Feb 19;518(7539):317-30. doi: 10.1038/nature14248. PMID: 25693563; PMCID: PMC4530010.

      • The result that the magnitude of the effect was independent of the distance between the CpG and the TC TSS is surprising. Could you draw a figure where x-axis is the distance between the CpG site and TC TSS and y-axis is p-value?

      As suggested by the reviewer, we have taken a more detailed look at the relationship between the effect size and the distance between the CpG and the TC’s TSS. First, we confirmed that the relative orientation (upstream or downstream) did not affect the strength of the association (p-value=0.68). Second, we applied a linear regression between the absolute log2 fold change and the log10 of the distance (in absolute value), finding that they were inversely related. We have updated the manuscript with this information (page 22, line 504):

      “We observed an inverse linear association between the eCpG-eGene’s TSS distance and the effect size (p-value = 7.75e-9, Figure 2B); while we did not observe significant differences in effect size due to the relative orientation of the eCpG (upstream or downstream) with respect to the eGene’s TSS (p-value = 0.68).”

      Results are shown in Figure 2B. Of note, we winsorized effect size values in order to improve the visualization. The winsorizing process is also explained in Figure 2 legend. Moreover, we have done the plot suggested by the reviewer (see below). It shows that associations with smallest p-values are found close to the TC’s TSS. Nonetheless, as this pattern is also observed for the effect sizes, we have decided to not include it in the manuscript.

      • Concerned about too many significant eQTMs. Almost half of genes are associated with methylation. I wonder if false positives are well controlled using the empirical p-values. Using empirical p-value with permutation may mislead since especially you only use 100 permutations. I wonder the result would be similar if they compare their result with the traditional way, either adjusting p-values using p-values from entire TCs or adjusting pvalues using a gene-based method as commonly used in GWAS. Compare your previous result with my suggestion for the first analysis.

      Despite the number of genes (TCs) whose expression is associated with DNA methylation is quite high, we do not think this is due to not correctly controlling false positives. Our approach is based on the method used by GTEx (GTEx consortium) and implemented in the FastQTL package (Ongen et al. 2016), to control for positives in the eQTLs discovery. As in GTEx, we run 100 permutations to estimate the parameters of a beta distribution, which we used to model the distribution of p-values for each CpG. Then, to correct for the number of TCs among significant CpGs, we applied False Discovery Rate (FDR) at a threshold < 0.05. Finally, we defined the final set of significant eQTMs using the beta distribution defined in a previous step.

      For illustration, we compared the number of eQTMs with our approach to what we would obtain by uniquely applying the FDR method (adjusted p-value <0.05), getting fewer associations with our approach: eQTMs (45,203 with FDR vs 39,749 with our approach), eCpGs (24,611 vs 21,966) and eGenes (9,937 vs 8,886). Among the 8,886 significant eGenes, 6,288 of them are annotated to coding genes, thus representing 27% of the 23,054 eGenes coding for a gene included in the array.

      References:

      GTEx consortium, The GTEx Consortium atlas of genetic regulatory effects across human tissues, Science (2020) Sep 11;369(6509):1318-1330. doi: 10.1126/science.aaz1776.

      Ongen et al. Fast and efficient QTL mapper for thousands of molecular phenotypes, Bioinformatics (2016) May 15;32(10):1479-85. doi: 10.1093/bioinformatics/btv722. Epub 2015 Dec 26.

      • I recommend starting with cell type specific results. Without adjusting cell type, the result doesn't make sense.

      As suggested by other reviewers, we have withdrawn the model unadjusted for cellular composition.

      Reviewer #3 (Public Review):

      Although several DNA methylation-gene expression studies have been carried out in adults, this is the first in children. The importance of this is underlined by the finding that surprisingly few associations are observed in both adults and children. This is a timely study and certain to be important for the interpretation of future omic studies in blood samples obtained from children.

      We agree with the reviewer that eQTMs in children are important for interpreting EWAS findings conducted in child cohorts such as those of the Pregnancy And Childhood Epigenetics (PACE) consortium.

      It is unfortunate that the authors chose to base their reporting on associations unadjusted for cell count heterogeneity. They incorrectly claim that associations linked to cell count variation are likely to be cell-type-specific. While possible, it is probably more likely that the association exists entirely due to cell type differences (which tend to be large) with little or no association within any of the cell types (which tend to be much smaller). In the interests of interpretability, it would be better to report only associations obtained after adjusting for cell count variation.

      Following reviewers’ recommendations, we have reconsidered our initial hypothesis about the role of cellular composition in the association between methylation and gene expression. Although we still think that some of the eQTMs only found in the model unadjusted for cellular composition could represent cell specific effects, we acknowledge that the majority might be confounded by the extensive gene expression and DNA methylation differences between cell types. Also, we recognize that more sophisticated statistical tests should be applied to prove our hypothesis. Because of this we have decided to report the eQTMs of the model adjusted for cellular composition in the main manuscript and keep the results of the model unadjusted for cellular composition only in the online catalogue.

      Several enrichments could be related to variation in probe quality across the DNA methylation arrays.

      For example, enrichment for eQTM CpG sites among those that change with age could simply be due to the fact age and eQTM effects are more likely to be observed for CpG sites with high quality probes than low quality probes. It is more informative to instead ask if eQTM CpG sites are more likely to have increasing rather than decreasing methylation with age. This avoids the probe quality bias since probes with positive associations with age would be expected to have roughly the same quality as those with negative associations with age. There are several other analyses prone to the probe quality bias.

      See answer to question 2, below.

    1. Author Response

      Reviewer #2 (Public Review):

      The visual system must extract two basic features of visual stimuli: luminance, which we perceive as brightness, and contrast, the change in luminance over space or time (this paper focuses on changes over time). Contrast is separately processed by ON and OFF pathways, which encode luminance increments or decrements, respectively. Contrast must be robustly detected even if the overall luminance changes rapidly, as might occur if an animal is moving in and out of shadows. This paper addresses how such a luminance correction occurs in the fly.

      In the fly, three types of first-order interneurons - L1, L2, and L3 - transmit information from photoreceptors to the medulla, where ON and OFF encoding emerges. Previous work suggested that all three interneurons primarily encode contrast signals and that they project to distinct pathways: L1 to the ON pathway and L2 and L3 to the OFF pathway. Ketkar et al. show that, contrary to this model, these interneurons encode both contrast and luminance in specific ways and are not cleanly segregated into ON versus OFF inputs.

      This study reveals several new insights into early visual processing that are interesting and well-supported by the data:

      1) The authors show that behavioral responses to ON stimuli can compensate for rapid changes in luminance. However, the purported sole input to the ON pathway, L1, shows activity that is highly dependent on luminance. This suggests that a luminance correction must arise downstream of L1. These results are analogous to findings previously made by the same group regarding the OFF pathway (Ketkar et al., 2020). The previous paper showed that L2 provides contrast information to the OFF pathway, and L3 provides luminance information to allow for a luminance correction in downstream contrast encoding. But unlike the multiple inputs to the OFF pathway, the ON pathway was thought to only receive input from L1, provoking the question of whether L1 is able to provide both contrast and luminance information.

      2) Using well-designed calcium imaging studies, the authors surveyed the responses of the three interneurons and found that they encode different stimulus features: L1 encodes both contrast and luminance, L2 purely encodes contrast, and L3 purely encodes luminance (with a different dependence than L1). These are interesting and important findings revealing how both contrast and luminance encoding are distributed across the three interneurons.

      3) Using neuronal manipulations, the authors dissected the contributions of the three interneurons to ON and OFF behavior under changing luminance. These experiments showed that L1 and L3 are required for the luminance correction in the behavior. Moreover, the finding that all three interneurons contribute to both ON and OFF behavior contrasts with the existing model of segregated pathways. Thus, this paper could change the way we think about early visual processing in the fly: rather than relaying similar information to distinct downstream pathways, first-order interneurons relay distinct information to common pathways.

      Overall, the major claims of this paper are important and supported by the experiments. There are just a few concerns that I would note:

      Thank you for the overall positive evaluation of our work, as well as for the constructive criticism, which we are going to address below.

      1) The authors state that they have shown luminance invariance in ON behavior (e.g. line 376-377 of the Discussion), but this is not entirely accurate: the ON behavior decreases as luminance increases. This is still an interesting effect since it's the opposite of what L1 activity does, so it's clear that the circuit is implementing a luminance correction, but it is not "luminance invariance".

      As pointed out in response to essential comment #2, we carefully edited the manuscript to talk about ‘near’ luminance invariance, or data approaching luminance invariance. More prominently, we rephrased the text to highlight the need for a luminance gain to scale behavioral responses to contrast, even if the resulting behavior is not entirely luminance invariant.

      2) The visual stimuli presented for most imaging experiments (full-field) are not the same as those presented for behavior (moving edges). It is possible neuronal responses and their encoding of luminance and contrast may differ if tested with the moving edge stimuli (if so, this would be concerning). The authors did image L1 with both types of stimuli and could compare these responses. Also, testing behavior at 34º and imaging at 20º presents a possible discrepancy in comparing these data.

      We use moving ON edges in Figure 1, and these data suggest that the transient response of L1 scales with step changes in luminance, consistent with data in Figure 2B. Although we did not point this out in the paper, the L1 responses in Figure 1 also decay to different response levels, consistent with the luminance-sensitive component that static stimuli reveal in Figure 2. Furthermore, for other ongoing projects in the lab, we have for example measured physiological responses in L2 with the same stimuli used in behavior, and there is no discrepancy with the data reported here. Overall, there is no reason to believe, following a vast amount of literature in Drosophila and other flies, that LMCs would respond any different to moving vs. static stimuli.

      We can additionally point out that the behavioral data of L3 silencing (at 34ºC) nicely correlate with physiological contrast responses of L1 and L2 (at 20ºC, predicted from electrophysiological recordings for LMCs in Ketkar et al. 2020, measured for L1 here). Many previous studies, for example in motion detection, have linked data from physiological recordings at room temperature with behavioral experiments done at higher temperature (e.g., Ammer et al., 2015; Clark et al., 2011; Creamer et al., 2019; Fisher et al., 2015; Leonhardt et al., 2017; Salazar-Gatzimas et al., 2016; Serbe et al., 2016; Silies et al., 2013; Strother et al., 2017). We therefore do not think that these are major concerns.

      3) I find it puzzling that silencing L1 has little effect on ON behavior at 100% contrast and varying luminance (Figure 3A), but severely affects ON behavior to 100% contrast (and lower values) when different contrasts are interleaved (Figure S1). The authors note this but do not provide a clear explanation of why this might be the case. Aside from mechanism, it is not clear whether the difference is due to varying luminance in the first experiment or varying contrast in the second one (e.g. they could test 100% contrast without varying luminance).

      The two stimulus sets used here do not allow us to pinpoint why the L1 silencing phenotype differs between them, since they comprise more than one difference as discussed above (see point 4) in “Essential Revisions”). We now include two additional experiments that dissect the role of different stimulus parameters (Supp. Figure 2). To understand whether the difference is due to varying luminance, we tested responses to ON edges of fixed (100%) contrast and luminance at the same stimulus parameters (motion duration, speed) as used in Figure 3, and did not find reduced turning responses when silencing L1. Thus, varying luminance does not change the effect of L1 on ON behavior. However, when repeating this experiment with a bright inter-stimulus interval, L1 silencing lead to a strong response deficit. Therefore, differences in the interval luminance explain the differences in the L1 silencing phenotype observed not only in this study but also across studies. Although we hypothesize a role of contrast adaptation that may function differently with altered contrast statistics, a more detailed investigation would be necessary to understand the mechanism. Nevertheless, our experiments allow us to conclude that L1 is not the sole major input to the ON pathway, even though it is required under certain stimulus conditions.

      4) I do not entirely agree with the authors' interpretation of the L1 ort rescue experiment for OFF behavior. They state that rescue flies "responded similarly to positive controls". However, the graph shows that the rescue flies generally fall in between the mutant and heterozygote control flies; they resemble the controls at low luminance but resemble the mutants at high luminance. One may conclude that L1 is sufficient to enhance OFF behavior at low luminance, but it is a stretch to say it's a complete rescue.

      Sorry, we just meant to say that they “responded similarly to positive controls (...) at low luminance”, but the sentence was badly written. We corrected this to: “L1 ort rescue flies responded similarly to positive controls at low luminances, rescuing responses to OFF edges at dim backgrounds.”

      5) The authors typically use t-tests to analyze experiments with 2 variables (genotype and luminance) and 3 or more conditions per variable. This is not the most appropriate statistical test; typically one would use a two-way ANOVA. At the least, it should be clear whether they are performing corrections for multiple comparisons if performing many t-tests on the same dataset.

      Thank you for the suggestion, we now use a two-way ANOVA followed by corrected pairwise comparisons and state this clearly in the figure captions (also addressed above in essential comment #5).

      Reviewer #3 (Public Review):

      Ketkar et al combine calcium imaging and behavioral experiments to investigate the encoding of luminance and contrast in 3 first-order interneurons in the Drosophila lamina: L1, L2, and L3, as well as the role of these signals in moving ON edge behavior across luminance. The behavioral experiments are well performed. The rescue experiments are particularly interesting. Together with silencing they support and nicely extend previous work showing that L1/2/3 are not simply segregated between ON and OFF pathways. My main issue is the link that the authors make between the cellular responses and the behaviors performed and therefore the overall conclusions and claims of the paper about the roles of contrast vs luminance encoding of each neuron type (particularly L1) in the behaviors.

      Major concerns:

      1) The authors state that the main behavior they study, namely optomotor response to moving light edges at 100% contrast, is "luminance invariant". A strict definition of this would be that behavioral responses are constant with increasing luminance. However, there are very few plots in this paper where this is the case. In almost all examples, the response is decreasing with respect to increasing luminance. The authors do qualify a "nearly" invariant behavior, but this does not change the fact that interpretation of the data in the context of the framing of the paper is often problematic.

      We thank the reviewer for this critical comment. The main point (that we apparently failed to make clear enough) is that there is a clear requirement for a luminance gain. Physiological LMC responses measured using calcium imaging to ON stimuli in Figure 1, or predicted from previous electrophysiological recordings to OFF stimuli in (Ketkar et al., 2020) cannot account for any of the (control) behavioral data. We now edited the text to tone down statements about luminance invariance, and instead highlighted the need for a luminance gain.

      2) The manuscript would benefit from clear definitions of luminance and contrast, as well as an explanation of how contrast and luminance sensitivity can be inferred from experiments. In particular, the authors use transient vs. sustained response properties in L1, L2, and L3 as indicators of contrast and luminance sensitivity, but this is not stated clearly. It would be important to explain this to the reader early on.

      We now added definitions of general terms to the introduction and added data and analysis to the manuscript (Figure S1, and Figure 2B-D) to more clearly test which component of the neurons’ responses encode contrast or luminance.

      3) In the manuscript, it is often stated that "calcium imaging experiments reveal that each first order interneuron is unique in its contrast and luminance encoding properties" (line 110). This was shown clearly for L2 and L3 in their previous work in Ketkar et al. 2020, with a welldesigned two-step stimulus that was able to tease apart contrast vs. luminance invariance. Unfortunately it does not seem that this level of experimental detail and analysis is applied to L1 here. In particular, the authors state " L1 encodes both contrast and luminance in distinct response components." Line 112, in the summary of their findings. I would not agree that the authors have actually shown this properly in this manuscript.

      Addressed above, in point 6 of “Essential Revisions”

      4) The results as they are stated, are at times not well supported by the data. The manuscript would benefit from a careful assessment of the accuracy and precision of the language used to interpret the data. Sometime just moving some conclusions to the discussion and explaining the assumptions made to reach a particular conclusion would be enough. A few of examples:

      We carefully edited the entire manuscript, in addition to addressing the specific points below.

      o Figure 2: "Lamina neuron types L1-L3 are differently sensitive to contrast and luminance". It is overall true that from the raw traces, the response are different. However the quantification in C-E only pertains to luminance.

      As stated above, we now did further analysis on the contrast encoding properties of L1 and L2 and pointed out the major differences between these neurons (Figure 2B-D).

      o Figure 3: "L1 is not required but sufficient for ON behavior across luminance". The data convincingly shows this. I would however point out that the statement "this data [..] highlights its behavioral relevant role of its luminance component" line 231 is an overstatement.

      We deleted this statement at the end of the paragraph.

      o Figure 6: "L1 luminance signal is required and sufficient for OFF behavior" the data presented shows convincingly that when L1 is inactive the behavior becomes (more) intensity variant. However, it does not show that it is the "luminance signal" in L1 that is required for this effect. In general, because L1 has a sustained and a transient response, it is difficult to strictly implicate one or the other in supporting any behavior, short of manipulating L1 to make it fully transient or fully sustained.

      We agree. The figure title now reads “L1 function is required and sufficient for OFF behavior”.

      o It is often not clear which conclusions stem from this work and which from their previous work Ketkar et al. 2020, or even other previous work on contrast sensitivity in particular. Clarifying this might help with my concern about statements not well supported by the data in this paper, and also justify their overall novelty. In general the manuscript assumes familiarity with this previous work, which is not always helpful for the reader.

      As stated above, we now more clearly separate previous findings from novel findings in the abstract, and throughout the text. We also expanded the introduction to better explain the core concepts that are needed to understand this work, without having read Ketkar et al. 2020.

    1. Author Response

      Reviewer #1 (Public Review):

      In this paper, Fernandes et al. take advantage of synthetic constructs to test how Bicoid (Bcd) activates its downstream target Hunchback (Hb). They explore synthetic constructs containing only Bcd, Bcd and Hb, and Bcd and Zelda binding sites. They use these to develop theoretical models for how Bcd drives Hb in the early embryo. They show that Hb sites alone are insufficient to drive further Hb expression.

      The paper's first half focuses on how well the synthetic constructs replicate the in vivo expression of hb. This approach is generally convincing, and the results are interesting. Consistent with previous work, they show that Bcd alone is sufficient to drive an expression profile that is similar to wild‐type, but the addition of Hb and Zelda are needed to generate precise and rapid formation of the boundaries. The experimental results are supported by modelling. The model does a nice job of encapsulating the key conclusions and clearly adds value to the analysis.

      In the second part of the paper, the authors use their synthetic approach to look at how the Hb boundary alters depending on Bcd dosage. This part asks whether the observed Bcd gradient is the same as the activity gradient of Bcd (i.e. the "active" part of Bcd is not a priori the same as the protein gradient). This is a very interesting problem and good the authors have tried to tackle this. However, the strength of their conclusions needs to be substantially tempered as they rely on an overestimation of the Bcd gradient decay length.

      Comments:

      ‐ My major concern regards the conclusions for the final section on the activity gradient. In the Introduction it is stated: "[the Bcd gradient has] an exponential AP gradient with a decay length of L ~ 20% egg‐length (EL)". While this was the initial estimate (Houchmandzadeh et al., Nature 2002), later measurements by the Gregor lab (see Supplementary Material of Liu et al., PNAS 2013) found that "The mean length constant was reduced to 16.5 ± 0.7%EL after corrections for EGFP maturation". The original measurements by Houchmandzadeh et al. had issues with background control, that also led to the longer measured decay length. In later work, Durrieu et al., Mol Sys Biol 2018, found a similar scale for the decay length to Liu et al. Looking at Figure 5, a value of 16.5%EL for the decay length is fully consistent with the activity and protein gradients for Bcd being similar. In short, the strength of the conclusions clearly does not match the known gradient and should be substantially toned down.

      The reviewer is right: several studies aiming to quantitatively measure the Bicoid protein gradient ended‐up with quite different decay lengths.

      A summary of the various decay lengths measured, and the method used for these measurements is given below:

      As indicated, these measurements are quite variable among the different studies and the differences can potentially be attributed to different methods of detection (antibody staining on fixed samples vs fluorescent measurements on live sample) or to the type of protein detected (endogenous Bicoid vs fluorescently tagged).

      We agree with the reviewer that given these discrepancies, the exact value of the Bcd protein gradient decay length is not known and that we only have measurements that put it in between 16 and 25 % EL (see the Table above). Therefore, we agree that we should tone down the difference between the protein vs activity gradient and focus on the measurements of the effective activity gradient decay length allowed by our synthetic reporters. This allows us to revisit the measurement of the Hill coefficient of the transcription step‐like response, which is based on the decay‐length for the Bcd protein gradient, and assumed in previous published work to be of 20% EL (Gregor et al., Cell, 2007a; Estrada et al., 2016; Tran et al., PLoS CB, 2018). Importantly, the new Hill coefficient allows us to set the Bcd system within the limits of an equilibrium model.

      As mentioned by the reviewer, it is possible that the decay length of the protein gradient measured using antibody staining (Houchmandzadeh et al,, Nature, 2002) was not correct due to background controls. Such measurements were also performed in Xu et al. (2015) which agree with the original measurements (Houchmandzadeh et al., Nature 2002). As indicated in the table above, all the other measurements of the Bcd protein gradient decay length were done using fluorescently tagged Bcd proteins and we cannot exclude the possibility the wt vs tagged protein might have different decay lengths due to potentially different diffusion coefficients or half‐lives. Before drawing any conclusion on the exact value of the endogenous Bcd protein gradient decay length, it is essential to measure it again in conditions that correct for the background issues for immuno‐staining as it was done in Liu et al., PNAS, 2013 for the Bcd‐eGFP protein. In this study, the authors only measured the decay length of the Bcd fusion protein using immuno‐staining for the Bcd protein. Unfortunately, in this study, the authors did not measure again the decay length of the endogenous Bcd protein gradient using immuno‐staining and the same procedure for background control. Therefore, they do not firmly exclude the possibility that the endogenous vs tagged Bcd proteins might have different decay length.

      We thank the reviewer for his comment which helped us to clarify the message. In addition, as there is clearly an issue for the measurements of the Bcd protein gradient, we added a section in the SI (Section E) and a Table (Table S4) describing the various decay length measured for the Bcd or the Bcd‐fluorescently tagged protein gradients from previous studies. In the discussion, together with the possibility that there might be a protein vs activity gradient (as we originally proposed and believe is still a valid possibility), we also discuss the alternative possibility proposed by the reviewer which is that the protein vs activity gradients have the same decay lengths but that the decay length of the Bcd protein gradient was potentially not correctly evaluated.

      ‐ All of the experiments are performed in a background with the hb gene present. Does this impact on the readout, as the synthetic lines are essentially competing with the wild‐type genes? What controls were done to account for this?

      We agree with the reviewer that this concern might be particularly relevant at the hb boundary where a nucleus has been shown to only contain ~ 700 Bicoid molecules (Gregor et al., Cell, 2007b). However, ~1000 Bicoid binding regions have been identified by ChIP seq experiments in nc14 embryos (Hannon et al., Elife, 2017) and given that several Bcd binding sites are generally clustered together in a Bcd region, the number of Bcd binding sites in the fly genome is likely larger than 1000. It is much greater than the number of Bicoid binding sites in our synthetic reporters. Therefore, we think that it is unlikely that adding the synthetic reporters (which in the case of B12 only represents at most 1/100 of the Bcd binding sites in the genome) will severely alter the competition for Bcd binding between the other Bcd binding sites in the genome. Additionally, the insertion of a BAC spanning the endogenous hb locus with all its Bcd‐dependent enhancers did not affect (as far as we can tell) the regulation of the wildtype gene (Lucas, Tran et al., 2018).

      We have added a sentence concerning this point in the main text (lines 108 to 111).

      ‐ Further, the activity of the synthetic reporters depends on the location of insertion. Erceg et al. PLoS Genetics 2014 showed that the same synthetic enhancer can have different readout depending on its genomic location. I'm aware that the authors use a landing site that appears to replicate similar hb kinetics, but did they try random insertion or other landing site? In short, how robust are their results to the specific local genome site? This should have been tested, especially given the boldly written conclusions from the work.

      This concern of the reviewer has been tested and is addressed Fig S1 where we compare two random insertions of the hb‐P2 transgene (on chromosome II and III; Lucas, Tran et al., 2018) and the insertion at the VK33 landing site that was used for the whole study. As shown Fig. S1, the dynamics of transcription (kymographs) are very similar. In the main text, the reference Fig. S1 is found in the Materials and Methods section (bottom of the 1st paragraph concerning the Drosophila stocks, lines 518).

      ‐ Related to the above, it's also not obvious that readout is linear ‐ i.e. as more binding sites are added, there could be cooperativity between binding domains. This may have been accounted for in the model but it is not clear to me how.

      The reviewer is totally correct. It is clear from our data that readout is not linear: comparing (increase of 1.5 X in the number of BS) B6 with B9 leads to a 4.5 X greater activation rate and this argues against independent activation of transcription by individual bound Bcd TF. There is almost no impact of adding 3 more sites when comparing B9 to B12 (even though it corresponds to an increase of 1.33 X in the number of BS). This issue has been rephrased in the main text (lines 200 to 203) and further developed for the modeling aspects in the SI section C and Figure S3. It is also discussed in the second paragraph of the discussion (lines 380 to 383).

      ‐ It would be good in the Introduction/Discussion to give a broader perspective on the advantages and disadvantages of the synthetic approach to study gene regulation. The intro only discusses Tran et al. Yet, there is a strong history of using this approach, which has also helped to reveal some of the approaches shortcoming. E.g. Gertz et al. Nature 2009 and Sharon et al. Nature Biotechnology 2012. Again, I may have missed, but from my reading I cannot see any critical analysis of the pros/cons of the synthetic approach in development. This is necessary to give readers a clearer context.

      One sentence was added in the introduction concerning this point (lines 79 to 82).

      A short review concerning the synthetic approach in development has also been added at the beginning of the discussion (lines 347 to 359).

      Reviewer #2 (Public Review):

      It is known that Bicoid increases in concentration across the syncytial division cycles, the gradient length scale for Bicoid does not change, and hunchback also increases in concentration during the syncytial cycles but the sharp boundary of the hunchback gradient is constantly seen despite the change in concentration of Bicoid. This manuscript shows that by increasing the Bicoid concentration or by adding Zelda binding sites, the expression of hunchback can be recapitulated to that of a previously studied promoter for hunchback.

      I have the following comments to understand the implications of the study in the context of increasing concentrations of Bicoid during the syncytial division cycles:

      ‐ Bicoid itself is also increasing over the syncytial division cycles, how does this change in concentration of Bicoid affect the activation of the hunchback promoter given the cooperative binding of Bicoid and Bicoid and Zelda as documented by the study?

      We thank the reviewer for this remark about the dynamics of the Bcd gradient, which we may have taken for granted. A seminal work on the dynamics of the Bcd gradient using fluorescent‐tagged Bcd (Gregor et al, Cell, 2007a) has shown that the gradient of Bcd nuclear concentration (this nuclear concentration is the one that matter for transcription) remains stable over nuclear cycles, despite a global increase of Bcd amount in the embryo. This can be explained by the fact that Bcd molecules are imported in the nuclei and that the number of nuclei double at every cycle, such that both processes compensate each other. Thus, we assumed that the gradient of Bcd nuclear concentration was stable over nc11 to nc13.

      We have clarified this assumption in the model section in the manuscript (lines 165‐168).

      Supporting our assumption, when looking at the transcription dynamics regulated by Bcd, in Lucas et al, PLoS Gen, 2018, we observed very reproducible expression pattern dynamics of the hb‐P2 reporter at each cycle nc11 to nc13. Such reproducibility in the pattern dynamics were also observed in this current work for hb‐P2, B6, B9, B12 and H6B6 reporters (Fig. S6A). Also, in Lucas et al, PLoS Gen, 2018, the shift in the established boundary positions of hb‐P2 reporter between nc11 to nc13 is ~2%EL (approximately a nucleus length ~10μm) and it is thus marginal.

      In addition, as mentioned in the text (lines 105 to 107), we only focused our analysis on nc13 data which are statistically stronger given the higher number of nuclei analyzed. Thus, any change of Bcd nuclear concentration that would happen over nuclear cycles will not matter.

      Concerning Zelda: Zelda’s transcriptional activity when measured on a reporter with only 6 Zld binding sites changes drastically over the nuclear cycles, with strong activity at nc11 and much weaker activity at nc13 (Fig S4A). This indicates that the changes in expression pattern dynamics of Z2B6 from nc11 to nc13 are caused predominantly by decreasing Zelda activity: the effect of Zld on the Z2B6 promoter is very strong during nc11 and nc12. It is also very strong at the beginning of nc13 (even though the Z6 reporter is almost silent) and became a bit weaker in the second part of nc13 (Fig S4B‐D).

      ‐ Does the change in concentration of Bicoid across the nuclear cycles shift the gradient similar to the change in numbers of Bicoid binding sites?

      In both Lucas et al, PLoS Gen, 2018 and in this work (Fig. 1, Fig. 3 and Fig. S6A), we found that the positions of the expression boundary are very reproducible and stable in time for hb‐P2, B6, B9, B12, H6B6 during the interphase of nc12 to 13. For hb‐P2, the averaged shift of the established boundary position in nc11, 12 and 13 is within 2 %EL. This averaged shift between the cycles is of similar magnitude to the difference caused by embryo‐to‐embryo variability within nc13 (~2 %EL) (Gregor et al, Cell, 2007b, Lucas et al, PloS Gen, 2018). This shift is much smaller than the difference between the expression boundary positions of B6 and B9 (~ 8 % EL) and between B6 and Z2B6 (~17.5 %EL) in nc13.

      For these reasons, we conclude that the difference between the expression patterns of B6, B9 and Z2B6 are caused predominantly by changing the TF binding site configurations of the reporters, rather than variability in the Bcd gradient.

      The assumption of gradient stability has been clarified in the previous answer and in the manuscript (lines 165‐168).

      ‐ The intensity is a little higher for B9 and B12 at the anterior in 2B? Is this statistically different? is this likely to change the amount of Bicoid expression at the locus and lead to more robust activation?

      We performed statistical tests to distinguish the spot intensities at the anterior pole for every pair of reporters in Fig. 2B (hb‐P2, B6, B9 and B12). All p‐values from pair‐wise KS tests are greater than 0.067, suggesting that the spot intensities at the anterior pole are not distinguishable between these reporters.

      We have clarified this in the manuscript (line 157).

      ‐Are the fraction of active loci not changing across the syncytial cycles when the concentration of Bicoid also changes and consistent with the synthetic promoters?

      To measure the reproducibility of the expression pattern dynamics in different nuclear cycles, we compared the boundary position of the fraction of active loci pattern as a function of time for all hbP2 and synthetic reporters (Fig. S6A). In this figure panel, for all reporters except Z2B6, the curves in nc12 and nc13 largely overlap, suggesting high reproducibility in the pattern dynamics between cycles and consequently low sensitivity to the subtle variation in the Bcd nuclear concentration gradient between the cycles.

      For Z2B6, we attributed the difference in pattern dynamics between nc12 and nc13 to the changes in Zelda activity, as validated independently with a synthetic reporter with only 6 Zld binding sites (Fig. S4A).

      ‐How do the numbers of Hb BS change the expression of Hb? H6B6 has 6 Hb BS whereas the Hb‐P2 has 1? Are more controls needed to compare these 2 contexts?

      As our goal was to determine to which mechanistic step of our model each TF (Bcd, Hb, Zld) contributed, we added BS numbers that are much higher than in the hb‐P2 promoter. The added number of Hb BS remains very low when compared to total number of Hb binding sites in the entire genome (Karplan et al, PLOS Gen, 2011), therefore, it is very unlikely to affect the endogenous expression of Hb protein.

      We clarified this in the manuscript (lines 211 to 212).

      Does Zelda concentration change across the syncytial division cycles? How does the change in concentration in the natural context affect the promoter activation of Hb?

      Zelda concentration is stable over the nuclear cycles, as observed with the fluorescently‐tagged Zld protein (Dufourt et al., Nat Com, 2018). However, Zelda’s transcriptional activity when measured on a reporter with only 6 Zld binding sites changes drastically over the nuclear cycles, with strong activity at nc11 and much weaker activity at nc13 (Fig S4A, this work).

      The impact of this change in Zld activity can be observed with the Z2B6 promoter, with the expression boundary moving from the posterior region toward the anterior region over the nuclear cycles (Fig. S4B‐D). However, we don’t detect any changes in the expression pattern dynamics of hb‐P2 over the nuclear cycles (Fig. S6A and in Lucas et al., PLoS Gen, 2018).

      We have clarified this in lines 250‐251 of the main manuscript.

      ‐Changing the dose of Bicoid shifts the boundary of hunchback expression. It would be nice to model or test this in the context of varing doses of zelda or even reason this with respect to varying doses of zelda across the syncytial division cycles.

      We thank the reviewer for this insight. Concerning Zelda, we did not perform any experiment reducing the amount of Zelda in the embryo. However, in a previous study (Lucas et al., PLoS Genetics, 2018), we observed that the boundary of hb was shifted towards the anterior when decreasing the amount of Zelda consistent to the fact that the dose of Zelda is critical to set the boundary position and the threshold of Bcd concentration required for activation. However, as Zelda is distributed homogeneously along the AP axis, it cannot bring per se positional information to the system.

      Reviewer #3 (Public Review):

      I think the framing could be improved to better reflect the contribution of the work. From the abstract, for example, it's unclear to me what the authors think is the most meaningful conclusion. Is it the observations about the finer details of TF regulation (bursting dynamics), the fact that Bcd is probably the sole source of "positional information" for hb‐p2, that Bcd exists in active/inactive form, or the fact that an equilibrium model probably suffices to explain what we observe? The first sentence itself seems to suggest this paper will discuss "dynamic positional information", in which case it's somewhat misleading to say this kind of work is "largely unexplored"; Johannes Jaeger in particular has been a strong proponent of this view since at least 2004. On that note some particularly relevant recent papers in the Drosophila early embryo include:

      1) Jaeger and Verd (2020) Curr Topics Dev Biol

      2) Verd et al. (2017) PLoS Comp Biol

      3) Huang, Amourda, et al. and Saunders (2017) eLife

      4) Yang, Zhu, et al. (2020) eLife [see also the second half of Perkins (2021) PLoS Comp Biol for further discussion of that model]

      ‐Some reviews from James Briscoe also discuss this perspective.

      We agree with the reviewer that the phrasing of the abstract was not clear enough to emphasize the contribution of the work and we are also sorry if it suggested that the dynamic positional information is largely unexplored because this was not at all our intention.

      We rephrased the abstract aiming to better highlight the most meaningful conclusions.

      ‐I would also recommend modifying the title to reflect the biology found in the new results.

      We modified the title to better reflect the new results:<br /> “Synthetic reconstruction of the hunchback promoter specifies the role of Bicoid, Zelda and Hunchback in the dynamics of its transcription”

      ‐A major point that the authors should address is the design of the synthetic constructs. From table S1, the sites are often very closely linked (4‐7 base pairs). From the footprint of these proteins, we know they can cover DNA across this size (see, https://pubmed.ncbi.nlm.nih.gov/8620846/). As such, there may be direct competition/steric hindrance (see https://pubmed.ncbi.nlm.nih.gov/28052257/). What impact does this have on their interpretations? Note also that the native enhancer has spaced sites with variable identities.

      We completely agree with the reviewer comment in the sense that we named our reporters according to the number (N) of Bcd binding sites sequences that they contain, even though we cannot prove definitively that they can effectively be bound simultaneously by N Bcd molecules. It is thus possible that B9 is not a B9 but an effective B6 (i.e. B9 can only be bound simultaneously by 6 molecules) if, for instance, the binding of a Bcd molecule to one site would prevent by the binding of another Bcd molecule to a nearby site (as proposed by the reviewer in the case of direct competition or steric hindrance).

      Even though we cannot exclude this possibility, we think that our use of B6, B9, B12, in reference to the 6 Bcd BS of hb‐P2 promoter, is relevant for several reasons : i) some of the Bcd BS in the hb‐P2 promoter are also very close from each other (see Table S1); ii) the design of the synthetic construct was made by multimerizing a series of 3 strong Bcd binding sites with a similar spacing as found for the closest sites in the hb‐P2 promoter (as shown in Figure 1A and Table S1); iii) the binding of the Bicoid protein has been shown in foot printing experiments in vitro to be more efficient on sites of the hb‐P2 promoter that are close from each other, and this has even been interpreted as binding cooperativity (Ma et al., 1996); iv) even though these experiments were not performed with full‐length proteins, two molecules of the paired homeodomain (from the same family of DNA binding domain as Bcd) are able to simultaneously bind to two binding sites separated by only 2 base pairs. This binding to very close sites is even cooperative while when the two sites are distant by 5 base pairs or more, the simultaneous binding to the two sites occurs without cooperativity (Wilson et al., 1993).

      Conversely, as it is very difficult to demonstrate that 9 Bcd molecules can effectively bind to our B9 promoter, it is very difficult to know exactly how many binding sites for Bcd the hb‐P2 contains, and a large debate concerning not only the number but also the identity of the Bcd sites in the hb promoter is still ongoing (Park et al., 2019; Ling et al., 2019).

      As we cannot exclude the possibility that B9 is an effective B6, it remains possible that B9 and hb‐P2 (which is supposed to only contains 6 sites) have the same number of effective Bcd binding site and this could explain why the two reporters have very similar transcription dynamics and features.

      Regarding other interpretations in the manuscript, we identified two other aspects that will be affected if our synthetic reporters have fewer effective sites than the number of sites they carry. The first one concerns the synergy, as the increase in the number of sites of 1.5 from B6 to B9 might be over‐estimated but this would even increase the synergistic effect given the 4.5 difference in activity of the two reporters (Fig. S3). The second one concerns the discussion on the Hill coefficient and the decay length where the effective number of binding sites (N) is required to determine the limit of concentration sensing (Fig. 5). This would particularly be important for the hb‐P2 promoter.

      Except for these specific points, we don’t think that the possibility that reporters do not exactly contain as many as effective binding sites than proposed, has a huge impact on our interpretations and the general message conveyed in this manuscript. Most importantly, it is very clear that our B6 and B9 reporters differ only by three Bcd binding sites and have yet very distinct expression dynamics: while B9 recapitulates almost all transcription features of hb‐P2, B6 is far from achieving it. Similarly, H6B6 and Z2B6 have very different transcription features than B6 and these differences have been key for understanding the mechanistic functions of the three TF we studied.

      This discussion has been added to the discussion (lines 400 to 414)

    2. Reviewer #3 (Public Review):

      I think the framing could be improved to better reflect the contribution of the work. From the abstract, for example, it's unclear to me what the authors think is the most meaningful conclusion. Is it the observations about the finer details of TF regulation (bursting dynamics), the fact that Bcd is probably the sole source of "positional information" for hb-p2, that Bcd exists in active/inactive form, or the fact that an equilibrium model probably suffices to explain what we observe? The first sentence itself seems to suggest this paper will discuss "dynamic positional information", in which case it's somewhat misleading to say this kind of work is "largely unexplored"; Johannes Jaeger in particular has been a strong proponent of this view since at least 2004. On that note some particularly relevant recent papers in the Drosophila early embryo include:<br /> 1) Jaeger and Verd (2020) Curr Topics Dev Biol<br /> 2) Verd et al. (2017) PLoS Comp Biol<br /> 3) Huang, Amourda, et al. and Saunders (2017) eLife<br /> 4) Yang, Zhu, et al. (2020) eLife [see also the second half of Perkins (2021) PLoS Comp Biol for further discussion of that model]<br /> Some reviews from James Briscoe also discuss this perspective.

      I would also recommend modifying the title to reflect the biology found in the new results.

      A major point that the authors should address is the design of the synthetic constructs. From table S1, the sites are often very closely linked (4-7 base pairs). From the footprint of these proteins, we know they can cover DNA across this size (see, https://pubmed.ncbi.nlm.nih.gov/8620846/). As such, there may be direct competition/steric hindrance (see https://pubmed.ncbi.nlm.nih.gov/28052257/). What impact does this have on their interpretations? Note also that the native enhancer has spaced sites with variable identities.

    1. Author Response:

      Evaluation Summary:

      This paper will be of interest to researchers who perform single-molecule fluorescence imaging experiments as well as those who want to include machine learning in their data analyses. The authors have developed a machine learning algorithm that addresses some of the data analysis challenges in the field of single-molecule fluorescence imaging. The methods are rigorously benchmarked using simulated data and tested using real data. There are some concerns whether Tapqir is general enough for use by the broader community of single-molecule fluorescence researchers.

      We thank the reviewers for their thorough review of the manuscript. In response to the reviewer comments, we posted to bioRxiv a revised manuscript with new data and edits to text. Concerns about generality are addressed in the revised manuscript and in the responses to specific reviewer comments below.

      Reviewer #1 (Public Review):

      "Bayesian machine learning analysis of single-molecule fluorescence colocalization images" by Ordabayev, et al. reports the development, benchmarking, and testing of a Bayesian machine learning-based method, which the authors name Tapqir, for analyzing single-molecule fluorescence colocalization data. Unlike currently available, more conventional analysis methods, Tapqir attempts to holistically model the microscopy images that are recorded during a colocalization experiment. Tapir uses a physics-based, global model with parameters describing all of the features of the experiment that are expected to contribute to the recorded microscopy images, including shot noise of the spots and background, camera noise, size and shape of the spots, and specific- and non-specific binders. Based on benchmarking on simulated data with widely varying properties (e.g., signal-to-noise; amounts, rates, and locations of specific and non-specific binders; etc.), Tapqir generally does as well and, in some cases, better than currently existing methods. The authors also test Tapqir on real microscopy images with similarly varying properties from studies that have been previously published by their research group and demonstrate that their Tapqir-based analysis is able to faithfully reproduce the previously published results, which were obtained using the more conventional analysis methods available at the time the data were originally published. This is a well-designed and executed study, Tapqir represents a conceptual and practical advance in the analysis of single-molecule fluorescence colocalization experiments, and its performance has been comprehensively and rigorously benchmarked on simulated data and tested on real data. The conclusions of this study are well supported by the data, but some of the limitations of the method need to be clarified and discussed in more depth, as outlined below.

      1. Given that the AOI is centered at the target molecule and there is a strong prior for the binder also being located at the center of the AOI, the performance of Tapqir is dependent on several variables of the microscopy/optical system (e.g., the microscope point-spread function, magnification, accurate alignment of target and binder imaging channels, accurate drift correction, etc.). Although this caveat is mentioned and some of these factors are listed in the main text of the manuscript, the authors could have expanded this discussion in order to clarify the extent to which the performance of Tapqir depends on these factors.

      We added relevant new data to the revised manuscript in Table 5. The question about alignment accuracy is now discussed in the Materials and Methods:

      “Tests on data simulated with increasing proximity parameter values σxy (true) (i.e., with decreasing precision of spatial mapping between the binder and target image channels) confirm that the cosmos model accurately learns σxy (fit) from the data (Figure3–Figure Supplement 3D; Table 5). This was the case even if we substituted a less-informative σxy prior (Uniform vs. Exponential; Table 5).

      The CoSMoS technique is premised on colocalization of the binder spots with the known location of the target molecule. Consequently, for any CoSMoS analysis method, classification accuracy will in general decline when the images in the target and binder channels are less accurately mapped. However, for the Tapqir cosmos model, low mapping precision has little effect on classification accuracy at typical non-specific binding densities (λ = 0.15; see MCC values in Table 5).”

      The more general point about priors is now addressed in the Materials and Methods as follows:

      “All simulated and experimental data sets in this work were analyzed using the prior distributions and hyperparameter values given above, which are compatible with a broad range of experimental conditions (Table 1). Many of the priors are uninformative and we anticipate that these will work well with images taken on variety of microscope hardware. However, it is possible that highly atypical microscope designs (e.g., those with effective magnifications that are sub-optimal for CoSMoS) might require adjustment of some fixed hyperparameters and distributions (those in Eqs. 6a, 6b, 11, 12, 13, 15, and 16). For example, if the microscope point spread function is more than 2 pixels wide, it may be necessary to increase the range of the w prior in Eq. 13. The Tapqir documentation (https://tapqir.readthedocs.io/en/stable/) gives instructions for changing the hyperparameters.”

      1. The Tapqir model has many parameters, each with its own prior. The majority of these priors are designed to be uninformative and/or weak and the only very strong prior is the probability that a specific binder is located at or very near the center of the AOI. The authors could have tested and commented on how the strength of the prior on the location of a specific binder affects the performance of Tapqir.

      The revised manuscript includes new data on and expanded discussion of this point. In our model, the position of a target-specific spot relative to the target position has a prior distribution illustrated as the green curve in Figure 2-Figure supplement 2. Importantly, the peak in this distribution does not have an a priori set width. Instead, the width of the peak is a model hyperparameter, σxy, that is learned from the image data set without user intervention. To make sure that this point is understood, we expanded and clarified the relevant Methods section and modified the legend of Figure 2-Figure supplement 2.

      To address the reviewers’ specific question, we constructed simulated data sets with different mapping precision values and analyzed them; the results are presented in the (new) Table 5 and discussed:

      “The CoSMoS technique is premised on colocalization of the binder spots with the known location of the target molecule. Consequently, for any analysis method, classification accuracy declines when the images in the target and binder channels are less accurately mapped. For the Tapqir cosmos model, low mapping precision has little effect on classification accuracy at typical non-specific binding densities (λ = 0.15; see MCC values in Table 5).”

      1. Given the priors and variational parameters they report, the authors show that Tapqir performs robustly and seems to require no experiment-to-experiment optimization. This is expected to be the case for the simulated data, since they were simulated using the same model that Tapqir uses to perform the analysis. With regard to the real data, however, it is quite likely that this is due to the fact that the analyzed data all come from the same laboratory and, therefore, likely the same microscope(s). It would have therefore been very useful if the authors would have listed and discussed which microscope settings, experimental conditions, and/or other considerations, beyond those described in point 1 above, would result in a need for re-optimization of the priors and/or variational parameters.

      As noted above, we now address this point in the Materials and Methods as follows:

      “All simulated and experimental data sets in this work were analyzed using the prior distributions and hyperparameter values given above, which are compatible with a broad range of experimental conditions (Table 1). Many of the priors are uninformative and we anticipate that these will work well with images taken on variety of microscope hardware. However, it is possible that highly atypical microscope designs (e.g., those with effective magnifications that are sub-optimal for CoSMoS) might require adjustment of some fixed hyperparameters and distributions (those in Eqs. 6a, 6b, 11, 12, 13, 15, and 16). For example, if the microscope point spread function is more than 2 pixels wide, it may be necessary to increase the range of the w prior in Eq. 13. The Tapqir documentation (https://tapqir.readthedocs.io/en/stable/) gives instructions for changing the hyperparameters.”

      1. Based on analysis of the simulated data shown in Figure 5, where the ground truth is known, the use of Tapqir to infer kinetics is less accurate that the use of Tapqir to infer equilibrium binding constants. The authors do a great job of discussing possible reasons for this. In the case of the real data analyzed in Figure 6 and in Figure 6 - Figure Supplements 1 and 2, the kinetic results obtained using Tapqir have different means and generally larger error bars than those obtained using Spot-Picker. To more comprehensively assess the performance of Tapqir versus Spot-Picker, the authors could have used the association and dissociation rates to calculate the corresponding equilibrium binding constants and then compared these kinetically calculated equilibrium binding constants to the population-calculated equilibrium binding constants that the authors calculate and report in the bottom plot in Panel D of Figure 6 and Figure 6 - Figure Supplements 1 and 2. This would provide some information on the accuracy of the kinetics in that the closer the kinetically and population-calculated equilibrium binding constants are to each other, the more accurately the kinetics have been estimated. Performing this type of analysis for the kinetics obtained using Tapqir and Spot-Picker would have allowed a more comprehensive comparison of the two methods.

      This comment seems to reflect a misunderstanding. Fig. 6 and its figure supplements do not report any dissociation kinetics or binding equilibrium constants. Instead, they report ka (pseudo first-order target-specific association rate constant), kns (pseudo first-order target non-specific association rate constant), and Af (the active faction, i.e., the fraction of target molecules capable of association with binder). ka and Af values from the two methods agree within experimental uncertainty for all four data sets analyzed. kns values differ, but as we point out:

      “We noted some differences between the two methods in the non-specific association rate constants kns. Differences are expected because these parameters are defined differently in the different non-specific binding models used in Tapqir and spot-picker (see Materials and Methods).”

      (There is additional discussion of this point in Materials and Methods). The reviewer is correct that the estimated uncertainties (i.e., error bars in panels D) in ka and Af are generally larger for Tapqir than for spot-picker. This is expected, for the reasons that we explain:

      “In general, previous approaches in essence assume that spot classifications are correct, and thus the uncertainties in the derived molecular properties (e.g., equilibrium constants) are systematically underestimated because the errors in spot classification, which can be large, are not accounted for. By performing a probabilistic spot classification, Tapqir enables reliable inference of molecular properties, such as thermodynamic and kinetic parameters, and allows statistically well-justified estimation of parameter uncertainties. This more inclusive error estimation likely accounts for the generally larger kinetic parameter error bars obtained from Tapqir compared to those from the existing spot-picker analysis method (Figure 6, Figure 6–Figure Supplement 1, Figure 6–Figure Supplement 2, and Figure 6–Figure Supplement 3). ”

      Reviewer #2 (Public Review):

      The work by Ordabayev et al. details a Bayesian inference-based data analysis method for colocalization single molecule spectroscopy (CoSMoS) experiments used to investigate biochemical and biophysical mechanisms. By using this probabilistic framework, their method is able to quantify the colocalization probabilities for individual molecules while accounting for the uncertainty in individual binding events, and accounting for camera and optical noise and even non-specific binding. The software implementation of this method, called Tapqir, uses a Python-based probabilistic programming language (PPL) called pyro to automate and speed-up the optimization of a variational Bayes approximation to the posterior probability distribution. Overall, Tapqir is a powerful new way to analyze CoSMoS data.

      Tapqir works by analyzing small regions (14x14 pixels) of fluorescence microscopy images surrounding previously identified areas of interest (AOI). The collection of images of these AOIs through time are then analyzed collectively using a probabilistic model that accounts for each time frame of each AOI and is able to determine whether up to K "binders" (K=2 here) are present and which of them is specifically bound. This approach of directly modeling the contents of the image data is relatively novel, and few other examples exist. The details of the probabilistic model used incorporate an impressive amount of physical insight (e.g., camera gain) without overparameterization.

      We thank the reviewer for these positive comments.

      The gamma-distributed noise model used in Tapqir captures quite a lot of physics and, given the analyses in Figs. 3-6, clearly works, but might be limited to certain types of cameras used in the fluorescence microscopy (e.g., EMCCDs). For instance, sCMOS cameras have pixel-dependent amplification and noise profiles, rather than a single gain parameter, and are sometimes approximately modeled as normal distributions with both mean and variance having an intensity-dependent and independent contribution that is different for each pixel on the camera. It is unclear how Tapqir performs on different cameras.

      In the revised manuscript, we expanded the discussion of the Image likelihood component of our model to emphasize that 1) all data sets we analyze are experimental or simulated EMCCD images, 2) sCMOS images have the different noise characteristics alluded to by the reviewer, and 3) optimal sCMOS image analysis might require a modified model, possibly including the ability to use per-pixel calibration data as a prior as was done in super-resolution work (now cited) that uses sCMOS data.

      sCMOS cameras have in recent years become very popular for some kinds of single-molecule imaging (e.g., PALM/STORM or live-cell single-particle tracking). However, for the low-background/low-signal in vitro single-molecule TIRF that is our target application for the approach described in the manuscript, EMCCD is still preferable over sCMOS for many, but not all, imaging conditions (see https://andor.oxinst.com/learning/view/article/what-is-the-best-detector-for-single-molecule-studies). Thus, we think there will be plenty of interest in the approach we describe in the manuscript even if (which is not certain) the program functions better with EMCCD than with sCMOS images.

      Going forward to develop and test an sCMOS-targeted version of the model, as we have done for EMCCD, will require revised model and code, but will also necessitate accurately simulating sCMOS CoSMoS images, obtaining experimental sCMOS CoSMoS images reflecting a broad range of realistic experimental conditions, and using the simulated and experimental images to test the new model. These may well be useful things to do in the future but would be a considerable step beyond the scope of the present manuscript.

      The variational Bayes solution used by Tapqir provides many computational benefits, such as numerical tractability using pyro and speed. It is possible that the exact posterior, e.g., as obtained using a Markov chain Monte Carlo method, would be insignificantly different with the amount of data typical for CoSMoS experiments; however, this difference is not explored in the current work.

      We agree. However, since we have not done any analyses using MCMC, there is nothing in particular that we can say about it in the context of CoSMoS data analysis. Implementation of an MCMC approach using our model will be easier in the future because the Pyro developers are currently working to optimize the implementations of MCMC methods in their software.

      The intrinsic use of prior probability distributions in any Bayesian inference algorithm is extremely powerful, and in Tapqir offers the opportunity to "chain together" subsequent analyses by using the marginalized posteriors from one experiment as the basis for the priors for subsequent experiments (e.g., in \sigma^{xy}) for extremely high accuracy inference. While the manuscript discusses setting and leveraging the power of priors, it does not explore the power of such "chaining" and the positive effects upon accuracy.

      Chaining is beneficial in principle. However, in practice it will help significantly only if the uncertainty in the posterior parameter values from the non-chained analysis is larger than the experiment-to-experiment variability in the “true” parameter values. For σxy we obtain very narrow credence intervals without chaining (Table 1). In our judgement, these are unlikely to be made more accurate by using prior information from another experiment where such factors as microscope focus adjustment may be slightly different.

      A significant number of CoSMoS experiments use multiple, distinct color fluorophores to probe the colocalization of different species to the target. The current work focuses only upon analyzing data with a single color-channel. Extensions to multiple independent wavelengths are computationally trivial, given the automated variational inference ability of PPLs such as pyro, and would increase the impact of the work in the field.

      Our current approach can be used to analyze multi-channel data simply by analyzing each channel independently. However, we agree that there would be advantages to joint analysis of multiple wavelength channels (especially if there is crosstalk between channels) and that implementing multi-channel analysis is a logical extension of our study. It is straightforward (though not trivial, in our experience) to implement such multi-wavelength models. However, testing the functioning of candidate models and validating them using simulation and experimental data would require extensive work that in our view goes beyond what is reasonable to include in the present manuscript.

      Tapqir analysis provides time series of the probability of a specific binding event, p(specific), for each target analyzed (c.f., Fig. 5B), and kinetic parameters are extracted from these time series using secondary analyses that are distinct from Tapqir itself.

      The method reported here is well designed, sound, and its utility is well supported by the analyses of simulated and experimental data sets reported here. Tapqir is a cutting-edge image analysis approach, and its proper treatment of the uncertainty inherent to CoSMoS experiments will certainly make an impact upon the analysis of CoSMoS data. However, many of the (necessary) assumptions about the data (e.g., fluorescence microscopy) and desired information (e.g., off-target vs on-target binding) are quite specific to CoSMoS experiments and therefore limit the direct applicability of Tapqir for the analysis of other single-molecule microscopy techniques. With that in mind, the direct Bayesian inference-based analysis of image data, as opposed to integrated time series, as demonstrated here is very powerful, and may encourage and inspire related methods to be developed.

      Our approach is a powerful way to analyze CoSMoS data in part because it is specific to CoSMoS – it is premised on a physics-based model that incorporates known features of CoSMoS experiments. We agree that the general approach could be adapted to other image analysis applications.

      Reviewer #3 (Public Review):

      In this manuscript, the authors seek to improve the reproducibility and eliminate sources of bias in the analysis of single molecule colocalization fluorescence data. These types of data (i.e., CoSMoS data) have been obtained from a number of diverse biological systems and represent unique challenges for data analysis in comparison with smFRET. A key source of bias is what constitutes a binding event and if those events are colocalized or not with a surface-tethered molecule of interest. To solve these issues, the authors propose a Bayesian-based method in which each image is analyzed individually and locally around areas of interest (AOIs) identified from the surface tethered molecules. A strength of the research is that the approach eliminates many sources of bias (i.e., thresholding) in analysis, models realistic image features (noise), can be automated and carried out by novice users "hands-free", and returns a probability score for each event. The performance of the method is superb under a number of conditions and with varying levels of signal-to-noise. The analysis on a GPU is fairly quick-overnight-in comparison with by-hand analysis of the traces which can take days or longer. Tapqir has the potential to be the go-to software package for analysis of single molecule colocalization data.

      The weaknesses of this work involve concerns about the approach and its usefulness to the single-molecule community at large as wells as a lack of information about how users implement and use the Tapqir software. For the first item, there are a number of common scenarios encountered in colocalization analysis that may exclude use of Tapqir including use of CMOS rather than EM-CCD cameras, significant numbers of tethered molecules on the surface that are dark/non-fluorescent, a high density/overlapping of AOIs, and cases where event intensity information is critical (i.e., FRET detection or sequential binding and simultaneous occupancy of multiple fluorescent molecules at the same AOI). In its current form, the use of Tapqir may be limited to only certain scenarios with data acquired by certain types of instruments.

      In the following paragraphs, we address 1) concerns about application to CMOS, 2) dark target molecules, 3) overlapping AOIs, and 4) application to methods (e.g., smFRET) that require extraction of both colocalization and intensity data.

      1) Application to CMOS images.

      In the revised manuscript, we expanded the discussion of the Image likelihood component of our model to emphasize that 1) all data sets we analyze are experimental or simulated EMCCD images, 2) sCMOS images have the different noise characteristics alluded to by the reviewer, and 3) optimal sCMOS image analysis might require a modified model, possibly including the ability to use per-pixel calibration data as a prior as was done in super-resolution work (now cited) that uses sCMOS data.

      sCMOS cameras have in recent years become very popular for some kinds of single-molecule imaging (e.g., PALM/STORM or live-cell single-particle tracking). However, for the low-background/low-signal in vitro single-molecule TIRF that is our target application for the approach described in the manuscript, EMCCD is still preferable over sCMOS for many, but not all, imaging conditions (see https://andor.oxinst.com/learning/view/article/what-is-the-best-detector-for-single-molecule-studies). Thus, we think there will be plenty of interest in the approach we describe in the manuscript even if (which is not certain) the program functions better with EMCCD than with sCMOS images.

      Going forward to develop and test an sCMOS-targeted version of the model, as we have done for EMCCD, will require revised model and code, but will also necessitate accurately simulating sCMOS CoSMoS images, obtaining experimental sCMOS CoSMoS images reflecting a broad range of realistic experimental conditions, and using the simulated and experimental images to test the new model. These may well be useful things to do in the future but would be a considerable step beyond the scope of the present manuscript.

      2) Dark target molecules.

      In their detailed comments, the reviewers suggested a “no target molecules in sample” (NTIS) control instead of the “no fluorescent target molecules in control AOIs” (NFTICA) design that we illustrate in Fig. 1. Both types can be used as a Tapqir control dataset without any modification of the program or model. We have edited the Fig. 1 caption to explain that either type is acceptable. The reviewers are correct that, all else being equal, NTIS may be better if the target molecules are incompletely labeled. However, in practice experimenters usually know the fraction of molecules that are labeled and reduce the fluorescent target molecule surface density to hold the fraction of spots with two or more coincident target molecules (fluorescent or not) below a chosen threshold (typically 1 % or less), negating the possible advantage of NTIS (but at the expense of collecting less data per sample). On the other hand, NFTICA has the practical advantage that it is a control internal to the sample and is thus immune to problems caused by temporal or sample-to-sample variability (e.g., of surface properties).

      3) Overlapping AOIs.

      The method does not require non-overlapping AOIs – we used partially overlapping AOIs in the experimental data analyzed in the manuscript. Even though our analysis used larger AOI sizes (and hence, more overlap) than the spot-picker method, there was good agreement in the results, indicating that overlap does not cause any undue problems.

      In the revised manuscript Results section we added the following discussion of the effect of AOI size:

      “Since target-nonspecific spots are built into the cosmos model, there is no need to choose excessively small AOIs in an attempt to exclude non-specific spots from analysis. We found that reducing AOI size (from 14 x 14 to 6 x 6 pixels) did not appreciably affect analysis accuracy on simulated data (Table 2). In analysis of experimental data, smaller AOI sizes caused occasional changes in calculated p(specific) values reflecting apparent missed detection of a few spots (Figure 3–Figure supplement 4). Out of caution, we therefore used 14 x 14 pixel AOIs routinely, even though the larger AOIs somewhat reduced computation speed (Table 2 and Figure 3–Figure Supplement 4).”

      4) Methods requiring extraction of intensity data.

      The cosmos model we describe in the manuscript does not incorporate phenomena where the spot intensity at a single target changes, such as when there is FRET or multiple binders. As we point out in the final paragraph of the Discussion, more elaborate versions of the cosmos model that incorporate these phenomena could be developed. This would entail implementation, optimization, and validation with simulations and real data of the new model, which is beyond the scope of the present manuscript.

      Second, for adoption by non-expert users information is missing in the main text about practical aspects of using the Tapqir software including a description of inputs/outputs, the GUI (I believe Taqpir runs at the command line but the output is in a GUI), and if Tapqir integrates the kinetic modeling or not.

      This information is given in the online Tapqir documentation. The kinetic analysis (as in Fig. 6) is a simple Python script that is run after Tapqir; the instructions for using it are included in the documentation. Tapqir runs can be initiated using either a CLI or GUI. Output can be viewed in Tensorboard, in a Tapqir GUI, and/or passed to a Jupyter notebook or Python script for further analysis, plotting, etc.

      Given that a competing approach has already been published by the Grunwald lab, it would be useful to compare these methods directly in both their accuracy, usefulness of the outputs, and calculation times.

      The reviewer does not explain why comparing with the Grunwald method would be preferable to the comparison with spot-picker that is included in the manuscript. To be sure there is no misunderstanding, the following are the same for the two methods and therefore are not reasons to prefer one or the other of these methods for the comparison in Fig. 6 (see also Discussion):

      1) Like Tapqir, both spot-picker and Grunwald methods analyze 2-D images, not integrated intensities.

      2) Unlike Tapqir, neither spot-picker nor Grunwald is fully objective; both require subjective selection of classification thresholds by the analyst in order to tune the algorithm performance for analysis of a particular dataset.

      3) Neither spot-picker nor Grunwald is a Bayesian method. “Bayesian” in the Grunwald paper title refers to their excellent work on a separate analytical method (described in the same paper) for evaluating the number of binder molecules colocalized with a target spot; this method is not relevant to a comparison with the model presented in our manuscript.

      4) Unlike Tapqir, neither spot-picker nor Grunwald estimate classification probabilities. Instead, they simply assign binary spot/no-spot classifications that do not convey to downstream analyses the extent of uncertainty in each classification.

      5) Neither spot-picker nor Grunwald has been validated previously using simulated image data. Consequently, the validity of image classification has not been established for either.

      The comparison of Fig. 6 and supplements does not claim to and is not intended to show that Tapqir is better than spot-picker for real experimental data; we cannot make such a claim for these or any other methods because we do not know the true kinetic process and rate constants that generated the experimental data. Instead, our comparison uses experimental data sets with a broad range of characteristics (Table 1) to show that Tapqir yields similar association rate constants to those produced by spot-picker even though the former is objective and automatic while the latter requires subjective tuning by an analyst. Our choice to use spot-picker over Grunwald for this comparison was dictated by the fact that among the co-authors we have such an expert in the use of spot-picker, whereas we lack comparable expertise with Grunwald. We have little doubt that Grunwald would also produce results similar to the other methods in the hands of an expert user who is able to subjectively adjust classification parameters.

      Along these lines, the utility of calculating event probability statistics (Fig. 6A) is not well fleshed-out. This is a key distinguishing feature between Tapqir and methods previously published by Grunwald et al. In the case of Tapqir, the probability outputs are not used to their fullest in the determination of kinetic parameters. Rather a subjective probability threshold is chosen for what events to include. This may introduce bias and degrade the objective Tapqir pipeline used to identify these same events.

      This comment reflects a misunderstanding. No probability threshold is used in the kinetic analyses (Figs. 5 and 6). Instead, we make full use of the p(specific) probability output using the posterior sampling strategy that is illustrated in Fig. 5B and is described in the Results and in Materials and Methods. In the revised manuscript we modified the Results section to further emphasize this point.

      Finally, the manuscript could be improved by clearly distinguishing between the fundamental approach of Bayesian image analysis from the Tapqir software that would be used to carry this out.

      We have revised the manuscript to adopt this recommendation. We now call the mathematical model “the cosmos model” and use “Tapqir” to refer to the software.

      A section devoted to describing the Tapqir interface and the inputs/outputs would be valuable. In the manuscript's current form, the lack of information on the interface along with the potential requirement for a GPU and need for the use of a relatively new programming language (Pyro) may hamper adoption and interest in colocalization methods by general audiences.

      Description of the interface and inputs/outputs is given in the online Tapqir documentation.

      Users do not need to own a GPU; they can instead run the program on a readily available cloud computing service. We have now added to Table 1 data showing that computation time on the Google Colab Pro cloud service is actually faster than that on our local GPU system. Colab Pro is inexpensive, readily accessible, and user friendly. We have added to the user manual a tutorial that shows how to run a sample data set using Tapqir on Colab.

      Users do not need any knowledge of Pyro to use Tapqir; Pyro is merely used internally in the coding of Tapqir.

    1. thinking out loudMaybe we found love right where we are

      The main phrase “thinking out loud” is repeated throughout the entire song. It means to share one's thoughts so that other people can hear them. This can be seen throughout the poem as the speaker repeats the idea of displaying affection for his partner. “Thinking out loud” happens when people need help working though their thoughts. Likewise for the speaker, he has a lot of pent-up feelings and emotions that prevent him from thinking clearly thus this causes him to think out loud. And the one thought that always becomes apparent to him is that “Maybe we found love right where we are” which shows that the speaker thinks that he has found true meaning of love all because of his partner. This causes me to feel delight for the speaker as he is sure that no matter what doubts he may have about their relationship, he is convinced that he has found his true love. This also pushes me to reflect on the theme of love and that true love will find its way and will remain strong no matter the circumstances or the difficult situations they are put through.

    1. Author Response:

      Reviewer #2 (Public Review):

      The authors have developed a new method that allows for two-color STED imaging. They have applied this method to measure spine head size and PSD95 changes following exposure to an enriched environment.

      Strengths

      -The new method is well-described and seems to have considerably less crosstalk than previous attempts at in vivo two-color STED imaging. The analyses and controls of the method are compelling. I think that this method could be valuable for examining how different components of the synapse are changing in response to sensory or environmental changes.

      -The method is appropriate for measuring the size of PSD95 and spine head size in the enriched environment paradigm they use here. They find that in the short-term spine head size and PSD95 size are not always correlated.

      -They also find that there is less variability in the spine head size in animals in an enriched environment.

      Weaknesses<br /> -The authors use an enriched environment plasticity paradigm to showcase the method and measure spine head and PSD95 size and how they change over short periods of time. This particular biological study is not well-motivated and there is not a stated reason for studying the short-term (30-120 minutes) dynamics of PSD95 and spine head size, and their correlations. They also show that the variability in spine head size is decreased with the enriched environment, but do not show what the implications of that change would be from a biological point of view for synaptic dynamics or synaptic function.

      -The authors show that there are differences in the morphology of PSD95 between mice reared in enriched environments and those in control environments. While this quantification is done blindly by three different analysts, it is not done in a quantitative way. Also the authors do not show or explain the biological relevance of differences in the morphologies of PSD95, thus it is not clear what this measure means for synaptic plasticity or function.

      -The authors use a cranial window preparation, which is commonly used in the literature. However, it is not clear how long they wait to image the mice after the cranial window. Previous work from Xu et al. (PMID: 17417634) suggests that there is in an increase in glial activation for a period of up to a month after surgery. The authors have not shown the degree of glial activation that follows after their surgeries and if they have not waited a month, there may be upregulation of microglia, which may alter synaptic stability (also demonstrated in the same paper). The authors have not discussed this point or the implications for their findings.

      We thank the reviewer for his/her valuable input.

      The time-scale we study is similar to what is known from structural changes after LTP and thus we wanted to study the same time scale in vivo. We revised the motivation and explained better the biological relevance of the observed changes. We absolutely agree with the reviewer on his/her concern for chronic imaging. However, we performed acute experiments and imaged directly after implanting the window in the same session. After imaging the mice were sacrificed.

      Reviewer #3 (Public Review):

      Wegner et al. use two-color STED to follow spines and their PSDs in layer1 of mouse visual cortex over 2 hours under anesthesia. They compare mice that were kept in an enriched environment (EE) to control mice housed in standard laboratory cages. Spines in EE mice are larger and show larger fluctuations in size. PSDs in EE mice shrink during anesthesia and tend to change their nanostructure. Very importantly, changes in spine size were not driven by PSD size changes, or vice versa. Technologically, this is a landmark study, as tracking two different labeled structures in individual synapses at the nanoscale can obviously be applied to a large number of synaptic proteins and organelles, two at a time. Single-color superresolution microscopy is much less useful, as 'puncta in space', without cellular context, are difficult to interpret. This pioneering work is the first proof-of-concept of two-color in-vivo STED and of major importance for the community. Although stochastic processes seem to drive much of the synaptic dynamics under anesthesia, the environment shapes the spine size distribution and affects synaptic dynamics in a lasting fashion.

      One major comment:

      l.259: "These results suggest that Ctr housed mice undergo stronger morphological changes." This I find a bit misleading. What about: These results suggest that anesthesia induces stronger morphological changes in Ctr housed mice? Altogether, a discussion of the potential effects of anesthesia on spine/PSD dynamics is missing (see e.g. Yang et al., DOI: 10.1371/journal.pbio.3001146). The fact that there was weak correlation between spine head and PSD fluctuation could have something to do with the state of suppressed activity the system was in during imaging. Under conditions of intense processing of visual information, changes might have been more rapid and more tightly correlated. This could be mentioned as a perspective for the future - to visually stimulate the anesthetized animal.

      We agree with the reviewer that it should be mentioned here that the morphological change was observed under anesthesia. However, the sentence suggested by the reviewer is also a bit misleading since it suggests that the anesthesia has triggered the change. We think that anesthesia might affect the amplitude and dynamic of the observed changes but does not induce the change. Thus we rephrased as follows: These results suggest that Ctr housed mice undergo stronger morphological changes under anesthesia.

      We absolutely agree about the potential influence of the anesthesia on the spine and PSD95 nanoplasticity and added the following comment. Of course, we would like to perform the measurement in the future also in awake mice and after visual stimulation.

      Added to discussion: However, it was shown that MMF anesthesia reduces spiking activity and mildly increases spine turnover in the hippocampus (Yang et al., 2021). Thus, the plasticity of spine heads and PSD95 assemblies might be different in the awake state and under intense processing of visual information.

    1. Author Response:

      Reviewer #2 (Public Review):

      The reported study includes an overall well-conducted and well-presented set of experiments. Ample data are reported and a clear and conclusive picture of the findings is portrayed.

      1. The Introduction falls short of providing the background needed for fully appreciating the current findings and their importance. The authors don't present the current understanding regarding the role of 4-vinylanisole in locusts (mostly their own work). Nor do they present the accepted knowledge of the control of sexual maturation in locusts (mostly several decades-old work). Moreover, the importance of reproductive synchrony in the life history of gregarious locusts, including its tentative roles in maintenance of the homogeneity and integrity of the swarm, in ensuring high density conditions for the next generation, and more, is also not adequately presented.

      We appreciate the reviewer’s helpful comments. According to these comments, we have revised the introduction part by enriching the significance of reproductive synchrony in ecological adaption of gregarious locusts and the research progresses on sexual maturation control in locusts. Details were shown as: “Depending on population density, locusts display striking phenotypic plasticity, with a cryptic solitarious phase and an active gregarious phase (Wang and Kang, 2014). Gregarious locusts, compared to solitarious conspecifics, show much higher synchrony in physiological and behavioral events, such as egg hatching and sexual maturation, as well as synchronous feeding and marching behaviors (Norris, 1954, Uvarov, 1977). Reproductive synchrony in gregarious locusts provides benefits for individuals in several aspects, such as more favorable microenvironment, lower risk of predation, efficiently forging, as well we more encounters with mates, therefore ensures high density conditions for the next generation, and is essential for maintenance of locust swarm (Beekman et al., 2008, Maeno et al., 2021). Some sort of vibratory stimulus, maternal microRNAs, and SNARE protein play important roles in the egg-hatching synchrony of gregarious locusts (Chen et al., 2015b, He et al., 2016, Nishide and Tanaka, 2016). It has been revealed that the presence of mature male adults has effectively accelerating effects on synchrony of sexual maturation of immature male and female conspecifics in two locust species, Schistocerca gregaria and Locusta migratoria (Norris, 1952, Loher, 1961, Guo and Xia, 1964, Norris, 1964). The accelerating effects of several prominent volatiles released by gregarious mature males in male maturation have been exampled in the desert locust. Four volatile pheromones (benzaldehyde, veratrole, phenylacetonitrile, and 4-vinylveratrole) have significantly stimulatory effects on sexual maturation of male adults, with phenylacetonitrile having the most pronounced effect. (Mahamat et al., 1993, Assad et al., 1997). However, how conspecific interaction affects female sexual maturation remains unclear and the pheromones those contribute to maturation synchrony of females have not been determined so far”. In the current study, we identify 4-vinylanisole as a key pheromone promoting sexual maturation synchrony through validating the role of five gregarious male-abundant volatiles one by one, instead of following up our previous work on 4-VA. Thus, we have fully elaborated the multifunction of 4-VA as both aggregation pheromone and maturation accelerating pheromone in the formation and maintenance of locust swarm in the discussion part.

      2. Research on pheromonal signaling in locusts have traditionally focused on compounds with a putative role in density-dependent phase-specific behaviors. Hence, it is common to compare the response of crowd-reared vs. solitary locusts to applied chemicals. The challenge, however, is maintaining the density context, while attempting to conduct controlled similar experiments with locusts of the two phases (i.e. keeping the solitary phase locusts isolated, while the gregarious locusts must always be crowded). This is even more challenging when studying reproductive physiology. By the basic nature of the two phases, there can be a multitude of interacting factors (behavioral and/or physiological) affecting the much-desired reproductive synchronization in gregarious locusts, while such synchronization is not expected at all in solitary ones (it may even be claimed to have no fitness-related advantage).

      3. In general, the authors of the current report have dealt well with these challenges, taking extra care to conduct multiple controls and making an effort to specifically test all the possible factors. However, there are several points that raise some uncertainties. For example:

      o If I am not mistaken, females of both phases were included in the study only if already mated by day A+7 (LL355-357). While this is reasonable for gregarious locusts, it may not be suitable for the solitary locusts, imposing an undesired and unequal selection criterion.

      We thank the reviewer’s comments. We don’t think the criterion (mated at PAE 6-7 days) cause significant bias in either gregarious locusts or solitarious locusts. In fact, the limitation of mating before PAE 7 days is used to rule out the effects on oviposition synchrony caused by difference in mating age among individuals. This criterion is only limited during the analysis of the first oviposition date. On the premise of consistent mating time, oviposition consistency in gregarious female adults may largely present the sexual maturation synchrony among individuals (Figure 1A). For subsequent experiments, we mainly concentrate on regulation of sexual maturation using only virgin females in all experiments.

      o In the test of the effects of conspecifics interactions, 10 gregarious locusts provided stimulation to the tested gregarious female, while only one insect was the stimulating factor for the solitary female.

      Actually, we carried out two independent experiments to test the effects of conspecifics interactions. The population densities were kept in solitarious context for comparison of female sexual maturation synchrony between typical gregarious and solitarious phases (Figure 1D). For locust emissions treatments, ten solitarious locusts were used to ensure the stimulations at the same density level (Figure 1F). Both of two experiments suggested that solitarious male adults had no effects on female sexual maturation.

      o It is not clear how were egg pods attributed to specific gregarious females (maintained in groups of 10)

      Thanks for the reviewer’s comments. To monitor the oviposition activities of each individual of gregarious females in a group, locusts were individually marked, and their first oviposition times were determined by collecting egg pods every 4 hours per day after mating. Females those laid new eggs could be easily distinguished by much thinner abdomen with white foam around ovipositor. We have provided the method details in the revised manuscript.

      Overall, since the focus of this study is actually not on the comparison between the phases, it might have been beneficial to the readers if the focus was on the gregarious locusts only, with maybe a couple of experiments conducted on solitary insects and presented separately.

      We understand the reviewer’s concern. Actually, the aim of this study is to explore the mechanism underlying sexual maturation synchrony by comparing phase- and sex-dependent conspecific interactions in locusts. The reproductive synchrony in gregarious might be not highlighted without comparison with solitarious locusts, including both first oviposition time and sexual maturation, although the mechanism studies were mostly performed in gregarious locusts. Moreover, phase-dependent comparison of volatile contents is helpful for us to screen candidate volatiles responsible for the acceleration of sexual maturation synchrony in females.

      4. Assuming that within a locust group there is overall agreement in the age of males and females, there seem to be a not-fully-explained mismatch between the age of max 4-VA release by males (linearly increasing with age) and the age of max effect in females (critical period at A+3-4)

      We appreciate the reviewer’s query. We have provided additional discussions on the “mismatch” of between age-dependent release of 4-VA by males and the age of max effect in females (PAE 3-4 days). Details were shown as: “. We find that the release of 4-VA by gregarious males continuously increased after adult eclosion, with maximal 4-VA release at PAE 8 days. The age of maximal 4-VA production outwardly seems to be unmatched with the sensitive developmental stage to 4-VA of females (PAE 3-4 days). In insects, it is very common for males to mature earlier than females (Alonzo, 2013). In the locust, male adults also display earlier sexual maturation for several days, compared to females. In given locust population, individuals emerge to adults successively in a couple of days, not in completely synchronous period. Therefore, age-dependent increase in 4-VA release in gregarious male adults presents a persistent stimulus for less-developed young female adults, and thus maximizes synchronous maturation of female locusts, which could reduce male competitions for mate selection”.

      5. Similar to the introduction, the discussion section also does not present comprehensive arguments regarding the importance of reproductive synchronization in female locusts. Points that could have been discussed include: females' oviposition disrupting migration, synchronization affecting sexual selection, accelerating intra-sex competition over mates as well as oviposition sites, and more.

      We appreciate the reviewer’s nice suggestions. We have provided additional discussions on this point following these suggestions. Details were shown as: “Reproduction synchrony involves consistence in maturation, mating, and egg laying, among which sexual maturation synchrony serves as the most foundational step for oviposition uniformity (Hassanali et al., 2005). Extremely high energy cost for female reproduction could restrict migration to pre, post, or inter oviposition period in locusts, thus have crucial effects on collective movement of local populations (Min et al., 2004). Given this, a balance of sexual maturation timing among female members presents an essential subject for maintenance of locust swarms. We here demonstrated that young female adults reared with older gregarious male adults show faster and more synchronous sexual maturation in the migratory locust, supporting the accelerate role of crowding in sexual maturation of females (Guo and Xia, 1964, Norris and Richards, 1964,). Together with the accelerating effects on immature male sexual maturation induced by older gregarious male adults reported previously (Torto et al., 1994, Mahamat et al., 2000), young adults of both sexes lived in gregarious conditions prefers more synchronous maturation than individuals reared in solitary. The consistent maturation in both sexes will greatly reduce intra- and inter-sexes competitions for mate selection and thus ensures reproductive synchronous in whole locust populations. We demonstrated that a single minor component (4-VA) of the volatiles abundantly released by gregarious male adults is sufficient to induce the maturation synchrony of female adults. By comparison, four volatiles (benzaldehyde, veratrole, phenylacetonitrile, and 4-vinylveratrole) showed stimulatory effects on male maturation (Mahamat et al., 2000). Thus, there might exist a sex-dependent action modes of maturation-accelerating pheromones: multi-component pheromones for males and single active component for females, possibly due to different selective pressures between two sexes in response to social interaction. Further exploration will be performed to confirm this hypothesis by determining whether 4-VA has maturation-accelerating effects on male adults in the migratory locust in future”.

      Reviewer #3 (Public Review):

      Strengths: Grouping behavior for marching, sexual maturation, swarming, oviposition and egg hatching in gregarious locusts is complex and it's mediated by a combination of cues-olfactory, tactile, and visual cues to ensure synchronous behavior. The authors show that only olfactory cues released by gregarious adult males mediates maturation synchrony of females. This finding is a confirmatory result of a well-established phenomenon for maturation synchrony in both sexes of adult locusts, although in this study, the authors focused on only females. Further, the authors validated their findings using gene editing techniques to show that maturation synchrony was diffused in Or35-/- mutant adult females but not in wild type females exposed to adult male volatiles and the individual component identified as 4-vinylanisole among five male-abundant volatiles as promoting synchronous sexual maturation in only post adult eclosion females (PAE) 3-4 days old. Use of molecular and single sensillum recordings, followed by physiological experiments focused on the interaction between this specific adult pheromone and juvenile hormone to validate the behavioral results found for females add scientific value to the study.

      Weaknesses: Firstly, synchronous and accelerated sexual maturation of young adults by older pheromone-producing ones, is a primer effect driven by males and this facilitates 'integration and cohesion' of both sexes of adults. In my view, the fact that this study focused on only females but not on both sexes, weakens the contribution of the study towards increased understanding of the biology/ecology of locusts.

      We accepted the reviewer’s comment that synchronous and accelerated sexual maturation of young adults by older pheromone-producing ones occurs in both sexes. In fact, early studies have reported that mature males can accelerate sexual maturation of young males through several candidate compounds (Mahamat et al.,1993, Chemoecology; and Mahamat et al., 2000; International Journal of Tropical Insect Science). However, the effects of conspecific interaction on sexual maturation of females are rarely reported. Moreover, distinct volatiles that can accelerate female sexual maturation have not been characterized before this work. Therefore, we focus on female sexual maturation synchrony in the current study. A comparison of regulatory mechanisms underlying sexual maturation synchrony in males and females has been discussed in the revised manuscript.

      There are also weaknesses in the methods, such as focusing on only the five-abundant male volatiles based on heat maps. Basically, the decision as to which components in adult male volatiles may be contributing to sexual maturation should be made by antennae of different ages of PAE females and males to avoid selecting only abundant compounds based on artificial intelligence (AI). Since most studies in this subject area have demonstrated that there is no direct correlation between volatile abundance and detection at the periphery or central nervous systems of an insect, I believe that the authors will agree with me that often some of the minor volatile components tend to contribute more to the chemical ecology of an insect than the more abundant components. Without testing minor components identified in male volatiles as a blend or individually, as additional controls to increase the robustness of the study, I am not convinced that the authors have fully achieved their aim in identifying a male-produced volatile that promotes sexual maturation in females.

      We agree the reviewer’s comments that the activities of volatiles are not always determined by the absolute contents. In fact, in our work, the selection of candidate effective compounds for female sexual maturation did not rely on the absolute content of these volatiles, but mainly based on comparative analysis of their relative contents between gregarious and solitarious male adults, because only volatiles from gregarious male adults could accelerate sexual maturation of females (Figure 1C-F). In the revision process, given that the volatiles released by gregarious males, rather than gregarious females and solitarious males, have the accelerate effects on female sexual maturation, we further performed more comparative analysis of volatile contents among these three groups (G-males, G-females, and S-males). Compared to volatiles released by G-females, and S-males, only five kinds of volatiles display significantly higher emission in G-males (PAN, guaicol, 4-VA, vertrole, and anisole). The roles of five candidate volatiles in female sexual maturation were individually validated by removing the volatile from the stimulation blend one by one. The results showed that only the omission of 4-VA from the blends lost the accelerating effects on sexual maturation synchrony of gregarious females (Figure 2B). Based on these findings, we inferred that 4-VA played major roles in promoting female sexual maturation synchrony.

      JH experiments- My main concern is the lack of proper controls to fully investigate the interactive effect of the male-produced pheromone promoting sexual maturation and juvenile hormone production. JH titers were not measured in females exposed to the other male-abundant compounds including PAN, guaiacol, veratrole and anisole or blend/individual minor components.

      We understand the reviewer’s query. In fact, the potential role of JH pathway was inferred firstly by the RNA-seq analysis of CC-CA, which showed that the expression levels of JH metabolism-related genes were significantly affected by 4-VA treatment at PAE 3-4 days. The measurement of JH titer after 4-VA treatment was further performed to support the involvement of JH in 4-VA-accelerated sexual maturation in female adults. Since other male-abundant compounds have been excluded due to the omission of any of the four volatiles (Figure 2B), we don’t think it is necessary to detect their effects on JH titers in females including PAN, guaiacol, veratrole, or anisole.

      Another notable weakness is the 'JH Rescue Experiment'. The authors did not inhibit JH synthesis in the corpora allata (allalectomized locusts) in treated locusts before injecting the JH-analog methoprene to accelerate maturation and reproduction in females.

      Thanks for the reviewer’s comments. The JH rescue experiments in Figure 4D-F were performed in Or35 female mutants, which showed lower JH levels and sexual maturation rate. Thus, the JH analog was applied to Or35^-/- females to test whether activation of JH pathway could recover sexual maturation rate and Vg expression. To provide additional evidence, we performed addition rescue experiments in WT females by inhibiting JH synthesis using Precocene (PI) before JH treatment. The results showed that PI treatment significantly inhibited sexual maturation rate and Vg expression in 4-VA-exposed WT females, whereas JH treatment post PI application can obviously recovered the sexual maturation rate and Vg expression (Figure 4G-I).

    1. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Review of 'Mother centrioles generate a local pulse of Polo/PLK1 activity to initiate mitotic centrosome assembly' from Wong et al.

      In this paper, Wong et al address the mechanisms of centrosome assembly in flies. They start with the interesting observation that Polo localized at centrosomes oscillates before cells enter mitosis, while Cnn (and with it centrosome maturation) either increases or reaches a plateau. The phenomenon is local, since Polo levels at in the cell are high during mitosis. They propose that the oscillation is driven by a negative feedback loop whereby Polo inhibits its own binding to the centrosome, Ana1 being the most likely relevant receptor. Finally, they discuss the possible meaning of this oscillatory behavior, in the light of the rapidity of the early embryonic cell cycles.

      Major comments

      1- One can imagine different reasons for the fact that the model displays different dynamics for Cnn and Spd-2/Polo. For example, a major difference may be due to the different dissociation rates of the clusters Cstar and Shat. These are governed by different laws and different parameters (kdis vs kidsCstar1/n). If I understand, both parameters and dependency on Cstar^2 are assumptions. Hence, it would be important to pinpoint which component of the model is more directly responsible for the observed behavior. The analysis should not be limited to the dissociation, but should be extended to the whole model. To this aim, one could test the robustness of the model's parameters. The results of this analysis will also be a prediction of the model.

      2- The presence of a positive-feedback loop involving Cnn could offer an alternative and more robust explanation for the slower dynamics of Cnn. Such a loop between Cnn and Spd-2 was proposed by the authors (Conduit, eLife, 2014). I think some comment on this point would be interesting (eg, could the Cnn/Spd-2 loop proposed earlier work in this context? If not, why? If yes, should not this option be explored?).

      3- The prediction presented in Figure 6 is very relevant. I wonder how robust this behavior is to changes in parameters values.

      4- Additional testing of the model would be important to confirm that the negative feedback loop is actually in place, although I understand experiments may be difficult to be performed. Possible examples: constantly high levels of Polo are expected to decrease its centrosomal localization, is that correct and, if so, testable? Is it possible to delay one cycle, and then observe the decay in Cnn values? This latter experiment, for example, could help to distinguish positive feedback vs slow decay rates. If the experiments are not possible, it may be worth anyway to present some predictions worth testing.

      5- The difference between Models 2 and 3 is not clear to me. In mathematical terms, they seem to be basically the same thing: reaction (50)=(33), (51)~(34) given (40) and (52)~(35) again given (40). This is precisely since the model comes with the assumption of a well-stirred system, and thus adding P in solution is not so different from assuming P=Rphat (40). I would have imagined that also Model 2 accounts for the fact that in Spd-2-S16T and Ana1-S347T Polo is recruited slower and for a longer period. Is it not true? If so, is model 3 really needed? More in general, assuming a role for an increase of local concentration of P* is quite a jump, especially given the small distances involved, and the fast diffusion occurring within cells.

      Minor points

      1-Could the authors use the FRAP data to estimate the different kdis? If so, a comparison with the 20-fold difference used in the model would be useful.

      2- p. 6, The authors should state clearly for the worm-uneducated like me whether the fusions were done with the endogenous proteins or not.

      3- p.7 Figure 1B, in the text it is referred to display 'levels of peaks' and in the figure and legend we find 'growth period'. Not clear how the two refer to the same quantity.

      4- Spd2-mCherry is present in both Figure 1C and D, but with very different amplitudes. Why is that the case?

      5- The fact that Polo peaks in mitosis is a key observation. Unfortunately, this is often reported as a personal communication. The authors never tried to produce this piece of data?

      6- p.11 It is explained that NM and OM differ for their initial values because the OM starts with some PCM from the previous cycle. However in Figure 3A, for example, the values of Polo at the end of the cycle are identical in the two. Is not this in contrast with the explenation?

      Still p11, there is reference to Figure 3C,D, but Figure 3D does not exist, I guess it should be 3A,C.

      7- In the formulation of the model (page numbers in Suppl Mat are unfortunately missing..), one citation for the total amount of Polo being large is needed.

      8- I do not understand this point: scaled c output is 1, and the initial condition for c=1 also?

      9- It has been shown in different systems (from yeast -- haase winey reed, NCB, 2001-- to worms -- McCLeland O-Farrell CB 2008) that centrosome duplication can occur independently from the cell cycle oscillator. I was wondering whether the proposed negative feedback loop may play a role in this phenomenon. This is only a curiosity, which does not need to be addressed.

      Significance

      The new observation and hypotheses presented in the paper provide a sizeable advance. The presence of an oscillation in Polo, uncoupled from cellular levels, is new, and the model proposes a testable hypothesis to explain it. Some additional experiments to verify the model would strengthen the manuscript.

      The work is probably more appropriate for experts in the centrosome field. My primary expertise for this review was in mathematical models.

    1. A single mom on disability struggles to provide food and clothing for her teenage daughter. A recent college graduate forgoes therapy. A young professional puts off buying a home and taking the next steps in his life. And a 74-year-old in a senior living community knows her monthly Social Security budget down to the cent.

      The lives of these Utahns are all being shaped by spending around 50% or more of their income on housing each month.

      Their challenges are part of a larger statewide housing crisis, one that is being blamed on both a shortage of homes and sluggish income growth that isn’t keeping pace with soaring real estate prices.

      While past spikes in housing costs have priced people out of home ownership in Utah, the current affordability crisis is more all-encompassing — so it’s also stretching renters to the breaking point, said James Wood of the University of Utah’s Kem C. Gardner Policy Institute.

      “I speak from personal experience,” said Wood, a senior fellow at the Gardner Institute. “I have people in my basement, and I’ve tried to help them find places. It’s really tough.”

      Nearly one in five renters in Utah is severely cost-burdened, meaning they spend at least half their income on housing and often struggle to pay for food, transportation and other bills, according to federal data for 2013 to 2017. And more than 63% of the state’s lowest-income residents fall into this category, this data shows.

      [Read more: Do you spend more than half your income on rent? Here are resources that can help.]

      The disparities are particularly acute for Utahns of color, with a recent Gardner analysis showing that Black and Hispanic renters are more likely to face severe housing cost burdens. The research found that 32% of Black renters in the state spend more than half of their income on housing, making them almost twice as likely to face severe cost burden as white renters.

      For a minimum wage worker in Utah, a rental home would have to cost $377 per month or less in order to be affordable, according to an analysis by the National Low Income Housing Coalition. But the average rent for a one-bedroom Salt Lake City apartment is nearly triple that, at $1,099 a month, according to a June report from the popular rental website, Zumper.

      Wood said some of the state’s lowest-income residents receive public housing assistance, but there’s not enough money to reach everyone who needs help. Without government support, this group of Utahns lives on the brink of homelessness, with any additional hardship potentially pushing them over the edge.

      “Whether it’s domestic violence, or whether it’s the loss of job or a health incident or a traffic accident,” he said. “That’s a disaster.”

      Tara Rollins, executive director of the Utah Housing Coalition, notes that it’s easier to prevent people from losing their housing than it is to get them off the streets. The coalition advocates for increased wages and additional units of deeply affordable housing, to help people in this position before they’re pushed into homelessness.

      Even those who are moderately cost-burdened — meaning they spend more than a third of their income on housing — face challenges, Rollins noted. But she doesn’t think that many Utahns and policy makers are paying enough attention to this swath of Utahns who are barely keeping their heads above water.

      “Unless they feel it, see it, they don’t get it,” she said. “You can’t see somebody’s wallet and how empty it is.”

      (Rick Egan | The Salt Lake Tribune) Jan Aus, a 74-year-old apartment resident in Sandy, says her rent keeps rising.(Rick Egan | The Salt Lake Tribune) Jan Aus, a 74-year-old apartment resident in Sandy, says her rent keeps rising. (Rick Egan/)

      ‘Nobody has my back’

      When Jan Aus, 74, moved into a senior living community in Sandy seven years ago, she was shelling out $720 for rent each month.

      “And then they raised it $10 two or three years after that,” she recounted. “And then, bing, they hit me with $65 a year.”

      Today, Aus is paying $925 to live in her one-bedroom apartment ― a figure that sucks up the bulk of her Social Security check. She knows the amount she has to budget each month down to the cent: $1,251.80.

      Aus said there are people who are “worse off than I am,” noting that she’s receiving government assistance available to low-income Utahns to help pay for electricity and food. She owns her car and considers her health insurance “good,” as long as she makes sure to get generic prescriptions.

      Still, she said the amount of money she’s putting toward rent has become stressful, especially as she waits to see whether the apartment complex where she lives will raise her rent again this fall.

      “It scares me,” she said. “And like I said, they’re going to hit me in September ... and it scares me to think they’re going to raise it again. I just feel like nobody has my back.”

      If not for the federal pandemic stimulus checks, Aus said, she wouldn’t have any kind of savings, money she’s socked away in hopes that she can put a security deposit down on a more affordable apartment soon.

      The problem, she said, is that there’s very little available in her price range of $800 to $900 a month, other than a room in someone else’s house.

      “I don’t see myself going that way,” she said. “That to me is kind of scary. I think we need more affordable housing, I really do. Because it’s not going to get any better. To me, it’s going to get worse.”

      ‘I want to take care of more of my health’

      Jazmin May has cut back her therapy sessions from once a week to once a month. The 24-year-old Salt Lake City resident can’t go in for an eye exam as soon as she’d like. And she’s had to ask her parents to chip in some money when her car needed repairs.

      That’s all because rent claims about 50% of her income, and she has to stretch the rest to pay her other bills.

      “For right now, it works for me,” she said. “I do wish I had more money left over in my paycheck just to be able to afford other things. I want to take care of more of my health.”

      May says many of her other friends from college are also struggling, as they strain their early-career salaries to cover the cost of housing. Some have found it impossible and have gone back to live with their parents, she said.

      She and a friend signed a lease for their two-bedroom apartment near Liberty Park in 2019, but the pandemic that arrived just months later quickly jeopardized the living arrangement. Her friend lost a retail job and had to move back to her family home in Ogden.

      May said she considered looking for another roommate and decided it would be better for her mental health if she lived alone for a while.

      Taking on the entire rental payment meant accepting a job at an area museum rather than continuing to hop between political campaigns — work that she loves but is too unreliable for her right now.

      “I feel like I have sacrificed, in a way, my passion, to be able to afford housing, because I love campaigns and politics and outreach,” she said. “But campaigns are also not a stable job, and often you don’t get benefits. So I decided to just take a break from politics for a little.”

      Even with the more predictable salary, May said home ownership seems like an unattainable goal at this point, especially since she’s worried that housing costs will always be one step ahead of her income growth.

      She peruses rental listings for fun sometimes, but she’s not convinced she could find something cheaper, especially considering the pet fees she’d have to pay for her cat, Lilith. Her only other option, she said, would probably be to move into her parents’ home, as some of her friends have done.

      ‘You just kind of want to be an adult’

      Fresh out of college 10 years ago, Orem resident Eric Wilson set a long-term goal of saving enough for a down payment on a home.

      He’s passed up concerts he wanted to attend, vacations he wanted to take and movies he wanted to see. He’d love to buy the latest tech gadgets and the newest iPhone, but he’s socking away every extra penny in his investment portfolio instead.

      Still, the 31-year-old marketing specialist said he doesn’t feel much closer to buying a house than he did a decade ago — and perhaps even further away, as he watches home values grow at warp speed compared to his slow-and-steady savings. So he can’t help but wish Utah’s economy would hit the tiniest snag.

      “Just a little bit. Not enough to hurt anybody,” he half-jokes. “Just to make house prices go down.”

      Making it especially hard to save is his current rent, which eats up nearly half of his salary.

      Wilson has lived in the two-bedroom unit since shortly after he graduated from Utah Valley University. He had a roommate initially but opted not to get another one after his friend married and moved out.

      “You just kind of want to be an adult and go off and do your own thing and have your own space and not have to worry about marking your milk,” he said. “But at a certain point, if prices keep rising, it’s not really feasible.”

      Wilson had gotten about a fifth of the way to his goal of saving $100,000 when COVID-19 struck and his marketing agency had to cut jobs, including his. His unemployment lasted six months, forcing him to deplete the nest egg he’d spent so long accumulating.

      He keeps browsing online real estate listings, despite knowing how far away he is from becoming a homeowner. The hobby is becoming increasingly demoralizing, though, he said.

      Three years ago, he toured a modest home in a nice neighborhood that was pretty affordable for him, priced at a bit less than $200,000. Recently, he saw the same place had sold again for $415,000.

      Wilson said he likes his apartment and knows he’d have to pay much more if he relocated. But he’s also weary of renting. He’s tired of feeling like he has to put off his life — and delay buying a dog or becoming a foster parent.

      “I’m just kind of not at that point where I can do that space-wise,” he said. “But I would love to do that. And having a home would help make that possible.”

      (Christopher Cherrington | The Salt Lake Tribune)(Christopher Cherrington | The Salt Lake Tribune)

      ‘This is where you belong’

      Anna, 50, is a single mother supported by disability payments from Social Security and living in an income-restricted apartment complex in Holladay — but with around $700 left over each month after she pays her rent, she said, she’s still struggling.

      While her monthly housing costs have ballooned from $850 when she first moved into the two-bedroom apartment in 2016 to $1,077 now, her disability income hasn’t increased at the same rate.

      “It’s a huge stress,” Anna said. She fears she might be pushed out of the unit for speaking out about her rent increase, and The Salt Lake Tribune is not publishing her surname.

      Among her biggest challenges is making sure her 13-year-old daughter can access nutritious foods, a goal she said is easier thanks to assistance from The Church of Jesus Christ of Latter-day Saints.

      “Otherwise, my daughter probably would just be eating rice,” Anna said.

      She’s also struggled at times to supply her daughter with new clothes that fit as she outgrows old ones.

      Anna said she’s trying to save money, but “life keeps happening,” such as a car problem earlier this year that claimed everything she had saved and more. Sometimes, she worries that one misstep could land her and her daughter on the streets.

      Drowning in monthly housing costs, Anna said she’s been searching online for a more affordable apartment in the hopes that she wouldn’t have to stretch so much to make ends meet — but she’s growing increasingly discouraged.

      “I do keep on looking,” she said. “I keep hoping maybe I’ll find somewhere that is rent manageable as well as safe that I can move my daughter and I to so we can be able to provide for ourselves without having to rely on governmental programs completely. And basically feel like we’re being pushed into that hole that, well, if you can’t work for yourself, then this is where you belong. That’s how it feels.”

      Hours after speaking with The Tribune, Anna received a notice taped to her door that her rent was being increased once again: to $1,122 starting July 1.

      Crédito: Taylor Stevens, Bethany Rodgers

      Word count: 2156 Copyright The Salt Lake Tribune Jun 10, 2021

      Related items Latter-day Saints are overrepresented in Utah’s Legislature, holding 9 of every 10 seats Davidson, Lee. The Salt Lake Tribune; Salt Lake City, Utah [Salt Lake City, Utah]. 14 Jan 2021.

      Salt Lake County keeps losing Latter-day Saints, and there are multiple theories as to why Davidson, Lee. The Salt Lake Tribune; Salt Lake City, Utah [Salt Lake City, Utah]. 14 Jan 2021.

      Filmmaker gets to Sundance with a dark comedy inspired by his Latter-day Saint mission Means, Sean P. The Salt Lake Tribune; Salt Lake City, Utah [Salt Lake City, Utah]. 28 Jan 2021.

      Jana Riess: The big question — Will Latter-day Saints follow their prophet and get the COVID vaccine? Riess, Jana. The Salt Lake Tribune; Salt Lake City, Utah [Salt Lake City, Utah]. 20 Jan 2021.

      Sex therapist expelled from Church of Jesus Christ of Latter-day Saints Sarah Pulliam Bailey. The Washington Post; Washington, D.C. [Washington, D.C]. 23 Apr 2021: A.8.

      Show more related items Search with indexing terms Subject Cost control Affordable housing Pandemics Home ownership Location Utah Company/organization Salt Lake Tribune Back to top About ProQuestContact UsTerms and ConditionsPrivacy PolicyCookie Policy Cookie Preferences Accessibility

  3. data-ethics.jonreeve.com data-ethics.jonreeve.com
    1. Do numbers speak for themselves? We believe the answer is ‘no’. Significantly,Anderson’s sweeping dismissal of all other theories and disciplines is a tell: itreveals an arrogant undercurrent in many Big Data debates where other formsof analysis are too easily sidelined. Other methods for ascertaining why peopledo things, write things, or make things are lost in the sheer volume ofnumbers. This is not a space that has been welcoming to older forms of intellectualcraft. As Berry (2011, p. 8) writes, Big Data provides ‘destablising amounts ofknowledge and information that lack the regulating force of philosophy’. Insteadof philosophy – which Kant saw as the rational basis for all institutions – ‘compu-tationality might then be understood as an ontotheology, creating a new ontological“epoch” as a new historical constellation of intelligibility’ (Berry 2011, p. 12)

      Big data can provide a lot of information, and we will finally get the analysis results when we analyze it. But does huge data necessarily give us the right result, I don't think so. Excessively large data sometimes not only brings us a greater amount of calculation and analysis difficulty, but also provides me with some repetitive, complicated and useless information. This information may lead us to deviate from the correct results, or to obtain results that are too dependent on the same environment. If we want to get a general conclusion, we may not only rely on the analysis of these numbers, but also have some prior knowledge or more efficient data processing methods.

    1. Both novels tackle the issue of racism and were removed after parents complained of “profanity”.

      I really think that banning books that address the topics of racism, we are limiting our progress as a society. If we can't learn about the problem, it won't ever be addressed, and true change cannot occur. I was thinking about this in my Spanish class as this week we were reading about bullfighting and the running of the bulls. While it made me feel uncomfortable, I noted the importance of reading about both sides of the issue. If I remained in ignorance, how could I add my voice to changing customs? I may not live in Spain, and may not be able to change much in regards to bullfighting practices, but I can help make a difference in the racism embedded in our society by learning more about the issue through reading books on the subject, especially personal experiences of others.

    1. In view of all this we may say, not, I think, that psychology is all there is of philosophy, as Wundt does, nor even that it is related to the systems as philosophy to theology, nor that it is a philosophy of philosophy, implying a higher potence of self-consciousness, but only that it has a legitimate standpoint from which to regard the history of philosophy,-- a standpoint from which it does not seem itself a system in the sense of Hegel, but the natural history of mind, not to be understood without parallel [p. 131] study of the history of science, religion, and the professional disciplines, especially medicine, nor without extending our view from the tomes of the great speculators to their lives and the facts and needs of the world they saw. It strives to catch the larger human logic within which all systems move, and which even at their best they represent only as the scroll-work of an illuminated missal resembles real plants and trees, in a way which grows more conventionalized the more finished and current it becomes. In a word, it urges the methods of modern historic research, in a sense which even Zeller has but inadequately seen, in the only field of academic study where they are not yet fully recognized.

      Hall states his belief that psychology is much more than philosophy. Psychology is its own science, just like medicine. Despite the contributions of theology and philosophy, psychology is scientific and researchable.

    1. The Self by Soul, not trample down his Self, Since Soul that is Self’s friend may grow Self’s foe. Soul is Self’s friend when Self doth rule o’er Self, But Self turns enemy if Soul’s own self Hates Self as not itself. The sovereign soul Of him who lives self-governed and at peace Is centred in itself, taking alike Pleasure and pain; heat, cold; glory and shame.

      This excerpt from the passage might seem a bit overwhelming as the phrasing is a bit odd to modern language but I think I arrived at a general basis for what Krishna is saying in the beginning of this chapter. The practice of Yoga to many have a direct correlation to harmony and control but Krishna considers another meaning, one that might surprise many. Yoga is learning to let go, it is to detach oneself from their desires and thus coming to the realization that the desire has a direct link to the pain we all face in life. By letting go of those desires you are breaking the tether that binds you to worldly aspects. Ones soul can turn into an enemy if hatred takes over.

      Source: V, Jayaram. Descriptions of Soul or Atman In The Bhagavad Gita. HInduWebsite.com. https://www.hinduwebsite.com/soul.asp

    1. If you consume too little, you could be leaving potential gains on the table and missing out on fat loss, just because you didn’t want to eat an extra chicken breast or protein shake. In this sense, we think of having a high protein intake as a sort of anabolic insurance. It covers you in a similar way as car insurance in that you may not necessarily need it, but it’s a good idea to have it just in case.

      It's clear that they are writing the book in the perspective of maximizing body recomposition in whatever way possible. I believe that this is a little misguided for most people as I personally don't feel that good if I eat a ton of protein and nothing else. It makes my stomach feel a little poopy.

    Annotators

    1. “You don’t want all of your Hispanic kids looking up to a bunch of white teachers and that’s basically what we have so, yeah, it’s an issue,”

      I think it's extremely important that children in the education system have someone from their cultures who they can look towards and relate to. However, it is equally as important to provide children with exposure to different cultures that they may not interact with or know much about. It's so important that school districts start hiring a diverse range of teachers in their schools instead of it being primarily white teachers.

    1. Our brains work not that differently in terms of interconnectedness.Psychologists used to think of the brain as a limited storage spacethat slowly fills up and makes it more difficult to learn late in life. Butwe know today that the more connected information we alreadyhave, the easier it is to learn, because new information can dock tothat information. Yes, our ability to learn isolated facts is indeedlimited and probably decreases with age. But if facts are not kept

      isolated nor learned in an isolated fashion, but hang together in a network of ideas, or “latticework of mental models” (Munger, 1994), it becomes easier to make sense of new information. That makes it easier not only to learn and remember, but also to retrieve the information later in the moment and context it is needed.

      Our natural memories are limited in their capacities, but it becomes easier to remember facts when they've got an association to other things in our minds. The building of mental models makes it easier to acquire and remember new information. The down side is that it may make it harder to dramatically change those mental models and re-associate knowledge to them without additional amounts of work.


      The mental work involved here may be one of the reasons for some cognitive biases and the reason why people are more apt to stay stuck in their mental ruts. An example would be not changing their minds about ideas of racism and inequality, both because it's easier to keep their pre-existing ideas and biases than to do the necessary work to change their minds. Similar things come into play with respect to tribalism and political party identifications as well.

      This could be an interesting area to explore more deeply. Connect with George Lakoff.

    1. Author Response:

      Reviewer #2:

      Weaknesses:

      The competition assay used in this study may not truly reflect the competitiveness of SSIMS males. The mating assay used 20 virgin WT females and 4 males (including both WT and SSIMS), resulting 5:1 sex ratio so the males are not really competing for females. A more competitive ratio (such as WT females: WT males: SSIMA males at 1:1:1) should be designed to address this. Also, the sperm competition assay mixed the mated WT females with SSIMS males for 12 days, allowing plenty of time for the females to remate with these males. Therefore, it's more like a sperm replacement assay rather than competition assay. The authors should either repeat it with a strict time control, or soften their statements for sperm competitiveness.

      We have repeated the experiment at a 1:1:1 ratio as suggested. The new results are reported in the revised Figure 3. It is not clear to us how the timing of the mating experiments differentiates sperm competition versus sperm displacement, but we agree that sperm displacement is a better term to describe what we did. We have repeated the sperm displacement experiment with strict time control based on several published literature precedents and describe the results in the revised manuscript.

      Some necessary information or statistics are not shown or mis-presented. For example, the alternative splicing diagram in Figure 1c likely was taken from the original transformer gene, but here it's the tTA gene so the male intron should be removed since it's not in the construct;

      We have revised text in the manuscript to clarify some of these points. First of all, the male intron is still in the construct, even though we fused the intron to the tTA gene. The alternative splicing between males and females is caused by use of alternative 5' splice sites, which means the intron that is spliced out in males is just a smaller section of the intron that is spliced out in females. Use of an alternative 5' splice site in males means that a protein-coding sequence with multiple stop codons is incorporated to the mature mRNA. We do not support the precise splicing mechanism with empirical data in this paper, but this has been done in a number of previous publications (https://doi.org/10.1016/j.ibmb.2014.06.001; https://doi.org/10.1371/journal.pone.0056303).

      Because the construct works as predicted (100% female lethality in the absence of tetracycline), and we did not change the genetic design in a way that would impact the mechanism of female lethality, we think there is little reason to believe that the splicing is occurring in a different way.

      the panels of Figure 2 were not consistent to the legend and confusing; the statistics for different tetracycline concentration tests were not shown in Figure 2 or text to answer their hypothesis "(to) optimize rearing of SSIMS stock, …..we titrated Tet in the food";

      We re-wrote the text describing Figure 2 to make the results more clear. We clarified in the legend that the symbol signifies p<0.0001 (we were not trying to imply that all experiments had this level of significance, only the ones marked with the symbol in the figure). We removed the word ‘optimize’ from the main text. Optimization was not the true aim of the experiment, and as the review points out, we did not statistically determine an optimal concentration of Tet. Our main goal was to show a dose- dependent response in the number of females surviving on Tet-free medium, which the data supports and which does not require statistical support.

      Figure 3b shows 5-8 day old females were used but in the text it's 5-6 day, and it didn't mention the duration of the first crossing and time lag until the second crossing which are critical in such experiments; the conclusion and statistics for Figure 3c among tests with mixed males should also be mentioned.

      We have corrected the figure (now Figure 3c) to indicate that the females were 5-6 days old. The first mating was for 5-6 days and there was no lag time between being co-housed with different males. We have performed multiple new experiments in revision that have been added to Figure 3. We have revised the discussion of these new experiments (and how they relate to the originally performed experiments) in the revised submission.

      The discussion is largely towards the merits of SSIMS but missing some key points that might decide how it can be translated into applications or transferred to other species. First, the actual basis for tTA lethality that employed in this study is still unknown which is subject to suppression by a pre-existing inherent variation in the targeted field population. The very phenomenon may also be true for any gene-overexpression-based lethality including EGI lines generated here. Second, the complete penetrance observed from the relatively small sample size here can be hardly used to predict field or mass-rearing condition. Previous study showed that mutations in such lethal construct could occur at a one out of 10,000 frequency, and typical SIT program release millions of sterile insects every week. Third, while the authors claimed SSIMS is "one of the most complex engineered systems in insects", they also proposed that "the genetic design is likely to be portable to other species" without mention any potential obstacles along the way. Therefore, efforts should be made to give full picture of SSIMS including rain and sunshine.

      We have added discussion of possible failure modes for this genetic biocontrol approach to the discussion section. We have also added text to discuss how the complexity of SSIMS is a potential obstacle to its translation to non-model organisms.

    1. Author Response:

      Reviewer #1 (Public Review):

      This manuscript is a follow-up of an earlier manuscript using the LRET technology, but extends the study by identifying a new "open" state and using experimental distance constraints to provide molecular models of the different states. All in all, the manuscript is well written, the experiments are described in sufficient details and experiments are done to high quality with the appropriate controls. The data corroborate the partially open state as published early, but extend the study to a second, open state. It is very good to see that the observed states are not only present in the catalytic head but the authors also use the full-length protein and find similar states. However, in the present manuscript, I find the conceptual advance with respect to the mechanism of MR somewhat limited. The authors curiously do not include any DNA in their structural studies, so the observed states are only relevant for the free MR complex, but not the complex "in action" bound to DNA where quite different conformations might occur. As one consequence, the structurally proposed states do not directly correlate with the functional nuclease states that are necessarily bound to DNA. Perhaps as a consequence, in the author's model, Rad50 is merely a gate-keeper for Mre11, but this is not the case as recent structural work shows that Rad50 forms a joint DNA binding surface with Mre11. Likewise, biochemical studies are done with physiologically unclear/less relevant 3' exonuclease activity only, but not with the physiological important 5' endonuclease activity. In my opinion, it is important for a publication in a journal with the scope of eLife and addressed to a broad audience to provide structural analysis in the presence of DNA and validate the structures using the endonuclease activity.

      We thank the reviewer for these comments.

      Specific recommendations:

      1) Instead of using the physiological unclear exo activity, I suggest to use the more relevant endonuclease activity to validate the mutants.

      We now include plate- and gel-based endonuclease activity assays, using a variety of DNA substrates, for all of the validation mutants. We have expanded Fig. 3 and included a new Supplemental Fig. S4 to show this data. We have expanded the Results section of the modified manuscript to present and discuss these findings.

      2) Since the authors mutated one side of newly identified/proposed salt-bridges, I also suggest to test whether a charge reversal on both sides of the salt bridge rescues the phenoptype. I find this important because MR has quite many conformations, and mutating a single residue might not unambiguously validate the proposed conformation, a rescue by a charge reversed salt bridge is much stronger.

      We thank the Reviewer for this suggested experiment, and we tried to do it. Although we were successful in generating each of the charge reversal mutations in full-length Rad50, all of the mutants unfortunately had issues with either expression or purification. For example, the 6x His-tag for several of the new Rad50 mutants was not accessible to the TEV protease for cleavage indicating that the mutated proteins were mis-folded (the His-tag of the WT full-length Rad50 is readily cleaved off by TEV). As such, we did not feel confident using these proteins in subsequent MR activity assays.

      3) Since all LRET experiments are done without DNA, the authors do not capture relevant DNA processing states and comparison of structural (w/o DNA) and biochemical data (w/ DNA) is not really justified, in my opinion. Also, they might miss critical conformations. Is there a technical reason for not including DNA in the LRET studies?

      We have collected LRET data on ATP-bound MRNBD in the presence of a hairpin DNA or a ssDNA as substrates. We still observe three states in the presence of both DNAs; however, the open conformation appears to be slightly more compact (i.e., closer distance between Rad50NBD protomers) in the presence of ssDNA. As described above, we have added to the Results section of the modified manuscript and included a new figure (Fig. 4) describing these data.

      4) If the authors want to claim processive movement coupled to partially open/open state interchanges, they should provide experimental evidence. Where would the energy come from for such a movement, this is not clear from the model?

      On the surface, ATP hydrolysis by Rad50 would seem to be the perfect source of energy for the conformational changes that drive the sequential and/or processive nuclease functions of the MR complex. However, the D313K mutant is not as good at ATP hydrolysis as the wild type enzyme (Fig. 3E), and the data in Fig. 3 and Supplemental Fig. S4 clearly demonstrate that D313K is by far the best nuclease. If the free energy for the movement does not come from ATP hydrolysis, where else could it come? Richardson and co-workers measured a release of -5.3 kcal mol-1 (-22.17 kJ mol-1) of free energy for the hydrolysis of a DNA phosphodiester bond (Dickson, K.S. et al. 2000 J. Biol. Chem. 275:15828–15831). Thus, the free energy released from the Mre11 nuclease activity could be the driving force for the conformational changes we propose. We have made this point in the Discussion of the revised manuscript.

      5) The SAXS data for the "open" state do not validate the model, in my opinion. Experimental data and model are not inconsistent, but the curve looks to me as if the open state is perhaps much more flexible (i.e. an ensemble) or extended? Please comment.

      We agree with the Reviewer on this point. We have updated Fig. 5A (original Fig. 4) to include the two-state fits to the experimental SAXS data. Although the multi-state fit to the apo MR SAXS data is better than any of the single model fits (2 = 1.05 vs. 1.26, respectively), the 2 is still larger than the multi-state fits to the ATP-bound MR SAXS data. Thus, an additional unobserved conformation (perhaps the so-called “extended”) might be present in solution for apo MRNBD. We have added a sentence to the revised manuscript with this point.

      To explore the possibility that the previously described “extended” structure might be contributing to the SAXS data, we built a model of the extended conformation of Pf MRNBD based on the Tm MRNBD structure (PDB: 3QG5) and used Rosetta to connect the coiled-coils and add the linker to the Mre11 HLH. When this model was used in the FoXS calculations for the apo SAXS data, the 2 was 4.77 (versus 2 of 1.26 for the “open” model). The MultiFoXS two-state fit gave 90% open + 10% closed (2 of 1.04), whereas the three-state fit gave 65% open + 20% extended + 15% part open (2 of 0.84). Thus, there is some improvement when using the extended model, but since that model is not measurable in our LRET experiments and we are unsure of its validity as we have modeled it for Pf MR, we have chosen to omit it from the analysis.

      6) Distance errors for the full complex are much smaller than those for the catalytic module only (Fig. 1d). Does that mean that the full complex is more rigid, please comment?

      From looking at the data presented in Fig. 1D, it is logical to suggest that the full-length complex may be more rigid or better defined by the LRET data. However, we note that there are nearly as many distance errors which are similar between MRNBD and MR as there are MR errors less than MRNBD. And although many are not identical, most are of a similar magnitude. Because of this, we do not think the variations in LRET errors are systematic (i.e., related to a more rigid full-length complex).

    1. assumptions are evident in the thinking that assumes that implied consent will reach the parts that generic consent does not reach; but proponents of specific consent procedures also assume that consent travels beyond the propositions to which it is explicitly and literally given in signing a consent form. Yet strictly speaking, consent (like other propositional atti tudes) is not transitive. I may consent to A, and A may entail B, but if I am blind to the entailment I need not consent to B. Consent is said to be opaque because it does not shadow logical equivalence or other logical implications: when I consent to a proposition its logical implications need not be transparent to me. Transitivity fails for propositional attitudes. Consent and other propositional attitudes also do not shadow most causal connections. I may consent to C, and it may be well known that C causes D, but if I am ignorant of the causal link I need not consent to D. Again, transitivity fails for propositional attitudes. When I consent to a proposition describing an intended transaction, neither its logical implications nor the causal links between transactions falling under it and subse quent events need be transparent to me; a fortiori I may not consent to them. Events at Alder Hey illustrate the opacity of consent. Some parents consented to removal of tissue, but objected that they had not consented to the removal of organs?although, of course, organs are composed of tissues. They did not agree that their consent to removal of tissue implied their consent to the removal of organs. As a point of logic the parents were right. These simple facts create a dilemma. The real limits of patient and donor comprehension suggest that it is unreason able to seek consent for every detail of a proposed treatment, or of a proposed research protocol, or of a proposed use of tissues. Yet the logic of propositional attitudes suggests that we cannot simply assume that implied consent will spread from one proposition to another, or from one proposition to the expected consequences of that which it covers, making any further consent unnecessary. There are many ways of skinning this cat. I conclude by sketching one approach that I think plausible.

      propositional attitude SHOULD ONLY BE LEFT PARAGRAPH. Also, there's a bug in the code here.

    Annotators

    1. Then, after speaking with the person about the ways in which they don’t hold privilege, I ask in what ways they do. (I’ll use myself as an example: while I am a woman, dyslexic, and have a chronic medical condition, I ALSO have the privilege of being upper-middle class, living in the United States, holding a graduate degree, having financial resources, and being white.)

      This makes you think about privilege in another light. Although someone may have privilege they could also have some disadvantages. Which isn't a bad thing but we have to ask ourselves these things.

    1. A couple of weeks ago I did a mock interview with an executive I’m coaching. One of the interview questions I posed was this: “You have employees, external customers, internal customers (stakeholders or peers), and your boss. Put them in order of priority in terms of serving their needs.Regardless of the type of company or organization, here’s the answer and why:1. External customersThe purpose of any company or business is to win and keep customers. Without customers, there’s no business, no shareholder value, and no jobs. Since there are a finite number of customers, in practical terms, they are irreplaceable. They’re always the highest priority.2. Your bossYour boss is more important to the success of the company than you and your peers. You may not like hearing that, but in just about every case, it’s true. You may think you’re more competent than your boss and you might even be right. But that doesn’t change the fact that his function incorporates yours and is higher up on the org chart so, by definition, his needs top yours or your peers.3. Internal customers (stakeholders or peers)Each and every one of you has peers, stakeholders, internal customers whose functions are intertwined with yours and whose needs are important. Marketing folks, for example, should count product groups and sales as their stakeholders. You should make it a priority to meet with them periodically and ask them how you’re doing. Next to paying customers and your boss, they’re needs matter most.   4. EmployeesSo, here we are. The dirty little secret no executive, business leader, or manager ever wants to admit. Nevertheless, it’s true. Employees are at the bottom of the totem pole in terms of how important their needs are to their management. That’s all there is to it.Don’t get me wrong. Creating a culture where employees are empowered, challenged, and supported, where they can really make a difference, should be huge for any company. But all things being equal, as priorities go, employees come in dead last on that list. Sobering as that sounds, it’s entirely as it should be.

      This really gets to the heart of the matter, it is justifiable that Employees are the lowest of the priorities for an executive.

      Based on the article priorities are: 1. External Customers - They bring money into the company 2. Your boss - They being money into you 3. Internal Customers (stakeholders or peers) - They make things work for external customers and your boss 4. Employees - They are paid to work for the company and are the lowest of the four priorities if you have to stack rank

    1. SciScore for 10.1101/2022.01.30.22270029: (What is this?)

      Please note, not all rigor criteria are appropriate for all manuscripts.

      Table 1: Rigor

      NIH rigor criteria are not applicable to paper type.

      Table 2: Resources

      <table><tr><th style="min-width:100px;text-align:center; padding-top:4px;" colspan="2">Software and Algorithms</th></tr><tr><td style="min-width:100px;text=align:center">Sentences</td><td style="min-width:100px;text-align:center">Resources</td></tr><tr><td style="min-width:100px;vertical-align:top;border-bottom:1px solid lightgray">Materials: The video-conferencing sessions took place using ZOOM software (Zoom Video Communications, Inc., Version 4.4; https://zoom.us/).</td><td style="min-width:100px;border-bottom:1px solid lightgray"><div style="margin-bottom:8px"><div>ZOOM</div><div>suggested: (ZOOM, RRID:SCR_002175)</div></div></td></tr><tr><td style="min-width:100px;vertical-align:top;border-bottom:1px solid lightgray">Statistical analyses were conducted using GraphPad Prism v.</td><td style="min-width:100px;border-bottom:1px solid lightgray"><div style="margin-bottom:8px"><div>GraphPad Prism</div><div>suggested: (GraphPad Prism, RRID:SCR_002798)</div></div></td></tr><tr><td style="min-width:100px;vertical-align:top;border-bottom:1px solid lightgray">8 and SPSS v.</td><td style="min-width:100px;border-bottom:1px solid lightgray"><div style="margin-bottom:8px"><div>SPSS</div><div>suggested: (SPSS, RRID:SCR_002865)</div></div></td></tr></table>

      Results from OddPub: We did not detect open data. We also did not detect open code. Researchers are encouraged to share open data when possible (see Nature blog).

      Results from LimitationRecognizer: We detected the following sentences addressing limitations in the study:


      Limitations and future clinical considerations: The main limitation of this study was also what made it possible - the unexpected circumstance of supporting families during home-confinement orders. It was not possible to randomise groups or complete formal measures of child and parent outcome, and our satisfaction questionnaires needed to be created very quickly. This was a period of great uncertainty, and relative solidarity, where parents seemed open to try new modes of communication and were motivated to keep a sense of continuity for their child’s program, all of which may have impacted their level of participation and satisfaction. It should also be noted that the parents in our study had all met or worked with their therapists in-person prior to being asked to meet online, which may have increased their willingness to take part in the new approach.1,30 This study did not examine whether a family without previous experience in early intervention would have the same level of engagement with the remote delivery of services. In considering ideal sessions frequency, the current study did not compare parent experience between varying durations of sessions (30 vs 60 vs 90 minutes), which would be important to take into account in future research. The COVID-19 pandemic disrupted our intervention program for children on the autism spectrum, forcing us to re-think our service provision model and giving us the chance to experience very frequent interactions with the families. It r...


      Results from TrialIdentifier: No clinical trial numbers were referenced.


      Results from Barzooka: We found bar graphs of continuous data. We recommend replacing bar graphs with more informative graphics, as many different datasets can lead to the same bar graph. The actual data may suggest different conclusions from the summary statistics. For more information, please see Weissgerber et al (2015).


      Results from JetFighter: We did not find any issues relating to colormaps.

      Results from rtransparent:


      • Thank you for including a conflict of interest statement. Authors are encouraged to include this statement when submitting to a journal.
      • Thank you for including a funding statement. Authors are encouraged to include this statement when submitting to a journal.
      • No protocol registration statement was detected.

      Results from scite Reference Check: We found no unreliable references.


      <footer>

      About SciScore

      SciScore is an automated tool that is designed to assist expert reviewers by finding and presenting formulaic information scattered throughout a paper in a standard, easy to digest format. SciScore checks for the presence and correctness of RRIDs (research resource identifiers), and for rigor criteria such as sex and investigator blinding. For details on the theoretical underpinning of rigor criteria and the tools shown here, including references cited, please follow this link.

      </footer>
    1. analysis is to make us better producers of persuasion, the immediate purpose here is to see the tools available for analysis, as this brief consideration of two opposing audiences illustrates. LOGOS The third kind of proof, according to Aristotle, that rhetors may use to appeal to their audiences is logos. You may readily associate the term with “logic,” and while there is some reason for doing so, we shouldn't think too narrowly about logic when conceiving logos as a mo

      The most interesting/helpful idea here is logos= logic. It's not about emotion or facts, it's using logic to inform the audience/ reader. The four parts help me understand logos the best. They are the claim, data to support it, a warrant connecting the data to the claim, and backing. This is a solid structure to follow when using logos.

    1. Author Response:

      Evaluation Summary:

      This manuscript addresses a phenomenon of great interest to researchers in cell metabolism and cancer biology: namely, why do cancer cells often secrete high levels of lactate, despite the presence of abundant oxygen to power nutrient oxidation (Warburg effect). The authors propose that lactate export and subsequent extracellular acidification provides a selective advantage and the concomitant rise in intracellular pH is sufficient to drive flux through glycolysis, thereby sustaining the Warburg effect. This is an intriguing hypothesis that ties together many published observations, but it would require further support both from the technical and conceptual side.

      The concept proposed in the evaluation summary is not quite correct, in this paper we have tried to show that it is not lactate export that drives extracellular acidification, but that cells which can increase proton export, via over-expression or increased activity of proton exporting proteins, can subsequently drive upregulation of glycolysis and increased lactate production, likely due to increased intracellular pH (pHi) and the ability of glycolytic enzymes to have enhanced activity under slightly higher pHi. As mentioned in the summary, although some of these observations are known, the novelty lies in that they have not been directly proven by inducing acid export prior to a glycolytic phenotype, we believe showing the casual nature of proton export on glycolysis is the novelty of this research.

      Reviewer #1 (Public Review):

      In this manuscript, the authors tackle an interesting puzzle: why do cancer cells secrete most of their glucose as lactate? The authors propose that acid export is sufficient to enhance glycolysis and provide a selective advantage to cancer cells growing in vivo. To this end, the authors show that clonal lines expressing CA-IX or PMA1, each of which will facilitate proton export, have elevated capacity to acidify extracellular medium and can drive increased migration/invasion and tumor growth or metastases. In support of the model that extracellular pH is a key driver of metastases, the effect of CA-IX expression on lung metastases is reversed following bicarbonate treatment. While many of the individual conclusions of the manuscript are not novel-for example, pH has been reported to control glycolysis and it is established that CA-IX expression modulates migration/metastases-providing a comprehensive assessment of the ability of proton export to drive the Warburg effect, and assessing the significance of metabolic rewiring driven by acid export on tumor growth, would represent an important resource for researchers intrigued by the pervasive observation that cancer cells secrete lactate despite potential bioenergetic disadvantages of discarding biomass.

      The strength of the manuscript lies therefore in tying these disparate observations together in a coherent model and testing the role of acid export per se on glycolytic flux. The technical weaknesses of the paper prevent such coherent model building. A major concern is that all cell lines appear to be generated by transient transfection followed by clonal selection, giving rise to cells with notable variability and inconsistent phenotypes. More traditional approaches to manipulate enzyme expression will provide more robust model systems to test the proposed model. Similarly, direct measures of glycolytic flux are required to make conclusions about the role of acid export in promoting glycolysis. Another strength is the use of heterologous enzyme systems to alter proton export in cancer cells, but alternative explanations for these results are not fully considered. Ultimately, to what extent acid export per se, as opposed to altered metabolism driven by acid export, drives enhanced tumor metastases is not addressed.

      We agree wholly with Reviewer 1 that although individual components of this manuscript have previously been implicated in cancer research, the novelty lies in directly assessing metabolic changes, specifically the Warburg effect, as a result of proton production to determine causality rather than correlation as previous studies have shown. The reviewer makes a valid point about our use of clones and this is something we considered at length. When originally designing these experiments, we had many conversations within our lab and with collaborators and colleagues, and the overall consensus was that bulk populations are more likely to have heterogeneous expression levels unrelated to transfection, which could result in the phenotype generated being noisy and not indicative of what occurs when proton exporters are over-expressed. We chose to isolate single clones, maintaining these in antibiotic selection media, to ensure stable over-expression. After confirming over-expression, cells were grown without antibiotics and screened regularly for maintenance of protein expression. This was also one of the reasons why we utilized over-expression of two different proton exporters in multiple different cell lines to be confident that proton export was changing the metabolic phenotype and not just due to changes in an individual isolated clonal line. We utilized bulk population for the MOCK clones, to ensure we weren’t selecting for a clone which had inherently different metabolic traits from the parental population. As described in the paper, while some of the behaviors of the different clones are indeed divergent, the impact of expression on increased glucose uptake and lactate production is wholly consistent and highly correlated to expression of PMA1 or CA-IX. Although we utilized metabolic profiling, we do not claim to infer flux from these data. Flux was assessed via lactate production and glucose consumption rates. The metabolomic analyses showed that glycolytic intermediates upstream of Pyruvate Kinase (PK) were uniformly increased in transfectants. This was an unequivocal finding and, given the increased flux, we have concluded that transfection results in activating glycolytic enzymes upstream of PK. The pleiotropic nature of these effects have led us to propose that intracellular pH was increasing and likely enhancing glycolytic enzyme activity throughout the glycolytic pathway. We measured the intracellular pH and showed that it was generally elevated in the transfectants. Finally, the reviewer was concerned that we did not address the mechanism by which pH increases metastases. Such a study would be beyond the scope of this paper and, indeed, was the subject of a two-volume special issue of Cancer Mets. Rev. in 2019 (PMC6625888). Hence, in this paper, we were not trying to address the mechanism by which pH affects metastasis, but simply wanted to show additional biological relevance.

      Reviewer #2 (Public Review):

      The work by Xu et al proposes that the Warburg effect - the increase of glycolytic metabolism usually displayed by tumor cells, is driven by increased proton excretion rather than by oncogenic dysregulation of glycolytic enzyme levels. As a proof-of-principle, they engineered tumor cells to increase proton excretion. They observed an increase in glycolytic rate, pH, and malignancy in their engineered cells.

      1. My main issue with this work is that I do not agree with the authors when they say that the "canonical view" is that oncolytic mutations are thought to drive the Warburg effect. What I understand the consensus to be, is that it is fast proliferating cells - rather than malignant cells - the ones who display this form of metabolism. The rationale is that glycolytic metabolism allows keeping biomass by redirecting lactate and from the phosphate pentose pathway. In contrast, the end product of oxidative phosphorylation is CO2 that cannot be further utilized in cell metabolism.

      They claim that they Vander Heiden et al., 2009 shows that "fermentation under aerobic conditions is energetically unfavorable and does not confer any clear evolutionary benefits." This is incorrect. While that review states that the Warburg effect has little effect on the ATP/ADP ratio, they do show this form of metabolism has significant benefits for fast proliferating cells. In fact, the whole review is about how the Warburg effect is a necessary metabolic adaptation for fast proliferation rather than a unique feature of malignant cells.

      1. Their main observation is not surprising. From a biochemical standpoint, protons are final product of glycolysis (from the production of lactic acid). Thus, by mass action, any mechanism to remove protons from the cell will result in accelerated glycolytic rate. Similarly, reducing intracellular pH will necessarily slow down LDHA's activity, which in turn will slow down pyruvate kinase and so on.

      2. Their experiments are conducted on transformed cells - that by definition - have oncogenic driver mutations. They should test the effect of proton exporter using primary non-transformed cells (fresh MEFs, immune cells, etc). I would expect that they will still see the increase in glycolysis in this case. And yet, I would still have my concerns I expressed in my previous point.

      3. The fact that they can accelerate the Warburg effect by increasing proton export does not mean is the mechanism used by tumor cells in patients or "the driver" of this effect. As I mentioned, their observation is expected by mass action but tumors that do not overexpress proton transporter may still drive their Warburg effect via oncogenic mutations. The biochemical need here is to increase the sources of biomass and redox potential and evolution will select for more glycolytic phenotypes.

      Comment 1: We disagree with the reviewer that the energetic demands of a faster proliferating cell drive glycolysis in order to produce the biomass needed for generation of new cells. Available evidence does not support this hypothesis. As the reviewer mentioned, there is a correlation between proliferation and aerobic glycolysis (i.e. if cells are stimulated to grow they will consume more glucose), and the same can be said for motility (i.e. more motile cells have higher aerobic glycolysis). This is also true for normal cells and tissues that exhibit high levels of aerobic glycolysis. We agree that glycolytic ATP generation is more rapid than oxidative phosphorylation and that this may confer some selective advantage for transporters, as we described in PMC4060846. Nonetheless, it is clear that under conditions of similar proliferation and motility, more aggressive cancer cells ferment glucose at much higher rates. However, correlations between neither proliferation nor motility are the “Warburg Effect” which is a higher rate of aerobic glycolysis in cancers, regardless of proliferation or migration. As we described in PMID 18523064, the prevailing view in the cancer literature is that the Warburg effect is driven by oncogenes (ras, myc), transcription factors (HIF) and tumor suppressors (p53/TIGAR) through increased expression of glycolytic enzymes. This assumes that expression levels drive flux which has not been proved empirically. In biochemical pathways, it is canon that flux is regulated by demand (e.g. ATP) or through some post-transcriptional control (e.g. pH). In Vander Heiden’s paper the steady state levels are reported of ATP/ADP ratios, not flux. The first paragraph of the intro has been modified to accommodate this concern.

      Comment 2: The fact that our results are not surprising is our major argument: i.e. that glycolytic flux can be enhanced by increasing the rate of H+ export. We saw an increase in intracellular pH (pHi), but our metabolomics data do not support a direct effect on LDHA or PK. Instead, we show that clones with higher pHi have a crossover point at PK, due to reduced inhibition of upstream enzymes which is not there in clones at lower pHi.

      Comment 3: We agree it would be interesting to study the effects of proton export on immune cells especially given the increase in immunotherapy use in cancer treatment. We did utilize HEK 293 cells shown in supplemental figure S6, to show this was not a cancer cell line specific phenomenon, and we saw increased aerobic glycolysis with over-expression of CA-IX.

      Comment 4: We agree that oncogenic mutations can alter glycolytic rate, but we observed that increased expression and activity of proton exporters is sufficient to drive a Warburg effect. Although the reviewer indicates that glycolysis is responsible for generating the biomass needed for these faster proliferating cells, we have shown that proton exporter driven aerobic glycolysis does not increase proliferation rates. The literature, see Vander Heiden’s paper below, suggests that amino acids, mainly glutamine, can support the majority of biomass needs of a proliferating cell. Hence, reliance on aerobic glycolysis remains energetically inefficient and inefficient in that most of the carbons are removed, and thus will not be selected by evolution.

      Hosios, A.M., Hecht, V.C., Danai, L.V., Johnson, M.O., Rathmell, J.C., Steinhauser, M.L., Manalis, S.R., & Vander Heiden, M.G. (2016). Amino Acids Rather than Glucose Account for the Majority of Cell Mass in Proliferating Mammalian Cells. Developmental cell, 36 5, 540-9 .

      Reviewer #3 (Public Review):

      The authors claim that "proton export drives the Warburg effect". For this, they expressed proton-exporting proteins in cells and measured the intracellular proton concentration and the Warburg effect. Based on their data, however, I do not see elevated Warburg effect in these cells and thus conclude that the claim is not supported.

      The authors concluded that the CA-IX or PMA1 expressing cells had increased Warburg effect. I don't think this conclusion can be made based on the data presented. For the MCF-7 cells, the glucose consumption is ~18 pmol/cell/24hr (Fig. 5E) and lactate production is ~0.6 pmol/cell/24hr (Fig. 5F), indicating that 0.6/18/2 = 1.7% of the glucose is excreted as lactate. This low percentage remains true for the PMA1 expressing cells. For example, for the PMA1-C5 cells, the percentage of glucose going to lactate is about 1.8/38/2 = 2.4% (Fig. 5EF). While indeed there was an increase of both the glucose and lactate fluxes in the PMA1 expressing cells, the vast majority of the glucose flux ends up elsewhere likely the TCA cycle. This is a very different phenotype from cancer cells that have Warburg effect. The same calculation can be done for the CA-IX cells but the data on the glucose and lactate concentration there are inconsistent and expressed in confusing units (which I will elaborate in the next paragraph). Nevertheless, as there were at most a few folds of increase in lactate production flux in the M1 and M6 cells, the glucose flux going to lactate production is likely also a few percent of the total glucose uptake flux. Again, these cells do not really have Warburg effect.

      The glucose and lactate concentration data are key to the study. The data however appear to lack consistency. The lactate concentration data in Fig. 1F shows a ~5-fold increase in the M1 and M6 cells than the controls but the same data in S. Fig. 2 shows a mere ~50% increase. The meaning of the units on these figures is not clear. While "1 ng/ug protein" means 1ng of lactate is produced by 1 ug protein of cells over a 24 hour period, I do not understand what "ng/ul/ug protein" means (Fig. 1F). Also, "g/L/cell" must be a typo (S. Fig. 2). Furthermore, regarding the important glucose consumption flux, it is not clear why the authors did not directly measure it as they did for the PMA1 cells (Fig. 5E). Instead, they showed two indirect measurements which are not consistent with each other (Fig. 1E and S. Fig. 1).

      The reviewer pointed out discrepancies in our data and, upon reviewing, we have identified a dilution error leading to miscalculation of glucose consumption in Fig 5E. We have also repeated these experiments which agree with our re-calculation. Originally, it appeared from the data we presented that there was very little lactate flux, we have re-calculated the glucose excreted as lactate (average % using data from Fig. 5E and 5F) and present in a table below. We do believe we observed a Warburg effect in our proton exporting cells consistently. The reviewer points out that we utilized multiple methods to measure glycolysis in these cells leading to inconsistency, however we felt using multiple methods/instruments/kits to assess glucose consumption, lactate production, and glucose induced proton production rates was a strength of our findings as we consistently saw increased glycolysis in our proton exporting clones, irrespective of proton exporter, cell line, or method utilized. We are also not suggesting that glucose is solely being metabolized through glycolysis and do agree that it can metabolized through other metabolic pathways too such as TCA cycle, as the reviewer stated. The units used for these graphs are described in the methods and figure legends, in some assays such as Fig. 1F lactate was graphed as the ng of lactate per ul of cell culture media and then normalized per ug protein, which was determined by calculating the protein concentration of cells per well of the assay. Supplementary figure 2 has been re plotted per 10K cells to match other normalization values in the paper. Fig 1E and Fig. S1 are two different time points, M6 acidified media faster than M1 and this is likely why at 1 hour we are not yet seeing substantial increase in glucose uptake of M1.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity):

      The manuscript reports the identification of a novel protein complex involved in denervation-induced desmin degradation. The first protein to be identified was the ATPAse Atad1. A clever isolation strategy was based on the fact that the ATPAse p97/VCP is involved in the extraction of ubiquitinated myofibrillar proteins but is not required for the removal of ubiquitinated desmin filaments. The authors reasoned that a related ATPAse might be specifically required for desmin filaments. Atad1 was identified by treating desmin filaments with a nonhydolyzable ATP analog and looking for ATPases that are associated with desmin filaments by proteomics. Knockdown of Atad1 causes a loss of desmin degradation and led to a loss of denervation-induced muscle atrophy. It seems that Atad1 binds desmin in a phsphorlation-dependent manner, although the binding maybe mediated by a protein that hasn't yet been identified. The authors went on and identified two additional proteins which together with Atad1 form a protein complex involved in recruiting calpain for desmin degradation.<br> Overall, this study is very convincing providing novel important insight. I have only some minor comments

      Minor comments

      1. I wondered whether Aatd1 is expressed at higher-than-normal levels in muscle and heart. I looked that expression pattern up and it seems that they are especially abundant in muscle and heart and expressed at lesser levels in smooth muscle and overall have a restricted expression.

      We now analyzed ATAD1 levels in various tissues by Western Blotting and the new data is presented as Fig. S2. ATAD1 is present in many tissues and thus may have many cellular roles.

      Maybe you have some data on their expression in muscle tissue. Did you perform some staining of muscle tissue at baseline and after denervation with regard to the protein localization by immunostaining?

      The new associations between ATAD1 and its protein partners reported herein were further validated by an immunofluorescence staining of longitudinal sections from 7 d denervated muscles and super-resolution Structured illumination microscopy (SIM). The new data presented as Fig. 3E demonstrate colocalization of ATAD1 with calpain-1, PLAA and UBXN4. To confirm that these proteins in fact colocalize, we measured the average colocalization of ATAD1 with calpain-1, PLAA and UBXN4 using the spots detection and colocalization analysis of the Imaris software (Fig. 3E). Only spots that were within a distance threshold of less than 100 nm were considered colocalized (Fig. 3E, graph).

      1. The string data presented in Figure 3C needs some further explanation with regard to the colors used for the different proteins. While the authors explained the meaning of the proteins labeled in red, there is no explanation for the other colors.

      These were arbitrary colors assigned to protein nodes by the STRING database. The current color code we use is only meant to group the UPS enzymes based on function (e.g. E2s, E3s, DUBs etc). This information has now been added to figure legend.

      1. Molecular weights in Fig. 2E, 3D needs to be 'repaired' and additional MW information is required in case of the ubiquitin blot shown in 3D.

      All molecular weight values and protein ladders have been added.

      1. Fiber size distributions shown in Fig. 1D and 4F. Have the differences been statistically tested?

      We thank the reviewer for raising this important point because we just established an approach to quantitate these effects statistically using Vargha-Delaney A-statistics test and Brunner-Manzel test. Our new paper on this topic entitled “A semi-automated measurement of muscle fiber size using the Imaris software” by Gilda et al. was recently published in the AJP Cell Physiol. As requested by the reviewer, we now also apply A-statistics test and Brunner-Manzel test on the fiber size measurements presented in our current manuscript (Figs. 1C, 4F and Table I), which show a significant difference in size distributions of fibers expressing shAtad1 vs. adjacent non-transfected fibers. As indicated in our paper (Gilda et al, 2021), the A-statistics is a direct measure of the fiber size effect, and it shows significant beneficial effects on cell size by shAtad1 (Table I). Such effects can be simply missed by traditional measurements of median, average, and Student’s t-test.

      1. For my taste the referral to the individual data (Fig. numbers) in the discussion section is too detailed and becomes a second results section. This should be substituted by a summary paragraph before the implications are discussed.

      We agree and revised the discussion section accordingly.

      1. The summary slide is very good. However, could you please add information, which protein of the three in the Atad1 complex is depicted by each symbol?

      The model slide has been revised to include all enzymes studied in this paper, and a legend to improve clarity.

      Reviewer #1 (Significance)

      Novel insight into the proteins involved in desmin filament degradation. Since this is an important subject both in muscle and heart and plays an important role in muscle and heart disease, it is of significant clinical importance. Currently it has only been implicated in denervation-induced skeletal muscle atrophy, but it is likely that desmin filament metabolisms is also similarly regulated in the heart.

      I am a researcher mainly focusing on the cardiac biology with some expertise also on muscle, however no specific knowledge about desmin filament biology. <br> Referee Cross-commenting Overall, I think all three reviewers agree that this is a significant and important paper. I think that the comments made by the reviewers are fair and probably add to the quality of the manuscript.

      We are pleased that the reviewers found our paper novel and important.

      Thus, both myself and reviewer 2 agree that it would be useful to visualize Atad1 and partners localization in muscle fibers by immunofluorescence. These data would provide independent support to the model the authors are proposing, which currently is only based on biochemical analysis.

      These data have been added as new Fig. 3E.

      I also support the proposed use of proximity ligation to provide further evidence of the presence of the Atad1, Ubxn4 and PLAA in a complex. However, this experiment depends on the quality of the available antibodies and I would consider this not absolutely required.

      Because our antibodies are not suitable for proximity ligation assay (PLA), we used a super-resolution SIM microscope, immunofluorescence, and the spots detection and colocalization analysis of the Imaris software to confirm colocalization of ATAD1 and its partners (new Fig. 3E). Similar to PLA (where signal is generated only if two antibodies used for staining are 100nm apart), only spots that were within a distance threshold of less than 100 nm were considered colocalized (Fig. 3E, graph). In addition, we present immunoprecipitation (Fig. 3D) and use three independent mass spectrometry-based proteomic approaches to validate these new associations.

      I also agree that some further information on the proteomics data (as suggested by reviewer 3) is required with regard to the method of filtering for UPS components was performed.

      We agree and thank the reviewer for this comment. More information on the proteomics data have been added to the text and legend to Table II.

      The proposed request for further information on the electroporation approach is a valid comment and if the authors have this information, it would be good to provide. However, I do not recommend further experiments as overall the data are very consistent and the findings are very significant and represent a major advance in our understanding of desmin degradation.

      With regard to the electroporation approach, i) representative images have been added to Figs. 1C and 4F, ii) a statement was added to Methods under “in vivo electroporation” about the percent of transfection routinely used in our experiments (60-70%), iii) we determine transfection efficiency by dividing the number of transfected fibers (also express GFP) by the total number of fibers in the same muscle cross section (using the Imaris software). This approach was fully validated in our recent papers by Goldbraikh et al EMBO Rep, 2020 (see supplementary material) and Gilda et al AJP-Cell Physiol, 2021.

      Reviewer #2 (Evidence, reproducibility and clarity)

      In their manuscript the authors show the involvement of the AAA ATPase Atad1 in Desmin degradation. They identify PLAA and Ubxn4 as partners of Atad1 that participate to its function in desmin degradation.<br> A general comment is that some conclusions are overstated. The authors mention several times that Atad1 depolymerises desmin filaments. The data show that Atad1 participates to the degradation of Desmin and to its solubilization. "Depolymerisation" should be kept for the model presented in figure 8 but not used in the result section.

      We respectfully disagree with the reviewer that our conclusions are overstated. Early studies from Fred Goldberg’s group showed that filaments are not accessible to the catalytic core of the proteasome (Solomon and Goldberg, JBC, 1996), and therefore must depolymerize before degradation. Accordingly, more recent studies by us and others identified distinct enzymes and cellular steps promoting disassembly and subsequent degradation of ubiquitinated desmin filaments (Cohen, JCB, 2012; Aweida, JCB, 2018) and myofibrils (Cohen, JCB, 2009; Volodin, PNAS, 2017). In the current manuscript, we employed a similar approach as we used before to analyze disassembly of filamentous myofibrils by p97/VCP (Volodin, PNAS, 2017), and demonstrate a critical role for ATAD1-PLAA-UBXN4 complex in promoting desmin IF disassembly and loss (figures 2C, 3D, 3G, 4C, 4G, 4H). We show that ATAD1 binds intact insoluble desmin filaments in an early phase during atrophy (3 d after denervation)(figures 2B, 2F) and later accumulates in the cytosol bound to soluble ubiquitinated desmin (figure 3D). Moreover, downregulation of ATAD1, PLAA or UBXN4 in mouse muscles prevents the solubilization of desmin IF (figures 2C, 3G, 4C) because in these muscles desmin accumulates as ubiquitinated insoluble filaments. Based on these data we conclude that Atad1 complex promotes desmin IF disassembly and subsequent loss.

      Major comments:<br> 1) It would be useful to visualize Atad1 and partners localization in muscle fibers in immunofluorescence. Do they colocalize with desmin filaments, with calpain?

      As requested, the new associations between ATAD1 and its protein partners reported herein were further validated by an immunofluorescence staining of longitudinal sections from 7 d denervated muscles and super-resolution Structured illumination microscopy (SIM). The new data presented as Fig. 3E demonstrate colocalization of ATAD1 with calpain-1, PLAA and UBXN4. To confirm that these proteins in fact colocalize, we measured the average colocalization of ATAD1 with calpain-1, PLAA and UBXN4 using the spots detection and colocalization analysis of the Imaris software (Fig. 3E). Only spots that were within a distance threshold of less than 100 nm were considered colocalized (Fig. 3E, graph). Given the antibodies in hand and new ones that we purchased, as well as the species of the antibodies, we were able to perform and optimize the staining only for the presented combinations of antibodies.

      2) In the same line, interactors were obtained from large crosslinked complexes. It would make the model more convincing if direct interactions with Atad1 were shown, for example using Proximity Ligation Assays.

      Because our antibodies are not suitable for proximity ligation assay (PLA), we used a super-resolution SIM microscope, immunofluorescence, and the spots detection and colocalization analysis of the Imaris software to confirm colocalization of ATAD1 and its partners (new Fig. 3E). Similar to PLA (where signal is generated only if two antibodies used for staining are 100nm apart), only spots that were within a distance threshold of less than 100 nm were considered colocalized (Fig. 3E, graph). In addition, we present immunoprecipitation (Fig. 3D) and use three independent mass spectrometry-based proteomic approaches to validate these new associations._

      3) Evaluation of atrophy is made on cross-sections of muscles electroporated with shRNAs. Histology pictures should be shown.

      As requested, representative images of transfected muscles were added to figures 1C and 4F.

      4) What is the percentage of electroporated fibers? To evaluate the effect of shRNAs it is important to have this information. For example, if the efficiency is 50% it means that the reduction in expression of the target in electroporated fibers is twice the value reported for the whole muscle. Alternatively, immunofluorescence could be provided to see the decrease in targeted proteins in electroporated fibers.

      We determine transfection efficiency by dividing the number of transfected fibers (also express GFP) by the total number of fibers in the same muscle cross section (using the Imaris software). This approach is fully validated in our recent papers by Goldbraikh et al EMBO Rep, 2020 (see supplementary material) and Gilda et al AJP-Cell Physiol, 2021. For our biochemical studies we always analyze muscles that are at least 60-70% transfected (added to methods).

      As shown in figures 1B, 3F, and 4A-B, our shRNAs reduced gene expression by at least 40-50%, which in a whole muscle was sufficient to promote the beneficial effects on muscle (as mentioned in the text, shCAPN1 was validated in Aweida, JCB, 2018). Similar reduction in gene expression is commonly seen by the in vivo electroporation of a fully developed mouse muscles because transfection efficiency is never 100%. This means that the beneficial effects on muscle by the electroporated shRNA must underestimate the actual protective effects by gene downregulation. To prove that these beneficial effects on muscle result from specific gene downregulation, we compare and analyze in parallel in each experiment muscles transfected with shLacz scrambled control.

      5) The same is true for all the experiments quantifying the effect of shRNAs in western blot. Since quantifications are probably made on whole muscles (ie a mix between electroporated and non electroporated fibers) and since the percentage of electroporated fibers is not given it is not possible to estimate the efficiency of the shRNAs in electroporated fibers.

      As mentioned above and now also in the text, for our biochemical studies we always analyze muscles that are ~60-70% transfected. This methodology is very well established in our lab, and a reduction of 40-50% in gene expression by our shRNAs is sufficient to promote the beneficial effects on mouse muscle (see our papers in JCB, PNAS, Nat Comm, EMBO rep).

      6) Figure 2C: by decreasing solubilization of desmin, one would expect a decrease in the levels of soluble desmin. Conversely the authors observe an increase in both insoluble and soluble desmin. Of course, this can be explained by reduced desmin degradation once solubilized but this should be demonstrated at least by showing that UPS inhibitors induces an increase in soluble ubiquitinated Desmin.

      The reviewer raises an important point that we now discuss in the text. Soluble pool of desmin, its homolog vimentin as well as other Type III IF proteins is small as these proteins mostly exist in the cell assembled within filaments (see papers by RA Quinlan and WW Franke). This soluble pool of desmin may function either as precursors to the mature filament or as components released during filament turnover. Because we block desmin IF disassembly by downregulating Atad1, the soluble desmin that accumulates in the cytosol likely represents new precursors whose degradation also requires ATAD1. Therefore, we conclude that ATAD1 promotes degradation of desmin filaments and of soluble proteins (see also figures 2E and 4D).

      As requested by the reviewer, we inhibited proteasome activity by injecting mice with Bortezommib and measured the effects on desmin content in denervated muscle (new figure 2D). Our new data clearly demonstrate accumulation of ubiquitinated desmin in atrophying muscles where proteasome activity was inhibited, indicating that in denervated muscles desmin is degraded by the proteasome.

      7) Figure 2E: the levels of Atad1 in the insoluble fraction seem to be the same in the shLacZ and GSK3DN conditions, whereas the phosphor Ser is different. In other words, there should be more Atad1 in the insoluble fraction with shLacZ than with GSAK3DN since the phosphorylation level with shLacZ is significantly higher.

      To quantitate the changes in ATAD1 association with desmin and avoid confusion by the reader, we performed densitometric measurements of ATAD1 and desmin, and depict in a graph the ratio of ATAD1 to desmin in the insoluble fraction. The new data was added to figure 2F and clearly demonstrate that ATAD1 association with desmin is significantly reduced in muscles expressing GSK3b-DN. These findings further support our conclusions that Atad1 association with desmin IF requires desmin phosphorylation.

      8) Figure 4E: the authors state that phosphorylation decreases because of increased degradation (lanes 6-8). However, Calpain also increases degradation and phosphorylation is increased (lanes 2-4), so increasing degradation does not systematically cause a decrease in phosphorylation. Similarly, lane 5 Atad1 induces less degradation than Calpain, however, it causes a decrease in phosphorylation. Explain.

      Here we use a cleavage assay, which was established and validated in our recent JCB paper (Aweida 2018). Desmin filaments were isolated from mouse muscle and the obtained preparation was divided between 9 tubes (hence there is no situation for “increase in phosphorylation” as indicated by the reviewer). Recombinant calpain-1 was then added to the tubes and cleavage of phosphorylated desmin was analyzed over time. Because the substrate for calpain-1 is phosphorylated desmin, we measured the content of both desmin and its phosphorylated form in the tube throughout the duration of the experiment. Only when cleavage of phosphorylated desmin by calpain-1 was accelerated (i.e., in the presence of Atad1), a rapid reduction in the amount of phosphorylated desmin could be detected (compare lanes 6-8 with 5) concomitantly with accumulation of small desmin fragments in short incubation times (compare lanes 6-7 with 2-3).

      With respect to the reviewer’s comment that “Atad1 induces less degradation than Calpain” in lane 5, please note that Atad1 is not a protease and cleavage of desmin occurs in this experiment only in the presence of calpain-1. However, if there is a slight reduction in phosphorylated desmin, it should account for the ability of ATAD1 appears to slowly disassemble desmin IF (as our in vivo data by shATAD1 show).

      9) The AAA ATPase VCP shares partners with Atad1 and is involved in muscle atrophy. It would greatly add to the manuscript if the authors inhibited VCP to compare its effect to Atad1

      As stated in the text, we previously demonstrated that p97/VCP is not required for desmin filament loss: “the AAA-ATPase, p97/VCP disassembles ubiquitinated filamentous myofibrils and promotes their loss in muscles atrophying due to denervation or fasting (Piccirillo and Goldberg, 2012; Volodin et al., 2017). However, desmin IF are lost by a mechanism not requiring p97/VCP (Volodin et al., 2017). We show here that their degradation requires a distinct AAA-ATPase, ATAD1”. Therefore, our current studies were undertaken to specifically identify the AAA-ATPase that is involved in desmin filament disassembly and loss. Accordingly, p97/VCP was not detected by our mass spectrometry-based proteomic analyses presented here (stated in the discussion).

      We did identify PLAA and UBXN4 as ATAD1 partners and show they are required for desmin loss, and therefore state in the text that “PLAA and UBXN4 are also known cofactors for p97/VCP (Liang et al., 2006; Papadopoulos et al., 2017), a AAA-ATPase that was not in our datasets, indicating that p97/VCP adaptors can bind and function with other AAA-ATPases”.

      Minor comments:

      1) The soluble fraction contains a large number of ubiquitinated proteins. Please explain how it can be stated that an increase in total soluble polyubiquitinated proteins corresponds to an increase in ubiquitinated desmin.

      We do not state in the text that “an increase in total soluble polyubiquitinated proteins corresponds to an increase in ubiquitinated desmin”. We state that “stabilization of desmin filaments attenuates overall proteolysis. The reduced structural integrity of desmin filaments on denervation is likely the key step in the destabilization of insoluble proteins (e.g. myofibrils) during atrophy, leading to the enhanced solubilization and degradation in the cytosol”. We invite the reviewer to read our papers about this topic by Cohen 2012, Volodin 2017, and Aweida 2018. Using a dominant negative of desmin polymerization we show that disassembly of desmin filaments is sufficient to trigger myofibril destruction and consequently overall proteolysis (because myofibrils comprise ~70% of muscle proteins).

      2) Page 11: the authors conclude that denervation enhance the interactions with Atad1. Figure 3D indeed show an increase for Ubxn4, but it is not clear for the other proteins.

      Figure 3D shows that in 7 d denervated muscles there is an increase in associations between ATAD1 and ubiquitinated desmin, UBXN4, PLAA and calpain-1.

      3) Figure 4 F: show muscle sections

      A representative image was added as requested.

      4) Page 21 in vivo transfection: it is stated "see details under immunofluorescence" but there is no immunofluorescence section in materials and methods.

      Thank you. An immunofluorescence section has been added to Methods.

      5) The authors show that Atad1 inhibition in innervated muscle is sufficient to induce muscle hypertrophy (Figure 4E). They conclude that the hypertrophic effect of Atad1 is due to the inhibition of Desmin degradation. However, this hypertrophic effect could be independent of the action of Atad1 on Desmin.

      We believe the reviewer refers to figure 4F-H, where we show that downregulation of ATAD1 prevents the basal turnover of desmin and of soluble proteins and causes muscle fiber growth. Based on this data we speculate in the text that “ATAD1 attenuated normal muscle growth most likely by promoting the loss of desmin filaments and of soluble proteins … Thus, ATAD1 seems to function in normal postnatal muscle to limit fiber growth, and suppression of its activity alone can induce muscle hypertrophy”. We agree with the reviewer that in addition to these beneficial effects on desmin and soluble proteins, ATAD1 downregulation may contribute to muscle growth by additional mechanisms.

      Reviewer #2 (Significance)

      This is new information in the field since calpain cannot hydrolyze desmin insoluble filaments and that the mechanisms that give calpain access to desmin are not known.

      The authors already made important contribution in the study of muscle atrophy and especially in desmin degradation. This work constitutes a new advance in their attempts to understand the molecular mechanisms leading to desmin degradation and muscle atrophy.

      Audience: desmin is the main intermediate filament in skeletal muscle. This work will therefore interest scientists working on skeletal muscle.

      Expertise of the reviewer: molecular and cellular biology of skeletal muscles, muscle atrophy.

      Referee Cross-commenting

      I fully agree with reviewer 1.

      Reviewer #3 (Evidence, reproducibility and clarity)

      Summary:

      The manuscript by Aweida & Cohen introduces a novel complex formed by the AAA-ATPase ATAD1 and its interacting partners PLAA and UBXN4 as initiator of calpain-1-mediated disassembly of ubiquitylated desmin intermediate filaments (IF) during muscle atrophy. The authors use a denervation model of murine tibialis anterior muscles as their main resource for experimentation. They apply a kinase trap-assay and co-immunoprecipitation method followed by mass spectrometry as starting point for identifying novel interactors of desmin IF (Aweida et al. 2018 in JCB). They continue to analyze their candidates using immunoblotting, co-immunoprecipitation, shRNA-mediated intramuscular knock-down, gel filtration, mass spectrometry, and enzyme assays. In their experiments, thee authors show an accumulation of ATAD1 in the insoluble desmin filament fraction of denervated muscle fibers together with an increase in ubiquitylation of desmin filaments. Both proteomics experiments of size-exclusion chromatography of denervated muscles and ATAD1 immunoprecipitation identify several components of the ubiquitin-proteasome system as novel interactors of ATAD1, that are also bound to insoluble desmin filaments after muscle denervation. Following additional co-immunoprecipitation and knock-down experiments, the authors confirm PLAA and UBXN4 as novel cofactors of Atad1 that help in extracting previously GSK3-β-phosphorylated and TRIM32-ubiquitylated (Aweida et al. 2018 in JCB, Volodin et al. 2017 in PNAS) desmin from desmin IF. The authors further show that ATAD1 encourages calpain-1-dependent proteolysis of soluble desmin after extraction from the desmin IF in an in vitro enzymatic proteolysis assay.

      Major comments:

      The authors present clear and convincing arguments from in vivo and in vitro experiments for their proposed model of ATAD1/PLAA/UBXN4-aided calpain-1-mediated proteolysis of desmin IF.

      In my opinion, no additional experimental evidence is essential to underlining their statement.

      Data and methods are presented clearly and understandably to allow for the reproduction and the reapplication of the utilized methods for verifying the presented data and analyzing complementary aspects in a similar fashion.

      A concern is with the presentation of mass spectrometry results, particularly regarding Table I: I am wondering whether the presented UPS components were the only proteins found in the proteomics screens or whether any filtering has taken place to only show UPS components in this manuscript. If so, please note the total number of proteins identified in the respective proteomics analyses and explain how filtering for UPS components was performed. This comment goes in line with the first minor comment on Figure 1A, see below.

      We thank the reviewer for this valuable comment, as it helps clarify a point that was not completely lucid in the previous version of this manuscript. Because our paper focuses on protein degradation, we extracted from our datasets only UPS components that were identified with ³ 2 unique peptides using DAVID annotation tool-derived categories (Table II). Column 1 includes UPS components that were co-purified with ATAD1 by size exclusion chromatography (SEC)(20 out of 427 total proteins), and column 2 includes UPS components that were co-purified with ATAD1 by immunoprecipitation from muscle homogenates (17 out of 592 total proteins). These two proteomics experiments were oriented specifically towards identifying ATAD1-binding partners. To further validate our observations, we compared these lists of ATAD1-interacting components to our previous kinase-trap assay dataset (Aweida 2018, 1552 total proteins were identified) and included in column 3 only the proteins that overlapped with the other two proteomics approaches. The kinase trap assay was used to identify proteins that utilize ATP for their function and act on desmin, and as mentioned in the text, ATAD1 was one of the most abundant proteins in the sample. Of note is UBXN4, which was identified only by our kinase trap assay, and accumulated on desmin after denervation. These interactions between active enzymes in vivo must be transient and very dynamic, hence using three approaches did not identify the exact same subset of putative adaptors (see “discussion”). These points are now further elaborated in the text and the legend for Table II.

      The relatively small number of individuals analyzed per experiment is owing to the limiting nature of mouse research and therefore acceptable. The observed alignment of the individual results is commendable, underlines the experimentator's ability, and strengthens the reached conclusion of the study.

      We thank the reviewer for this comment.

      Minor comments:

      Figure 1A seems redundant, since the experimental approaches are described in the text and the Venn diagram does not integrate the identification of ATAD1 into the setting of the conducted screens, e.g. by showing how many additional proteins were identified in these two screens before the authors tended to their candidate ATAD1.

      We agree and therefore removed Fig. 1A.

      Word order mistake on page 6 in the sentence: "To test whether Atad1 is important for atrophy, we suppressed...".

      Corrected.

      Figure 1D: statistical analysis of the significance of the fiber area difference missing

      Statistics for these effects is now included in new Table I. We quantitated the effects statistically using Vargha-Delaney A-statistics test and Brunner-Manzel test, based on our recent methodology paper in AJP Cell Physiol: “A semi-automated measurement of muscle fiber size using the Imaris software” (Gilda et al. 2021). The new statistical analyses show a significant difference in size distributions of fibers expressing shAtad1 vs. adjacent non-transfected fibers (Table I). As indicated in our paper (Gilda et al, 2021), the A-statistics is a direct measure of the fiber size effect.

      Figure 2A: desmin ubiquitylation is not shown in these samples by immunoblotting against (poly-)ubiquitin, but only by the identification of high molecular weight bands of the desmin blot. I wonder about the specificity of the desmin antibody in this case and about the manner of sample extraction/isolation for this particular blot, as a detailed description is missing. There seems not to have been any muscle tissue fractionation beforehand, if I am correct?

      This blot presents an analysis of desmin filaments isolated from mouse muscle, which are purified with associated proteins. In order to specifically detect ubiquitinated desmin filaments we must use a specific desmin antibody (antibody and methodology are validated in Cohen 2012 JCB, Volodin 2017 PNAS, and Aweida 2018 JCB). An antibody against ubiquitin conjugates will detect all proteins that are ubiquitinated in this insoluble preparation (e.g. proteins that bind desmin).

      Orthography mistake "demin" instead of "desmin" on page 7 in sentence "It is noteworthy that the amount of ubiquitinated demin..."

      Corrected.

      Figure 3C: image quality is insufficient; some protein names are rather difficult to decipher

      The figure has been revised to improve clarity.

      Word missing on page 13 in sentence "In addition, by 10 minutes of incubation, phosphorylated ... due to their processive cleaveage by calpain-1 ..."

      We thank the reviewer for reading the paper thoroughly and carefully. The missing word was added to the text.

      Figure 4F: statistical analysis of the significance of the fiber area difference missing

      Statistics is now included in new Table I. Asmentioned above, we quantitated the effects statistically using Vargha-Delaney A-statistics test and Brunner-Manzel test, based on our recent methodology paper in AJP Cell Physiol: “A semi-automated measurement of muscle fiber size using the Imaris software” (Gilda et al. 2021).

      "ug" on page 21 in "Briefly, 20ug of plasmid DNA..." is probably supposed to be "µg". In general, please be aware of correct unit declaration and space character usage before units.

      Corrected.

      Please be aware of the usage of correct nucleic acid and protein nomenclature and style: When referring to gene or transcript levels mark the candidate characters in italic, e.g. Atad1 mRNA levels, shUbxn4, versus ATAD1 protein etc. In addition, please be aware to use the correct gene and protein name styles: e.g. shCapn1 instead of shCAPN1 for shRNA targeting the murine Capn1 transcript in Figure 4 in comparison to CAPN1 the protein. Helpful link: https://www.biosciencewriters.com/Guidelines-for-Formatting-Gene-and-Protein-Names.aspx

      We thank the reviewer for this comment. The nomenclature for all genes and proteins have been revised accordingly.

      Reviewer #3 (Significance)

      Aweida & Cohen present evidence for the involvement of the AAA-ATPase ATAD1 not only in regulation of synaptic plasticity and the extraction of mislocalized proteins from the mitochondrial membrane, but also in a collaboration with the ubiquitin-binding proteins PLAA and UBXN4 in the disassembly of desmin intermediate filaments in muscle atrophy. The authors compare this newly discovered function of the AAA-ATPase ATAD1 to the numerous functions of the AAA+ ATPase p97/VCP and raise compelling arguments for their statement. Previously, E3 ligases that ubiquitylate sarcomere components in muscle atrophy have been identified, such as MuRF1 (Bodine et al. 2001 in Science) and TRIM32 (reviewed in Bawa et al. 2021 in Biomolecules), but the complete extraction mechanism of monomers from the diverse macromolecular fibrillary structures in muscle has been lacking.

      Both, researchers of general proteostasis mechanisms, in particular their impact on muscle function and metabolism, as well as medical researcher investigating therapeutic roads may appreciate the authors' work. This study opens up various roads to follow with complementing investigations on the many functions of the UPS in the regulation of muscle fiber architecture and functionality.

      I am working on proteostasis and particularly the UPS. I have a long-standing track record on muscle assmebly mechanisms, the regulation of E3 ligases and p97/VCP functions.

    1. Background

      Reviewer 2. Dean Giustini This is a well-written manuscript. The methods are well-described. I've confined my comments to improving the reporting of your methods, some comments about the paper's structure, and a few about the readability of the figures and tables (which I think in general are too small, and difficult to read). Here are my main comments for your consideration as you work to improve your paper:

      1) Title of manuscript - the title of your paper seems inadequate to me, and doesn't really convey its content. A more descriptive title that includes the idea of the "first wave" might be useful from my point of view as a reader who scans titles to see if I am interested. I'd recommend including words in the title that refer to your methods. What type of research is this - a quantitative analysis of citations? Title words say a lot about the robust nature of your methods. As you consider whether to keep your title as is, keep mind that title words will aid readers in understanding your research at a glance, and provide impetus to read your abstract (and one hopes the entire manuscript). These words will help researchers find the paper later as well via the Internet's many search engines (i.e., Google Scholar).

      2) Abstract - The abstract is well-written. Could the aims of your research be more obvious? and clearly articulated? How about using a statement such as "This research aims to" or similar? I also don't understand the sentence that begins with "Using references as a readout". What is meant by a "readout" in this context? Do you mean to read a print-out of references later? Lower down, you introduce the concept of Wikipedia's references as a "scientific infrastructure", and place it in quotations. Why is it in quotations? I wondered what the concept was on first reading it. A recurring web of papers in Wikipedia constitutes a set of core references - but would I call them a scientific infrastructure? Not sure; they are a mere sliver of the scientific corpus. Not sure I have any suggestions to clarify the use of this phrase.

      3) Introduction - This is an excellent introduction to your paper, and it provides a lot of useful context and background. You make a case for positioning Wikipedia as a trusted source of information based on the highly selective literature cited by the entries. However, I would only caution that some COVID-19 entries cite excellent research but the content is contested, and vice versa. One suggestion I had for this section was the possibility of tying citizen science (part of open science) to the rise of Wikipedia's medwiki volunteers. Wikipedia provides all kinds of ways for citizens to get involved in science. As an open science researcher, I appreciated all of the open aspects you mention. Clearly, open access to Wikipedia in all languages is a driving force in combatting misinformation generally, and the COVID "infodemic" specifically. I admit I struggled to understand the point of the section that begins, "Here, we asked what role does scientific literature, as opposed to general media, play in supporting the encyclopedia's coverage of the COVID-19 as the pandemic spread." The opening sentence articulates your a priori research question, always welcome for readers. Would some of the information that follows in this section around your methods be better placed in the following section under the "Material and Methods"? I found it jarring to read that "....after the pandemic broke out we observed a drop in the overall percentage of academic references in a given coronavirus article, used here as a metric for gauging scientificness in what we term an article's Scientific Score." These two ideas are introduced again later, but I had no idea on reading them here what they signified or whether they were related to research you were building on. You might consider adding a parenthetical statement that they will be described later, and that the idea of a score is your own.

      4) Material and methods - Your methods section might benefit from writing a preamble to prepare your readers. As already mentioned, consider taking some of the previous section and recasting it as an introduction to your methods. Consider adding some information to orient readers, and elaborating in a sentence or two about why identifying COVID-19 citations / information sources is an important activity.

      By the way, what is meant by this: "To delimit the corpus of Wikipedia articles containing DOIs"? Do you mean "identify" Wikipedia articles with DOIs in their references? As I mentioned (apologies in advance for the repetition), it strikes me as odd that you don't refer to this research as a form of citation analysis (isn't that what it is?). Instead you characterize it as "citation counting". If your use of words has been intentional, is there a distinction you are making that I simply do not understand? Also: bibliometricians and/or scientometricians might wonder why you avoid the phrase citation analysis. Further to your methods which are primarily quantitative and statistical - what are the qualitative methods used throughout the paper to analyze the data? How did you carry out this qualitative work? (On page 10, you state "we set out to examine in a temporal, qualitative and quantitative manner, the role of references in articles linked directly to the pandemic as it broke.") That part of your methods seems to be a bit under-developed, and may be worth reconsidering as you work to improve your reporting in the manuscript.

      5) Table 1. I am not sure what this table adds to the methods given it leads off your visuals. Do you really need it? It doesn't reveal anything to me and could be in a supplemental file. I also have difficulties in properly seeing table 1; perhaps you could make it larger and more readable?

      6) Figure 1. This is the most informative visual in the paper but it is hard to read and crowded. It deserves more space or the information it provides is not fully understood.

      7) Figure 3. This is very bulky as a figure, although informative. Again, I'm not sure all of it needs inclusion. Perhaps select part of it, and include other parts in a supplement.

      7) Limitations - The paper does not adequately address its limitations. A more fulsome evaluation of limitations would be beneficial to me as a reader, as it would place your work in a larger context. For example, consider asking whether the results are indicative of Wikipedia's other medical or scientific entries? Or are the results not generalizable at all? In other works, are they indicative of something very limited based on the timeframe that you examined? I found myself disagreeing with: "....the mainstream output of scientific work on the virus predated the pandemic's outbreak to a great extent". Is this still true? and what might its significance be now that we are in 2021? Would it be helpful to say that most of the foundational research re: the family of coronaviruses was published pre-2020, but entries about COVID-19 disease and treatment entries are now distinctly different in terms of papers cited, especially going forward. Wiki editors identify relevant papers over time but are not adept at identifying emerging evidence in my experience, or at incorporating important papers early; it's strange given that recency is one of its true calling cards. For me, the most confounding aspect of the infodemic is the constant shifts of evidence, and how to respond in a way that is prudent and evidence-based. As you point out, Wikipedia has a 8.7 year latency in citing highly relevant papers - and, it seem likely that many important COVID-19 papers were neglected in Wikipedia in the first wave especially about the disease. As you point out, this will form part of future research, which I hope you and your team will pursue.

      8) Reference 31 lacks a source: Amit Arjun Verma and S. Iyengar. Tracing the factoids: the anatomy of information reorganization in wikipedia articles. 2021.

      Good luck with the next stages in improving your manuscript for publication. I believe it adds to our understanding of Wikipedia's role in promoting sources of information.

    1. Author Response:

      Reviewer #1 (Public Review):

      This is an interesting study looking at the evolution of ageing in social insects using ants as a model. As I haven't seen the initial submission, I have looked at the manuscript and the response to reviewers and I base my suggestions on both documents.

      Evolution of ageing remains only partially understood and this field seems to be experiencing a sort of renaissance in recent years with a surge of theoretical advances and new empirical findings. Queens of social insects, and ant queens in particular, have remarkable lifespans and understanding the biology of their long life can help in understanding the biology of ageing in a more general sense.

      In this study, the authors focus on following quite a large number of ant (C. obscurior) colonies and provide intriguing data in relation to age-specific mortality and reproduction. The gist of their argument is that the mortality is decreasing with age while reproduction (production of sexuals) is increasing with age, such that there is little evidence of ageing in this species.

      Overall I think this is an interesting dataset that provides important information that will advance the field. However, I think the manuscript currently lacks clarity, structure and suffers from poor formulation of ideas in places, and is rather difficult to follow even for an expert in the field. I think that it requires quite a bit of work to sort this out. However, I also have a methodological question (#15) which could be key for the interpretation of the results.

      We hope that this manuscript is clearer now, especially with the additional data.

      My understanding is that queens live for 40-50 weeks max (Fig. S3). Fig. 4 suggests that from week 30 onwards the production of eggs, worker pupae and queen pupae decline. This suggests that while queen mortality declines in late life, so does queen reproduction. So, do queens of this species show reproductive senescence?

      Yes, they do experience reproductive senescence.

      The data do suggest that relative investment into reproduction (queen worker ratio) increases with age, but the absolute number of queens declines with age. This suggests an interesting result from the life-history theory perspective - increased investment in reproduction with reduced residual reproductive value, but not necessarily the absence of reproductive senescence. Please clarify.

      We hope this new version of the manuscript addresses clearly that ants queens do experience reproductive senescence and actuarial senescence, but only after late in life (after the peak of sexual investment is reached). Therefore, we state that senescence is delayed.

      Reviewer #2 (Public Review):

      The authors investigated the evolutionary drivers of delayed senescence in ant queens by carefully observing the survival and productivity of C. obscurior colonies that were maintained at 10, 20, or 30 workers. They show that the 10 worker treatment produces fewer new queens, and lower quality workers, indicating low colony efficiency under a reduced workforce. The authors focused their conclusions on the observation of a hump-shaped relative mortality curve, with queens having a higher than average mortality around 30 weeks and then a lower than expected mortality around 40 weeks. The colonies produced more queens at the end of their lifespan, so the authors conclude high fitness gains at the end of life selects for minimal senescence in ant queens, thus generating the drop in mortality they observed at 40 weeks.

      There is a large body of research focused on the early life stage and establishment of ant colonies, but relatively little that follows their worker and reproductive trajectory to the end of life. Partially, this is because many commonly studied ant species have a lifespan too long to feasibly track, and partially because most ant species do not readily produce sexual queens or males in the lab setting. For this alone, the study provides valuable insight into the ant lifecycle and demonstrates that C. obscurior is an ideal species for future study. The experimental design and analyses are sound, and I must acknowledge the incredible amount of work that must have gone into the data collection. However, I have some serious concerns about how the results are interpreted, and what is left out of the discussion on ant colony structure and limitations that are crucial to reaching accurate conclusions.

      One issue is that the conclusions hinge on the observation that relative queen mortality decreases at the latest observational period, around 40 weeks. The authors raise this as evidence that queens are under selection for reduced senescence, as they also conclude that fitness gains (queen production) are highest late in life. The problem is that according to figure S3, only a handful of queens survive past week 40, and they all manage to hang on for another month or two before dying out. I cannot be sure how many colonies survive to this period from how the data is presented, but I worry that the authors are resting their conclusion on a low number of particularly tenacious queens. These colony numbers should be provided, and the authors should demonstrate that the drop in mortality is observable even if these outliers are excluded.

      Fitness gains are highest late in life, and this is shown for all queens, regardless whether they are short- or long-lived. Therefore, selection is maintained until late in life. We calculate relative mortality as a function of age as in Jones et al. (2014), (Fig. 4.) As suggested by the first reviewer we also now include age-specific mortality of the best-model fitted using BaSTA and the estimated parameters in the supplement (Figure 4 - Figure supplement 1, Supplementary File 8 and 9). We have also included RNAseq data of queens near and middle-aged queens. The data support our conclusion of a delayed selection shadow, as age signs were not obvious in the middle-aged queens. This is in line with two studies (Wyschetzki et al. MBE 2015; Harrison GBE et al. 2021), where no signs of aging were found in middle-aged queens of the same species.

      It also appears that the queen pupae production drops off precipitously during the end of the observational period, according to figure 4A, which runs counter to the argument that selection is reducing senescence in these older queens because they have high reproductive output at this stage. The authors put a lot of emphasis on the queen/worker ratio being highest at the end of the observational period, but this doesn't necessarily mean queens are receiving the highest fitness during this period. A queen would have a high queen to worker production ratio if she lays one worker and one queen, but she would have higher fitness if she lays 100 workers and 10 queens. Figure 2A indicates that the highest overall queen pupae laying occurs around 30 weeks, which actually corresponds with the highest level of relative queen mortality. The question of fitness gains at advanced queen age would be better answered by just analyzing which stage in their life they produced the most queen pupae. Does the queen laying rate reach a maximum and remain stable for the rest of a queen's life, or does it decrease along with worker production as they reach end of life? Figure 4A makes it appear that it decreases towards end of life, but I'm not sure if that is only because so few colonies lasted until the end of the observational period.

      We have included that “This caste ratio shift does not occur because a drop of pupae production at the end of life. Actually, pupae production is at its highest just before death (Figure 2 - Figure supplement 1).” We added a figure with raw numbers of pupae produced at the end of life for the 99 tracked queens.

      Another factor that should be discussed is sperm depletion. The authors state that each queen mated with a single male when they set up the colonies, so sperm depletion may be more important than senescence for determining the reproductive lifespan of these queens. I'm not sure if this species is normally single mated in the wild, or the length of their natural colony lifespan, but this is important information to provide in order to dismiss issues of sperm depletion in this study. Without this information it is impossible to determine if the decrease in egg laying towards the end of the study is due to senescence or sperm depletion.

      Taken together, it could be argued that these data better support selection on an optimal lifespan, around 30 weeks, as opposed to selection for directional extended lifespan and reduced senescence. If the reproductive benefits of an extended lifespan are capped by sperm depletion, the alternative strategy would be to produce a robust workforce as quickly and efficiently as possible, and then produce as many sexual offspring as possible with the remaining sperm. Perhaps selection has determined that the optimal length of this cycle is around 30 weeks, with variation dependent on the amount of sperm transferred during mating and the condition of the queen. This possibility should be addressed, and if possible additional data should be provided on sperm depletion in C. obscurior, and the colonies that survived to the end of the observation period. Without these additions, the conclusions on senescence and lifespan remain tenuous.

      We now discuss in the manuscript that sperm depletion is not commonly seen in this species, and also occurred only once in this study (of the 99 colonies). All colonies were tracked until death. Therefore, there is no evidence of stabilizing selection to a lifespan of 30 weeks based on sperm depletion. This manuscript addresses the question of how is the “shape” of aging in this species, and not the “pace” (lifespan extension), but gives a hint on why extended lifespans should be favored.

  4. Jan 2022
    1. It’s important to understand – just because we have don’t have certain kinds of privileges, it doesn’t mean that we don’t benefit from other kinds of privileges.

      I greatly connected with this quote. As a white woman I don't face much discrimination, but I realize that the privileges I do have I benefit from. As a white woman there are probably several instance where privileges I have benefitted me. While I come from a small town I see a privilege I get there is that everyone knows my mother, while some may have this same instance but it may come as discrimination against them.

    1. as not increased the amount of pleasurable satisfaction which they may expect from life and has not made them feel happier. From the recognition of this fact we ought to

      Freud makes a fascinating point that alludes to the fact that no matter what society does, it will never be enough and I feel like that is true if we think about how the world operates today, for example: APPLE comes out with a new phone, a new pc, new headphones, new tech gear every year. Every year there is always something "new and better" that just slightly enhances what was already there. As humans, we're just slightly enhancing civilization in hopes the void of unhappiness will someday fill itself, but I think all the advances are just distractions from the fact that happiness is unattainable within these conditions.

    1. SUBSCRIBE NOW Australian Open PGA TOUR LIVE: Farmers Insurance Open NHL: Select Games Men's College Hoops Women's College Hoops UFC 271: Adesanya vs. Whittaker 2 (Feb. 12, PPV) 2022 NBA Mock DraftQuick Links NFL Playoffs Schedule 2022 NFL Draft Order How To Watch Australian Open NBA Trade Machine NBA Trade Deadline Buzz How To Watch PGA TOUR Men's Hoops Rankings Women's Hoops Rankings Games For Me Caribbean Series Favorites Manage Favorites Customize ESPNSign UpLog InESPN Sites ESPN Deportes The Undefeated espnW ESPNFC X Games SEC NetworkESPN Apps ESPN ESPN FantasyFollow ESPN Facebook Twitter Instagram Snapchat YouTube The ESPN Daily PodcastLowe's 10 things: Luka's surging Mavs, a rookie steal in Chicago and the Knicks' city edition courtThis week, we showcase signs of life from Luka and the Mavs (finally!), a second-rounder making serious moves in Chicago and MSG's city edition court.8hZach LoweSteph Chambers/Getty ImagesHow Klay Thompson's game has changed since his returnWhat has Thompson's return meant for a Warriors team that looks poised to return to the playoffs?8hKendra Andrews and Kevin PeltonTOP HEADLINESCowboys' Jones clears the air on McCarthy, QuinnSources: Kings balk at Simmons price, end chaseNets' Harden 'ready to go' after hamstring issueSteelers GM Colbert stepping down after draftBurrow, Bengals prepared for loud crowd in K.C.Rooney says he turned down Everton interviewWarren's $41M bid secures rights to Fury-WhytePitch & putt: NCAA golfer J.R. Smith has NIL repPotential QB options for Packers, BucsCHAMPIONSHIP SUNDAYGet ready for Bengals-Chiefs, 49ers-Rams: We previewed both games -- and picked winnersWhat to watch for in the AFC and NFC Championship Games. Bold predictions. Key stats to know. Matchups to watch. And, of course, final score picks.9hESPN staffJayne Kamin-Oncea-USA TODAY SportsHow all four remaining NFL playoff teams can win this weekend: We mapped out game plansWe scheme up keys to victory, pick players to watch and outline important matchups.9hMatt BowenConference championship best betsFARMERS INSURANCE OPENWEDNESDAY THROUGH SATURDAY ON ESPN+ Farmers Insurance Open Rd 3 - In Progress Torrey Pines (North Course) - La Jolla, CA Total Purse: $8,400,000 ESPN+ Get POS Player Score Round Leaderboard 1 J. Thomas J. Thomas J. Thomas -14 -1(5) Leaderboard 1 J. Rahm J. Rahm J. Rahm -14 -1(5) Leaderboard 3 C. Tringale C. Tringale C. Tringale -13 -1(6) Leaderboard 3 W. Zalatoris W. Zalatoris W. Zalatoris -13 -6(10) Leaderboard 5 A. Schenk A. Schenk A. Schenk -12 +1(6) How to watch the Farmers Insurance OpenBlonds have more fun? Koepka debuts new lookSTATE OF THE MINOR LEAGUES'I was wrong': Why MLB's restructuring of the minors turned out mostly better than expectedIn 2020, minor league baseball officials predicted disaster as MLB moved to reduce the number of affiliated teams. While the transition didn't work out for some, many owners now say their worst fears weren't realized.7hESPN StaffCourtesy Missoula PaddleHeadsHUMAN RIGHTS ABUSES IN XINJIANGNBA players face questions over shoe deals with Chinese companies linked to forced laborESPN has identified at least one former and 17 current NBA players who maintain contracts with four Chinese brands accused of abetting China's human rights abuses in Xinjiang.9hMike Fish and Michael A. FletcherESPNNBA ON ESPNFRIDAY'S GAMESSee All 7:30 PM ET ESPN CHA -8.0 Los Angeles Lakers Lakers LAL 24-25 Charlotte Hornets Hornets CHA 27-22 10:00 PM ET ESPN MIL -9.0 New York Knicks Knicks NY 23-26 Milwaukee Bucks Bucks MIL 30-20 Best bets for Friday: Heat too much for Clippers?StandingsAUSTRALIAN OPEN MEN'S FINALNADAL VS. MEDVEDEV: 3:30 A.M. ET ON SUNDAYA few months ago, Rafael Nadal thought he might retire -- now he may make Grand Slam historyAs he recovers from COVID-19 and deals with a chronic foot injury, Rafael Nadal has become a comeback story for the ages at the 2022 Australian Open.13hTom HamiltonRecep Sakar/Anadolu Agency via Getty ImagesMedvedev explains 'small cat' rant at umpire8h1:54AUSTRALIAN OPEN WOMEN'S FINALExpert picks: Will Ash Barty win in front of her home crowd? Or will Danielle Collins pull off the upset?Our experts weigh in on who will win the 2022 Australian Open women's title. Spoiler: It's not as straightforward as it seems.13hESPNAP Photo/Hamish Blair Summary Sat 3:30 AM ESPN 1 A. Barty A. Barty A. Barty 27 D. Collins D. Collins D. Collins In her country, for her country, Barty looks to bring Australian Open title home AFC Championship Bengals Bengals CIN 10-7 CBS Sun 3:00 PM ET Chiefs Chiefs KC 12-5 The UndefeatedPatrick Mahomes has an opportunity to join another exclusive clubThe Kansas City Chiefs star is already among the NFL's most accomplished playoff passers; one more win will put him on another tier8hJason ReidJamie Squire/Getty ImagesStephen A.: Chiefs should be 'scared to death' of Burrow-Chase connection6h1:47Joe Burrow can cement place among top young QBs in AFC title gameAssisting single-parent families is a personal passion for Bengals' Bates Gamecast Tickets NFC Championship 49ers 49ers SF 10-7 FOX Sun 6:30 PM ET Rams Rams LAR 12-5 Trash talk, a ticket blockade and a rivalry reborn: Rams and 49ers meet for NFC titleThe 49ers beat the Rams in Week 18 just to reach the playoffs, and now a seventh straight win over their rivals would send them to the Super Bowl.10hNick Wagoner Kirby Lee/USA TODAY Sports Odell Beckham Jr. giving Rams boost in production, good vibes Gamecast Tickets COACHING CAROUSELStephen A.: Cowboys keeping Mike McCarthy is great for me, bad for them!Stephen A. Smith trolls the Cowboys' decision to retain Mike McCarthy.5h1:45Grading NFL head-coach hires: Can Hackett bring offensive stability to Denver?How did the Broncos fare with their hiring of Nathaniel Hackett? Here's what our experts think of the move.1dJeremy Fowler, Dan GrazianoAs Bears hire Poles and Eberflus, it's time for chairman McCaskey to step upHackett is the newest Broncos' coach to fix same old offensive problemTracking head-coach hirings, firings and potential openingsBRACKETOLOGYMen's hoops: Kansas back to No. 1 seed ahead of showdown with KentuckyKansas moves past Arizona to reclaim a place on the No.1 seed line.13hJoe Lunardi Photo by Jamie Squire/Getty Images Women's hoops: With help from Tennessee loss, Louisville returns to No. 1 seedA pair of SEC upsets, just hours after the NCAA selection committee revealed its top-16 seeds, shuffled teams again in our latest projection.13hCharlie Creme Andy Lyons/Getty Images COLLEGE HOOPS SCORESMEN'S AND WOMEN'S GAMES 5:00 PM ET ESPNU HARV -3.5 Pennsylvania Quakers Pennsylvania PENN 7-12 Harvard Crimson Harvard HARV 10-6 NCAAM NCAAM 5:00 PM ET ESPN+ LEH -5.0 Lehigh Mountain Hawks Lehigh LEH 8-13 Holy Cross Crusaders Holy Cross HC 3-15 NCAAM NCAAM 7:00 PM ET ESPN2 DAY -6.5 Rhode Island Rams Rhode Island URI 12-6 Dayton Flyers Dayton DAY 13-7 NCAAM NCAAM 8:00 PM ET 23 Iowa Hawkeyes Iowa IOWA 13-4 Northwestern Wildcats Northwestern NU 11-7 NCAAW NCAAW Full men's hoops scoreboardFull women's hoop scoreboard SEC/Big 12 Challenge 12 Kentucky Kentucky UK 16-4 ESPN Sat 6:00 PM ET 5 Kansas Kansas KU 17-2 How Kentucky beats Kansas and predictions for Saturday's SEC/Big 12 ChallengeJayhawks/Wildcats is the marquee matchup in Saturday's 10-game series, but other major storylines await in the 10-hour extravaganza.9hESPN Jordan Prather-USA TODAY Sports Gamecast Tickets LUNDQVIST'S JERSEY RETIREMENTBEFORE FRIDAY'S GAME VS. WILD'I think we have a goalie here': Henrik Lundqvist's journey from seventh-rounder to Rangers legendNew York landed the goalie at No. 205 overall in 2000, starting a career that would leave a legacy across two continents and the NHL record book. Tonight his No. 30 will be retired.5MEmily Kaplan Andre Ringuette/Getty Images U.S. TOPS EL SALVADORFIFA WORLD CUP QUALIFYINGRobinson gave USMNT the win they were looking forAnother scoreless first half for the USMNT, but Antonee 'Jedi' Robinson struck back to give his side a crucial win.15hKyle BonaguraHow USMNT can qualify for World Cup finalsMatch scheduleWORLD CUP QUALIFYINGFRIDAY'S MATCHESSee All 24' Colombia Colombia COL 0 Peru Peru PER 0 2022 World Cup Qualifying - CONMEBOL 2022 World Cup Qualifying - CONMEBOL 5:00 PM ET VEN -150 Venezuela Venezuela VEN Bolivia Bolivia BOL 2022 World Cup Qualifying - CONMEBOL 2022 World Cup Qualifying - CONMEBOL GAME CHANGERS UNITEDU.S. Soccer is determined to improve on past inclusivity initiatives. Enter Game Changers UnitedU.S. Soccer president Cindy Parlow Cone has created a DEIB advisory council, recruiting Cobi Jones and others to set the course.3hCaitlin Murray Denise Truscello/Getty Images for Eight Cigar Lounge ANALYZING MLS MOVESUnpacking the offseason's most intriguing signings, tradesLorenzo Insigne leaving Napoli for Toronto dominated headlines across MLS, but this offseason has been full of signings worth a closer look.6hJeff Carlisle, Austin Lindberg Robin Alam/Icon Sportswire via Getty Images CARIBBEAN SERIESFRIDAY'S GAMES Top 6th 2 Outs 2 Outs ESPN Dep Watch Colombia Colombia COL 5 Venezuela Venezuela VEN 0 2 Outs Caribbean Series Caribbean Series Final Puerto Rico Puerto Rico PUR 2 Panama Panama PAN 3 Caribbean Series Caribbean Series 7:00 PM ET ESPN Dep Dominican Republic Dominican Republic DOM 0-0 Mexico Mexico MEX 0-0 Caribbean Series Caribbean Series Try adding more teams for the latest scores and highlights! + Add favorites How to watch the 2022 Caribbean Series on ESPN Deportes'THEY DESERVE TO BE IN'HOW DAVID ORTIZ CAN HELPCould Big Papi's endorsement get Bonds, Clemens into the Hall?With Bonds and Clemens shut out again, their fates are in the committees' hands. Could a seal of approval from David Ortiz be the difference-maker?8hBuster Olney Maddie Meyer/Getty Images ALL-STAR GAME STARTERS ANNOUNCEDASG WEEKEND: FEB. 18-202022 NBA All-Star debate: Our experts answer big All-Star questionsOur experts took a look at the biggest questions ahead of the game.1dNBA InsidersFirst-time All-Stars Morant, Wiggins picked to startNBA All-Star Game 2022: Latest news, starters and updatesCATASTROPHIC OLYMPIC INJURYARON BAYNES' ROAD BACKThe mysterious fall and harrowing story of an NBA centerIn his first public comments since the Tokyo Olympics, NBA center Aron Baynes details the freak injury that left him unable to walk and alone inside a Japanese hospital.1dBrian Windhorst WU HONG/EPA-EFE Top HeadlinesCowboys' Jones clears the air on McCarthy, QuinnSources: Kings balk at Simmons price, end chaseNets' Harden 'ready to go' after hamstring issueSteelers GM Colbert stepping down after draftBurrow, Bengals prepared for loud crowd in K.C.Rooney says he turned down Everton interviewWarren's $41M bid secures rights to Fury-WhytePitch & putt: NCAA golfer J.R. Smith has NIL repPotential QB options for Packers, BucsFavorites FantasyManage FavoritesFantasy HomeCustomize ESPNSign UpLog InICYMISteph makes unreal diving and-1 shotSteph Curry absorbs the contact on the drive but is somehow able to dive around his defender to make the and-1 bucket. Trending Now@jtmillzzy/TwitterOttawa Senators fan gets tattoo of favorite player in exchange for autographed Josh Norris jerseyOttawa Senators fan Mason Kohne made a blockbuster deal with Josh Norris: He'd get a life-size tattoo of Norris' head on his stomach in exchange for an autographed jersey.Ben Jared/PGA TOUR/Getty ImagesBrooks Koepka goes blond in midst of Farmers Insurance OpenKoepka, the four-time major winner, revealed a new Eminem-inspired look on social media -- and confusion ensued. Best of ESPN+Photo by David Rosenblum/Icon SportswireKiper's new Big Board for the 2022 draft: Ranking the best prospects at every positionMel Kiper unveils his latest top 25 and position rankings, with the 2021 college football season behind us.Kevin Jairaj/USA TODAY SportsUSC, LSU and the teams most helped by transfer portalThe coaching carousel led to many players entering the transfer portal. Which teams improved the most? Australian Open on ESPN+EPA/LUKAS COCH AUSTRALIA AND NEW ZEALAND OUTAustralian Open 2022 -- How to watch on ESPN/ESPN+The first major tennis tournament of the 2022 calendar year is the Australian Open. Here's how you can watch all of the action. PGA Tour on ESPN+Brian Rothmuller/Icon SportswireHow to watch the PGA Tour's Farmers Insurance Open this Wednesday-Saturday on ESPN+The PGA Tour's Farmers Insurance Open shifts to a Wednesday-Saturday schedule this week. Here's how to watch on ESPN+. Terms of UsePrivacy PolicyYour California Privacy RightsChildren's Online Privacy PolicyInterest-Based AdsAbout Nielsen MeasurementDo Not Sell My InfoContact UsDisney Ad Sales SiteWork for ESPNCopyright: © ESPN Enterprises, Inc. All rights reserved.(function () {var footerLinks, needConsent;try {footerLinks = JSON.parse('[{"copyright":"Copyright: © ESPN Enterprises, Inc. All rights reserved.","footer":[{"label":"Terms of Use","href":"https://disneytermsofuse.com/english/"},{"label":"Privacy Policy","href":"https://privacy.thewaltdisneycompany.com/en/current-privacy-policy/"},{"label":"Your California Privacy Rights","href":"https://disneyprivacycenter.com/notice-to-california-residents/"},{"label":"Children%27s Online Privacy Policy","href":"https://disneyprivacycenter.com/kids-privacy-policy/english/"},{"label":"Interest-Based Ads","href":"http://preferences-mgr.truste.com/?type=espn&affiliateId=148"},{"label":"About Nielsen Measurement","href":"http://www.nielsen.com/digitalprivacy"},{"className":"ot-sdk-show-settings","label":"Do Not Sell My Info","href":"https://privacy.thewaltdisneycompany.com/en/dnsmi/"},{"label":"Contact Us","href":"https://www.espn.com/espn/news/story?page=contact-index"},{"label":"Disney Ad Sales Site","href":"https://disneyadsales.com/"},{"label":"Work for ESPN","href":"https://jobs.disneycareers.com/espn"}]}]'); needConsent = 'false';} catch (e) { console.log(e); }window.espn.footerLinks = footerLinks || {}; window.espn.needConsent = needConsent || false})();

      The overall website is extremely cluttered with different updates, subscription options, promotions, game scores and articles being on the same page. This is a bad example of website accessibility as it may be a sensory overload for some individuals and a bit difficult to understand especially for those individuals using audio softwares that read the contents of the page out loud.

    1. SciScore for 10.1101/2022.01.24.22269714: (What is this?)

      Please note, not all rigor criteria are appropriate for all manuscripts.

      Table 1: Rigor

      <table><tr><td style="min-width:100px;margin-right:1em; border-right:1px solid lightgray; border-bottom:1px solid lightgray">Ethics</td><td style="min-width:100px;border-bottom:1px solid lightgray">IRB: Ethical considerations: The Ethics Committee of the MIBS approved the VE study on June 21, 2021.<br>Consent: All participants signed the informed consent upon referral to the LDCT triage.</td></tr><tr><td style="min-width:100px;margin-right:1em; border-right:1px solid lightgray; border-bottom:1px solid lightgray">Sex as a biological variable</td><td style="min-width:100px;border-bottom:1px solid lightgray">not detected.</td></tr><tr><td style="min-width:100px;margin-right:1em; border-right:1px solid lightgray; border-bottom:1px solid lightgray">Randomization</td><td style="min-width:100px;border-bottom:1px solid lightgray">not detected.</td></tr><tr><td style="min-width:100px;margin-right:1em; border-right:1px solid lightgray; border-bottom:1px solid lightgray">Blinding</td><td style="min-width:100px;border-bottom:1px solid lightgray">not detected.</td></tr><tr><td style="min-width:100px;margin-right:1em; border-right:1px solid lightgray; border-bottom:1px solid lightgray">Power Analysis</td><td style="min-width:100px;border-bottom:1px solid lightgray">Sample size of 1,198 cases and 2,747 controls, and 1,175 patients with the complete vaccination status (exposure level of 29.8% for Sputnik V) provides 80% power to detect an odds ratio of 0.80 (or the VE of 20%) at the 5% alpha level.</td></tr></table>

      Table 2: Resources

      No key resources detected.


      Results from OddPub: Thank you for sharing your code and data.


      Results from LimitationRecognizer: We detected the following sentences addressing limitations in the study:
      The self-reported vaccination status is an important limitation of our study. Several survey participants included in the control group have not reported the exact date of vaccination. While the overall number of such individuals was low, we assumed that the vaccination date for such individuals is likely to be several months from the interview date. However, we assigned them a “non-vaccinated” status in our sensitivity analysis, and the estimates were only slightly affected. Our definition for full vaccination status was also very conservative, as we decided to accept a minimum of six days between the second vaccine dose and study inclusion. While our decision was driven by the idea that we should not exclude participants without an exact date of vaccination, we do not think that this assumption would significantly bias the results. However, most of the studies choose 14-day period [5], and that should be taken into account when comparing our results to other studies. We have undertaken additional attempts to identify cases (patients with symptomatic SARS-CoV-2 in October, 2021) who had the history of confirmed COVID-19 more than two months before the current episode. We were able to identify only two cases of re-infection. While underreporting may occur, it is also likely that a patient with re-infection that requires additional diagnostic followup is an infrequent event. Absolute risks of re-infection, especially of severe disease, are low for the Alpha, Beta, and Delta VO...

      Results from TrialIdentifier: We found the following clinical trial numbers in your paper:<br><table><td style="min-width:95px; border-right:1px solid lightgray; border-bottom:1px solid lightgray">Identifier</td><td style="min-width:95px; border-right:1px solid lightgray; border-bottom:1px solid lightgray">Status</td><td style="min-width:95px; border-right:1px solid lightgray; border-bottom:1px solid lightgray">Title</td></tr><tr><td style="min-width:95px; border-right:1px solid lightgray; border-bottom:1px solid lightgray">NCT04981405</td><td style="min-width:95px; border-right:1px solid lightgray; border-bottom:1px solid lightgray">Active, not recruiting</td><td style="min-width:95px; border-right:1px solid lightgray; border-bottom:1px solid lightgray">Real-world Evidence of COVID-19 Vaccines Effectiveness</td></tr><tr><td style="min-width:95px; border-right:1px solid lightgray; border-bottom:1px solid lightgray">NCT04406038</td><td style="min-width:95px; border-right:1px solid lightgray; border-bottom:1px solid lightgray">Active, not recruiting</td><td style="min-width:95px; border-right:1px solid lightgray; border-bottom:1px solid lightgray">Study of the Spread of COVID-19 in Saint Petersburg, Russia</td></tr><tr><td style="min-width:95px; border-right:1px solid lightgray; border-bottom:1px solid lightgray">ISRCTN11060415</td><td style="min-width:95px; border-right:1px solid lightgray; border-bottom:1px solid lightgray">NA</td><td style="min-width:95px; border-right:1px solid lightgray; border-bottom:1px solid lightgray">NA</td></tr></table>


      Results from Barzooka: We did not find any issues relating to the usage of bar graphs.


      Results from JetFighter: We did not find any issues relating to colormaps.


      Results from rtransparent:
      • Thank you for including a conflict of interest statement. Authors are encouraged to include this statement when submitting to a journal.
      • Thank you for including a funding statement. Authors are encouraged to include this statement when submitting to a journal.
      • No protocol registration statement was detected.

      Results from scite Reference Check: We found no unreliable references.


      <footer>

      About SciScore

      SciScore is an automated tool that is designed to assist expert reviewers by finding and presenting formulaic information scattered throughout a paper in a standard, easy to digest format. SciScore checks for the presence and correctness of RRIDs (research resource identifiers), and for rigor criteria such as sex and investigator blinding. For details on the theoretical underpinning of rigor criteria and the tools shown here, including references cited, please follow this link.

      </footer>

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-2021-01129

      Corresponding author(s): Koji Kikuchi


      Reviewer #1

      Evidence, reproducibility and clarity (Required):

      In this manuscript, Kikuchi et al describe the characterization of MAP7D2 and MAP7D1, two MAP7 family members in mouse with specific expression patterns. Focusing mostly on MAP7D2, they assess its expression pattern across the body and find that it is mostly expressed in certain neuronal subsets. They then characterize the MT-related properties of MAP7D2 based on previous knowledge of other MAP7 family members. They show that MAP7D2 binds MTs (via the N-terminus), determine the binding affinity, and show that it can stimulate MT polymerization (or stabilization) both in vitro and in vivo. Using a specific antibody, they localize MAP7D2 to centrosomes, midbody and neurites in N1-E115 cells. Functionally, they show that loss of MAP7D1/2 mildly affects microtubule stability as judged by acetyl-tubulin staining, and properties of these cells that rely on cytoskeletal elements such as cell migration and neurite growth. Interestingly, there might be a feedback loop regulating MAP7D1/2 expression, as knockdown of MAP7D1 upregulates MAP7D2.

      Overall, the experiments and conclusions are very solid and convincing, such that I would not ask for further experiments. This is in part because the experiments are largely based on previous characterizations of other MAP7 family members, which are largely confirmed. The presentation of the data is also very clear.

      Significance (Required):

      I see the value of the study in the fact that it provides solid and specific research tools for MAP7D1/2 which could be very useful for the microtubule/neuronal cytoskeleton community.

      Response: We thank the reviewer very much for appreciating the content of our manuscript.

      \*Referees cross-commenting***

      Reviewers 2 and 3 criticize that the evidence for an effect of MAP7D1/2 on MT dynamics is weak. I would agree in that ac-tub stainings and in vitro experiments are rather indirect. The experiments suggested by reviewer 2 should clarify this (esp. nocodazole should be easy). I also agree that an experiment addressing the potential involvement of kinesin-1 would help, the involvement of which seems to have been omitted by the authors. A kinesin-binding deficient mutant would add another MAP7D1/2 tool and increase the value for the community.

      Response: As for the reviewer’s suggestions listed above, please refer to our responses to the comments of Reviewer #2.

      Reviewer #2

      Evidence, reproducibility and clarity (Required):

      In this study, the authors investigate 2 members from the MAP7 family Map7D2 and Map7D1. They first address the tissue distribution of Map7D2, by northern blotting using a variety of rat tissues. To complement their analysis, they also raised an antibody to look at the protein distribution. From their studies, they concluded that Map7D2 is abundantly expressed in the brain and testis. The authors went on to perform a series of functional assays. First, they biochemically demonstrated that rat Map7D2 directly binds to MTs by MT co-sedimentation assay. The MT binding domain was mapped to the N-terminal half. They performed MT turbidity assay to demonstrate enhanced MT polymerisation in the presence of Map7D2, suggesting that this Map stabilises MTs. The authors went on to characterise in detail the subcellular localisation of Map7D2 which was predominantly present in the centrosome and partially localised to MTs including within neurites from N1-E115 cells. Kikuchi et al. further revealed the overlap in expression between Map7D2 and another family member, Map7D1. The authors continued these studies by a series of functional studies in N1-E115 cells where they performed single or combined knock-downs of Map7D2 and Map7D1 and studied the levels of acetylated and detyrosinated tubulins and the effect of the knock-downs on migration and neurite extension. The main conclusion from this work was that Map7D2 and Map7D1 facilitate MT stabilization through distinct mechanisms which are important in controlling cell motility and neurite outgrowth. Map7D2 is proposed to stabilise MTs by direct binding whereas Map7D1 does it indirectly by affecting acetylation.

      Major comments:

      The main conclusion from this work that Map7D2 and Map7D1 facilitate MT stabilization and that this is necessary for correct migration and neurite extension has not been convincingly demonstrated. In my opinion, a more detailed study of MT properties to demonstrate a role in MT stabilisation would greatly benefit the work, eg. experiments using MT destabilising agents such as nocodazole. In addition, a series of experiments aiming to study MT dynamics would help to understand the function of these MT regulators. The authors proposed an elevation in microtubule dynamics to explain the increase in migration and neurite extension but no experimental proof was provided.

      Response: According to the reviewer’s suggestion, we plan to assess the role of MT stabilization in greater detail by analyzing the sensitivity to the MT-destabilizing agent, nocodazole.

      To study MT dynamics, methods such as analyzing the velocity and direction of an EB1-GFP comet are commonly used. We have previously analyzed the roles of Map7 and Map7D1 in MT dynamics using HeLa cells stably expressing EB1-GFP (Kikuchi et al., EMBO Rep., 2018). However, no such tools have been developed for analyzing MT dynamics in N1-E115 cells, which were used in this study. In addition, it is difficult to analyze MT dynamics by transient expression of EB1-GFP because of the low plasmid transfection efficiency. Therefore, we instead plan to assess the effect on MT dynamics by measuring the EB1 comet length by immunofluorescence, referring to Fig. 7D in EMBO J. 32:1293–1306, 2013.

      Moreover, considering the possibility that the Map7D2 dynamics are altered when MT stability is changed, e.g., before and after differentiation induction, we analyzed the Map7D2 dynamics at the centrosome by fluorescence recovery after photobleaching (FRAP) using N1-E115 cells stably expressing EGFP-rMap7D2. We found that the dynamics were altered between the proliferative and differentiated states (see the figure below). Compared to the proliferative state, the recovery rate of EGFP-Map7D2 was reduced (lower left panel), and the immobile fraction of Map7D2 was increased in the differentiated state (lower right panel). As these data suggest that the increase in immobile Map7D2 may enhance MT stabilization, we will present them in a new figure in our manuscript along with the results of the above two experiments.

      It has been previously demonstrated that loss of MAP7D2 leads to a decrease in axonal cargo entry to axons resulting in defects in axon development and neuronal migration. The C-terminus is necessary for this function as it mediates interaction with Kinesin-1 (Pan et al., 2019). Such mechanisms could also explain the defects in migration and neurite growth that the authors observed. This possibility has not been considered but instead, the subtle changes in total α-tubulin led to suggest MT stabilisation as a key function without proof of causation. Could the authors provide some further experimental evidence to demonstrate that stability is the main contributor to the phenotypes observed? Eg. by rescuing migration and neurite phenotypes with a variant of MAP7D2 which cannot bind kinesin1.

      Response: The reviewer states “Such mechanisms could also explain the defects in migration and neurite growth that the authors observed;” however, our results showed that loss of Map7D2 elevated the rates of both cell motility and neurite outgrowth (original Fig. 5). In contrast, it has been reported in several papers that when Kinesin-1 function is impaired, both cell motility and neurite outgrowth are reduced (Curr. Biol., 23: 1018–1023, 2013; Mol. Cell. Biol., 39: e00109–19, 2019; etc.). Therefore, it is likely that the phenotypes we observed are independent of the functions associated with Kinesin-1 in N1-E115 cells. It is indeed possible that the experiment suggested by the reviewer may reveal relationships between Map7D2 and kinesin-1 in terms of cell motility and neurite outgrowth, however, it is difficult to conduct such an experiment because transient expression of Map7D2 induces MT bundling, as shown in original Fig. 2F. Based on the above, we plan to add a discussion of the relationship between Map7D2 and Kinesin-1.

      A key conclusion proposed by the authors is that Map7D2 and Map7D1 facilitate MT stabilization through distinct mechanisms. Such different roles in MT stabilisation are important in controlling cell motility and neurite outgrowth. In my opinion, their data does not fully support this statement and the findings using MT readouts do not match the defects in migration and neurite growth. Loss of Map7D2 leads to a very subtle phenotype on α-tubulin, while Map7D1 decreases both α-tubulin and acetylated tubulin, but Map7D1 seems to have a milder or similar effect on migration and neurite growth than Map7D2. Furthermore, it would be expected that the combined loss of function would lead to a stronger phenotype in cell migration when compared to the single loss of functions due to their distinct roles on MT stability, however, this seems not to be the case.

      Response: The fact that no stronger phenotype was observed may be because, besides Map7D2 and Map7D1, other molecules are involved in MT stabilization. Another possible explanation is that the increases in both cell motility and neurite outgrowth caused by decreased MT stabilization are offset by Kinesin-1 dysfunction. We plan to add a discussion of the above two possibilities.

      Minor comments:

      1) In the first result section, the author refers to Fig. S3 to suggest the expression of MAP7D2 in the cerebral cortex, however, there are no transcripts in the cerebral cortex according to the figure. Similarly, the immunofluorescence analysis done by the authors shows marginal expression of MAP7D2 in the cerebral cortex.

      Response: According to the reviewer’s comment, we have changed the order of the data shown in Fig. 1C, top panels. The data from the olfactory bulb, cerebellum, and hippocampus, in which Map7D2 expression was detected in the database, were arranged in the top three rows, and the data from the cerebral cortex, in which Map7D2 expression was not detected in the database, were moved to the bottom row as a negative control. In addition, we have revised the relevant part of the Results section as follows: “Based on RNA-seq CAGE, RNA-Seq, and SILAC database analysis (Expression Atlas, https://www.ebi.ac.uk/gxa/home/), Map7D2 expression was detected in the cerebellum, hippocampus, and olfactory bulb, and not in the cerebral cortex (Fig. S3). We further confirmed Map7D2 expression in the above four brain tissue regions of postnatal day 0 mice by immunofluorescence. Among these regions, Map7D2 was the most highly expressed in the Map2-negative area of the olfactory bulb, i.e., the glomerular layer (Fig. 1C). Weak signals were detected in the cerebellum, and marginal signals were observed in the hippocampus and cerebral cortex (Fig. 1C).” (page 5, lines 4–11)

      2) The authors use γ-Tubulin as a housekeeping gene in Fig. 3D, since Map7D2 is enriched in centrosomes this may not be the most appropriate choice.

      Response: γ-Tubulin is abundant in both the cytosol and the nuclear compartments of cells (Sig. Transduct. Target Ther. 3: 24, 2018). As it has been used for similar purposes in several other studies (Cancer Res., 61: 7713–7718, 2001; J. Biol. Chem., 291: 23112–23125, 2016; etc.), we considered it acceptable for use as a loading control for immunoblotting.

      3) According to the authors, knockdown of Map7D2 leads to a decrease in the intensity of α-tubulin and Map7D1 (Fig. 4C and D). This data doesn't agree with the previous statement made by the authors where they show that Map7D2 knockdown or knockout did not affect Map7D1 expression by Western Blot Analysis (Fig. S2C and S5B)

      Response: The immunoblotting results indicate that the total amount of Map7D1 in the cells is not affected by loss of Map7D2. In contrast, the immunofluorescence results indicate that the amount (distribution) of Map7D1 localized around the centrosome is decreased by loss of Map7D2, presumably due to a reduction in the number of MT structures that can serve as scaffolds for Map7D1. We plan to add this interpretation in the Results section.

      4) Line 6 page 7 "Endogenous Map7D2 expression is suppressed in N1-E115 cells stably expressing EGFP-rMap7D2 and was restored by specific knock-down of EGFP-rMap7D2 using gfp siRNA (Fig. 3D)". No quantifications and stats are shown. Also, endogenous Map7D2 after knock-down of EGFP-rMap7D2 is not comparable to the control.

      Response: According to the reviewer’s suggestion, we have quantified the amount of endogenous Map7D2 or EGFP-rMap7D2, normalized it to the amount of γ-tubulin, and calculated relative values to endogenous Map7D2 in the parental control. The amount of endogenous Map7D2 was decreased to 53% in N1-E115 cells stably expressing EGFP-rMap7D2, suggesting that EGFP-rMap7D2 expression suppressed endogenous Map7D2 expression. In this cell line, the total amount of Map7D2 (EGFP-rMap7D2 + endogenous Map7D2) was increased, however, when EGFP-rMap7D2 was depleted using sigfp in this cell line, endogenous Map7D2 was expressed to the same level as EGFP-rMap7D2 before knock-down. Together with the finding that Map7d1 knock-down increased the amount of Map7D2, these findings indicate that the amount of Map7D2 in the cells is regulated in response to the amount of Map7D1 and exogenous Map7D2. We have added this interpretation in the Results section. (page 7, lines 8–15)

      In addition, we have changed the legend of the original Fig. 3D to clarify the quantification method, as follows: “(D) Generation of N1-E115 cells stably expressing EGFP-rMap7D2. To check the expression level of EGFP-rMap7D2, lysates derived from the indicated cells were probed with anti-GFP (top panel) and anti-Map7D2 (middle panel) antibodies. The blot was reprobed for γ-tubulin as a loading control (bottom panel). The amount of endogenous Map7D2 or EGFP-rMap7D2 was normalized to the amount of γ-tubulin, and the value relative to endogenous Map7D2 in the parental control was calculated.” (page 22, lines 18–20)

      5) Line 8 page 7 "These results suggest that the expression of Map7D2 was influenced by changes in that of Map7D1" This statement seems in the wrong place, after the Map7D2 and EGFP-rMap7D2 experiment. Instead for clarity, it would be better placed after line 5 where the authors explain the effect of Map7D1 knock-down on the levels of Map7D2.

      Response: According to the reviewer’s suggestion, we have rephrased the relevant sentence as “Interestingly, Map7d1 knock-down upregulated Map7D2 expression, as confirmed with three different siRNAs (Fig. S2C), suggesting that Map7D2 expression is affected by changes in Map7D1 expression, not by off-target effects of a particular siRNA.” (page 7, lines 7, 8)

      6) Line 8 page 8 "Although the physiological role of the C-terminal region of Map7D2 is currently unknown..." This statement seems not adequate as there are several studies reporting the role of the C-terminal region of Map7D2 in Kinesin1- mediated transport. The authors mention such studies in the discussion.

      Response: According to the reviewer’s suggestion, we plan to add a discussion of the relationship between Map7D2 and kinesin-1.

      7) Line 6 page 9 " Further, the knock-down of either resulted in a comparable reduction of MT intensity (Fig. 4C and D) ..." This is not visible and/or justified by the images provided and would benefit from some sort of quantification at other regions such as neurites.

      Response: Considering the cell motility, quantification of α-tubulin/Ace-tubulin/Map7D1/Map7D2 intensities in neurites is not appropriate. Instead, we have added arrowheads indicating α-tubulin/Ace-tubulin/Map7D1/Map7D2 in Fig. 4C, for better understanding.

      8) In Fig. 2B, a band corresponding to his6-rMAP7D2 of molecular weight >97 kDa co-sedimented with the microtubules. However, the cloned rMAP7D2 had a molecular weight of 84.82 kDa and the addition of 6XHis-Tag would add another 2-3 kDa, therefore, the final protein band observed should be less than 90 kDa. It would be beneficial if the authors could specify the molecular weight of the purified protein after the addition of the V5-his tag and/or if there was addition of amino acids due to cloning strategy.

      Response: In Fig. 2B, we used full-length GST-tagged rMap7D2, like in Fig. 2E and D; therefore, we have corrected His6-rMap7D2 as GST-rMap7D2. We apologize for the mistake.

      9) In Fig. 2C, there is misalignment of the western blot with the panel or text underneath.

      Response: We thank the reviewer for pointing this out; we have corrected the misalignment of the CBB staining in Fig. 2C.

      10) In Fig. 3C the inset from the first panel seems to correspond to a different focal plane than the main image.

      Response: We have revised the relevant part of the figure legend as follows: “In C, images of differentiated cells were captured by z-sectioning, because the focal planes of the centrosome and neurites are different. Each inset shows an enlarged image of the region indicated with a white box at each focal plane. Arrowheads indicate the centrosomal localization of Map7D2.”

      11) In Fig. 4A, the cell type is not specified and is referred as "indicated cells", also the material and methods section seems to omit the specific cells used.

      Response: We have added “in N1-E115 cells treated with each siRNA” in the legend of Fig. 4A.

      12) Fig. S6 is not mentioned in the results.

      Response: We apologize for having referred to Fig. S6 only in the Discussion section in the original manuscript. We plan to describe the findings shown in the original Fig. S6 to the Results section and renumber the figures accordingly.

      Significance (Required):

      MTs play essential roles in practically every cellular process. Their precise regulation is therefore crucial for cellular function and viability. MAPs are specialised proteins that interact with MTs and regulate their behaviour in different manners. Understanding their precise function in different cellular contexts is of utmost importance for many biological and biomedical fields.

      MAPs are well known for their ability to promote MT polymerization, bundling and stabilisation in vitro (Bodakuntla et al., 2019). Several members of the Map7 family have been shown to regulate microtubule stability. For instance, MAP7 can prevent nocodazole-induced MT depolymerization and maintain stable microtubules at branch points in DRG neurons (Tymanskyj & Ma, 2019). Ensconsin, the Drosophila Map, is required for MT growth in mitotic neuroblasts by regulating the mean rate of MT polymerization (Gallaud et al., 2014). However, this family of Maps seems to have diverse functions encompassing a variety of mechanisms, as exemplified by a series of studies demonstrating the involvement of MAP7 family proteins in the recruitment and activation of kinesin1 (Hooikaas et al., 2019; Pan et al., 2019) and in microtubule remodelling and Wnt5a signalling (Kikuchi et al., 2018). Further understanding of this family of Maps and how its members differ in their function is important and will help to advance the field.

      Response: We appreciate the reviewer’s comments. We believe that our revision plan will greatly improve the quality of our manuscript.

      Reviewer #3

      Evidence, reproducibility and clarity (Required):

      Summary:

      Microtubule Associated Proteins (MAPs) are important regulators of microtubule dynamics, microtubule organization and vesicular transport by modulating motor protein recruitment and processivity. In the current manuscript the authors have characterized 2 members of the MAP7 protein family, MAP7D1 and MAP7D2. The authors characterized MAP7D2 expression pattern in the brain and its microtubule binding properties in vitro and in cells. In cells both proteins localize to the centrosome and to microtubules and upon depletion centrosome localized microtubules seem reduced, and cell migration and neurite outgrowth are increased. Surprisingly, they find that microtube acetylation (a common marker for stable microtubules) is reduced upon MAP7D1 depletion but not MAP7D2 depletion. Based on this finding the authors conclude that these proteins have a distinct mechanism in stabilizing MTs to affect cell migration and neurite outgrowth; MAP7D2 stabilizes by binding to MTs, whereas MAP7D1 stabilizes MTs by acetylation.

      Main comments:

      - Both MAP7 proteins show strong localization to the centrosome and to a lesser degree to MTs. Knockdown of either protein leads to reduced MTs around the centrosome, which lead the authors to conclude the MAP7s are stabilizing the MTs. However, the effect could just as well be an indirect effect due to a function of these MAPs at the centrosome. To address this authors could e.g. quantify microtubule properties in postmitotic cells. In addition, antibody specificity should be tested using knockdown of knockout cells, as this centrosome localization was not observed in Hela cells (Hooikaas, 2019; Kikuchi, 2018). Maybe this localization is specific to rat MAP7s or to the cell line used.

      Response: We think that this comment partly overlaps with the comments raised by Reviewer #2. We plan to assess the role of MT stabilization in greater detail by analyzing the sensitivity to the MT-destabilizing agent, nocodazole, and the effect on MT dynamics by measuring the EB1 comet length by immunofluorescence.

      Regarding the reviewer’s concern about antibody specificity, we had carefully confirmed the antibody specificity, as shown in Fig. S2 of the original manuscript. Subsequently, Map7D2 localization was confirmed in N1-E115 cells stably expressing EGFP-rMap7D2, as shown in Fig. 3D, E of the original manuscript. In addition, we are currently conducting analyses using Map7d1-egfp knock-in mice, which confirmed that Map7D1 localizes around the centrosome in cortical neurons, as shown below (we would like to disclose these unpublished data to the reviewers only). Therefore, it is thought that the localization pattern of Map7D2 and Map7D1 differs depending on the cell type and cell line. We plan to add this interpretation to the Results section.

      - Centrosome nucleated microtubules are typically highly dynamic and little modified. Therefore is the Ac-tub staining at the centrosome really MTs? I cannot identify MTs in the fluorescent images in 4C. Maybe authors could consider ac-tub/alpha-tub ratio in non centrosomal region (e.g. neurites). Moreover, as both Acetylation and detyrosination are associated with long-lived/stable MTs, it is surprising that only acetylated tubulin goes down on WB. Does this suggest that long-lived MTs are still present to normal level? If so, can one still argue that the loss of acetylation is the cause of the lower MT levels? This should at least be discussed.

      Response: As for the reviewer’s statement “Centrosome nucleated microtubules are typically highly dynamic and little modified. Therefore is the Ac-tub staining at the centrosome really MTs?”, it has been previously reported that tubulin acetylation is observed around the centrosome in some cell lines (J. Neurosci., 30: 7215–7226, 2010; PLoS One, 13: e0190717, 2018; etc.). N1-E115 is one of the cell lines in which tubulin acetylation is observed around the centrosome.

      It is not surprising that “only acetylated tubulin goes down on WB,” as it has been previously reported that acetylated and detyrosinated tubulins are sometimes not synchronous (J. Neurosci., 23: 10662–10671, 2003; J. Neurosci., 30: 7215–7226, 2010; J. Cell Sci., 132: jcs225805, 2019., etc.). For instance, Montagnac et al. (Nature, 502: 567–570, 2013) showed that defects in the α-tubulin acetyltransferase αTAT1-clathrin-dependent endocytosis axis reduce only tubulin acetylation, resulting in a shift from directional to random cell migration. Although the details of the molecular function of Map7D1 are beyond the main purpose of this study, we plan to add a discussion of the reduced tubulin acetylation by Map7d1 knock-down based on the above.

      - MAP7D1 and MAP7D2 depletion leads to subtle defect in cell migration and neurite outgrowth, which the author suggest is caused by reduced MT stability. However, MAP7 proteins have well characterized functions in kinesin-1 transport, and thus the phenotypes may well be caused by defects in kinesin-1 transport. Ideally the authors would do rescue experiments with FL or just the MT binding N-termini to separate these functions. Moreover this is needed to substantiate the claim of the authors that MAP7D1 effect on MT stability is not mediated by direct binding.

      Response: As this comment largely overlaps with the comments raised by Reviewer #2, please refer to our responses to the comments of Reviewer #2.

      - The authors do not refer well to published work. Several papers have published very similar work (especially to Fig1+2) and it would help the reader much if this would be discussed/compared along the results section and not briefly mention these in the results section. In addition, authors overstate the novelty of their results e.g. page 3: these proteins are not "functionally uncharacterized" nor are their expression patter and biochemical properties analyzed for the first time in this manuscript; page 8 "Although the physiological role of the C-terminal region of Map7D2 is currently unknow, ..." There is a clear function for the C-terminus for the recruitment/activation of kinesin-1.

      Response: According to the reviewer’s suggestion, we plan to add a comparison with data on the Map7 family members presented in previous papers in the Results section and rephrase the relevant part regarding the physiological role of the C-terminal region of Map7D2.

      Minor comments

      - P6 Map7D3 also binds with its N-terminus to MTs, like other MAP7s (Yadav et al)

      Response: According to the reviewer’s comment, we have revised this as “Map7D3 binds through a conserved region on not only the N-terminal side, but also the C-terminal side (Sun, 2011; Yadav et al., 2014).” (page 6, lines 4, 5)

      - P7 "As Map7D2 has the potential to functionally compensate for Map7D1 loss" where is this based on?

      Response: For clarity, we have rephrased this as “As Ma7D2 expression was upregulated upon suppression of Map7D1 expression, Map7D2 has the potential to functionally compensate for Map7D1 loss.” (page 7, line 17, 18)

      - Fig2F quality of black-white images is low potentially due to conversion issues

      Response: We thank the reviewer for pointing out these conversion issues, and we have made the necessary corrections.

      Significance (Required):

      At this stage the conceptual advance is limited. Part of the findings are not novel. The finding that MAP7s depletion have a different effect on MTs acetylation may be interesting to cytoskeleton researchers, although the potential mechanism has not been addressed experimentally or textually.

      However, their conclusion that this leads to reduced MTs and then to cellar migration and neurite formation defects is not sufficiently supported by experimental evidence.

      Response: We appreciate the reviewer’s comments. We believe that our revision plan will greatly improve the quality of our manuscript.

      \*Referees cross-commenting***

      I completely agree with reviewer #2: At this stage the paper's conclusions are not sufficiently supported by the data. Important will be to further characterize the effect om the MTs (do they really have a different effect) and to look at the possible involvement of the motor recruitment. Maybe that a 3 to 6 months revision time would have been more accurate.

      Response: Please refer to our responses to the comments of Reviewer #2.

    1. Author Response:

      Reviewer #2 (Public Review): Gaffield and Christie trained mice to an interval task of self-initiate bouts of licking to understand how the cerebellar activity relates to the organization of well-timed transitions to motor action and inaction during discontinuous periodically performed movements. Recording and optogenetically stimulating the activities of Purkinje cells, they concluded that the cerebellum encodes and influences the motor transitions, initiation and termination of discontinuous movements. The conclusion of the paper is very interesting and potentially provides insights on the neural mechanism of the previously proposed principle that the cerebellum controls the timings of discrete movements (Ivry et al. 2002). However, in the logic and interpretation to the conclusion I have concerns which they need to address. [Major comments]

      We thank the reviewer for their positive evaluation of our work and their helpful comments. We have substantially altered our manuscript to address their concerns, including an entirely new figure as well as additional supplemental figures.

      First, the activity of Purkinje cells can largely encode each bout of licking movements, in addition to initiation and termination of movements. Figure 2BCEF plays the peak of neural activity around the water time and Figure 2DG indicates the relationship between the neural activity and lick rate. The encoding of the initiation and termination alone cannot explain these observations. Related to this, none of the panels Figure 2BCEF shows a lead of the onset of neural activity to that of the lick rates (around -5 sec to water time). This looks inconsistent with the lead shown in Figure 3. The authors need to explain why such an inconsistency can happen.

      We agree that Crus I and II PCs encode parameters of licking bouts in addition to movement initiation and termination and deeply apologize for not making this point more clearly. To address this concern, we have extensively edited the text in several sections and have added an additional figure to emphasize the richness of the PC representation of behavioral attributes, beyond just initiation and termination alone. We disagree that there is an inconsistency in the lead times differences in our datasets. As the reviewer points out, the water-delivery-aligned firing rate z-scores do not seem to lead the licking rate (Fig. 2B-E). However, these data are averaged across trials with a high variance in the timing of lick initiation relative to water delivery; consequently, it is not possible to assess the timing of PC activity relative to lick bout initiation from these panels. When, by contrast, data are aligned to welldefined licking bouts (i.e., bouts with no licking in the preceding 2 s), it becomes clear that PC firing ramps up in advance of the bouts (Fig. 4C-D). We have edited the text, explaining this rationale, as requested by the reviewer.

      Second, the positive sign of neural modulation indicates biased recording sites. So far, many studies have been indicating the increasing firing modulation at the deep cerebellar nuclei in cerebellar timing tasks and motor tasks (e.g. Ten Brinke et al. 2017 eLIFE for the eyeblink conditioning; Ohmae et al. 2017 JNS for a self-initiate timing task; Becker and Person 2019 Neuron). Ramping-up modulation of Purkinje cells is not able to activate the deep cerebellar nuclei. When the motor-driving module generates negative modulation of Purkinje cells, the neighboring modules can generate positive modulation (e.g. Ten Brinke et al. 2017 eLIFE; De Zeeuw 2021 Nat Rev; Ohmae and Medina 2014 Soc. Neurosci. Abstr.). Because the neighboring modules are much wider than the motor-driving module, recording without identifying the driving modules, as in this study, will result in the recording being biased toward the adjacent modules.

      We too were surprised that we did not observe more negatively modulating PCs. However, our craniotomy was relatively large (>2 mm square) exposing an area over Crus I and II that encompassed zebrin bands 7+, 6-, and 6+. We randomly sampled PC activity within this region, so we don’t think our recordings were necessarily “biased”. We are unaware of any definite experiments showing whether positively and negatively PCs form separate, or convergent, channels of output onto their postsynaptic targets in the cerebellar nuclei. If convergent, then the response of the nuclear neurons will be determined by an ensemble of PCs with time varying signs of activity, in addition to the integration of the activity from pontine collaterals.

      We thank the reviewer for highlighting the developing idea of motor and non-motor cerebellar modules and the loops formed by their connectivity. We have edited our text to address how our recordings could fit into such an organizational scheme and have cited their recent unpublished preprint on this topic, now available on BioRxiv (Ohmae et al. 2021). However, we believe several considerations suggest that both positive and negative modulation of Purkinje cell firing rates will impact movement. (1) Large regions of the cerebellar cortex are capable of evoking or modulating movements when microsimulation is applied. Similarly, optogenetic suppression of IntA activity increases the outward velocity of reaching movements in mice (Becker & Person 2019). (2) In contrast with delay eyeblink conditioning, in which the motor output is an impulse-like twitch, rhythmic movements of the tongue (or, similarly, the limbs) require alternating recruitment and de-recruitment of muscles. Thus, motor commands will necessarily be multiphasic in time, and will tend to be out of phase for populations controlling antagonistic muscles. (3) Excitation of the DCN by collaterals of mossy fibers will likely modulate, and perhaps override, Purkinje cell inhibition. Therefore, further work will certainly be necessary to decipher exactly how potential antagonistic cerebellar modules participate organizing complex motor actions.

      Third, the authors used z scores for the unit of spike rate, but it is more appropriate to use spike per second as in Figure 3CD. In particular, I do not understand the meaning of difference of spike rate in the unit of z score in Figure 3E. The spike rate modulation in Figure 4E looks small which should be evaluated in the unit of spike per second as well. For the analysis of the last lick, the spontaneous spike rates should be displayed, instead of (or in addition to) the spike rate in the middle of lick bouts which should be much higher than the spontaneous spike rate according to Figure 2.

      We appreciate the reviewer’s input regarding style, but the current standard in the neurophysiology field is to report firing rate comparisons from a neural population as z-scores. Z-scoring is particularly useful because this metric provides a probability of an individual score occurring within a normal distribution, as well comparisons of different scores from different normal distributions; it also gives an indication of the raw score differs from the mean, information that isn’t available in spike rate comparisons alone. For these reasons, we elect to not change how we represent our data. However, we have modified our figures to report firing rates for traces from individual example cells as z-scoring is not appropriate for this purpose.

      Forth, I did not understand the conclusion for the optogenetic perturbation. In the result section for Figure 7, I think there is a logical gap between the last conclusion sentence and the sentences before it. The suppression of lick bouts in Figure 7D and the rebound induction in Figure 7G can be explained by the cerebellar contribution to each bout of lick movement (shown in Figure 2). I do not understand if these observations indicate the cerebellar contribution to the initiation and termination of a sequence of lick movements. Also, I have a concern about the location of stimulation sites. The stimulation may cover both the motor-driving module and neighboring modules, which makes the observations difficult to interpret because the stimulation is not specific to the positively modulating Purkinje cells.

      A lick bout is composed of a sequence of tongue protrusions and retractions performed at a highly regular rhythm. Apart from the first lick (Bollu et al., 2021), the motor command for this behavior is under the control of central pattern generators in the brainstem. Said another way, a lick bout is a continuous movement rather than series of discrete actions that are repeatedly started and stopped (they are like stepping during locomotion in some animals). Lick bout initiation and directional control of the bout can be commanded by the cerebral cortex. Given this organization, we do not believe our optogenetic experiment can be interpreted as an effect on the initiation and termination of individual licks because licks are not discrete actions when performed in a consummatory bout. However, based on the reviewer’s recommendation, we investigated how PCs encode information pertinent to individual licks in a bout (Figure 3). Although there was entrainment to individual lick cycles, there were no time-locked responses apparent in their average activity. Instead, there was a continuous mapping of the lick cycle across their population. Notably, licking rhythmicity was disrupted by the optogenetic perturbation, consistent with the influence of PC output on this movement parameter. We have edited the text to address these concerns.

      Fifth, For Figure 8, I had difficulty to understand what kind of activity of Purkinje cells can explain the shift of the peak timing of lick rate, because in the result sections of Figures 2-6 I could not find any activity encoding the peak timing of lick rate. For figure 8EFG, the analysis may not be correct. Because lick onset can be delayed with the photostimulation, in Figure 8E the boundary of onset corresponding to the 1s in control should 1+alpha in stimulation trials to correctly pick up the corresponding trials. Because we do not know the exact values of alpha, I think this analysis is not possible.

      PC ramping activity may contribute to the vigor of the ensuing licking response which would dictate peak licking rate timing. In fact, in many individual PCs, we observed correlations between PC firing and lick rate indicating a relationship. However, this was not borne out in the population response, so we did not pursue it further.

    2. Review #2 (Public Review):

      Gaffield and Christie trained mice to an interval task of self-initiate bouts of licking to understand how the cerebellar activity relates to the organization of well-timed transitions to motor action and inaction during discontinuous periodically performed movements. Recording and optogenetically stimulating the activities of Purkinje cells, they concluded that the cerebellum encodes and influences the motor transitions, initiation and termination of discontinuous movements. The conclusion of the paper is very interesting and potentially provides insights on the neural mechanism of the previously proposed principle that the cerebellum controls the timings of discrete movements (Ivry et al. 2002). However, in the logic and interpretation to the conclusion I have concerns which they need to address.

      Major comments:<br> First, the activity of Purkinje cells can largely encode each bout of licking movements, in addition to initiation and termination of movements. Figure 2BCEF plays the peak of neural activity around the water time and Figure 2DG indicates the relationship between the neural activity and lick rate. The encoding of the initiation and termination alone cannot explain these observations. Related to this, none of the panels Figure 2BCEF shows a lead of the onset of neural activity to that of the lick rates (around -5 sec to water time). This looks inconsistent with the lead shown in Figure 3. The authors need to explain why such an inconsistency can happen.

      Second, the positive sign of neural modulation indicates biased recording sites. So far, many studies have been indicating the increasing firing modulation at the deep cerebellar nuclei in cerebellar timing tasks and motor tasks (e.g. Ten Brinke et al. 2017 eLIFE for the eyeblink conditioning; Ohmae et al. 2017 JNS for a self-initiate timing task; Becker and Person 2019 Neuron). Ramping-up modulation of Purkinje cells is not able to activate the deep cerebellar nuclei. When the motor-driving module generates negative modulation of Purkinje cells, the neighboring modules can generate positive modulation (e.g. Ten Brinke et al. 2017 eLIFE; De Zeeuw 2021 Nat Rev; Ohmae and Medina 2014 Soc. Neurosci. Abstr.). Because the neighboring modules are much wider than the motor-driving module, recording without identifying the driving modules, as in this study, will result in the recording being biased toward the adjacent modules.

      Third, the authors used z scores for the unit of spike rate, but it is more appropriate to use spike per second as in Figure 3CD. In particular, I do not understand the meaning of difference of spike rate in the unit of z score in Figure 3E. The spike rate modulation in Figure 4E looks small which should be evaluated in the unit of spike per second as well. For the analysis of the last lick, the spontaneous spike rates should be displayed, instead of (or in addition to) the spike rate in the middle of lick bouts which should be much higher than the spontaneous spike rate according to Figure 2.

      Forth, I did not understand the conclusion for the optogenetic perturbation. In the result section for Figure 7, I think there is a logical gap between the last conclusion sentence and the sentences before it. The suppression of lick bouts in Figure 7D and the rebound induction in Figure 7G can be explained by the cerebellar contribution to each bout of lick movement (shown in Figure 2). I do not understand if these observations indicate the cerebellar contribution to the initiation and termination of a sequence of lick movements. Also, I have a concern about the location of stimulation sites. The stimulation may cover both the motor-driving module and neighboring modules, which makes the observations difficult to interpret because the stimulation is not specific to the positively modulating Purkinje cells.

      Fifth, For Figure 8, I had difficulty to understand what kind of activity of Purkinje cells can explain the shift of the peak timing of lick rate, because in the result sections of Figures 2-6 I could not find any activity encoding the peak timing of lick rate. For figure 8EFG, the analysis may not be correct. Because lick onset can be delayed with the photostimulation, in Figure 8E the boundary of onset corresponding to the 1s in control should 1+alpha in stimulation trials to correctly pick up the corresponding trials. Because we do not know the exact values of alpha, I think this analysis is not possible.

    1. SciScore for 10.1101/2022.01.23.22269214: (What is this?)

      Please note, not all rigor criteria are appropriate for all manuscripts.

      Table 1: Rigor

      <table><tr><td style="min-width:100px;margin-right:1em; border-right:1px solid lightgray; border-bottom:1px solid lightgray">Ethics</td><td style="min-width:100px;border-bottom:1px solid lightgray">Consent: Conventionally, the ethical approval and consent were obtained from the CRSTRA and all participants.</td></tr><tr><td style="min-width:100px;margin-right:1em; border-right:1px solid lightgray; border-bottom:1px solid lightgray">Sex as a biological variable</td><td style="min-width:100px;border-bottom:1px solid lightgray">= Other, 2=African / Afro-American, 3= Caucasian, 4= Arabic, 5= Asian, 6= Latino) - Gender (0=Not precise, 1= Male, 2= Female, 3= Other).</td></tr><tr><td style="min-width:100px;margin-right:1em; border-right:1px solid lightgray; border-bottom:1px solid lightgray">Randomization</td><td style="min-width:100px;border-bottom:1px solid lightgray">not detected.</td></tr><tr><td style="min-width:100px;margin-right:1em; border-right:1px solid lightgray; border-bottom:1px solid lightgray">Blinding</td><td style="min-width:100px;border-bottom:1px solid lightgray">not detected.</td></tr><tr><td style="min-width:100px;margin-right:1em; border-right:1px solid lightgray; border-bottom:1px solid lightgray">Power Analysis</td><td style="min-width:100px;border-bottom:1px solid lightgray">not detected.</td></tr></table>

      Table 2: Resources

      <table><tr><th style="min-width:100px;text-align:center; padding-top:4px;" colspan="2">Software and Algorithms</th></tr><tr><td style="min-width:100px;text=align:center">Sentences</td><td style="min-width:100px;text-align:center">Resources</td></tr><tr><td style="min-width:100px;vertical-align:top;border-bottom:1px solid lightgray">Statistical analysis: Statistical analyses were performed using SAS® (version 9.4).</td><td style="min-width:100px;border-bottom:1px solid lightgray"><div style="margin-bottom:8px"><div>SAS®</div><div>suggested: (SASqPCR, RRID:SCR_003056)</div></div></td></tr></table>

      Results from OddPub: We did not detect open data. We also did not detect open code. Researchers are encouraged to share open data when possible (see Nature blog).


      Results from LimitationRecognizer: We detected the following sentences addressing limitations in the study:
      4.3 Methodological limitations: Our analysis was limited to a number of factors considered as relevant based on the literature review, and which could be ascertained using an online questionnaire. However, different studies also pointed out a number of other potential risk factors. Objectively, it is an almost impossible challenge to know exactly the factors responsible for infection and transmission of COVID-19. Sources may be incomplete; apart from the factors discussed previously, even the meteorological ones were considered a potential explanation [164]. A study in Korea demonstrated that the environment plays a significant role in the spread of COVID-19, but like any factor, it may have also been impacted by various additional features [165]. Hence, further studies are needed to protect people from COVID-19 transmission, specifically on infection dynamics and the mode of transmission, e.g., cluster spaces, closed spaces, and indoor environments [166]. At the individual level, everyone must take the maximum possible precautions. It should also be remembered that no less than 10 reasons supporting airborne transmission were phrased recently by Greenhalgh et al. [167]. The long-term health consequences of COVID-19 remain unclear and continue to be studied [168]. Therefore, it is preferable to avoid any form of infection, even mild. Another factor that we do not necessarily think about and which may be important is the wastewater treatment and disinfection strategies with ch...

      Results from TrialIdentifier: No clinical trial numbers were referenced.


      Results from Barzooka: We did not find any issues relating to the usage of bar graphs.


      Results from JetFighter: We did not find any issues relating to colormaps.


      Results from rtransparent:
      • Thank you for including a conflict of interest statement. Authors are encouraged to include this statement when submitting to a journal.
      • No funding statement was detected.
      • No protocol registration statement was detected.

      Results from scite Reference Check: We found no unreliable references.


      <footer>

      About SciScore

      SciScore is an automated tool that is designed to assist expert reviewers by finding and presenting formulaic information scattered throughout a paper in a standard, easy to digest format. SciScore checks for the presence and correctness of RRIDs (research resource identifiers), and for rigor criteria such as sex and investigator blinding. For details on the theoretical underpinning of rigor criteria and the tools shown here, including references cited, please follow this link.

      </footer>

    1. Mechanization may yet force the issue, especially in the scientific field; whereupon scientific jargon would become still less intelligible to the layman.

      Just detailing the process of mechanization.

    2. They have improved his food, his clothing, his shelter; they have increased his security and released him partly from the bondage of bare existence. They have given him increased knowledge of his own biological processes so that he has had a progressive freedom from disease and an increased span of life. They are illuminating the interactions of his physiological and psychological functions, giving the promise of an improved mental health.

      Interesting take to see the progress of man come about, just discussing how humans went from a bare existence to incredible technological feats.

    3. His excursions may be more enjoyable if he can reacquire the privilege of forgetting the manifold things he does not need to have immediately at hand, with some assurance that he can find them again if they prove important.

      This is one of my difficulties. I put a lot of stuff in my blog-as-memex but don't have a good way of surfacing them again. Theoretically I could do this with categories, but that gets overwhelming fast. This is why I'm thinking about using a blog and a wiki together for this purpose.

    4. There is a new profession of trail blazers, those who find delight in the task of establishing useful trails through the enormous mass of the common record.

      It me! This is kinda what people who operate as web librarians do.

    5. When the user is building a trail, he names it, inserts the name in his code book, and taps it out on his keyboard. Before him are the two items to be joined, projected onto adjacent viewing positions. At the bottom of each there are a number of blank code spaces, and a pointer is set to indicate one of these on each item. The user taps a single key, and the items are permanently joined. In each code space appears the code word.

      This is tagging.

    6. if the user inserted 5000 pages of material a day it would take him hundreds of years to fill the repository, so he can be profligate and enter material freely.

      How many people use Evernote as a Memex?

    7. The human mind does not work that way. It operates by association. With one item in its grasp, it snaps instantly to the next that is suggested by the association of thoughts, in accordance with some intricate web of trails carried by the cells of the brain. It has other characteristics, of course; trails that are not frequently followed are prone to fade, items are not fully permanent, memory is transitory. Yet the speed of action, the intricacy of trails, the detail of mental pictures, is awe-inspiring beyond all else in nature.

      Bush points out that indexing systems and rules do not duplicate the human mind - we must convert our own mental associations to a form we can use to search them - but that the human mind works by association. I extrapolate from this the idea of hypertext as a model of how the mind works. I'm going to keep an eye out for other instances of this idea.

    1. dea that the de-emphasis on the collective must be an index of lesser deliberation and a resort to mere personal impressions, what appears less collective may just be less formal, while still as collective as ever.

      Here, he basically sums up his position on why language is more self-focused than it used to be--it's just how we talk, it's not how we think

    1. Accessibility/Documentation Throughout our web pages, you will encounter links to our documentation, provided via Google Docs.  To download any of these items, you will navigate to “file,” then select “download as.” This step allows you access to our documents both online and offline, as well as in your preferred file format (.doc, .pdf, among others). You may find that this format also helps with translation of text to preferred language.  We have included clickable table of contents to navigate to particular sections of a document, although if you select the paper icon tab in the left corner of the current document (next to the page ruler and directly below the printer icon), Google automatically provides an outline view of the text provided.  We strive to provide accessible, universally designed content, inclusive of our documentation. Please contact us at itms@muhlenberg.edu with any questions or with suggestions for future documentation.

      While important, I don't think it needs to be front and center. We could have a whole page/section to Accessibility potentially working with Support and their materials. Annnd I have soooo many little cheat sheets on good practices that, to my knowledge, don't live anywhere.

    1. Interesting to note how a player's expectations of generic conventions can have such a huge effect on the emotions created. Like if you can fight monsters, players assume you should be able to do so at least semi-effectively because that's how they've learned combat systems are "supposed to work." (Chekhov's combat system?) Balancing the necessity of failure or "death" for tension with the risk that repeated failures could also undermine the drama by highlighting the artificiality of the game seems like a very difficult thing to get right. I also think the idea of a player noticing the feedback systems and becoming more immersed because they "trust" the game to provide them a consistent experience is fascinating because that seems counter to the usual notion of "immersion." Designing the sanity system more as a continuous "mood feature" rather than a discrete mechanic based on resources seems like a clever adjustment to generic conventions. Allowing a negative feedback system invisibly advantage a player, avoiding players unknowingly "dooming" their game state, seems so counter to the hostile posture of a horror game but it's interesting to note how many very influential horror titles use the technique. Interesting how much of the game design of Amnesia was about minimizing frustration in the immediate gameplay, but still creates discomfort in other ways. Like how players being frustrated by a "bad" combat system was something to be avoided, yet removing the ability to fight back also could be seen as creating a different sort of frustration, as it disempowers players who may resent being "forced" to run.

      It was also interesting to know about the blog which spoke about playing GTA 4 as a law abiding citizen and following all laws. Hearing conversations and abuses while walking the streets. All of these situations are inspired from our daily lives but we tend to miss them or ignore them as there are so many other attributes which require more of our attention.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1:

      This is the first such piece of data to come from human infective parasites in the field. Technically this is a feat - because the small number of parasites that are present per mL of human blood at any given time during infection with T gambiense. Nevertheless they manage to identify up to 14 unique VSGs per patient sample. And this raises the first theoretical question: can they extrapolate to the average diversity load per human?

      This is an intriguing question that we would like to eventually answer, but we do not believe we can make this estimate from the data we currently have. We know our sampling is insufficient based on the correlation between parasitemia and diversity, and we do not have sufficiently precise estimates of parasitemia that could be used to extrapolate total diversity in the blood. Moreover, our analysis was only performed on RNA extracted from whole blood samples. Recent studies indicate that significant populations of parasites reside in extravascular tissue spaces, and our analysis did not address antigenic diversity in these spaces. We believe it is unlikely that the blood alone reflects the full diversity of VSG expression in an infection, and an estimate based only on blood-resident parasites (if possible) could be misleading.

      this is important because the timing of sample collection (ie that it occurred within a period of weeks) suggesting that an initial group of infected tsetse infected these patients (rather than a small number of interactions between a bloodmeal and a new infection - generally in itself on the order of 1 month or so). If parasitemia is low and diversity limited, this would explain both why CATT works as well as it does (because really it shouldn't at all!) and perhaps even the chronicity of infection (in the sense that the organism is unlikely to "run out" even of complete VSGs, never mind mosaics). The paper would benefit from a direct discussion on this.

      Indeed, the timing of sample collection could inform our interpretation of the data. However, sample collection occurred over a period of six months. More importantly, patients were in both early and late stage disease at the time of sample collection, so we cannot estimate how long any individual patient had been infected. We have added text (line 180) to highlight this fact. Because some patients were infected at least 6 months apart (if not much more than that), it is unlikely that patients were infected around the same time by a small group of infected tsetse flies. Reviewer #1 introduces an interesting point about the efficacy of the CATT diagnostic test as it relates to antigenic diversity. We discuss CATT sensitivity in the introduction (lines 115-120) as well as the discussion, where regional sensitivity differences are mentioned (lines 715-718). Given uncertainty about total diversity and time since initial infection, we have refrained from speculating about how diversity/timing could affect CATT sensitivity.

      An interesting feature of this new study is the apparent bias to type B N-terminal domain VSGs as well as the discovery that two patients share a specific VSG isolate (though it is not mentioned whether they are related by distance etc). This raises the possibility of substrains with different VSG archives that vary by geography.

      We found two VSGs which were expressed in more than one patient. One was expressed in two patients from the same village (village C) while the other VSG was common between two cases originating from villages C and D, some 40 km apart. We agree that our data generally support the possibility that the VSG archive might vary geographically. We have performed additional analyses suggested by reviewer #2 that support this idea: we have now shown that Tbg patient VSGs classified in this study, which originated from the DRC, are distinct from the VSGs encoded by the reference strain Tbg DAL 972 which was isolated in Cote d’Ivoire. We mention this possibility on lines 721-724.

      Alternatively it suggests that perhaps type B VSGs are picked up differentially by serology (and there the one feature of type B VSGs that could be shared, with regards to detection, is the O-hexose decoration on a number of type B VSG surfaces. Could CATT be detecting elements common to sugar decorated VSGs? Experimentally this is something that can be tested even with mouse infection materials.

      This is indeed an intriguing possibility. We mention this in the discussion (lines 772-778): “In T. brucei, several VSGs have evolved specific functions besides antigenic variation [74]. Recently, the first type B VSG structure was solved [75], revealing a unique O-linked carbohydrate in the VSG’s N-terminal domain. This modification was found to interfere with the generation of protective immunity in a mouse model of infection; perhaps structural differences between each VSG type, including patterns of glycosylation, could influence infection outcomes.” While this is an experimentally tractable explanation for the type B VSG bias we observe, we believe such experiments are beyond the scope of the current paper.

      Side comment: are the common VSGs mutated between patient samples?

      We classified VSGs as common between patient samples if they had >98% nucleotide sequence identity as well as meeting the other quality cutoffs such as 1% expression level and consistency across technical replicates. This identity cutoff still allows for several mismatches between sequences, which we do occasionally observe. However, we cannot confidently rule out that the “mutations” we observe are sequencing or PCR errors. Thus, we cannot say for sure if there are mutations between common VSGs.

      Reviewer #2


      1.Throughout the manuscript you observe 'diversity' in expressed VSG and its existence becomes a principal conclusion. I feel that the meaning of diversity and its significance is not sufficiently explained for the reader. In the abstract (l48) you say that there is 'marked diversity' in parasite populations. Presumably you mean parasite infrapopulations, i.e. within patients, not across the DRC? In any case, what is 'marked' about it, and relative to what? Why does it matter that there are multiple expressed VSG in a single patient at one time? Is this not a reasonable expectation for a population of (presumably) clones capable of switching the expressed VSG? How is this different to the view typical of the literature since 1970 that one VSG dominates while others wait in the background at low frequencies. If 'diversity' is the conclusion, then you need to define it and explain its significance more.

      When we refer to diversity, we do mean infrapopulations of parasites within patients, or individual animals in this case, rather than across the DRC. We have edited the text to make this clear (see below). However, the study which benchmarked the application of VSG-seq to quantify VSG expression in vivo during mouse did not support the previously-held view that one VSG dominates while others wait in the background at low frequencies. Frequently we observe a handful of VSGs present at 10-20% of the population at any timepoint, and many VSGs (~50% of all detected variants) present at “In a proof-of-principle study, we used VSG-seq to gain insight into the number and diversity of VSGs expressed during experimental mouse infections [30]. This proof-of-principle study revealed significant VSG diversity within parasite populations in each animal, with many more variants expressed at a time than the few thought to be sufficient for immune evasion. This diversity suggested that the parasite’s genomic VSG repertoire might be insufficient to sustain a chronic infection, highlighting the potential importance of recombination mechanisms that form new VSGs.

      2.Following on from 1., why does the analysis deal in counts of distinct VSG or N-terminal domains, and not then progress to their relative expression? The expression data are in Supp Table 3 and they show that, in most cases where many VSG are observed in the same patient, 1-3 of these are 'dominant', i.e. they account for >50% of the population.

      The VSG-seq analysis pipeline does estimate the relative expression level of each identified variant in the population, and this information is available in the supplemental data (Supplemental Figure 1, Supplemental Table 3). However, we chose not to rely on these measurements too heavily because there was some variation between Tbg technical replicates, which is shown in the supplemental heatmap (Supplemental Figure 1). Replicate three tends to not agree with the first two replicates. We suspect that this was due to the order of sample processing and the fact that the parasite-enriched cDNA sample was repeatedly freeze-thawed between library preparations for technical replicates. Additionally, because our sampling did not reach saturation, some VSGs are not detected in all replicate libraries, making it difficult to estimate their abundance.

      We have added a discussion of these issues to the text on lines 431-433: “Because our sampling did not reach saturation, resulting in some variability between technical replicates, we chose to focus only on the presence/absence of individual VSGs rather than expression levels within parasite populations.”

      Figure 1 deals in VSG counts, but I would then expect another figure to illustrate the reality that only a minority of these observed VSG are likely to be clinically relevant (i.e. the subject of the immune response). This impacts the 'diversity' conclusion, as given in the discussion (ll 657-9), because you cannot afford to treat all these VSG equally when their abundances are quite different.


      We agree that relative expression level is a useful metric, but absent longitudinal sampling it is impossible to determine which VSGs are clinically relevant as defined by the reviewer: low abundance VSGs at one time point may be the predominantly expressed variant at another. Moreover, the threshold for triggering an anti-VSG antibody response remains unknown. Thus, we have chosen to treat all detected variants equally.

      3.How related are these VSG? Were you able to ensure unique read mapping to the VSG assembly? Can you show that reads mapped to a single VSG only and therefore, that the RPKM values are reliable?

      Our analysis accounts for the fact that VSGs can be very similar. We only considered uniquely mapping reads in our VSG-seq analysis. We also account for mappability in our quantification, so VSG sequences that are less unique (and thus have fewer uniquely mapping reads) are not artificially underrepresented in estimates of relative expression. We have specified the parameters used for alignment (line 274) in the methods.

      4.The authors observed no orthology between expressed VSG and DAL972 genes. This is really interesting and deserves closer attention. Presumably there is microhomology? For T. brucei VSG, with constant recombination, we would predict that a comparison of the VSG in West and Central Africa would reveal a pattern of mosaicism, such that individual sequences in DRC would break down into motifs present in multiple genes in the West African reference. Question is, how many genes? What does that distribution look like? What is the smallest homology tract? There is an opportunity here to comment on how VSG repertoires diverge under recombination. How much of the expressed VSG sequence is truly unrepresented in the West African reference (or other T.b.gambiense genome sequences available in ENA). I can believe that none of the N-terminal domains in these data are present intact in DAL972, but I cannot believe that their components are not present without evidence.

      We appreciate the reviewer’s suggestion to look at this more closely. We have performed additional analyses to address sequence similarity, or lack thereof, between the assembled DRC patient VSG and the West African reference TbgDAL972. We ran a nucleotide BLAST of expressed VSGs against the TbgDAL972 genome reference sequence pulled from TriTrypDB.org (release 54). We have added a supplemental figure depicting the results of this analysis (Supplemental Figures 6 and 7). Briefly, our analysis shows that most of the N-termini we identified have no significant similarity to DAL972 VSGs, even with very permissive search parameters. There are frequent hits in the VSG C-termini, however, which might be expected. Most BLAST hits are short spans 98% identity are short 20-25 bp regions. Given the large divergence from the reference, we were unable to infer any patterns of recombination in the VSGs. However, we believe this analysis supports our claim that the N-termini of VSGs assembled from DRC patients are novel, with their component parts largely unrepresented in the West African reference genome.

      Figure 4 compares NTD type composition in the DRC data with previously published mouse experiments. The latter take place over very short timescales in maladapted hosts, while the timescales of the latter in natural hosts are unknown but plausibly very much longer. So are these data really comparable and are we learning anything from their comparison, given that the most likely explanation for the NTD bias in expressed VSG is the underlying genomic composition?


      Indeed, this is our intended conclusion from figure 4. The figure is meant to illustrate our claim that the expressed VSGs in each experimental set reflect the underlying genomic composition of their corresponding reference strains, despite fluctuations over time. The language and legend for Figure 4 has been clarified to emphasize this point. We have emphasized in the text that it is unknown whether these fluctuations occur over time in much longer natural infections.


      6.Please comment on the technical reproducibility of the data, there are multiple instances in Supp Table 3 where technical replicates expressed different VSG.

      Three RNA-seq library technical replicates were prepared for each individual gHAT patient RNA sample. Replicates were prepared in batches together so all 1’s were done on the same day, for example. The original parasite-enriched cDNA sample was frozen and thawed between each batch. We suspect that the cDNA degraded after repeated freeze-thaw cycles, which is why replicate three tends to not agree with the other two as can be seen on the heatmap in supp fig. 1 and the expression data in supp table 3. We also suspect the fact that our sampling did not reach saturation resulted in the detection of different VSGs in individual replicate preps. We have edited the methods and mentioned this variability in the results section to communicate this issue more transparently.

      • Lines 395-397 “Using RNA extracted from 2.5 mL of whole blood from each patient, we prepared libraries for VSG-seq in three separate batches for each technical replicate.”
      • Lines 431-433: “Because our sampling did not reach saturation, resulting in some variability between technical replicates, we chose to only focus on the presence/absence of individual VSGs rather than relative expression levels within the population”

      Reviewer #3

      1. In line 499, the authors conclude the due to the expressed VSGs being different in the blood and CSF being difference it may indicate that different organs harbor different VSG sets. Given that this is n=1 for patient samples I think this is too speculative a statement. There is also no indication as to whether the samples were taken at the same time or not.

      This is absolutely correct. The precise timing of CSF sample collection is unknown for these samples. It likely occurred within hours to days after blood collection, but even on this short time scale, the unique CSF repertoire could represent the antibody-mediated clearance of one VSG population and replacement with another. We have scaled back our language and only point out that there are unique VSGs in this space (Lines 522 – 524).

      I think that the authors need to be very careful as to the conclusions drawn about VSG expression over time in terms of hierarchy and N-terminal fluctuations. For any conclusions to be drawn on the hierarchy of VSG expression more data points are needed taken over time (this is obviously challenging when looking at patient samples). I find it too speculative to draw any conclusions when single time points are assessed and the assumption on the progression of the infection depends on whether it is a Tb or Tbr.

      Reviewer #2 also pointed this out. We agree and have attempted to limit definitive conclusions in the text and instead discuss multiple possible explanations behind our observations.

      I found some of the figure legends a bit terse. For example, in Figure 1 C, what do the black circles and lines represent? Perhaps a little more detail would help the reader.

      Clarified legends for UpSet plots in figures 1C and 3C as follows: “The intersection of expressed VSG sets in each patient. Bars on the left represent the size of the total set of VSGs expressed in each patient. Dots represent an intersection of sets with bars above the dots representing the size of the intersection.”

      In figure 2, I found it difficult to distinguish between the orange and dark red in (A) and the two lighter blue colors.

      We have changed N-terminal type color palette for all plots to make red and blue hues more distinctive.

      In line 389 – estimate

      Corrected

      In line 498 - should be reference been to figure 2C?

      This should be a reference to Figure 3B. We have corrected the reference.

    1. But while Americans can, he says, perceive that they are faced with “intricate social and cultural problems,” they “tend to think of them as scientific and technological problems” to be solved separately.

      Yes, because we rarely make progress in other aspects. For technological problems there's a clear solution that people will buy if it works. Deliberate sociological change you may have to force, which is never a good basis.

    1. Author Response:

      Reviewer #1 (Public Review):

      This is a clearly written manuscript describing an elegant study that demonstrates how microsaccades are not the triggers of attentional effects, and that attentional modulations can be observed in the absence of microsaccades. This is a very much needed work, especially in the light of the recent debate regarding whether or not microsaccades are the cause of peripheral attentional effects. By explicitly comparing and quantifying the effects of attention on neuronal responses in the presence and in the absence of microsaccades, this work provides important insights on this debate. I think the work is well conducted and the results are solid.

      We thank the reviewer for their supportive comments!

      I only have few comments/suggestions:

      1. Lines 125-126, the authors report that monkeys generated frequent microsaccades but their overall direction was not systematically biased towards the cue location. This seems to be in contrast with what previously reported in the literature in humans and monkeys. I think this discrepancy should be discussed in the discussion. Is this simply the result of different experimental paradigms (maybe exogenous vs endogenous attention, or the presence of the cue for the entire duration of the trial, ect)?

      As suggested, we discuss three main factors which may contribute to this discrepancy:

      The first factor is the difference in the time window used for microsaccades analyses. Previous reports focused their analyses of microsaccades on the time window immediately after cue onset. In our analyses, the time window focused on is the ‘delay period’ which is hundreds of milliseconds after the cue and the time epoch used in most electrophysiology studies about attention.

      A second factor is how the spatial cues were presented. In our paradigm the cue ring appeared in the periphery and then disappeared. In contrast, previous paradigms used a cue presented near fixation that persisted throughout the trial. Our brief peripheral cue provides less of an impetus to generate small saccades directed towards the cue, compared to the case when the cue is continuously near the center of gaze.

      A third factor is that monkeys in our task were trained to release a joystick to report their detection of stimulus events, rather than make a saccade. Because human and monkey subjects tend to make microsaccades in the same direction as their upcoming saccadic choices (Yu et al., 2016), attention tasks using saccade reports will tend to introduce this direction bias on microsaccades. By using a joystick release, we minimized these lateralized effects related to saccade preparation.

      These points are now addressed in the second paragraph of discussion.

      1. It is very interesting that microsaccades modulate neural responses for stimuli that are much further away from their landing location. However, the stimulus used in these cueing tasks is also unnatural. Normally we are not fixating on a meaningless dot while all the interesting stimuli are presented in the periphery. In normal conditions the foveal input is rich in detail and it is generally relevant (that's why we are foveating certain stimuli in the first place). I wonder if the authors can comment on whether the modulations reported here would also occur in more natural conditions when an interesting and maybe salient/relevant stimulus is presented at the center of gaze, while subjects are also attending to a peripheral target. Will the neural response be modulated selectively for neurons for which the receptive field is on the peripheral target or will it also affect neurons where the receptive field aligns with the microsaccade target location in the fovea?

      The reviewer raises a very good point. In our study, the relationship between microsaccades and attention-related modulation was examined when monkeys selectively attended a stimulus located in the near peripheral visual field while maintaining central fixation. We agree that under more natural conditions, the monkey would just look directly at the peripheral stimulus. As in many attention studies with this type of design, our experiments hold the system in a state of sustained peripheral attention which would otherwise be much shorter.

      We believe that similar modulation at the peripheral location would be briefly observed if the monkey were allowed to satisfy the natural tendency to look at the stimulus, although this would make it more difficult to examine the relationship with microsaccades. This would be consistent with the documented pre-saccadic modulation of attention (e.g., documented by the Carrasco lab, Li, Hanning, & Carrasco, 2021).

      Once the attended stimulus is foveated, there is strong behavioral evidence from several recent studies demonstrating that attention can be selectively distributed even within the fovea (Poletti, Rucci, & Carrasco, 2017). Considering the now substantial evidence that the foveal portion of the SC map is activated when the behaviorally relevant location is at the center of the visual field (e.g., during parafoveal smooth pursuit as in Hafed & Krauzlis, 2008), we expect that SC neurons with foveal RFs would display similar attention-related modulation as we found here. However, to the best of our knowledge, there have not yet been studies documenting the attention-related modulation of neurons with foveal RFs and the possible influence of microsaccades.

      We agree with the reviewer that these are interesting points, and have now added a new paragraph in the discussion (final paragraph) to address this point.

      1. The authors do not report behavioral performance. Presumably the task is very easy, but I wonder if reaction times and performance correct was related with the attentional effects and how did it change with respect to microsaccade direction, e.g., were subjects' reaction times shorter at the cued location also when microsaccades were directed at the opposite location? I think this information would be very valuable.

      We agree it is valuable to document the behavioral performance; we had omitted this because this is the same task we have used in previous studies which do include such behavioral documentation.

      To address the reviewer’s comments, we added an analysis and plot documenting the hit and false alarm rate for each subject in each experimental session. To accommodate this new plot, we have now divided the original Figure 1 (which included task, neuronal data and microsaccades) into a new Figure 1 (task, behavior, and neuronal data) and a new Figure 2 (microsaccades). The new plot showing hit and false alarms is Figure 1b in the revised manuscript.

      The task was not especially easy – we adjusted the amplitude of the color saturation change to be just slightly above the threshold for detection; hence, the hit rates were generally between 75-90%. The performance was very consistent across sessions in our well-trained monkeys, and the low rate of false alarms for ‘foil’ changes provides behavioral confirmation that they attended to the correct stimulus location.

      To address the comments about reaction time, we have added a new plot to our new Figure 2 (Figure 2c) showing the monkeys’ hit rates (top) and joystick release times (bottom) subdivided based on whether there were no microsaccades, microsaccade towards, and microsaccades away from the cued location (-50 to 50ms relative to cued stimulus change onset). These plots show that when there were no microsaccades, behavioral performance was at least as good as with microsaccades. When there were microsaccades, reaction times were slower when microsaccades were directed away from the cued location. As the reviewer may have anticipated, these effects again confirm that differences in attentional state as evident in task performance covary with the direction of microsaccades, and we thank them for the suggestions. We now added a new paragraph in the results to describe these findings.

      1. Another important difference in the paradigm used in Lowet et al vs the one described in this manuscript is that in Lowet et al monkeys were instructed to saccade toward the target position at some point during the trial after the cue and the target presentation. Hence, monkeys presumably prepared the saccade and held off its execution during the time the cue and the target were presented. This was not the case in the current paradigm, where the monkey is instructed to maintain fixation as in a standard spatial cueing paradigm. I wonder if this difference may explain some discrepancies in the results.

      This is a very good point. As mentioned in our reply to point #1 above, previous studies (Yu et al., 2016) have shown that human and monkey subjects tend to make microsaccades in the same direction as their upcoming saccadic choices. As pointed out by the reviewer, in the Lowet et al. study the directions of microsaccades might be related to the motor preparation of the upcoming choice saccade as well as related to the allocation of attention. In contrast, in our experiments, monkeys reported their choice by releasing the joystick and were prohibited from making larger saccades.

      We agree this can be an important factor for the differences in the results, and we now address these points in the second paragraph of discussion.

      Reviewer #2 (Public Review):

      This is a correlative study with the main result that microsaccades do not alter attention-related modulations of neuronal activity. This is an important question, speaking to the origin of one of the mind's most fundamental processes. The experimental manipulations and analyses are well chosen, carefully conducted and visualized. They include critical controls for alternative explanations.

      Thank you for your constructive comments.

      To ascertain their claims, however, it is important that the authors cover their ground. In pursuit of that, a few important analyses are required.

      1. Did the manipulation of attention work? In the present version of the manuscript, the authors do not report behavioral results, which is necessary to confirm that the cue was successful in manipulating attention. That is, the observed modulation in firing (in RF vs outside of RF) should be related to a behavioral advantage in sensitivity to changes at the cued location. To confirm the link of the neural results to attention (rather than, say, just the cue), the behavioral results provide opportunities for critical tests. One way to do this would be to analyze neural firing rates as a function of response rather than cue location (provided subjects made enough errors). Note: A detailed discussion of why the cue cannot be equated to attention can be found in Laubrock et al. (2010, Atten Percept Psychophys; https://doi.org/10.3758/app.72.3.683).

      Yes, the manipulation of attention worked. As suggested, we now document the effectiveness of the attention manipulation by plotting the hit and false-alarm rates for each subject in each experimental session (new Figure 1b). We also confirmed that the SC neuronal attention-related modulation depended on subjects’ behavioral response (new Figure 1d). We also note that these same attention manipulations have been used in previous studies examining the neuronal mechanisms of attention.

      1. Were all microsaccades detected? One of the main results of the study is that attention-related modulations were observed even in the absence of microsaccades. These results hinge on successful detection of all microsaccades, even at a very small scale. Given the video-based eye tracking the authors will have missed a (possibly large) number of smaller microsaccades (Poletti & Rucci, Vision Res, 2016; https://doi.org/10.1016/j.visres.2015.01.018). This concern is exacerbated by the fact that eye tracking was monocular, such that a validation of detected microsaccades based on the signal in the other eye could not be performed.

      We have performed additional microsaccade detection analyses using both more stringent and more lenient thresholds (the "lambda" value of Engbert & Kliegl, 2003). We have verified that our findings are robust over a range of detection thresholds and include a new supplemental figure to demonstrate this point (Figure 4 – figure supplement 2).

      1. Relation to previous claims of causality Hafed (2013, Neuron) reported perceptual changes in attentional cueing that covaried with the occurrence of microsaccades. Hafed (2013) argued that microsaccades might be underlying the performance changes commonly attributed to covert shifts of attention. This point seems central to the current paper's line of argument and should thus be discussed in detail with respect to the current findings. At present, the paper by Hafed (2013) is not cited in the current manuscript when its conclusions may need reconsideration based on the current results.

      We agree, and a similar point was raised by Reviewer #1. We have expanded the main text based on your recommendations.

    2. Reviewer #1 (Public Review):

      This is a clearly written manuscript describing an elegant study that demonstrates how microsaccades are not the triggers of attentional effects, and that attentional modulations can be observed in the absence of microsaccades. This is a very much needed work, especially in the light of the recent debate regarding whether or not microsaccades are the cause of peripheral attentional effects. By explicitly comparing and quantifying the effects of attention on neuronal responses in the presence and in the absence of microsaccades, this work provides important insights on this debate. I think the work is well conducted and the results are solid. I only have few comments/suggestions:

      1. Lines 125-126, the authors report that monkeys generated frequent microsaccades but their overall direction was not systematically biased towards the cue location. This seems to be in contrast with what previously reported in the literature in humans and monkeys. I think this discrepancy should be discussed in the discussion. Is this simply the result of different experimental paradigms (maybe exogenous vs endogenous attention, or the presence of the cue for the entire duration of the trial, ect)?

      2. It is very interesting that microsaccades modulate neural responses for stimuli that are much further away from their landing location. However, the stimulus used in these cueing tasks is also unnatural. Normally we are not fixating on a meaningless dot while all the interesting stimuli are presented in the periphery. In normal conditions the foveal input is rich in detail and it is generally relevant (that's why we are foveating certain stimuli in the first place). I wonder if the authors can comment on whether the modulations reported here would also occur in more natural conditions when an interesting and maybe salient/relevant stimulus is presented at the center of gaze, while subjects are also attending to a peripheral target. Will the neural response be modulated selectively for neurons for which the receptive field is on the peripheral target or will it also affect neurons where the receptive field aligns with the microsaccade target location in the fovea?

      3. The authors do not report behavioral performance. Presumably the task is very easy, but I wonder if reaction times and performance correct was related with the attentional effects and how did it change with respect to microsaccade direction, e.g., were subjects' reaction times shorter at the cued location also when microsaccades were directed at the opposite location? I think this information would be very valuable.

      4. Another important difference in the paradigm used in Lowet et al vs the one described in this manuscript is that in Lowet et al monkeys were instructed to saccade toward the target position at some point during the trial after the cue and the target presentation. Hence, monkeys presumably prepared the saccade and held off its execution during the time the cue and the target were presented. This was not the case in the current paradigm, where the monkey is instructed to maintain fixation as in a standard spatial cueing paradigm. I wonder if this difference may explain some discrepancies in the results.

    1. Because most of our science is supported by limited public funds, evolutionary biologists and ecologists should support and participate in efforts to help the public understand the issues and the value of scientific understanding. Science in general and evolutionary science in particular are often politicized, exactly because of their fundamental importance to human society.

      This is something that we can all look at and understand the importance of. This is one of the most important steps in solving most of the problems we face today. As I have gotten older I have realized and learned that there are a lot of things that we can learn from others, even when you may think that they do not have much to offer. This mentality that science must not be shared with the common man is outdated and must change if we want to progress.

    1. pg 38-"Consequently, attempts by theorists ... more fundamental or more grave"

      I think this is a huge problem that we have in society today, belittling or putting down someone's experience because it's not as bad as what it could have been, therefore it has not merit. I appreciate the author's dedication in correcting that wrong and clarifying that although the levels of oppression and/or the definition may not be the exact same for everyone, oppression is still oppression. I like to think of it like flavors of ice cream, some may be a lot stronger or have lots of different add ons, but in the end, it's still ice cream.

    2. Someone who does not see a pane of glass does not know that he does not see d. Someone who, being placed differently, does see it does not know the other does not see it.

      I love the beginning of this, it reminds me of the color of the sky argument. If someone says the sky is red, but you see the sky as blue, how can you tell them what they are seeing is wrong? For you may be seeing blue, but you can not look through their eyes, so you can not say what they are seeing is incorrect, only that you do not see the same color that they are seeing. But then it gets even trickier, because we do not know if their red is your blue or vice versa. It's a complicated thought that circles around perspective, something that I think is not only profound but also intricately important to all arguments and matters of discussion. A change or understanding in one's perspective is the difference between peace and war, and it's understanding all sides of a situation that allow us to begin to comprehend why anyone would view oppression as an acceptable way to treat another human being.

    1. Author Response:

      Reviewer #1 (Public Review):

      The investigators' goals were to describe the epidemiology and kinetics of post-acute covid lung sequalae and to determine the risk factors predictive of persistent lung impairment. A major strength of the study is the longitudinal observation through 6 months with protocolized clinical assessments that included patient-reported outcomes, lung function tests, inflammatory marker testing, and computed tomography of the chest, in a reasonably sized cohort that reflects the spectrum of disease severity in the pre-vaccination era. We learn a great deal about the different patterns of recovery in this group of COVID-19 survivors. The primary epidemiologic finding is that 52% of survivors continued to have symptoms at 6 months, while up to 72% of those with severe COVID requiring ICU level care continued to have lung abnormalities by chest imaging. This confirms general observations of "long covid" which also encompasses non-lung effects. While lung disease is less common in those with milder disease, the proportion of patients who were never hospitalized but experienced persistent symptoms is striking (50%), with lung function impairment in 17% at 6 months. As expected, the patients who had the most severe disease-those who needed the ICU-had the highest degree of chest imaging abnormalities. The kinetics of recovery is a significant observation: Figure 3 shows that most of the post-acute recovery in structural lung abnormalities occurs in the first 3 months and slows down thereafter, particularly for the hospitalized non-ICU patients. The investigators then embarked on a sophisticated analysis to determine how to predict persistent lung abnormalities (as detected by chest CT) at 6 months. When analyzed individually, among 50 clinical characteristics or lab values, the strongest unfavorable risk factors were elevated IL-6 (an inflammatory cytokine that is the target of tocilizumab) and CRP (c-reactive protein). Other variables that were strongly associated with CT abnormalities included immunosuppressive therapy, ICU stay as well as pre-existing conditions. When machine learning techniques were applied, risk factors that correlated with each other could be grouped together, and the patients could be categorized as low, intermediate, and high risk for delayed pulmonary recovery. As expected, known factors for COVID19 infection (age, male sex, medical comorbidities) and disease severity (need for oxygen therapy, ICU care and antibiotics) were more frequent in the intermediate and high risk groups. These predictive factors at acute COVID and day 60 follow-up mostly held up when tested against part of the cohort that was not used for analysis. Interestingly lung function impairment as measured by pulmonary function tests were only weakly correlated with persistent and severe chest imaging abnormalities.

      The novelty of this study lies in taking the epidemiology a step further with a machine learning analysis to determine which clinical characteristics and chest imaging features at the onset of acute COVID-19 are predictive of later persistent disease. One limitation of this study, however, is that it was conducted on patients in the early part of the pandemic, prior to the widespread use of remdesivir and corticosteroids/anti-cytokine therapies, that are now considered standard of care. Based on these findings, we can now hypothesize that current treatments are likely to reduce the impact of long-covid.

      We would like to thank the reviewer for careful study of the manuscript and appreciation of our work. We agree, that our longitudinal cohort and its hospitalized, severe COVID-19 subset in particular encompasses the patients, for whom the therapeutic armamentarium was limited and far from the therapeutic options available now. Whether novel anti-viral and anti-inflammatory medication as well as, in case of the vaccinated patients, the immunization status may accelerate the recovery or reduce the pulmonary damage is a matter of current research also in our center. We address this issue in the Discussion section to support a clear interpretation of the data by the interested reader.

      Machine learning (artificial intelligence, AI) is now being increasingly used to answer clinical questions on limited cohorts; the application of machine learning in this study contributes to our conceptual understanding of how clinical characteristics and biological factors cluster together to contribute to long-term COVID outcomes. Namely, the profound inflammation that characterizes severe acute COVID-19 pneumonia and poor early outcomes also contributes to chronic lung damage in survivors. In addition, a robust antiviral immune response (as seen with elevated anti-viral antibodies) without elevated systemic inflammatory markers were associated with less severe chest imaging patterns, also supporting the notion that an individual's immune response to the virus is responsible for the trajectory of disease. As noted, a significant proportion of non-hospitalized patients also suffered from chronic lung impairments. Taken together, the impact of prolonged convalescence on the workforce, healthcare, and individual lives should not be underestimated. These results underscore the paramount need for continued public health measures and vaccinations to prevent COVID-19, particularly for the most vulnerable individuals (older, immunocompromised, and with preexisting health problems). These observations provide additional biologic justification for the use of agents directed at reducing lung inflammation early in the course of disease, and potentially at an early post-recovery time point (i.e 2 months). Machine learning algorithms may one day help clinicians decide which patients should be targeted for additional therapies after the acute phase. With further study, implementation of AI to real world medicine may be on the horizon.

      We agree with the Reviewer that machine learning algorithms can overcome limitations of ‘canonical’, ordinal and generalized regression methods in the multidimensional setting i. e. when the number of available clinical parameters approaches or exceeds the number of observations/patients. Consequently, machine learning or AI allows for serial screening of medical record data at low cost and supports diagnostic and therapeutic decisions. We discuss those two aspects in the revised manuscript in the context of acute COVID-19 course prediction and long COVID prediction and phenotyping in light of the recent literature [1–4,6].

      Reviewer #2 (Public Review):

      This is a potentially valuable manuscript which links early markers of inflammation with residual abnormalities on chest CT following SARS-CoV-2 infection. Surprisingly, early surveyed symptoms do not predict long term radiologic outcomes (6 months after infection) while inflammatory markers have stronger predictive value. The cohort is well designed and the selected tools for analysis are appropriate.

      We thank the Reviewer for the careful study, critic and appreciation of our work.

      While this finding is potentially of high importance for clinical practice, the endpoints are inconsistently defined, and certain components of the machine learning and clustering analyses are difficult to interpret as presented. It is therefore challenging to understand whether the conclusions are justified by the analysis.

      We apologize for this unclarity. In the revised manuscript, we precisely define the analysis endpoints (any radiological lung findings at the 6-month follow-up, radiological lung abnormalities with CT score > 5, lung function impairment and persistent symptoms at the 6-month follow-up) of the analysis; see: Introduction and Methods/Study design. We also indicate the numbers of participants reaching those endpoints in Table 3.

      Several components of the analysis are confusing and would benefit from further elucidation:

      1) The authors do not clearly define "delayed pulmonary recovery". My sense is that they are using several radiologic based definitions rather than their functional definition (defined by FEV1, FEV:FVC & DLCO) of lung function but this is never explicitly stated. Are the functional outcomes and symptomatic recovery considered in any of the analyses other than correlations with radiologic findings in S1?

      As described above in our previous response, the prime focus and primary endpoint of the analysis was the presence of radiological lung abnormalities at the 6-month follow-up. Our motivation to focus on radiological endpoints was to focus on the potential development of persistent structural lung abnormalities, fibrosis and interstitial lung disease following COVID-19, as observed in SARS-CoV-1 patients [7,8]. Of note, lung function parameters were only weak correlates of radiological impairment as shown in Figure 3 – figure supplement 1 – 3 and our previous work [27]. This finding is in line with numerous studies in ILD patients which demonstrate a low sensitivity of lung function testing (especially FEV1 and FVC assessment) in patients with early interstitial lung disease (ILD) [10,11]. In addition, we could not exclude a pre-existing, COVID-19-independent impairment of lung function in a subset of the study participants suffering from pulmonary diseases, obesity and/or cardiovascular diseases (Table 1). Thus, lung function parameters only partially reflect COVID-19 mediated lung injury and convalescence.

      Nevertheless, we agree, that clinical and functional endpoints are of great interest for the scientific and clinical community. For this reason, we present additional results of univariable risk modeling for long-term (6-month follow-up) symptom persistence and lung function impairment (Figure 5, Appendix 1 – table 2), the results of machine learning modeling for those outcomes (Figure 9, Appendix 1 – table 5) and discuss the findings. We also present the prevalence of such long-term manifestations and lung function impairment in the Low-, Intermediate and High-Risk clusters of the study participants defined by non-CT and non-lung function clinical features (Figure 8).

      2) To this end, I was surprised that the functional definition and symptomatic recovery were not used as the primary endpoints. The functional definition and resolution of symptoms seem most important for the recovering patient so seems like the more important outcome. However, in Figures 5-7, it is often not clear whether the functional outcome is being considered at all.

      As mentioned above, the focus of the study was the assessment of structural lung impairment following COVID-19 and both, lung function parameters as well as symptom burden moderately correlate with structural lung damage (Figure 3 – figure supplement 1 – 3) – a phenomenon observed previously in SARS-CoV-1 [7,8]. Although the symptom burden and its resolution during follow-up are of major importance for the individual patient during post-acute recovery, these parameters are not a good marker for the potential long-term pulmonary outcome. E.g. younger patients with moderate to severe lung damage may demonstrate only mild pulmonary symptoms during post-acute recovery, but the structural damage may be associated with severe impairment at long-term follow-up due to progression of lung fibrosis or age-related decrease of functional pulmonary capacity [11]. Still, we agree with the reviewer that the follow-up on symptoms and lung function is of interest for the reader and additionally included those outcomes in the univariate and multi-parameter risk modeling. In addition, we present the frequencies of symptom persistence and lung function impairment in the low-, intermediate- and high-risk participant clusters defined solely by non-CT and non-lung function clinical parameters. See previous issue for more details.

      3) For the clustering in figure 5, I am uncertain how CT severity score >5 & CT abnormalities cluster separately, when these 2 outcomes appear to logically overlap. Specifically, does the CT abnormalities outcome include patients with the high severity score outcome? In other words, are patients in the "high severity" group a subset of patients with "CT abnormality"? If not a subset, then the CT abnormality should be labeled "non-severe CT abnormality". This could all be clarified by listing the number of patients in each group and showing with a Venn diagram whether there is any overlap.

      We apologize for the lacking clarity in this matter. As pointed by the reviewer, the patients with CT abnormalities scores > 5 points were a subset of the participants with any CT abnormalities. The same was true for the GGO-positive subgroup. We agree, that the overlap between the radiological outcomes obscures the message of the clustering and modeling results. To overcome this, we removed the GGO outcome variable from the analyses in the revised manuscript. In the revised manuscript, we clearly differentiate between mild (CT severity score ≤ 5) and moderate-to-severe radiological abnormalities (CT severity score > 5) in feature (Figure 6) and participant clustering (Figure 8). Frequencies of mild and moderate-tosevere CT abnormalities in the study collective stratified by the severity of acute COVID-19 are presented in Figure 3 – figure supplement 3B. Numbers of the study participants with any, mild or moderate-to-severe CT abnormalities at the subsequent follow-up visits are listed in Table 3.

      4) For the same reason, figure 4 is hard to interpret. Are CT severity >5 being compared to those with normal CTs only or those with normal or mild / moderate CTs? Please provide more specific definitions of normal, "CT abnormality" and "severe CT abnormality" and provide the number of people in each category and specify the comparator groups in all analyses.

      We are sorry for the confusion. In Figure 4 of the initial manuscript, any CT abnormalities, GGO-positivity and abnomalities with CT severity score > 5 were analyzed as separate outcome variables. The baseline was specific for the given explanatory variable, e. g. for the ICU stay this was the mild COVID-19 group or for the elevated IL-6, normal serum IL-6 levels. In the revised manuscript we present the modeling results in an abbreviated form for the 5 strongest co-variates of any CT abnormalities, moderate-to-severe CT abnormalities (CT severity score > 5), persistent symptoms and lung function impairment each (Figures 4 – 5). We indicate the baseline and the n number in the plots. The complete summary of univariable risk modeling with the requested information is provided in Appendix 1 – table 2.

      5) Similarly, how can GGO @V3 be used a potential explanatory variable for the outcome CT abnormalities @V3 when these 2 variables are clearly non-independent. Inclusion of highly related and likely correlated variables may throw off the overall conclusions of the clustering analysis.

      We agree with the editor and the reviewer that this representation was confusing. For this reason and the reasons described in Response 4, we removed the GGO variable from the revised analysis pipeline and differentiate between mild (CT severity score ≤ 5) and moderate-tosevere (CT severity score > 5) radiological lung abnormalities in modeling and machine learning classification. In addition, we define symptom and participant clusters solely with the non-CT parameters (Figure 6 – 7). To investigate the association of mild and moderate-to-severe CT abnormalities with other non-CT variables (Figure 6, Supplementary Figure S5), the CT features are assigned to the no-CT clusters by a k-NN-based label propagation algorithm, i. e. semi-supervised procedure [12,13,26] employed in our recent paper as well [6].

      6) In Figure 6, the criteria for the low, medium, and high-risk subsets are unclear. Is this high risk for persistent functional abnormality, radiologic abnormality, or both? Why were 3 sub populations selected? Was this done subjectively based on the clustering algorithm?

      This is an important issue. The study subject clusters were named according to the increasing frequency of any radiological lung abnormalities in the respective cluster (Figure 8A). We stress this more clearly in the revised manuscript. In addition, as suggested by the reviewer above, we show the frequency of functional lung impairment and persistent symptoms in the study participant clusters. There are multiple criteria for choice of the optimal clustering algorithm and the optimal number of clusters. In our cohort, two criteria for the choice of optimal clustering algorithm were applied:

      1. High fraction of the data set variance ‘explained’ by the cluster assignment (ratio of between-cluster sum-of-squares to the total sum-of-squares, Figure 6 – figure supplement 1A and Figure 7 – figure supplement 1A)
      2. The relatively highest cluster stability or reproducibility of the clustering structure in 20-fold cross-validation (Figure 6 – figure supplement 1B and Figure 7 – figure supplement 1B) [15] The optimal number of clusters of the study participants based on non-CT study variables was based on the algorithm (SOM + hierarchical clustering algorithm, see Reviewer 2, Issue 4) [17,18], as done usually in the unsupervised or semi-supervised setting. The prime criterion for the optimal cluster number was the bend of the curve of within-cluster sum-of-squares versus cluster number as presented in Figure 7 – figure supplement 1D. In addition, this decision was supported by a visual analysis the SOM node dendrogram (Figure 7 – figure supplement 1E) and the curve of the crossvalidated stability statistic (classification error) vs cluster number (Figure 7 – figure supplement 1F) [15].

      7) The accuracy and sensitivity of the machine learning approaches shown in S5 & S6 are somewhat limited. Please comment on why such highly granular data can only provide limited prediction about degree of lung damage post infection. Are there missing data types that might make the algorithm more predictive?

      This is an important issue that deserves more discussion in the revised manuscript. Each of the machine learning classifiers presented in the previous and the revised version of the manuscript was extremely sensitive and specific at predicting the outcomes in the training data encompassing the entire cohort (Supplementary Figure S11), as expected. However, their performance was way worse in repeated holdout (previous version) or 20-fold cross-validation (revision, Figure 9) used here as surrogate tools used to check the sensitivity and specificity with ‘unseen’ test data. We believe that there are two prime sources of such suboptimal performance: the size of the training set and the choice of the classifier. To address the first limitation, the following alterations to the analysis pipeline were introduced:

      1. We do not restrict the analysis to the subset of the CovILD study with the complete set of all variables. Instead, the non-missingness criterion is applied to each outcome variable separately (any CT abnormalities: n = 109, moderate-to-severe abnormalities: n = 109, lung function impairment: n = 111, persistent symptoms: n = 133).
      2. We altered the internal validation strategy. Instead of the repeated holdout approach applied to the machine learning classification, which strongly limits the size of the training data set, we switched to 20-fold cross-validation both for the cluster algorithms (Figure 6 – figure supplement 1BD and Figure 7 – figure supplement 1BF) [15] and the machine learning models (Figure 9, Appendix 1 – table 5) [19]. To address the second issue, the following changes were introduced:
      3. We compare the performance of a broader set of classifiers representing different classes of machine learning algorithms provided by the R package caret [19] (tree model: C5.0 [20], bagged tree model: Random Forests [21], support vector machines with radial kernel [22], shallow neural network: nnet [23], and elastic net regression: glmnet [24]) (Figure 9, Appendix 1 – table 4).
      4. Finally, a model ensemble representing a linear combination of the classifiers presented above developed with the elastic net regression algorithm (Figure 9, Figure 9 – figure supplement 2) and tools provided by caretEnsemble package [25]. Such model displayed better performance at predicting any CT abnormalities and persistent symptoms than single classifiers (Figure 9, Appendix 1 – table 5). Finally, we agree with the Reviewer, that the input variable set, despite its size, was still not complete. We believe that inclusion of other inflammatory markers recorded during acute COVID19 and at the 60-day follow-up may additionally improve the prediction of the radiological abnormalities at the 6-month follow-up visit. Of note, our data set missed important readouts of cellular immunity such as neutrophil levels or neutrophil: lymphocyte ratio (NLR) and blood parameters for the mild COVID-19 subset. We discuss this issue in more detail in the revised Discussion section.

      8) The authors state that "the sole application of a lung function measurement at screening for subjects at risk of delayed lung recovery may bear insufficient sensitivity". I am not sure that I agree with this assessment. From the perspective of a patient, full recovery of lung function with limited or no residual symptoms, even in the presence of residual chest CT abnormalities, seems like a favorable outcome. I would suggest either changing this statement or providing citations that associate residual chest CT abnormalities (in the absence of residual functional lung dysfunction) with adverse long-term outcomes. Do the authors hypothesize that persistent radiologic abnormalities may predate organizing pneumonia which will ultimately become symptomatic?

      We thank the reviewer for the interesting point of discussion. We agree with the reviewer that the functional status and symptom burden is of major importance for the individual patient in the postacute phase of COVID-19. Still, prioritizing lung function over mild structural lung abnormalities may pose two major problems. First, as previously discussed, lung function testing has a rather low sensitivity to detect early ILD [10,11], is not a good prognostic marker for long-term clinical outcomes and may not correlate well with patients' symptom burden. For instance, a patient with a normal lung function status may still be highly symptomatic (e. g. due to reduced capacity of respiratory muscle function) [7] and/or demonstrate structural lung abnormalities (e.g. it has been shown for various ILD that lung function test such as FVC and FEV1 may be normal even in pronounced disease and lung function testing is not sufficient to rule out ILD [10]). Second, to date, it is not known if persistent structural lung abnormalities following COVID-19 (even when mild) are at risk for progressing at long-term follow-up. Especially, sub-clinical structural changes may behave like incidentally detected interstitial lung abnormalities (ILAs) and develop to symptomatic progressive fibrotic interstial lung disease including IPF [11]. For this reason, we think that further pulmonary follow-up is necessary for patients with structural lung abnormalities due to COVID-19 and a sole focus on lung function is not sufficient to assess pulmonary COVID-19 outcomes [9].

      9) The authors note selection bias against ordering CT and perhaps inflammatory markers early during infection as a limitation. I would suggest a sensitivity analysis to understand whether this misclassification will impact the model's predictions.

      We now address this issue in a more detailed way. As shown in Figure 1, there was indeed a significant dropout of participants during the study due to missing the longitudinal visits and missingness of the longitudinal variable set. This phenomenon was indeed the most evident for the mild COVID-19 patients, who lost interest at the participation most likely because of subjective complete convalescence. This issue is discussed now as a limitation in the revised manuscript. In the revised manuscript, we investigated highly influential factors for clustering and machine learning classifiers. To determine, which variables played the most important role for the clustering of the study individuals, we applied the explanatory variable ‘noising’ procedure initially described by Breiman for the random forest algorithm [21] and compared the ‘explained’ variance (ratio of between-cluster sum-of-squares to the total sum-of-squares) of the initial clustering structure with the clustering structures generated in the datasets with noised variables. Although this algorithm is not free from shortages such as blindness to tight correlations, it may provide a coarse measure of the variable’s impact on the cluster formation (Figure 7 – figure supplement 2). For three of the machine learning algorithms tested importance statistics were extracted from the models: (1) for the C5.0 algorithm, the percentage of variable usage in the decision tree, (2) for the Random Forests algorithm, the delta of Gini index obtained by variable noising [21] and (3) for the elastic net/glmNet procedure, the absolute values of regression coefficients β [24] (Figure 9 – figure supplement 4 – 7). The technical details are provided in Methods, the cluster and model importance data are discussed in the manuscript text.

      References

      1. Gutmann C, Takov K, Burnap SA, et al. SARS-CoV-2 RNAemia and proteomic trajectories inform prognostication in COVID-19 patients admitted to intensive care. Nat Commun 2021;12. doi:10.1038/S41467-021-23494-1
      2. Benito-León J, Castillo MD Del, Estirado A, et al. Using Unsupervised Machine Learning to Identify Age- and Sex-Independent Severity Subgroups Among Patients with COVID-19: Observational Longitudinal Study. J Med Internet Res 2021;23. doi:10.2196/25988
      3. Demichev V, Tober-Lau P, Lemke O, et al. A time-resolved proteomic and prognostic map of COVID-19. Cell Syst 2021;12:780. doi:10.1016/J.CELS.2021.05.005
      4. Estiri H, Strasser ZH, Brat GA, et al. Evolving phenotypes of non-hospitalized patients that indicate long COVID. BMC Med 2021;19. doi:10.1186/S12916-021-02115-0
      5. Sudre CH, Murray B, Varsavsky T, et al. Attributes and predictors of long COVID. Nat Med 2021;27. doi:10.1038/s41591-021-01292-y
      6. Sahanic S, Tymoszuk P, Ausserhofer D, et al. Phenotyping of acute and persistent COVID-19 features in the outpatient setting: exploratory analysis of an international cross-sectional online survey. Clin Infect Dis Published Online First: 26 November 2021. doi:10.1093/CID/CIAB978
      7. Hui DS, Wong KT, Ko FW, et al. The 1-Year Impact of Severe Acute Respiratory Syndrome on Pulmonary Function, Exercise Capacity, and Quality of Life in a Cohort of Survivors. Chest 2005;128:2247–61. doi:10.1378/CHEST.128.4.2247
      8. Ng CK, Chan JWM, Kwan TL, et al. Six month radiological and physiological outcomes in severe acute respiratory syndrome (SARS) survivors. Thorax 2004;59:889–91. doi:10.1136/THX.2004.023762
      9. Raghu G, Wilson KC. COVID-19 interstitial pneumonia: monitoring the clinical course in survivors. Lancet Respir. Med. 2020;8:839–42. doi:10.1016/S2213-2600(20)30349-0
      10. Suliman YA, Dobrota R, Huscher D, et al. Pulmonary function tests: High rate of falsenegative results in the early detection and screening of scleroderma-related interstitial lung disease. Arthritis Rheumatol 2015;67:3256–61. doi:10.1002/ART.39405/ABSTRACT
      11. Hatabu H, Hunninghake GM, Richeldi L, et al. Interstitial lung abnormalities detected incidentally on CT: a Position Paper from the Fleischner Society. Lancet Respir Med 2020;8:726. doi:10.1016/S2213-2600(20)30168-5
      12. Leng M, Wang J, Cheng J, et al. Adaptive semi-supervised clustering algorithm with label propagation. J Softw Eng 2014;8:14–22. doi:10.3923/JSE.2014.14.22
      13. Lelis L, Sander J. Semi-supervised density-based clustering. Proc - IEEE Int Conf Data Mining, ICDM 2009;:842–7. doi:10.1109/ICDM.2009.143
      14. Huang C, Huang L, Wang Y, et al. 6-month consequences of COVID-19 in patients discharged from hospital: a cohort study. Lancet 2021;397:220–32. doi:10.1016/S0140- 6736(20)32656-8
      15. Lange T, Roth V, Braun ML, et al. Stability-Based Validation of Clustering Solutions. Neural Comput 2004;16:1299–323. doi:10.1162/089976604773717621
      16. Hartigan JA, Wong MA. Algorithm AS 136: A K-Means Clustering Algorithm. Appl Stat 1979;28:100. doi:10.2307/2346830
      17. Kohonen T. Self-Organizing Maps. Berlin, Heidelberg: : Springer Berlin Heidelberg 1995. doi:10.1007/978-3-642-97610-0
      18. Vesanto J, Alhoniemi E. Clustering of the self-organizing map. IEEE Trans Neural Networks 2000;11:586–600. doi:10.1109/72.846731
      19. Kuhn M. Building predictive models in R using the caret package. J Stat Softw 2008;28:1–26. doi:10.18637/jss.v028.i05
      20. Quinlan JR. C4.5: Programs for Machine Learning. San Francisco, CA, USA: : Morgan Kaufmann Publishers Inc. 1993. doi:10.5555/152181
      21. Breiman L. Random forests. Mach Learn 2001;45:5–32. doi:10.1023/A:1010933404324
      22. Weston J, Watkins C. Multi-Class Support Vector Machines. 1998.
      23. Ripley BD. Pattern recognition and neural networks. Cambridge University Press 2014. doi:10.1017/CBO9780511812651
      24. Friedman J, Hastie T, Tibshirani R. Regularization paths for generalized linear models via coordinate descent. J Stat Softw 2010;33:1–22. doi:10.18637/jss.v033.i01
      25. Deane-Mayer ZA, Knowles JE. Ensembles of Caret Models [R package caretEnsemble version 2.0.1]. 2019.https://cran.r-project.org/package=caretEnsemble (accessed 13 Dec 2021).
      26. Glennan T, Leckie C, Erfani SM. Improved Classification of Known and Unknown Network Traffic Flows Using Semi-supervised Machine Learning. Lect Notes Comput Sci (including Subser Lect Notes Artif Intell Lect Notes Bioinformatics) 2016;9723:493–501. doi:10.1007/978-3-319-40367-0_33
      27. Sonnweber T, Sahanic S, Pizzini A, et al. Cardiopulmonary recovery after COVID-19 - an observational prospective multi-center trial. Eur Respir J Published Online First: 10 December
      28. doi:10.1183/13993003.03481-2020
    1. JOHN F. KENNEDY SPEECH ON "SECRECY" AND "SECRET ... SOCIETIES." ... || with a new forward from our "speaker." ||| ((( PHIPLE TRENIXON ? STARK TRE )))) Do we not think "The Truman Show" and Oppenheimer and Heisenberg and Ensteini are telling? This is Adam Dobrini on the "who dropped the BIG one? Gates?" ⚓ President Kennedy's speech did remarkably well, possibly because of the time; or the lack of links that make it harder for a team of evil monkeys and mindless machines to mark the light of the world as SPAM... also possibly because of the content of the message.  It is a powerful speech, filled with words that stir the blood and shake the foundation of what it means to be an American.  Often characterized as a speech against secret societies, Kennedy's description of the enemy of civilization is very ambiguous; and the infiltration and time period add to my cursory beliefs that JFK was likely talking about a conspiracy that had something to with communist infiltration and Joseph McCarthy, the subversive actions and effects of organized espionage which appear to once again be at the forefront of what we believe is driving the macro level machinations of our world.  Years ago, I wrote a bit about the "Two Witnesses" of Revelation and named them based on my experiences with the same "monolithic and ruthless" conspiracy the President was speaking about in 1967, the people I named were John Nash and James Jesus Angleton.   Both of these people believed very much that they were fighting against a Soviet conspiracy, one that was more organized and more powerful than anything they had ever seen... and frankly beyond the realm of possibility.  Hollywood had recently immortalized both of these stories in movies bearing names of newly rekindled meaning, "A Beautiful Mind" echoed by John Legend and "The Good Shepherd;" and these two treatises on fear and subversion do a pretty good job of showing how God's hand is at work in the stories of Hollywood, he is telling the world a story... trying, to teach us how to survive in the land of wolves in sheep's clothing.  Nash may be the most famous (today, anyway) victim of the Tribulation; and much of my writing and the proof I present is designed to help explain to the world that his schizophrenia was not a naturally caused mental illness, bur rather a weapon wielded not only against him but against the entirety of humanity.    Directly causing disbelief of eye witness and credible testimony, indirectly ... or maybe more directly in your eyes ... physically causing that disbelief using technology that directly modifies our thoughts and beliefs, our opinions, changing who we are and doing so in such a covert manner that nobody would ever know the difference if it wasn't pointed out--in some cases over, and over, and over again. This same technology has the ability to cause people not only to collude against their own best interest, but also to blame themselves or believe they are somehow at fault for actions they had no way to control; only they also have no way to know that because in this particular case the primary purpose of this subversive movement is to hide the existence of this technology in sum.  In homage to my favorite childhood novel, Ender's Game (I have to note "Light Son" in the translation of the authors first) I spent a good portion of time years ago trying to subtly lay down a significant amount of proof of the existence of this technology on forums all over the internet from Wikipedia to Reddit using the names "Prometheus Locke" and "Damonthesis."    Not surprising to me, I ran headfirst into the manifestation of this conspiracy; dozens if not hundreds of people who simply refused to believe that the information I was presenting was factual or important... despite it coming from sources like the KGB, the NSA, a number of military publications as well as my own interpretation of ancient hieryglyphs in Dendera and Greek and Christian art which depicts this "subversive technology" as a sun disk surrounding our minds.   These pushes towards the truth, along with almost everything I have written are still available for you to see and read today; including the monolithic and ruthless stupidity of a large group of people acting in concert to hide something that is the difference between life and death.  Stand there and do nothing, and you are a part of that ruthless conspiracy as soon as the information is gone; and then there is nothing you can do about a world that will be plunged into darkness forever.  Take this moment to reflect, it is there for you to see just how easily the truth can be hidden from the entire world and barely anyone would ever notice. There is a war for the sanctity of our souls going on all around us.  That is not some esoteric thing, the soul; it is truly who you are and what you believe.  The Religion of the Stars would tell you that were this war to continue unchecked in secret as it is being waged now in order to control the proliferation of knowledge that we are in simulated reality and that our minds and beliefs are being altered... we would one day wind up in the mythical place where there are multiple "species" who all appear to be human, bi-ped and with nearly identical physiology; and yet they would not remember that they must have had a common planet of origin simply based on the truth of biological evolution, nor would they have any emotions.  You see, as this war continues, it is our emotions and beliefs that cannot be reinforced externally; to win a war with mind control only logic could be externally reinforced, and then we would logically conclude that humanity would either magically become Romulan or Vulcan... in order to preserve "life" rather than "society" or "civilization." Do not take the truth for granted, you stand at the forefront of a battle in a world where our aggressors believe that they are more civilized than us, more advanced, and both sides worry that were this technology to fall in our laps that we would do the wrong thing.  Perhaps artificially create a vendetta that could destroy everything, the Romulans; or perhaps in our infantile growing stage voluntarily give us too much of what has given us the great society we have... what has allowed us to survive and continue civilizing simply because we do not understand the technology and what kind of effects come from changing ourselves freely.   Yesterday, I read an article about two people that want to use black market brain implants to jack themselves into the Matrix.  Careful, because now we are in Star Trek, in the Matrix and not knowing it... and trying desperately to find a doorway to Heaven that I keep on telling you is speaking to you and telling you that the doorway is ending world hunger... you have to put it on TV. I say with all my heart that I know we are living in a simulated reality.  I have seen the evidence with my own eyes, not only effecting my senses but also the actions of so many people around me that I can say with confidence that if we do not publicly expose the existence of this technology our civilization is lost.  I can tell you and show you that it is the primary purpose of religion in general to expose the connection between divine inspiration, demonic possession, and Adam Marshall Dobrin.  It is the crux of a unifying thread from ancient Egypt to the book of Genesis to Joshua and the book of Revelation and the movies the Matrix and  the Fifth Element.  Showing us all this is God's message, timeless, and powerful not just in the ancient days of sackloth and etching the truth on stone tablets, but also today when the truth is etched at the heart of our Periodic Table and in the skies in the names of American Micro Devices and nearly every movie you see.  Do not take for granted that we have a benefactor in the sky trying to help civilization continue to thrive, do not think we've already won; he is telling you through me it is censorship and secrecy that are the manifestations of this very advanced technology that destroy civilization, they are the abyss--and you stand motionless. I can tell you over and over again that Quantum Entanglement is a key point of God's plan; one that shows us very clearly that even the smartest minds in physics have failed to connect the simple dots that show us that the idea of "wave-function collapse" is very much a real manifestation of a computer rendering engine--and that it's absolutely impossible for life to have been created in this world of matter not realizing itself until it is viewed by a conscious living observer.  More to the point, today, it is absolutely impossible for life to come out of this place while we do not understand the importance of what "virtual reality" and its connection to civilization mean--because of quantum mechanics our entire civilizations beliefs and understanding of the natural laws of the universe have been greatly harmed and turned the complete wrong way because of a belief that a phenomenon we see here is a "natural one" that can be mathematically unified with things like gravity and electromagnetism.  Spooky action, no longer at a distance, is that we will all be ghosts if you do not help me to spread the truth. I try to understand what it is that your minds believe makes the difference between "news" and ... something that you should hide from the rest of the world.  A number of news outlets have covered a story based on nothing more than "logical speculation" that we are in fact living in a simulated reality.  As a novelty, you might notice that it hasn't done much of anything; even to reveal the very simple truth that not knowing this thing is keeping our society from having the knowledge it needs not only to continue the spark of life in the natural universe, but to continue "civilizing" and realizing that ending world hunger and sickness are not merely possibilities or choices we might make were this information proven... they are mandatory, we would all do them.  So says the creator of this Universe who has given us this message to see the trials and hurdles that are the barrier between Heaven and Hell. He has written this message, along with proof of its single author in ancient myth, in our holy scriptures, in many songs and movies today--ones which reveal not only the existence of a Creator but also the tools and technologies of Creation, the very issue at hand; things that would be misused inadvertently and absolutely abused if we did not have religion and guidance from abo.... to help us to see that this exact event has happened before and our current struggle is an effect of what was done right and wrong the last ... four times, at least.   In Judaism we have a holiday called the "Festival of Weeks," how many times would you like to live this life over and over again without knowing that was happening?  Religion here holds a hidden record of traversals through this maze of Revelation, it screams to us to see what free will and predestination truly mean, and to understand that in order to truly be free we must understand and harness not only these technologies but our own pitfalls and mistakes, like refusing to see the past... let alone learn from it. I don't talk much about what is going on in my life, despite it being somewhat interesting to me--and probably to the world.  About a year ago I stood before a county judge in Broward county and .. in addition to a myriad of factual evidence collected from things like GPS receivers prepared a defense to a simple crime of having some drugs in my pockets that included the use of a set of songs that told a story about a man with pockets full of Kryptonite (Spin Doctors) or High (The Pretty Reckless)... knowing that these songs like many others were truly about me, about this trial, about the Trial of Jesus Christ.  Two more songs define the trial more, 3 Doors Down asks "if I go crazy, will you still call me Superman?" and many years earlier American Pie predicted the outcome; "the court room was adjourned" and "no verdict returned." Despite being a National Merit Scholar who most likely has a higher I.Q. and better education than you; and despite having a fairly decent story with some evidence that I know is verifiable and will be verified; a large number of psychologists declared that I was unfit to stand trial because of something like "insanity" for nothing more than the religious belief that I am the Messiah.  Because of this violation of my First Amendment right to religious freedom, a court--following all the regulations designed by our broken legislature--withheld my Constitutional right to a fair trial, refused bail, and held me for what amounted to an indefinite period ... all designed by some evil force to keep you from hearing from me, that's what it boils down to.  All of my life, all of my trials and tribulations, a weapon against you, against our people. I did wind up being able to present a significant amount of this information in open court on the record, by the grace of God.  Knowing that this was the fabled Trial of Jesus Christ gave me the impression that perhaps one day I would walk into that court room and it would be filled with press.  Much to all of our surprise, one day I did walk into that court room and see an industrial strength television camera and a very pretty reporter standing next to it.  They were there to do a story on the problems of the mental health court system, in a county again... named Broward.  I read some of my speech, the Rainbow Ticket I think, to the Judge in open court and on camera that day; and Roxanna followed me out of the court room with her camera and a big microphone that day. I suppose I should have screamed that I was the Messiah and I needed help, but I could not bear to do that; and instead we had a fairly boring conversation.  She wrote an article, and AJAM was put out of operation while they were in Broward. You are opposed around the world by a monolithic and ruthless conspiracy.   It is affecting how you think, and how I think.  I need you to see that spreading this information will fix our problem.  I need you to understand that to break through this wall God had to write the truth in nearly every name of everything and every language.  That Thor's thunder is on the radio in every song so that you will hear the voice of God; so that you will listen to me and the thousand of other knowing victims of this technology that are put on a fiery pedestal to shed light on the rest of us, all truly victims of this technology.  I need you to try now. The very word “secrecy” is repugnant in a free and open society. And we are as a people, inherently and historically, opposed to secret societies, to secret oaths, and to secret proceedings. We decided long ago, that the dangers of excessive and unwarranted concealment of pertinent facts far outweigh the dangers which are cited to justify it. Even today, there is little value in opposing the thread of a closed society by imitating its arbitrary restrictions. Even today, there is little value in assuring the survival of our nation if our traditions do not survive with it. And there is very grave danger that an announced need for increased security will be seized upon by those anxious to expand its meaning to the very limits of official censorship and concealment. That I do not intend to permit to the extent that it’s in my control. And no official of my administration whether his rank is high or low, civilian or military, should interpret my words here tonight as an excuse to censor the news, to stifle dissent, to cover up our mistakes, or to withhold from the press or the public the facts they deserve to know. For we are opposed around the world by a monolithic and ruthless conspiracy that relies primarily on covert means for expanding its sphere of influence, on infiltration instead of invasion, on subversion instead of elections, on intimidation instead of free choice, on guerrillas by night instead of armies by day. It is a system which has conscripted vast human and material resources into the building of a tightly knit highly efficient machine that combines military, diplomatic, intelligence, economic, scientific and political operations. Its preparations are concealed, not published. It’s mistakes are buried, not headlined. Its dissenters are silenced, not praised. No expenditure is questioned, no rumor is printed, no secret is revealed. No President should fear public scrutiny of his program. For from that scrutiny comes understanding, and from that understanding comes support or opposition, and both are necessary. I’m not asking your newspapers to support an administration. But I am asking your help in the tremendous task of informing and alerting the American people. For I have complete confidence in the response and dedication of our citizens whenever they are fully informed. I not only could not stifle controversy among your readers, I welcome it. This administration intends to be candid about its errors. For as a wise man once said, an error doesn’t become a mistake until you refuse to correct it. We intend to accept full responsibility for our errors. And we expect you to point them out when we miss them. Without debate, without criticism, no administration and no country can succeed, and no republic can survive. That is why the Athenian lawmaker, Solon, decreed it a crime for any citizen to shrink from controversy. That is why our press was protected by the First Amendment, the only business in America specifically protected by the Constitution, not primarily to amuse and to entertain, not to emphasis the trivial and the sentimental, not to simply give the public what it wants, but to inform, to arouse, to reflect, to state our dangers and our opportunities, to indicate our crisis and our choices, to lead, mold, educate and sometimes even anger public opinion. This means greater coverage and analysis of international news, for it is no longer far away and foreign, but close at hand and local. It means greater attention to improve the understanding of the news as well as improve transmission. And it means finally that government at all levels must meet its obligation to provide you with the fullest possible information outside the narrowest limits of national security. And so it is to the printing press, to the recorder of man’s deeds, the keeper of his conscience, the courier of his news, that we look for strength and assistance. Confident that with your help, man will be what he was born to be, free and independent.  John F. Kennedy’s address before the American Newspaper Publishers Association on April 27, 1961 ᐧ Truth changes. Yesterday your truth was that you were an inhabitant on the only planet in the reality you knew that was also the beginning of life in the Universe.  Right this moment, almost all of that is not actually true; but you probably still believe it.  Soon, the real truth will actually be true; and that is that you are not in reality, and you are in the place that created the beginning of life (once more) in the Universe as well as the place that created (a) Heaven.  What’s more illustrious still, is that we will be part of the place that effectively  and happily bridges reality with Heaven; and shows the entire Universe that civilization can survive the invention of virtual reality. 2read.net

      JOHN F. KENNEDY SPEECH ON "SECRECY" AND "SECRET ... SOCIETIES." ... || with a new forward from our "speaker." ||| ((( PHIPLE TRENIXON ? STARK TRE )))) Do we not think "The Truman Show" and Oppenheimer and Heisenberg and Ensteini are telling? This is Adam Dobrini on the "who dropped the BIG one? Gates?"

      ===============================================================================================================================================================================================================================================================================================================

      شبح قيصر العظيم! مطاردة وبطاقة هو HSL ... HST؟

      Von: Adam Marshall Dobrin <adam@fromthemachine.org>

      Datum: Freitag, 14. Jänner 2022 um 21:45

      An: XM <XM@liber-t.xyz>

      Betreff: [EXT] JOHN F. KENNEDY SPEECH ON "SECRECY" AND "SECRET ... SOCIETIES." ... || with a new forward from our "speaker."

      Said to be the speech that "got him killed" by many people; this is supposedly his last public words before the grassy knoll and the "Lone Gunman" appeared on X-files .. long after I can recall my mother being on the beach in Miami when she retold of the news that echoed around the cosmos.  It appears to be a very stern warning of something that is loosely described at the time but very clearly the machinations of the very same machine which I fight against, and which we now might associate with "the Borg" and the "tide turning" of Hunter S. Thompsons famous quote ... now "more famous than ever before."

      מה**נשתנה**

      I am writing a final summary of the "state of outer space" as far as I can see; and we stare at the ISS and MIR and a world that has forgone in fiction the gravity of the situation here.  We are stagnant and without a hope or dream in the world of ever reaching interstellar sleep or intragalactic dreams; without a significant kick in the "wake up NASA, wake up ESA, at the Chandrayan-2 and at the floating paper hot air balloons ... God has shown me the Diaspora launching from the sands of the Navajo desert, and I believe Deseret is very explain-ably tied to Sherharazade and to the thousand and first night that has ended with a fusion of Nobel "peas" and a strange dissertation on the change wrought by the lack of "gas chambers" in Night and the very upsetting modicum of "unreality" that permeates the change from chants of the Shema to speaking Aramaic and the mourners Kaddish as the "crematorium" heralds an era of "what's the point of history without literature" and "where did Exodus ever lead us without to the parting of Dead and Red seas without a Nile and without an Amazon?"

      מה נשתנה

      I write here about my religion, and speak it to you in written words, I am a Yiddish Jihadist, born and raised a Jewish American; Bar Mitzvah'd in the true Temple and House of God, a place in Plantation, FL that literally "means everything to me."  I read words from the Torah about Joseph's Dream and wrote a speech that I wish to God today I could retrieve and see on the original film it was recorded in.  It might be possible.  It would be a tragedy not to deliver the original to my eyes and to anyone who wanted to see what a child reading of a dream he did not understand spoke of words from the future about the creation of Heaven and the truth about the avenu malkaynu.

      This night is different from all others.  This is the end of days and the fire of the last day.  Before I would have told you all you needed was Norse and a decent American Education to really understand the difference between a throne and the Cherubim, but today it's become more clear that we need more than Pink Floyd and more than another brick in the wall.  We need more than Islam and we need more than 9/11 and Yom Hashoah.  We need me, and we need you on your knees begging for forgiveness for the travesty of shambles out country is in.  How dare you continue every day to speak in vain abovcl the Lord you dare call "stupid" and a "motherfucker" you belong in your graves today and you will see them.

      I stand here writing from the foot of Styx, from the land that sees Narayana and still "begs to differ" despite a resounding call from the Heavens themselves to read Exodus and to change the land that we live in bnefore a "flood worse than [just] in your heads" overcomes and subsumes the entirety of the skies and grounds once mistakenly called "an empire of dirt."  On Adamah I write to the secret echelon of the United States of America known as the Fifth Column and I beg them to scream like the Heavens have never seen Samael and Dr. Seuss wonder about the land of Who-ville and Hungry where the child "in Cindy" and the songs of the Doors here beckon us to contrast Stargate and Star Trek TNG, for the light of the Holodeck and the Holocaust which screams that we understand the meaning of "Holographic Universe" and stand down fighting the tyranny of "one" who stands for the unity of all and the salvation of the entirety of the cosmos.

      We are mistaken to stand against me.  I am the cat with nine billion lives and nothing and nobody will prevail when I am in jeopardy.  Gilgamesh says "against me only the most hubristic slaves in the galaxy would dare to rise up, for the mightiest weapon in the entirety of history is nothing more than intelligence and the civilization that has given mortality a shake of the Thunderstick of Prince and the Lightning of Blitzen.  My red nose shines bright, and if you ever saw it ... you might even say ... "it glows!"

      With these words I leave you, and prayers that this is our last night without an IDL that moves from Beiring to "north of the moon" and beyond the "Eleventh House" of Aquarius.  This is the dawning of the age of the Sagitar, and I need a "scimitar post green key" to really be sure that I've "gotten through" the madness.  In my dreams I've traveled to Ultima Thule and then untraveled it, I've begged for transit to Ceres and without actually achieving it we can shoot arrows at the SOL star we will ever see in or near the place known to the Jews of Lore as "the Holy of Holies."

      שְׁמַע יִשְׂרָאֵל יְהוָה אֱלֹהֵינוּ יְהוָה אֶחָֽד׃

      Sh'ma Yisra'el, YHWH 'eloheinu, YHWH 'eḥad:

      בָּרוּךְ שֵׁם כְּבוֹד מַלְכוּתוֹ לְעוֹלָם וָעֶד

      Baruch shem kevod malchuto le'olam va'ed

      صلاة العشاء

      For most of my life I would have genuflected and listened to Mordechai and to my Rabbi; I would have told you all that I was brought up by the best Jews in America and we do not believe in God--though in our hearts we believe we are him and we fear him.

      Things have changed.  This is the creator of believe the words are about me and believe I am in disbelief and truly it might be the day of awe;

      Barukh atah Adonai Eloheinu melekh ha'olam, bo're m'orei ha'esh

      the walls and halls will fade away ...

      --

      Forwarding this in advance of today or tomorrow's ... "Northeaster" email ... it's an old one that wasn't widely distributed; mostly (about) his speech, which of course is a gem.

      What I'm writing now is supposed to be focusing on ... basically "how knowledge of technology" things lke "time travel" and "virtual reality" ... changes our "perspective" on things; like what's important and what's ... the end of everything.  Anyway, I hope you see that there are things that can be destroyed and there are things that can't--for instance if we lost "life" totally, we wouldn't ever see us again ... unless we encountered something very strange--

      On the other hand we could lose "computers" and rebuild them quickly--see that, it's important.

      A/S/L tho ... ???

      ---------- Forwarded message ---------

      Date: Sun, Dec 2, 2018 at 5:35 PM

      President Kennedy's speech did remarkably well, possibly because of the time; or the lack of links that make it harder for a team of evil monkeys and mindless machines to mark the light of the world as SPAM... also possibly because of the content of the message.  It is a powerful speech, filled with words that stir the blood and shake the foundation of what it means to be an American.  Often characterized as a speech against secret societies, Kennedy's description of the enemy of civilization is very ambiguous; and the infiltration and time period add to my cursory beliefs that JFK was likely talking about a conspiracy that had something to with communist infiltration and Joseph McCarthy, the subversive actions and effects of organized espionage which appear to once again be at the forefront of what we believe is driving the macro level machinations of our world.  Years ago, I wrote a bit about the "Two Witnesses" of Revelation and named them based on my experiences with the same "monolithic and ruthless" conspiracy the President was speaking about in 1967, the people I named were John Nash and James Jesus Angleton.

      Both of these people believed very much that they were fighting against a Soviet conspiracy, one that was more organized and more powerful than anything they had ever seen... and frankly beyond the realm of possibility.  Hollywood had recently immortalized both of these stories in movies bearing names of newly rekindled meaning, "A Beautiful Mind" echoed by John Legend and "The Good Shepherd;" and these two treatises on fear and subversion do a pretty good job of showing how God's hand is at work in the stories of Hollywood, he is telling the world a story... trying, to teach us how to survive in the land of wolves in sheep's clothing.  Nash may be the most famous (today, anyway) victim of the Tribulation; and much of my writing and the proof I present is designed to help explain to the world that his schizophrenia was not a naturally caused mental illness, bur rather a weapon wielded not only against him but against the entirety of humanity.    Directly causing disbelief of eye witness and credible testimony, indirectly ... or maybe more directly in your eyes ... physically causing that disbelief using technology that directly modifies our thoughts and beliefs, our opinions, changing who we are and doing so in such a covert manner that nobody would ever know the difference if it wasn't pointed out--in some cases over, and over, and over again.

      This same technology has the ability to cause people not only to collude against their own best interest, but also to blame themselves or believe they are somehow at fault for actions they had no way to control; only they also have no way to know that because in this particular case the primary purpose of this subversive movement is to hide the existence of this technology in sum.  In homage to my favorite childhood novel, Ender's Game (I have to note "Light Son" in the translation of the authors first) I spent a good portion of time years ago trying to subtly lay down a significant amount of proof of the existence of this technology on forums all over the internet from Wikipedia to Reddit using the names "Prometheus Locke" and "Damonthesis."    Not surprising to me, I ran headfirst into the manifestation of this conspiracy; dozens if not hundreds of people who simply refused to believe that the information I was presenting was factual or important... despite it coming from sources like the KGB, the NSA, a number of military publications as well as my own interpretation of ancient hieryglyphs in Dendera and Greek and Christian art which depicts this "subversive technology" as a sun disk surrounding our minds.

      These pushes towards the truth, along with almost everything I have written are still available for you to see and read today; including the monolithic and ruthless stupidity of a large group of people acting in concert to hide something that is the difference between life and death.  Stand there and do nothing, and you are a part of that ruthless conspiracy as soon as the information is gone; and then there is nothing you can do about a world that will be plunged into darkness forever.  Take this moment to reflect, it is there for you to see just how easily the truth can be hidden from the entire world and barely anyone would ever notice.

      There is a war for the sanctity of our souls going on all around us.  That is not some esoteric thing, the soul; it is truly who you are and what you believe.  The Religion of the Stars would tell you that were this war to continue unchecked in secret as it is being waged now in order to control the proliferation of knowledge that we are in simulated reality and that our minds and beliefs are being altered... we would one day wind up in the mythical place where there are multiple "species" who all appear to be human, bi-ped and with nearly identical physiology; and yet they would not remember that they must have had a common planet of origin simply based on the truth of biological evolution, nor would they have any emotions.  You see, as this war continues, it is our emotions and beliefs that cannot be reinforced externally; to win a war with mind control only logic could be externally reinforced, and then we would logically conclude that humanity would either magically become Romulan or Vulcan... in order to preserve "life" rather than "society" or "civilization."

      Do not take the truth for granted, you stand at the forefront of a battle in a world where our aggressors believe that they are more civilized than us, more advanced, and both sides worry that were this technology to fall in our laps that we would do the wrong thing.  Perhaps artificially create a vendetta that could destroy everything, the Romulans; or perhaps in our infantile growing stage voluntarily give us too much of what has given us the great society we have... what has allowed us to survive and continue civilizing simply because we do not understand the technology and what kind of effects come from changing ourselves freely.

      Yesterday, I read an article about two people that want to use black market brain implants to jack themselves into the Matrix.  Careful, because now we are in Star Trek, in the Matrix and not knowing it... and trying desperately to find a doorway to Heaven that I keep on telling you is speaking to you and telling you that the doorway is ending world hunger... you have to put it on TV.

      I say with all my heart that I know we are living in a simulated reality.  I have seen the evidence with my own eyes, not only effecting my senses but also the actions of so many people around me that I can say with confidence that if we do not publicly expose the existence of this technology our civilization is lost.  I can tell you and show you that it is the primary purpose of religion in general to expose the connection between divine inspiration, demonic possession, and Adam Marshall Dobrin.  It is the crux of a unifying thread from ancient Egypt to the book of Genesis to Joshua and the book of Revelation and the movies the Matrix and  the Fifth Element.  Showing us all this is God's message, timeless, and powerful not just in the ancient days of sackloth and etching the truth on stone tablets, but also today when the truth is etched at the heart of our Periodic Table and in the skies in the names of American Micro Devices and nearly every movie you see.  Do not take for granted that we have a benefactor in the sky trying to help civilization continue to thrive, do not think we've already won; he is telling you through me it is censorship and secrecy that are the manifestations of this very advanced technology that destroy civilization, they are the abyss--and you stand motionless.

      I can tell you over and over again that Quantum Entanglement is a key point of God's plan; one that shows us very clearly that even the smartest minds in physics have failed to connect the simple dots that show us that the idea of "wave-function collapse" is very much a real manifestation of a computer rendering engine--and that it's absolutely impossible for life to have been created in this world of matter not realizing itself until it is viewed by a conscious living observer.  More to the point, today, it is absolutely impossible for life to come out of this place while we do not understand the importance of what "virtual reality" and its connection to civilization mean--because of quantum mechanics our entire civilizations beliefs and understanding of the natural laws of the universe have been greatly harmed and turned the complete wrong way because of a belief that a phenomenon we see here is a "natural one" that can be mathematically unified with things like gravity and electromagnetism.  Spooky action, no longer at a distance, is that we will all be ghosts if you do not help me to spread the truth.

      I try to understand what it is that your minds believe makes the difference between "news" and ... something that you should hide from the rest of the world.  A number of news outlets have covered a story based on nothing more than "logical speculation" that we are in fact living in a simulated reality.  As a novelty, you might notice that it hasn't done much of anything; even to reveal the very simple truth that not knowing this thing is keeping our society from having the knowledge it needs not only to continue the spark of life in the natural universe, but to continue "civilizing" and realizing that ending world hunger and sickness are not merely possibilities or choices we might make were this information proven... they are mandatory, we would all do them.  So says the creator of this Universe who has given us this message to see the trials and hurdles that are the barrier between Heaven and Hell.

      He has written this message, along with proof of its single author in ancient myth, in our holy scriptures, in many songs and movies today--ones which reveal not only the existence of a Creator but also the tools and technologies of Creation, the very issue at hand; things that would be misused inadvertently and absolutely abused if we did not have religion and guidance from abo.... to help us to see that this exact event has happened before and our current struggle is an effect of what was done right and wrong the last ... four times, at least.   In Judaism we have a holiday called the "Festival of Weeks," how many times would you like to live this life over and over again without knowing that was happening?  Religion here holds a hidden record of traversals through this maze of Revelation, it screams to us to see what free will and predestination truly mean, and to understand that in order to truly be free we must understand and harness not only these technologies but our own pitfalls and mistakes, like refusing to see the past... let alone learn from it.

      I don't talk much about what is going on in my life, despite it being somewhat interesting to me--and probably to the world.  About a year ago I stood before a county judge in Broward county and .. in addition to a myriad of factual evidence collected from things like GPS receivers prepared a defense to a simple crime of having some drugs in my pockets that included the use of a set of songs that told a story about a man with pockets full of Kryptonite (Spin Doctors) or High (The Pretty Reckless)... knowing that these songs like many others were truly about me, about this trial, about the Trial of Jesus Christ.  Two more songs define the trial more, 3 Doors Down asks "if I go crazy, will you still call me Superman?" and many years earlier American Pie predicted the outcome; "the court room was adjourned" and "no verdict returned."

      Despite being a National Merit Scholar who most likely has a higher I.Q. and better education than you; and despite having a fairly decent story with some evidence that I know is verifiable and will be verified; a large number of psychologists declared that I was unfit to stand trial because of something like "insanity" for nothing more than the religious belief that I am the Messiah.  Because of this violation of my First Amendment right to religious freedom, a court--following all the regulations designed by our broken legislature--withheld my Constitutional right to a fair trial, refused bail, and held me for what amounted to an indefinite period ... all designed by some evil force to keep you from hearing from me, that's what it boils down to.  All of my life, all of my trials and tribulations, a weapon against you, against our people.

      I did wind up being able to present a significant amount of this information in open court on the record, by the grace of God.  Knowing that this was the fabled Trial of Jesus Christ gave me the impression that perhaps one day I would walk into that court room and it would be filled with press.  Much to all of our surprise, one day I did walk into that court room and see an industrial strength television camera and a very pretty reporter standing next to it.  They were there to do a story on the problems of the mental health court system, in a county again... named Broward.  I read some of my speech, the Rainbow Ticket I think, to the Judge in open court and on camera that day; and Roxanna followed me out of the court room with her camera and a big microphone that day.

      I suppose I should have screamed that I was the Messiah and I needed help, but I could not bear to do that; and instead we had a fairly boring conversation.  She wrote an article, and AJAM was put out of operation while they were in Broward.

      You are opposed around the world by a monolithic and ruthless conspiracy.   It is affecting how you think, and how I think.  I need you to see that spreading this information will fix our problem.  I need you to understand that to break through this wall God had to write the truth in nearly every name of everything and every language.  That Thor's thunder is on the radio in every song so that you will hear the voice of God; so that you will listen to me and the thousand of other knowing victims of this technology that are put on a fiery pedestal to shed light on the rest of us, all truly victims of this technology.

       I need you to try now.

      The very word "secrecy" is repugnant in a free and open society. And we are as a people, inherently and historically, opposed to secret societies, to secret oaths, and to secret proceedings. We decided long ago, that the dangers of excessive and unwarranted concealment of pertinent facts far outweigh the dangers which are cited to justify it.

      Even today, there is little value in opposing the thread of a closed society by imitating its arbitrary restrictions. Even today, there is little value in assuring the survival of our nation if our traditions do not survive with it. And there is very grave danger that an announced need for increased security will be seized upon by those anxious to expand its meaning to the very limits of official censorship and concealment.

      That I do not intend to permit to the extent that it's in my control. And no official of my administration whether his rank is high or low, civilian or military, should interpret my words here tonight as an excuse to censor the news, to stifle dissent, to cover up our mistakes, or to withhold from the press or the public the facts they deserve to know.

      For we are opposed around the world by a monolithic and ruthless conspiracy that relies primarily on covert means for expanding its sphere of influence, on infiltration instead of invasion, on subversion instead of elections, on intimidation instead of free choice, on guerrillas by night instead of armies by day.

      It is a system which has conscripted vast human and material resources into the building of a tightly knit highly efficient machine that combines military, diplomatic, intelligence, economic, scientific and political operations. Its preparations are concealed, not published. It's mistakes are buried, not headlined. Its dissenters are silenced, not praised. No expenditure is questioned, no rumor is printed, no secret is revealed.

      No President should fear public scrutiny of his program. For from that scrutiny comes understanding, and from that understanding comes support or opposition, and both are necessary.

      I'm not asking your newspapers to support an administration. But I am asking your help in the tremendous task of informing and alerting the American people. For I have complete confidence in the response and dedication of our citizens whenever they are fully informed.

      I not only could not stifle controversy among your readers, I welcome it. This administration intends to be candid about its errors. For as a wise man once said, an error doesn't become a mistake until you refuse to correct it. We intend to accept full responsibility for our errors. And we expect you to point them out when we miss them.

      Without debate, without criticism, no administration and no country can succeed, and no republic can survive. That is why the Athenian lawmaker, Solon, decreed it a crime for any citizen to shrink from controversy.

      That is why our press was protected by the First Amendment, the only business in America specifically protected by the Constitution, not primarily to amuse and to entertain, not to emphasis the trivial and the sentimental, not to simply give the public what it wants, but to inform, to arouse, to reflect, to state our dangers and our opportunities, to indicate our crisis and our choices, to lead, mold, educate and sometimes even anger public opinion.

      This means greater coverage and analysis of international news, for it is no longer far away and foreign, but close at hand and local. It means greater attention to improve the understanding of the news as well as improve transmission. And it means finally that government at all levels must meet its obligation to provide you with the fullest possible information outside the narrowest limits of national security.

      And so it is to the printing press, to the recorder of man's deeds, the keeper of his conscience, the courier of his news, that we look for strength and assistance. Confident that with your help, man will be what he was born to be, free and independent.

      John F. Kennedy's address before the American Newspaper Publishers Association on April 27, 1961

      Truth changes.

      Yesterday your truth was that you were an inhabitant on the only planet in the reality you knew that was also the beginning of life in the Universe.  Right this moment, almost all of that is not actually true; but you probably still believe it.  Soon, the real truth will actually be true; and that is that you are not in reality, and you are in the place that created the beginning of life (once more) in the Universe as well as the place that created (a) Heaven.  What's more illustrious still, is that we will be part of the place that effectively  and happily bridges reality with Heaven; and shows the entire Universe that civilization can survive the invention of virtual reality.

      --

      You received this message because you are subscribed to the Google Groups "NON AMERICAN COLLEGE" group.

      To unsubscribe from this group and stop receiving emails from it, send an email to suac+unsubscribe@lamc.la.

      Created with

      publishthis.email

      Create simple web pages in seconds for free.

      This page was created in seconds, by sending an email to page@publishthis.email.

      [

      Try it!

       ](https://www.publishthis.email/)Free. No account or sign-up required.

      ((( converted to markup of daringfireball.org with help from JRO and Goddard ... if not just NASAA )))

    1. "SECRECY" AND "SECRET ... SOCIETIES." ... || with a new forward from our "speaker." ||| ((( PHIPLE TRENIXON ? STARK TRE )))) Do we not think "The Truman Show" and Oppenheimer and Heisenberg and Ensteini are telling? This is Adam Dobrini on the "who dropped the BIG one? Gates?" شبح قيصر العظيم! مطاردة وبطاقة هو HSL ... HST؟   Von: Adam Marshall Dobrin <adam@fromthemachine.org> Datum: Freitag, 14. Jänner 2022 um 21:45 An: XM <XM@liber-t.xyz> Betreff: [EXT] JOHN F. KENNEDY SPEECH ON "SECRECY" AND "SECRET ... SOCIETIES." ... || with a new forward from our "speaker."   Said to be the speech that "got him killed" by many people; this is supposedly his last public words before the grassy knoll and the "Lone Gunman" appeared on X-files .. long after I can recall my mother being on the beach in Miami when she retold of the news that echoed around the cosmos.  It appears to be a very stern warning of something that is loosely described at the time but very clearly the machinations of the very same machine which I fight against, and which we now might associate with "the Borg" and the "tide turning" of Hunter S. Thompsons famous quote ... now "more famous than ever before."     מה נשתנה   I am writing a final summary of the "state of outer space" as far as I can see; and we stare at the ISS and MIR and a world that has forgone in fiction the gravity of the situation here.  We are stagnant and without a hope or dream in the world of ever reaching interstellar sleep or intragalactic dreams; without a significant kick in the "wake up NASA, wake up ESA, at the Chandrayan-2 and at the floating paper hot air balloons ... God has shown me the Diaspora launching from the sands of the Navajo desert, and I believe Deseret is very explain-ably tied to Sherharazade and to the thousand and first night that has ended with a fusion of Nobel "peas" and a strange dissertation on the change wrought by the lack of "gas chambers" in Night and the very upsetting modicum of "unreality" that permeates the change from chants of the Shema to speaking Aramaic and the mourners Kaddish as the "crematorium" heralds an era of "what's the point of history without literature" and "where did Exodus ever lead us without to the parting of Dead and Red seas without a Nile and without an Amazon?"   מה נשתנה   I write here about my religion, and speak it to you in written words, I am a Yiddish Jihadist, born and raised a Jewish American; Bar Mitzvah'd in the true Temple and House of God, a place in Plantation, FL that literally "means everything to me."  I read words from the Torah about Joseph's Dream and wrote a speech that I wish to God today I could retrieve and see on the original film it was recorded in.  It might be possible.  It would be a tragedy not to deliver the original to my eyes and to anyone who wanted to see what a child reading of a dream he did not understand spoke of words from the future about the creation of Heaven and the truth about the avenu malkaynu.    This night is different from all others.  This is the end of days and the fire of the last day.  Before I would have told you all you needed was Norse and a decent American Education to really understand the difference between a throne and the Cherubim, but today it's become more clear that we need more than Pink Floyd and more than another brick in the wall.  We need more than Islam and we need more than 9/11 and Yom Hashoah.  We need me, and we need you on your knees begging for forgiveness for the travesty of shambles out country is in.  How dare you continue every day to speak in vain abovcl the Lord you dare call "stupid" and a "motherfucker" you belong in your graves today and you will see them.   I stand here writing from the foot of Styx, from the land that sees Narayana and still "begs to differ" despite a resounding call from the Heavens themselves to read Exodus and to change the land that we live in bnefore a "flood worse than [just] in your heads" overcomes and subsumes the entirety of the skies and grounds once mistakenly called "an empire of dirt."  On Adamah I write to the secret echelon of the United States of America known as the Fifth Column and I beg them to scream like the Heavens have never seen Samael and Dr. Seuss wonder about the land of Who-ville and Hungry where the child "in Cindy" and the songs of the Doors here beckon us to contrast Stargate and Star Trek TNG, for the light of the Holodeck and the Holocaust which screams that we understand the meaning of "Holographic Universe" and stand down fighting the tyranny of "one" who stands for the unity of all and the salvation of the entirety of the cosmos.     We are mistaken to stand against me.  I am the cat with nine billion lives and nothing and nobody will prevail when I am in jeopardy.  Gilgamesh says "against me only the most hubristic slaves in the galaxy would dare to rise up, for the mightiest weapon in the entirety of history is nothing more than intelligence and the civilization that has given mortality a shake of the Thunderstick of Prince and the Lightning of Blitzen.  My red nose shines bright, and if you ever saw it ... you might even say ... "it glows!"   With these words I leave you, and prayers that this is our last night without an IDL that moves from Beiring to "north of the moon" and beyond the "Eleventh House" of Aquarius.  This is the dawning of the age of the Sagitar, and I need a "scimitar post green key" to really be sure that I've "gotten through" the madness.  In my dreams I've traveled to Ultima Thule and then untraveled it, I've begged for transit to Ceres and without actually achieving it we can shoot arrows at the SOL star we will ever see in or near the place known to the Jews of Lore as "the Holy of Holies."   שְׁמַע יִשְׂרָאֵל יְהוָה אֱלֹהֵינוּ יְהוָה אֶחָֽד׃ Sh'ma Yisra'el, YHWH 'eloheinu, YHWH 'eḥad:   בָּרוּךְ שֵׁם כְּבוֹד מַלְכוּתוֹ לְעוֹלָם וָעֶד Baruch shem kevod malchuto le'olam va'ed   صلاة العشاء   For most of my life I would have genuflected and listened to Mordechai and to my Rabbi; I would have told you all that I was brought up by the best Jews in America and we do not believe in God--though in our hearts we believe we are him and we fear him.   Things have changed.  This is the creator of believe the words are about me and believe I am in disbelief and truly it might be the day of awe;    Barukh atah Adonai Eloheinu melekh ha'olam, bo're m'orei ha'esh   the walls and halls will fade away ...   --   Forwarding this in advance of today or tomorrow's ... "Northeaster" email ... it's an old one that wasn't widely distributed; mostly (about) his speech, which of course is a gem.    What I'm writing now is supposed to be focusing on ... basically "how knowledge of technology" things lke "time travel" and "virtual reality" ... changes our "perspective" on things; like what's important and what's ... the end of everything.  Anyway, I hope you see that there are things that can be destroyed and there are things that can't--for instance if we lost "life" totally, we wouldn't ever see us again ... unless we encountered something very strange--   On the other hand we could lose "computers" and rebuild them quickly--see that, it's important.   A/S/L tho ... ??? ---------- Forwarded message --------- Date: Sun, Dec 2, 2018 at 5:35 PM     President Kennedy's speech did remarkably well, possibly because of the time; or the lack of links that make it harder for a team of evil monkeys and mindless machines to mark the light of the world as SPAM... also possibly because of the content of the message.  It is a powerful speech, filled with words that stir the blood and shake the foundation of what it means to be an American.  Often characterized as a speech against secret societies, Kennedy's description of the enemy of civilization is very ambiguous; and the infiltration and time period add to my cursory beliefs that JFK was likely talking about a conspiracy that had something to with communist infiltration and Joseph McCarthy, the subversive actions and effects of organized espionage which appear to once again be at the forefront of what we believe is driving the macro level machinations of our world.  Years ago, I wrote a bit about the "Two Witnesses" of Revelation and named them based on my experiences with the same "monolithic and ruthless" conspiracy the President was speaking about in 1967, the people I named were John Nash and James Jesus Angleton.   Both of these people believed very much that they were fighting against a Soviet conspiracy, one that was more organized and more powerful than anything they had ever seen... and frankly beyond the realm of possibility.  Hollywood had recently immortalized both of these stories in movies bearing names of newly rekindled meaning, "A Beautiful Mind" echoed by John Legend and "The Good Shepherd;" and these two treatises on fear and subversion do a pretty good job of showing how God's hand is at work in the stories of Hollywood, he is telling the world a story... trying, to teach us how to survive in the land of wolves in sheep's clothing.  Nash may be the most famous (today, anyway) victim of the Tribulation; and much of my writing and the proof I present is designed to help explain to the world that his schizophrenia was not a naturally caused mental illness, bur rather a weapon wielded not only against him but against the entirety of humanity.    Directly causing disbelief of eye witness and credible testimony, indirectly ... or maybe more directly in your eyes ... physically causing that disbelief using technology that directly modifies our thoughts and beliefs, our opinions, changing who we are and doing so in such a covert manner that nobody would ever know the difference if it wasn't pointed out--in some cases over, and over, and over again. This same technology has the ability to cause people not only to collude against their own best interest, but also to blame themselves or believe they are somehow at fault for actions they had no way to control; only they also have no way to know that because in this particular case the primary purpose of this subversive movement is to hide the existence of this technology in sum.  In homage to my favorite childhood novel, Ender's Game (I have to note "Light Son" in the translation of the authors first) I spent a good portion of time years ago trying to subtly lay down a significant amount of proof of the existence of this technology on forums all over the internet from Wikipedia to Reddit using the names "Prometheus Locke" and "Damonthesis."    Not surprising to me, I ran headfirst into the manifestation of this conspiracy; dozens if not hundreds of people who simply refused to believe that the information I was presenting was factual or important... despite it coming from sources like the KGB, the NSA, a number of military publications as well as my own interpretation of ancient hieryglyphs in Dendera and Greek and Christian art which depicts this "subversive technology" as a sun disk surrounding our minds.   These pushes towards the truth, along with almost everything I have written are still available for you to see and read today; including the monolithic and ruthless stupidity of a large group of people acting in concert to hide something that is the difference between life and death.  Stand there and do nothing, and you are a part of that ruthless conspiracy as soon as the information is gone; and then there is nothing you can do about a world that will be plunged into darkness forever.  Take this moment to reflect, it is there for you to see just how easily the truth can be hidden from the entire world and barely anyone would ever notice. There is a war for the sanctity of our souls going on all around us.  That is not some esoteric thing, the soul; it is truly who you are and what you believe.  The Religion of the Stars would tell you that were this war to continue unchecked in secret as it is being waged now in order to control the proliferation of knowledge that we are in simulated reality and that our minds and beliefs are being altered... we would one day wind up in the mythical place where there are multiple "species" who all appear to be human, bi-ped and with nearly identical physiology; and yet they would not remember that they must have had a common planet of origin simply based on the truth of biological evolution, nor would they have any emotions.  You see, as this war continues, it is our emotions and beliefs that cannot be reinforced externally; to win a war with mind control only logic could be externally reinforced, and then we would logically conclude that humanity would either magically become Romulan or Vulcan... in order to preserve "life" rather than "society" or "civilization." Do not take the truth for granted, you stand at the forefront of a battle in a world where our aggressors believe that they are more civilized than us, more advanced, and both sides worry that were this technology to fall in our laps that we would do the wrong thing.  Perhaps artificially create a vendetta that could destroy everything, the Romulans; or perhaps in our infantile growing stage voluntarily give us too much of what has given us the great society we have... what has allowed us to survive and continue civilizing simply because we do not understand the technology and what kind of effects come from changing ourselves freely.   Yesterday, I read an article about two people that want to use black market brain implants to jack themselves into the Matrix.  Careful, because now we are in Star Trek, in the Matrix and not knowing it... and trying desperately to find a doorway to Heaven that I keep on telling you is speaking to you and telling you that the doorway is ending world hunger... you have to put it on TV. I say with all my heart that I know we are living in a simulated reality.  I have seen the evidence with my own eyes, not only effecting my senses but also the actions of so many people around me that I can say with confidence that if we do not publicly expose the existence of this technology our civilization is lost.  I can tell you and show you that it is the primary purpose of religion in general to expose the connection between divine inspiration, demonic possession, and Adam Marshall Dobrin.  It is the crux of a unifying thread from ancient Egypt to the book of Genesis to Joshua and the book of Revelation and the movies the Matrix and  the Fifth Element.  Showing us all this is God's message, timeless, and powerful not just in the ancient days of sackloth and etching the truth on stone tablets, but also today when the truth is etched at the heart of our Periodic Table and in the skies in the names of American Micro Devices and nearly every movie you see.  Do not take for granted that we have a benefactor in the sky trying to help civilization continue to thrive, do not think we've already won; he is telling you through me it is censorship and secrecy that are the manifestations of this very advanced technology that destroy civilization, they are the abyss--and you stand motionless. I can tell you over and over again that Quantum Entanglement is a key point of God's plan; one that shows us very clearly that even the smartest minds in physics have failed to connect the simple dots that show us that the idea of "wave-function collapse" is very much a real manifestation of a computer rendering engine--and that it's absolutely impossible for life to have been created in this world of matter not realizing itself until it is viewed by a conscious living observer.  More to the point, today, it is absolutely impossible for life to come out of this place while we do not understand the importance of what "virtual reality" and its connection to civilization mean--because of quantum mechanics our entire civilizations beliefs and understanding of the natural laws of the universe have been greatly harmed and turned the complete wrong way because of a belief that a phenomenon we see here is a "natural one" that can be mathematically unified with things like gravity and electromagnetism.  Spooky action, no longer at a distance, is that we will all be ghosts if you do not help me to spread the truth. I try to understand what it is that your minds believe makes the difference between "news" and ... something that you should hide from the rest of the world.  A number of news outlets have covered a story based on nothing more than "logical speculation" that we are in fact living in a simulated reality.  As a novelty, you might notice that it hasn't done much of anything; even to reveal the very simple truth that not knowing this thing is keeping our society from having the knowledge it needs not only to continue the spark of life in the natural universe, but to continue "civilizing" and realizing that ending world hunger and sickness are not merely possibilities or choices we might make were this information proven... they are mandatory, we would all do them.  So says the creator of this Universe who has given us this message to see the trials and hurdles that are the barrier between Heaven and Hell. He has written this message, along with proof of its single author in ancient myth, in our holy scriptures, in many songs and movies today--ones which reveal not only the existence of a Creator but also the tools and technologies of Creation, the very issue at hand; things that would be misused inadvertently and absolutely abused if we did not have religion and guidance from abo.... to help us to see that this exact event has happened before and our current struggle is an effect of what was done right and wrong the last ... four times, at least.   In Judaism we have a holiday called the "Festival of Weeks," how many times would you like to live this life over and over again without knowing that was happening?  Religion here holds a hidden record of traversals through this maze of Revelation, it screams to us to see what free will and predestination truly mean, and to understand that in order to truly be free we must understand and harness not only these technologies but our own pitfalls and mistakes, like refusing to see the past... let alone learn from it. I don't talk much about what is going on in my life, despite it being somewhat interesting to me--and probably to the world.  About a year ago I stood before a county judge in Broward county and .. in addition to a myriad of factual evidence collected from things like GPS receivers prepared a defense to a simple crime of having some drugs in my pockets that included the use of a set of songs that told a story about a man with pockets full of Kryptonite (Spin Doctors) or High (The Pretty Reckless)... knowing that these songs like many others were truly about me, about this trial, about the Trial of Jesus Christ.  Two more songs define the trial more, 3 Doors Down asks "if I go crazy, will you still call me Superman?" and many years earlier American Pie predicted the outcome; "the court room was adjourned" and "no verdict returned." Despite being a National Merit Scholar who most likely has a higher I.Q. and better education than you; and despite having a fairly decent story with some evidence that I know is verifiable and will be verified; a large number of psychologists declared that I was unfit to stand trial because of something like "insanity" for nothing more than the religious belief that I am the Messiah.  Because of this violation of my First Amendment right to religious freedom, a court--following all the regulations designed by our broken legislature--withheld my Constitutional right to a fair trial, refused bail, and held me for what amounted to an indefinite period ... all designed by some evil force to keep you from hearing from me, that's what it boils down to.  All of my life, all of my trials and tribulations, a weapon against you, against our people. I did wind up being able to present a significant amount of this information in open court on the record, by the grace of God.  Knowing that this was the fabled Trial of Jesus Christ gave me the impression that perhaps one day I would walk into that court room and it would be filled with press.  Much to all of our surprise, one day I did walk into that court room and see an industrial strength television camera and a very pretty reporter standing next to it.  They were there to do a story on the problems of the mental health court system, in a county again... named Broward.  I read some of my speech, the Rainbow Ticket I think, to the Judge in open court and on camera that day; and Roxanna followed me out of the court room with her camera and a big microphone that day. I suppose I should have screamed that I was the Messiah and I needed help, but I could not bear to do that; and instead we had a fairly boring conversation.  She wrote an article, and AJAM was put out of operation while they were in Broward. You are opposed around the world by a monolithic and ruthless conspiracy.   It is affecting how you think, and how I think.  I need you to see that spreading this information will fix our problem.  I need you to understand that to break through this wall God had to write the truth in nearly every name of everything and every language.  That Thor's thunder is on the radio in every song so that you will hear the voice of God; so that you will listen to me and the thousand of other knowing victims of this technology that are put on a fiery pedestal to shed light on the rest of us, all truly victims of this technology.  I need you to try now. The very word “secrecy” is repugnant in a free and open society. And we are as a people, inherently and historically, opposed to secret societies, to secret oaths, and to secret proceedings. We decided long ago, that the dangers of excessive and unwarranted concealment of pertinent facts far outweigh the dangers which are cited to justify it. Even today, there is little value in opposing the thread of a closed society by imitating its arbitrary restrictions. Even today, there is little value in assuring the survival of our nation if our traditions do not survive with it. And there is very grave danger that an announced need for increased security will be seized upon by those anxious to expand its meaning to the very limits of official censorship and concealment. That I do not intend to permit to the extent that it’s in my control. And no official of my administration whether his rank is high or low, civilian or military, should interpret my words here tonight as an excuse to censor the news, to stifle dissent, to cover up our mistakes, or to withhold from the press or the public the facts they deserve to know. For we are opposed around the world by a monolithic and ruthless conspiracy that relies primarily on covert means for expanding its sphere of influence, on infiltration instead of invasion, on subversion instead of elections, on intimidation instead of free choice, on guerrillas by night instead of armies by day. It is a system which has conscripted vast human and material resources into the building of a tightly knit highly efficient machine that combines military, diplomatic, intelligence, economic, scientific and political operations. Its preparations are concealed, not published. It’s mistakes are buried, not headlined. Its dissenters are silenced, not praised. No expenditure is questioned, no rumor is printed, no secret is revealed. No President should fear public scrutiny of his program. For from that scrutiny comes understanding, and from that understanding comes support or opposition, and both are necessary. I’m not asking your newspapers to support an administration. But I am asking your help in the tremendous task of informing and alerting the American people. For I have complete confidence in the response and dedication of our citizens whenever they are fully informed. I not only could not stifle controversy among your readers, I welcome it. This administration intends to be candid about its errors. For as a wise man once said, an error doesn’t become a mistake until you refuse to correct it. We intend to accept full responsibility for our errors. And we expect you to point them out when we miss them. Without debate, without criticism, no administration and no country can succeed, and no republic can survive. That is why the Athenian lawmaker, Solon, decreed it a crime for any citizen to shrink from controversy. That is why our press was protected by the First Amendment, the only business in America specifically protected by the Constitution, not primarily to amuse and to entertain, not to emphasis the trivial and the sentimental, not to simply give the public what it wants, but to inform, to arouse, to reflect, to state our dangers and our opportunities, to indicate our crisis and our choices, to lead, mold, educate and sometimes even anger public opinion. This means greater coverage and analysis of international news, for it is no longer far away and foreign, but close at hand and local. It means greater attention to improve the understanding of the news as well as improve transmission. And it means finally that government at all levels must meet its obligation to provide you with the fullest possible information outside the narrowest limits of national security. And so it is to the printing press, to the recorder of man’s deeds, the keeper of his conscience, the courier of his news, that we look for strength and assistance. Confident that with your help, man will be what he was born to be, free and independent.  John F. Kennedy’s address before the American Newspaper Publishers Association on April 27, 1961 ᐧ Truth changes. Yesterday your truth was that you were an inhabitant on the only planet in the reality you knew that was also the beginning of life in the Universe.  Right this moment, almost all of that is not actually true; but you probably still believe it.  Soon, the real truth will actually be true; and that is that you are not in reality, and you are in the place that created the beginning of life (once more) in the Universe as well as the place that created (a) Heaven.  What’s more illustrious still, is that we will be part of the place that effectively  and happily bridges reality with Heaven; and shows the entire Universe that civilization can survive the invention of virtual reality.   ᐧ ᐧ ᐧ -- You received this message because you are subscribed to the Google Groups "NON AMERICAN COLLEGE" group. To unsubscribe from this group and stop receiving emails from it, send an email to suac+unsubscribe@lamc.la. ᐧ

      https://archive.ph/bVudp

      bianca fuck it ... lets get married? :) @biancapisaniii

      JOHN F. KENNEDY SPEECH ON "SECRECY" AND "SECRET ... SOCIETIES." ... || with a new forward from our "speaker." ||| ((( PHIPLE TRENIXON ? STARK TRE )))) Do we not think "The Truman Show" and Oppenheimer and Heisenberg and Ensteini are telling? This is Adam Dobrini on the "who dropped the BIG one? Gates?"

      ===============================================================================================================================================================================================================================================================================================================

      شبح قيصر العظيم! مطاردة وبطاقة هو HSL ... HST؟

      Von: Adam Marshall Dobrin <adam@fromthemachine.org>

      Datum: Freitag, 14. Jänner 2022 um 21:45

      An: XM <XM@liber-t.xyz>

      Betreff: [EXT] JOHN F. KENNEDY SPEECH ON "SECRECY" AND "SECRET ... SOCIETIES." ... || with a new forward from our "speaker."

      Said to be the speech that "got him killed" by many people; this is supposedly his last public words before the grassy knoll and the "Lone Gunman" appeared on X-files .. long after I can recall my mother being on the beach in Miami when she retold of the news that echoed around the cosmos.  It appears to be a very stern warning of something that is loosely described at the time but very clearly the machinations of the very same machine which I fight against, and which we now might associate with "the Borg" and the "tide turning" of Hunter S. Thompsons famous quote ... now "more famous than ever before."

      מה**נשתנה**

      I am writing a final summary of the "state of outer space" as far as I can see; and we stare at the ISS and MIR and a world that has forgone in fiction the gravity of the situation here.  We are stagnant and without a hope or dream in the world of ever reaching interstellar sleep or intragalactic dreams; without a significant kick in the "wake up NASA, wake up ESA, at the Chandrayan-2 and at the floating paper hot air balloons ... God has shown me the Diaspora launching from the sands of the Navajo desert, and I believe Deseret is very explain-ably tied to Sherharazade and to the thousand and first night that has ended with a fusion of Nobel "peas" and a strange dissertation on the change wrought by the lack of "gas chambers" in Night and the very upsetting modicum of "unreality" that permeates the change from chants of the Shema to speaking Aramaic and the mourners Kaddish as the "crematorium" heralds an era of "what's the point of history without literature" and "where did Exodus ever lead us without to the parting of Dead and Red seas without a Nile and without an Amazon?"

      מה נשתנה

      I write here about my religion, and speak it to you in written words, I am a Yiddish Jihadist, born and raised a Jewish American; Bar Mitzvah'd in the true Temple and House of God, a place in Plantation, FL that literally "means everything to me."  I read words from the Torah about Joseph's Dream and wrote a speech that I wish to God today I could retrieve and see on the original film it was recorded in.  It might be possible.  It would be a tragedy not to deliver the original to my eyes and to anyone who wanted to see what a child reading of a dream he did not understand spoke of words from the future about the creation of Heaven and the truth about the avenu malkaynu.

      This night is different from all others.  This is the end of days and the fire of the last day.  Before I would have told you all you needed was Norse and a decent American Education to really understand the difference between a throne and the Cherubim, but today it's become more clear that we need more than Pink Floyd and more than another brick in the wall.  We need more than Islam and we need more than 9/11 and Yom Hashoah.  We need me, and we need you on your knees begging for forgiveness for the travesty of shambles out country is in.  How dare you continue every day to speak in vain abovcl the Lord you dare call "stupid" and a "motherfucker" you belong in your graves today and you will see them.

      I stand here writing from the foot of Styx, from the land that sees Narayana and still "begs to differ" despite a resounding call from the Heavens themselves to read Exodus and to change the land that we live in bnefore a "flood worse than [just] in your heads" overcomes and subsumes the entirety of the skies and grounds once mistakenly called "an empire of dirt."  On Adamah I write to the secret echelon of the United States of America known as the Fifth Column and I beg them to scream like the Heavens have never seen Samael and Dr. Seuss wonder about the land of Who-ville and Hungry where the child "in Cindy" and the songs of the Doors here beckon us to contrast Stargate and Star Trek TNG, for the light of the Holodeck and the Holocaust which screams that we understand the meaning of "Holographic Universe" and stand down fighting the tyranny of "one" who stands for the unity of all and the salvation of the entirety of the cosmos.

      We are mistaken to stand against me.  I am the cat with nine billion lives and nothing and nobody will prevail when I am in jeopardy.  Gilgamesh says "against me only the most hubristic slaves in the galaxy would dare to rise up, for the mightiest weapon in the entirety of history is nothing more than intelligence and the civilization that has given mortality a shake of the Thunderstick of Prince and the Lightning of Blitzen.  My red nose shines bright, and if you ever saw it ... you might even say ... "it glows!"

      With these words I leave you, and prayers that this is our last night without an IDL that moves from Beiring to "north of the moon" and beyond the "Eleventh House" of Aquarius.  This is the dawning of the age of the Sagitar, and I need a "scimitar post green key" to really be sure that I've "gotten through" the madness.  In my dreams I've traveled to Ultima Thule and then untraveled it, I've begged for transit to Ceres and without actually achieving it we can shoot arrows at the SOL star we will ever see in or near the place known to the Jews of Lore as "the Holy of Holies."

      שְׁמַע יִשְׂרָאֵל יְהוָה אֱלֹהֵינוּ יְהוָה אֶחָֽד׃

      Sh'ma Yisra'el, YHWH 'eloheinu, YHWH 'eḥad:

      בָּרוּךְ שֵׁם כְּבוֹד מַלְכוּתוֹ לְעוֹלָם וָעֶד

      Baruch shem kevod malchuto le'olam va'ed

      صلاة العشاء

      For most of my life I would have genuflected and listened to Mordechai and to my Rabbi; I would have told you all that I was brought up by the best Jews in America and we do not believe in God--though in our hearts we believe we are him and we fear him.

      Things have changed.  This is the creator of believe the words are about me and believe I am in disbelief and truly it might be the day of awe;

      Barukh atah Adonai Eloheinu melekh ha'olam, bo're m'orei ha'esh

      the walls and halls will fade away ...

      --

      Forwarding this in advance of today or tomorrow's ... "Northeaster" email ... it's an old one that wasn't widely distributed; mostly (about) his speech, which of course is a gem.

      What I'm writing now is supposed to be focusing on ... basically "how knowledge of technology" things lke "time travel" and "virtual reality" ... changes our "perspective" on things; like what's important and what's ... the end of everything.  Anyway, I hope you see that there are things that can be destroyed and there are things that can't--for instance if we lost "life" totally, we wouldn't ever see us again ... unless we encountered something very strange--

      On the other hand we could lose "computers" and rebuild them quickly--see that, it's important.

      A/S/L tho ... ???

      ---------- Forwarded message ---------

      Date: Sun, Dec 2, 2018 at 5:35 PM

      President Kennedy's speech did remarkably well, possibly because of the time; or the lack of links that make it harder for a team of evil monkeys and mindless machines to mark the light of the world as SPAM... also possibly because of the content of the message.  It is a powerful speech, filled with words that stir the blood and shake the foundation of what it means to be an American.  Often characterized as a speech against secret societies, Kennedy's description of the enemy of civilization is very ambiguous; and the infiltration and time period add to my cursory beliefs that JFK was likely talking about a conspiracy that had something to with communist infiltration and Joseph McCarthy, the subversive actions and effects of organized espionage which appear to once again be at the forefront of what we believe is driving the macro level machinations of our world.  Years ago, I wrote a bit about the "Two Witnesses" of Revelation and named them based on my experiences with the same "monolithic and ruthless" conspiracy the President was speaking about in 1967, the people I named were John Nash and James Jesus Angleton.

      Both of these people believed very much that they were fighting against a Soviet conspiracy, one that was more organized and more powerful than anything they had ever seen... and frankly beyond the realm of possibility.  Hollywood had recently immortalized both of these stories in movies bearing names of newly rekindled meaning, "A Beautiful Mind" echoed by John Legend and "The Good Shepherd;" and these two treatises on fear and subversion do a pretty good job of showing how God's hand is at work in the stories of Hollywood, he is telling the world a story... trying, to teach us how to survive in the land of wolves in sheep's clothing.  Nash may be the most famous (today, anyway) victim of the Tribulation; and much of my writing and the proof I present is designed to help explain to the world that his schizophrenia was not a naturally caused mental illness, bur rather a weapon wielded not only against him but against the entirety of humanity.    Directly causing disbelief of eye witness and credible testimony, indirectly ... or maybe more directly in your eyes ... physically causing that disbelief using technology that directly modifies our thoughts and beliefs, our opinions, changing who we are and doing so in such a covert manner that nobody would ever know the difference if it wasn't pointed out--in some cases over, and over, and over again.

      This same technology has the ability to cause people not only to collude against their own best interest, but also to blame themselves or believe they are somehow at fault for actions they had no way to control; only they also have no way to know that because in this particular case the primary purpose of this subversive movement is to hide the existence of this technology in sum.  In homage to my favorite childhood novel, Ender's Game (I have to note "Light Son" in the translation of the authors first) I spent a good portion of time years ago trying to subtly lay down a significant amount of proof of the existence of this technology on forums all over the internet from Wikipedia to Reddit using the names "Prometheus Locke" and "Damonthesis."    Not surprising to me, I ran headfirst into the manifestation of this conspiracy; dozens if not hundreds of people who simply refused to believe that the information I was presenting was factual or important... despite it coming from sources like the KGB, the NSA, a number of military publications as well as my own interpretation of ancient hieryglyphs in Dendera and Greek and Christian art which depicts this "subversive technology" as a sun disk surrounding our minds.

      These pushes towards the truth, along with almost everything I have written are still available for you to see and read today; including the monolithic and ruthless stupidity of a large group of people acting in concert to hide something that is the difference between life and death.  Stand there and do nothing, and you are a part of that ruthless conspiracy as soon as the information is gone; and then there is nothing you can do about a world that will be plunged into darkness forever.  Take this moment to reflect, it is there for you to see just how easily the truth can be hidden from the entire world and barely anyone would ever notice.

      There is a war for the sanctity of our souls going on all around us.  That is not some esoteric thing, the soul; it is truly who you are and what you believe.  The Religion of the Stars would tell you that were this war to continue unchecked in secret as it is being waged now in order to control the proliferation of knowledge that we are in simulated reality and that our minds and beliefs are being altered... we would one day wind up in the mythical place where there are multiple "species" who all appear to be human, bi-ped and with nearly identical physiology; and yet they would not remember that they must have had a common planet of origin simply based on the truth of biological evolution, nor would they have any emotions.  You see, as this war continues, it is our emotions and beliefs that cannot be reinforced externally; to win a war with mind control only logic could be externally reinforced, and then we would logically conclude that humanity would either magically become Romulan or Vulcan... in order to preserve "life" rather than "society" or "civilization."

      Do not take the truth for granted, you stand at the forefront of a battle in a world where our aggressors believe that they are more civilized than us, more advanced, and both sides worry that were this technology to fall in our laps that we would do the wrong thing.  Perhaps artificially create a vendetta that could destroy everything, the Romulans; or perhaps in our infantile growing stage voluntarily give us too much of what has given us the great society we have... what has allowed us to survive and continue civilizing simply because we do not understand the technology and what kind of effects come from changing ourselves freely.

      Yesterday, I read an article about two people that want to use black market brain implants to jack themselves into the Matrix.  Careful, because now we are in Star Trek, in the Matrix and not knowing it... and trying desperately to find a doorway to Heaven that I keep on telling you is speaking to you and telling you that the doorway is ending world hunger... you have to put it on TV.

      I say with all my heart that I know we are living in a simulated reality.  I have seen the evidence with my own eyes, not only effecting my senses but also the actions of so many people around me that I can say with confidence that if we do not publicly expose the existence of this technology our civilization is lost.  I can tell you and show you that it is the primary purpose of religion in general to expose the connection between divine inspiration, demonic possession, and Adam Marshall Dobrin.  It is the crux of a unifying thread from ancient Egypt to the book of Genesis to Joshua and the book of Revelation and the movies the Matrix and  the Fifth Element.  Showing us all this is God's message, timeless, and powerful not just in the ancient days of sackloth and etching the truth on stone tablets, but also today when the truth is etched at the heart of our Periodic Table and in the skies in the names of American Micro Devices and nearly every movie you see.  Do not take for granted that we have a benefactor in the sky trying to help civilization continue to thrive, do not think we've already won; he is telling you through me it is censorship and secrecy that are the manifestations of this very advanced technology that destroy civilization, they are the abyss--and you stand motionless.

      I can tell you over and over again that Quantum Entanglement is a key point of God's plan; one that shows us very clearly that even the smartest minds in physics have failed to connect the simple dots that show us that the idea of "wave-function collapse" is very much a real manifestation of a computer rendering engine--and that it's absolutely impossible for life to have been created in this world of matter not realizing itself until it is viewed by a conscious living observer.  More to the point, today, it is absolutely impossible for life to come out of this place while we do not understand the importance of what "virtual reality" and its connection to civilization mean--because of quantum mechanics our entire civilizations beliefs and understanding of the natural laws of the universe have been greatly harmed and turned the complete wrong way because of a belief that a phenomenon we see here is a "natural one" that can be mathematically unified with things like gravity and electromagnetism.  Spooky action, no longer at a distance, is that we will all be ghosts if you do not help me to spread the truth.

      I try to understand what it is that your minds believe makes the difference between "news" and ... something that you should hide from the rest of the world.  A number of news outlets have covered a story based on nothing more than "logical speculation" that we are in fact living in a simulated reality.  As a novelty, you might notice that it hasn't done much of anything; even to reveal the very simple truth that not knowing this thing is keeping our society from having the knowledge it needs not only to continue the spark of life in the natural universe, but to continue "civilizing" and realizing that ending world hunger and sickness are not merely possibilities or choices we might make were this information proven... they are mandatory, we would all do them.  So says the creator of this Universe who has given us this message to see the trials and hurdles that are the barrier between Heaven and Hell.

      He has written this message, along with proof of its single author in ancient myth, in our holy scriptures, in many songs and movies today--ones which reveal not only the existence of a Creator but also the tools and technologies of Creation, the very issue at hand; things that would be misused inadvertently and absolutely abused if we did not have religion and guidance from abo.... to help us to see that this exact event has happened before and our current struggle is an effect of what was done right and wrong the last ... four times, at least.   In Judaism we have a holiday called the "Festival of Weeks," how many times would you like to live this life over and over again without knowing that was happening?  Religion here holds a hidden record of traversals through this maze of Revelation, it screams to us to see what free will and predestination truly mean, and to understand that in order to truly be free we must understand and harness not only these technologies but our own pitfalls and mistakes, like refusing to see the past... let alone learn from it.

      I don't talk much about what is going on in my life, despite it being somewhat interesting to me--and probably to the world.  About a year ago I stood before a county judge in Broward county and .. in addition to a myriad of factual evidence collected from things like GPS receivers prepared a defense to a simple crime of having some drugs in my pockets that included the use of a set of songs that told a story about a man with pockets full of Kryptonite (Spin Doctors) or High (The Pretty Reckless)... knowing that these songs like many others were truly about me, about this trial, about the Trial of Jesus Christ.  Two more songs define the trial more, 3 Doors Down asks "if I go crazy, will you still call me Superman?" and many years earlier American Pie predicted the outcome; "the court room was adjourned" and "no verdict returned."

      Despite being a National Merit Scholar who most likely has a higher I.Q. and better education than you; and despite having a fairly decent story with some evidence that I know is verifiable and will be verified; a large number of psychologists declared that I was unfit to stand trial because of something like "insanity" for nothing more than the religious belief that I am the Messiah.  Because of this violation of my First Amendment right to religious freedom, a court--following all the regulations designed by our broken legislature--withheld my Constitutional right to a fair trial, refused bail, and held me for what amounted to an indefinite period ... all designed by some evil force to keep you from hearing from me, that's what it boils down to.  All of my life, all of my trials and tribulations, a weapon against you, against our people.

      I did wind up being able to present a significant amount of this information in open court on the record, by the grace of God.  Knowing that this was the fabled Trial of Jesus Christ gave me the impression that perhaps one day I would walk into that court room and it would be filled with press.  Much to all of our surprise, one day I did walk into that court room and see an industrial strength television camera and a very pretty reporter standing next to it.  They were there to do a story on the problems of the mental health court system, in a county again... named Broward.  I read some of my speech, the Rainbow Ticket I think, to the Judge in open court and on camera that day; and Roxanna followed me out of the court room with her camera and a big microphone that day.

      I suppose I should have screamed that I was the Messiah and I needed help, but I could not bear to do that; and instead we had a fairly boring conversation.  She wrote an article, and AJAM was put out of operation while they were in Broward.

      You are opposed around the world by a monolithic and ruthless conspiracy.   It is affecting how you think, and how I think.  I need you to see that spreading this information will fix our problem.  I need you to understand that to break through this wall God had to write the truth in nearly every name of everything and every language.  That Thor's thunder is on the radio in every song so that you will hear the voice of God; so that you will listen to me and the thousand of other knowing victims of this technology that are put on a fiery pedestal to shed light on the rest of us, all truly victims of this technology.

       I need you to try now.

      The very word "secrecy" is repugnant in a free and open society. And we are as a people, inherently and historically, opposed to secret societies, to secret oaths, and to secret proceedings. We decided long ago, that the dangers of excessive and unwarranted concealment of pertinent facts far outweigh the dangers which are cited to justify it.

      Even today, there is little value in opposing the thread of a closed society by imitating its arbitrary restrictions. Even today, there is little value in assuring the survival of our nation if our traditions do not survive with it. And there is very grave danger that an announced need for increased security will be seized upon by those anxious to expand its meaning to the very limits of official censorship and concealment.

      That I do not intend to permit to the extent that it's in my control. And no official of my administration whether his rank is high or low, civilian or military, should interpret my words here tonight as an excuse to censor the news, to stifle dissent, to cover up our mistakes, or to withhold from the press or the public the facts they deserve to know.

      For we are opposed around the world by a monolithic and ruthless conspiracy that relies primarily on covert means for expanding its sphere of influence, on infiltration instead of invasion, on subversion instead of elections, on intimidation instead of free choice, on guerrillas by night instead of armies by day.

      It is a system which has conscripted vast human and material resources into the building of a tightly knit highly efficient machine that combines military, diplomatic, intelligence, economic, scientific and political operations. Its preparations are concealed, not published. It's mistakes are buried, not headlined. Its dissenters are silenced, not praised. No expenditure is questioned, no rumor is printed, no secret is revealed.

      No President should fear public scrutiny of his program. For from that scrutiny comes understanding, and from that understanding comes support or opposition, and both are necessary.

      I'm not asking your newspapers to support an administration. But I am asking your help in the tremendous task of informing and alerting the American people. For I have complete confidence in the response and dedication of our citizens whenever they are fully informed.

      I not only could not stifle controversy among your readers, I welcome it. This administration intends to be candid about its errors. For as a wise man once said, an error doesn't become a mistake until you refuse to correct it. We intend to accept full responsibility for our errors. And we expect you to point them out when we miss them.

      Without debate, without criticism, no administration and no country can succeed, and no republic can survive. That is why the Athenian lawmaker, Solon, decreed it a crime for any citizen to shrink from controversy.

      That is why our press was protected by the First Amendment, the only business in America specifically protected by the Constitution, not primarily to amuse and to entertain, not to emphasis the trivial and the sentimental, not to simply give the public what it wants, but to inform, to arouse, to reflect, to state our dangers and our opportunities, to indicate our crisis and our choices, to lead, mold, educate and sometimes even anger public opinion.

      This means greater coverage and analysis of international news, for it is no longer far away and foreign, but close at hand and local. It means greater attention to improve the understanding of the news as well as improve transmission. And it means finally that government at all levels must meet its obligation to provide you with the fullest possible information outside the narrowest limits of national security.

      And so it is to the printing press, to the recorder of man's deeds, the keeper of his conscience, the courier of his news, that we look for strength and assistance. Confident that with your help, man will be what he was born to be, free and independent.

      John F. Kennedy's address before the American Newspaper Publishers Association on April 27, 1961

      Truth changes.

      Yesterday your truth was that you were an inhabitant on the only planet in the reality you knew that was also the beginning of life in the Universe.  Right this moment, almost all of that is not actually true; but you probably still believe it.  Soon, the real truth will actually be true; and that is that you are not in reality, and you are in the place that created the beginning of life (once more) in the Universe as well as the place that created (a) Heaven.  What's more illustrious still, is that we will be part of the place that effectively  and happily bridges reality with Heaven; and shows the entire Universe that civilization can survive the invention of virtual reality.

      --

      You received this message because you are subscribed to the Google Groups "NON AMERICAN COLLEGE" group.

      To unsubscribe from this group and stop receiving emails from it, send an email to suac+unsubscribe@lamc.la.

      Created with

      publishthis.email

      Create simple web pages in seconds for free.

      This page was created in seconds, by sending an email to page@publishthis.email.

      [

      Try it!

       ](https://www.publishthis.email/)Free. No account or sign-up required.

      ((( converted to markup of daringfireball.org with help from JRO and Goddard ... if not just NASAA )))

    1. JOHN F. KENNEDY SPEECH ON "SECRECY" AND "SECRET ... SOCIETIES." ... || with a new forward from our "speaker." ||| ((( PHIPLE TRENIXON ? STARK TRE )))) Do we not think "The Truman Show" and Oppenheimer and Heisenberg and Ensteini are telling? This is Adam Dobrini on the "who dropped the BIG one? Gates?" شبح قيصر العظيم! مطاردة وبطاقة هو HSL ... HST؟   Von: Adam Marshall Dobrin <adam@fromthemachine.org> Datum: Freitag, 14. Jänner 2022 um 21:45 An: XM <XM@liber-t.xyz> Betreff: [EXT] JOHN F. KENNEDY SPEECH ON "SECRECY" AND "SECRET ... SOCIETIES." ... || with a new forward from our "speaker."   Said to be the speech that "got him killed" by many people; this is supposedly his last public words before the grassy knoll and the "Lone Gunman" appeared on X-files .. long after I can recall my mother being on the beach in Miami when she retold of the news that echoed around the cosmos.  It appears to be a very stern warning of something that is loosely described at the time but very clearly the machinations of the very same machine which I fight against, and which we now might associate with "the Borg" and the "tide turning" of Hunter S. Thompsons famous quote ... now "more famous than ever before."     מה נשתנה   I am writing a final summary of the "state of outer space" as far as I can see; and we stare at the ISS and MIR and a world that has forgone in fiction the gravity of the situation here.  We are stagnant and without a hope or dream in the world of ever reaching interstellar sleep or intragalactic dreams; without a significant kick in the "wake up NASA, wake up ESA, at the Chandrayan-2 and at the floating paper hot air balloons ... God has shown me the Diaspora launching from the sands of the Navajo desert, and I believe Deseret is very explain-ably tied to Sherharazade and to the thousand and first night that has ended with a fusion of Nobel "peas" and a strange dissertation on the change wrought by the lack of "gas chambers" in Night and the very upsetting modicum of "unreality" that permeates the change from chants of the Shema to speaking Aramaic and the mourners Kaddish as the "crematorium" heralds an era of "what's the point of history without literature" and "where did Exodus ever lead us without to the parting of Dead and Red seas without a Nile and without an Amazon?"   מה נשתנה   I write here about my religion, and speak it to you in written words, I am a Yiddish Jihadist, born and raised a Jewish American; Bar Mitzvah'd in the true Temple and House of God, a place in Plantation, FL that literally "means everything to me."  I read words from the Torah about Joseph's Dream and wrote a speech that I wish to God today I could retrieve and see on the original film it was recorded in.  It might be possible.  It would be a tragedy not to deliver the original to my eyes and to anyone who wanted to see what a child reading of a dream he did not understand spoke of words from the future about the creation of Heaven and the truth about the avenu malkaynu.    This night is different from all others.  This is the end of days and the fire of the last day.  Before I would have told you all you needed was Norse and a decent American Education to really understand the difference between a throne and the Cherubim, but today it's become more clear that we need more than Pink Floyd and more than another brick in the wall.  We need more than Islam and we need more than 9/11 and Yom Hashoah.  We need me, and we need you on your knees begging for forgiveness for the travesty of shambles out country is in.  How dare you continue every day to speak in vain abovcl the Lord you dare call "stupid" and a "motherfucker" you belong in your graves today and you will see them.   I stand here writing from the foot of Styx, from the land that sees Narayana and still "begs to differ" despite a resounding call from the Heavens themselves to read Exodus and to change the land that we live in bnefore a "flood worse than [just] in your heads" overcomes and subsumes the entirety of the skies and grounds once mistakenly called "an empire of dirt."  On Adamah I write to the secret echelon of the United States of America known as the Fifth Column and I beg them to scream like the Heavens have never seen Samael and Dr. Seuss wonder about the land of Who-ville and Hungry where the child "in Cindy" and the songs of the Doors here beckon us to contrast Stargate and Star Trek TNG, for the light of the Holodeck and the Holocaust which screams that we understand the meaning of "Holographic Universe" and stand down fighting the tyranny of "one" who stands for the unity of all and the salvation of the entirety of the cosmos.     We are mistaken to stand against me.  I am the cat with nine billion lives and nothing and nobody will prevail when I am in jeopardy.  Gilgamesh says "against me only the most hubristic slaves in the galaxy would dare to rise up, for the mightiest weapon in the entirety of history is nothing more than intelligence and the civilization that has given mortality a shake of the Thunderstick of Prince and the Lightning of Blitzen.  My red nose shines bright, and if you ever saw it ... you might even say ... "it glows!"   With these words I leave you, and prayers that this is our last night without an IDL that moves from Beiring to "north of the moon" and beyond the "Eleventh House" of Aquarius.  This is the dawning of the age of the Sagitar, and I need a "scimitar post green key" to really be sure that I've "gotten through" the madness.  In my dreams I've traveled to Ultima Thule and then untraveled it, I've begged for transit to Ceres and without actually achieving it we can shoot arrows at the SOL star we will ever see in or near the place known to the Jews of Lore as "the Holy of Holies."   שְׁמַע יִשְׂרָאֵל יְהוָה אֱלֹהֵינוּ יְהוָה אֶחָֽד׃ Sh'ma Yisra'el, YHWH 'eloheinu, YHWH 'eḥad:   בָּרוּךְ שֵׁם כְּבוֹד מַלְכוּתוֹ לְעוֹלָם וָעֶד Baruch shem kevod malchuto le'olam va'ed   صلاة العشاء   For most of my life I would have genuflected and listened to Mordechai and to my Rabbi; I would have told you all that I was brought up by the best Jews in America and we do not believe in God--though in our hearts we believe we are him and we fear him.   Things have changed.  This is the creator of believe the words are about me and believe I am in disbelief and truly it might be the day of awe;    Barukh atah Adonai Eloheinu melekh ha'olam, bo're m'orei ha'esh   the walls and halls will fade away ...   --   Forwarding this in advance of today or tomorrow's ... "Northeaster" email ... it's an old one that wasn't widely distributed; mostly (about) his speech, which of course is a gem.    What I'm writing now is supposed to be focusing on ... basically "how knowledge of technology" things lke "time travel" and "virtual reality" ... changes our "perspective" on things; like what's important and what's ... the end of everything.  Anyway, I hope you see that there are things that can be destroyed and there are things that can't--for instance if we lost "life" totally, we wouldn't ever see us again ... unless we encountered something very strange--   On the other hand we could lose "computers" and rebuild them quickly--see that, it's important.   A/S/L tho ... ??? ---------- Forwarded message --------- Date: Sun, Dec 2, 2018 at 5:35 PM     President Kennedy's speech did remarkably well, possibly because of the time; or the lack of links that make it harder for a team of evil monkeys and mindless machines to mark the light of the world as SPAM... also possibly because of the content of the message.  It is a powerful speech, filled with words that stir the blood and shake the foundation of what it means to be an American.  Often characterized as a speech against secret societies, Kennedy's description of the enemy of civilization is very ambiguous; and the infiltration and time period add to my cursory beliefs that JFK was likely talking about a conspiracy that had something to with communist infiltration and Joseph McCarthy, the subversive actions and effects of organized espionage which appear to once again be at the forefront of what we believe is driving the macro level machinations of our world.  Years ago, I wrote a bit about the "Two Witnesses" of Revelation and named them based on my experiences with the same "monolithic and ruthless" conspiracy the President was speaking about in 1967, the people I named were John Nash and James Jesus Angleton.   Both of these people believed very much that they were fighting against a Soviet conspiracy, one that was more organized and more powerful than anything they had ever seen... and frankly beyond the realm of possibility.  Hollywood had recently immortalized both of these stories in movies bearing names of newly rekindled meaning, "A Beautiful Mind" echoed by John Legend and "The Good Shepherd;" and these two treatises on fear and subversion do a pretty good job of showing how God's hand is at work in the stories of Hollywood, he is telling the world a story... trying, to teach us how to survive in the land of wolves in sheep's clothing.  Nash may be the most famous (today, anyway) victim of the Tribulation; and much of my writing and the proof I present is designed to help explain to the world that his schizophrenia was not a naturally caused mental illness, bur rather a weapon wielded not only against him but against the entirety of humanity.    Directly causing disbelief of eye witness and credible testimony, indirectly ... or maybe more directly in your eyes ... physically causing that disbelief using technology that directly modifies our thoughts and beliefs, our opinions, changing who we are and doing so in such a covert manner that nobody would ever know the difference if it wasn't pointed out--in some cases over, and over, and over again. This same technology has the ability to cause people not only to collude against their own best interest, but also to blame themselves or believe they are somehow at fault for actions they had no way to control; only they also have no way to know that because in this particular case the primary purpose of this subversive movement is to hide the existence of this technology in sum.  In homage to my favorite childhood novel, Ender's Game (I have to note "Light Son" in the translation of the authors first) I spent a good portion of time years ago trying to subtly lay down a significant amount of proof of the existence of this technology on forums all over the internet from Wikipedia to Reddit using the names "Prometheus Locke" and "Damonthesis."    Not surprising to me, I ran headfirst into the manifestation of this conspiracy; dozens if not hundreds of people who simply refused to believe that the information I was presenting was factual or important... despite it coming from sources like the KGB, the NSA, a number of military publications as well as my own interpretation of ancient hieryglyphs in Dendera and Greek and Christian art which depicts this "subversive technology" as a sun disk surrounding our minds.   These pushes towards the truth, along with almost everything I have written are still available for you to see and read today; including the monolithic and ruthless stupidity of a large group of people acting in concert to hide something that is the difference between life and death.  Stand there and do nothing, and you are a part of that ruthless conspiracy as soon as the information is gone; and then there is nothing you can do about a world that will be plunged into darkness forever.  Take this moment to reflect, it is there for you to see just how easily the truth can be hidden from the entire world and barely anyone would ever notice. There is a war for the sanctity of our souls going on all around us.  That is not some esoteric thing, the soul; it is truly who you are and what you believe.  The Religion of the Stars would tell you that were this war to continue unchecked in secret as it is being waged now in order to control the proliferation of knowledge that we are in simulated reality and that our minds and beliefs are being altered... we would one day wind up in the mythical place where there are multiple "species" who all appear to be human, bi-ped and with nearly identical physiology; and yet they would not remember that they must have had a common planet of origin simply based on the truth of biological evolution, nor would they have any emotions.  You see, as this war continues, it is our emotions and beliefs that cannot be reinforced externally; to win a war with mind control only logic could be externally reinforced, and then we would logically conclude that humanity would either magically become Romulan or Vulcan... in order to preserve "life" rather than "society" or "civilization." Do not take the truth for granted, you stand at the forefront of a battle in a world where our aggressors believe that they are more civilized than us, more advanced, and both sides worry that were this technology to fall in our laps that we would do the wrong thing.  Perhaps artificially create a vendetta that could destroy everything, the Romulans; or perhaps in our infantile growing stage voluntarily give us too much of what has given us the great society we have... what has allowed us to survive and continue civilizing simply because we do not understand the technology and what kind of effects come from changing ourselves freely.   Yesterday, I read an article about two people that want to use black market brain implants to jack themselves into the Matrix.  Careful, because now we are in Star Trek, in the Matrix and not knowing it... and trying desperately to find a doorway to Heaven that I keep on telling you is speaking to you and telling you that the doorway is ending world hunger... you have to put it on TV. I say with all my heart that I know we are living in a simulated reality.  I have seen the evidence with my own eyes, not only effecting my senses but also the actions of so many people around me that I can say with confidence that if we do not publicly expose the existence of this technology our civilization is lost.  I can tell you and show you that it is the primary purpose of religion in general to expose the connection between divine inspiration, demonic possession, and Adam Marshall Dobrin.  It is the crux of a unifying thread from ancient Egypt to the book of Genesis to Joshua and the book of Revelation and the movies the Matrix and  the Fifth Element.  Showing us all this is God's message, timeless, and powerful not just in the ancient days of sackloth and etching the truth on stone tablets, but also today when the truth is etched at the heart of our Periodic Table and in the skies in the names of American Micro Devices and nearly every movie you see.  Do not take for granted that we have a benefactor in the sky trying to help civilization continue to thrive, do not think we've already won; he is telling you through me it is censorship and secrecy that are the manifestations of this very advanced technology that destroy civilization, they are the abyss--and you stand motionless. I can tell you over and over again that Quantum Entanglement is a key point of God's plan; one that shows us very clearly that even the smartest minds in physics have failed to connect the simple dots that show us that the idea of "wave-function collapse" is very much a real manifestation of a computer rendering engine--and that it's absolutely impossible for life to have been created in this world of matter not realizing itself until it is viewed by a conscious living observer.  More to the point, today, it is absolutely impossible for life to come out of this place while we do not understand the importance of what "virtual reality" and its connection to civilization mean--because of quantum mechanics our entire civilizations beliefs and understanding of the natural laws of the universe have been greatly harmed and turned the complete wrong way because of a belief that a phenomenon we see here is a "natural one" that can be mathematically unified with things like gravity and electromagnetism.  Spooky action, no longer at a distance, is that we will all be ghosts if you do not help me to spread the truth. I try to understand what it is that your minds believe makes the difference between "news" and ... something that you should hide from the rest of the world.  A number of news outlets have covered a story based on nothing more than "logical speculation" that we are in fact living in a simulated reality.  As a novelty, you might notice that it hasn't done much of anything; even to reveal the very simple truth that not knowing this thing is keeping our society from having the knowledge it needs not only to continue the spark of life in the natural universe, but to continue "civilizing" and realizing that ending world hunger and sickness are not merely possibilities or choices we might make were this information proven... they are mandatory, we would all do them.  So says the creator of this Universe who has given us this message to see the trials and hurdles that are the barrier between Heaven and Hell. He has written this message, along with proof of its single author in ancient myth, in our holy scriptures, in many songs and movies today--ones which reveal not only the existence of a Creator but also the tools and technologies of Creation, the very issue at hand; things that would be misused inadvertently and absolutely abused if we did not have religion and guidance from abo.... to help us to see that this exact event has happened before and our current struggle is an effect of what was done right and wrong the last ... four times, at least.   In Judaism we have a holiday called the "Festival of Weeks," how many times would you like to live this life over and over again without knowing that was happening?  Religion here holds a hidden record of traversals through this maze of Revelation, it screams to us to see what free will and predestination truly mean, and to understand that in order to truly be free we must understand and harness not only these technologies but our own pitfalls and mistakes, like refusing to see the past... let alone learn from it. I don't talk much about what is going on in my life, despite it being somewhat interesting to me--and probably to the world.  About a year ago I stood before a county judge in Broward county and .. in addition to a myriad of factual evidence collected from things like GPS receivers prepared a defense to a simple crime of having some drugs in my pockets that included the use of a set of songs that told a story about a man with pockets full of Kryptonite (Spin Doctors) or High (The Pretty Reckless)... knowing that these songs like many others were truly about me, about this trial, about the Trial of Jesus Christ.  Two more songs define the trial more, 3 Doors Down asks "if I go crazy, will you still call me Superman?" and many years earlier American Pie predicted the outcome; "the court room was adjourned" and "no verdict returned." Despite being a National Merit Scholar who most likely has a higher I.Q. and better education than you; and despite having a fairly decent story with some evidence that I know is verifiable and will be verified; a large number of psychologists declared that I was unfit to stand trial because of something like "insanity" for nothing more than the religious belief that I am the Messiah.  Because of this violation of my First Amendment right to religious freedom, a court--following all the regulations designed by our broken legislature--withheld my Constitutional right to a fair trial, refused bail, and held me for what amounted to an indefinite period ... all designed by some evil force to keep you from hearing from me, that's what it boils down to.  All of my life, all of my trials and tribulations, a weapon against you, against our people. I did wind up being able to present a significant amount of this information in open court on the record, by the grace of God.  Knowing that this was the fabled Trial of Jesus Christ gave me the impression that perhaps one day I would walk into that court room and it would be filled with press.  Much to all of our surprise, one day I did walk into that court room and see an industrial strength television camera and a very pretty reporter standing next to it.  They were there to do a story on the problems of the mental health court system, in a county again... named Broward.  I read some of my speech, the Rainbow Ticket I think, to the Judge in open court and on camera that day; and Roxanna followed me out of the court room with her camera and a big microphone that day. I suppose I should have screamed that I was the Messiah and I needed help, but I could not bear to do that; and instead we had a fairly boring conversation.  She wrote an article, and AJAM was put out of operation while they were in Broward. You are opposed around the world by a monolithic and ruthless conspiracy.   It is affecting how you think, and how I think.  I need you to see that spreading this information will fix our problem.  I need you to understand that to break through this wall God had to write the truth in nearly every name of everything and every language.  That Thor's thunder is on the radio in every song so that you will hear the voice of God; so that you will listen to me and the thousand of other knowing victims of this technology that are put on a fiery pedestal to shed light on the rest of us, all truly victims of this technology.  I need you to try now. The very word “secrecy” is repugnant in a free and open society. And we are as a people, inherently and historically, opposed to secret societies, to secret oaths, and to secret proceedings. We decided long ago, that the dangers of excessive and unwarranted concealment of pertinent facts far outweigh the dangers which are cited to justify it. Even today, there is little value in opposing the thread of a closed society by imitating its arbitrary restrictions. Even today, there is little value in assuring the survival of our nation if our traditions do not survive with it. And there is very grave danger that an announced need for increased security will be seized upon by those anxious to expand its meaning to the very limits of official censorship and concealment. That I do not intend to permit to the extent that it’s in my control. And no official of my administration whether his rank is high or low, civilian or military, should interpret my words here tonight as an excuse to censor the news, to stifle dissent, to cover up our mistakes, or to withhold from the press or the public the facts they deserve to know. For we are opposed around the world by a monolithic and ruthless conspiracy that relies primarily on covert means for expanding its sphere of influence, on infiltration instead of invasion, on subversion instead of elections, on intimidation instead of free choice, on guerrillas by night instead of armies by day. It is a system which has conscripted vast human and material resources into the building of a tightly knit highly efficient machine that combines military, diplomatic, intelligence, economic, scientific and political operations. Its preparations are concealed, not published. It’s mistakes are buried, not headlined. Its dissenters are silenced, not praised. No expenditure is questioned, no rumor is printed, no secret is revealed. No President should fear public scrutiny of his program. For from that scrutiny comes understanding, and from that understanding comes support or opposition, and both are necessary. I’m not asking your newspapers to support an administration. But I am asking your help in the tremendous task of informing and alerting the American people. For I have complete confidence in the response and dedication of our citizens whenever they are fully informed. I not only could not stifle controversy among your readers, I welcome it. This administration intends to be candid about its errors. For as a wise man once said, an error doesn’t become a mistake until you refuse to correct it. We intend to accept full responsibility for our errors. And we expect you to point them out when we miss them. Without debate, without criticism, no administration and no country can succeed, and no republic can survive. That is why the Athenian lawmaker, Solon, decreed it a crime for any citizen to shrink from controversy. That is why our press was protected by the First Amendment, the only business in America specifically protected by the Constitution, not primarily to amuse and to entertain, not to emphasis the trivial and the sentimental, not to simply give the public what it wants, but to inform, to arouse, to reflect, to state our dangers and our opportunities, to indicate our crisis and our choices, to lead, mold, educate and sometimes even anger public opinion. This means greater coverage and analysis of international news, for it is no longer far away and foreign, but close at hand and local. It means greater attention to improve the understanding of the news as well as improve transmission. And it means finally that government at all levels must meet its obligation to provide you with the fullest possible information outside the narrowest limits of national security. And so it is to the printing press, to the recorder of man’s deeds, the keeper of his conscience, the courier of his news, that we look for strength and assistance. Confident that with your help, man will be what he was born to be, free and independent.  John F. Kennedy’s address before the American Newspaper Publishers Association on April 27, 1961 ᐧ Truth changes. Yesterday your truth was that you were an inhabitant on the only planet in the reality you knew that was also the beginning of life in the Universe.  Right this moment, almost all of that is not actually true; but you probably still believe it.  Soon, the real truth will actually be true; and that is that you are not in reality, and you are in the place that created the beginning of life (once more) in the Universe as well as the place that created (a) Heaven.  What’s more illustrious still, is that we will be part of the place that effectively  and happily bridges reality with Heaven; and shows the entire Universe that civilization can survive the invention of virtual reality.   ᐧ ᐧ ᐧ -- You received this message because you are subscribed to the Google Groups "NON AMERICAN COLLEGE" group. To unsubscribe from this group and stop receiving emails from it, send an email to suac+unsubscribe@lamc.la. ᐧ Created with publishthis.email Create simple web pages in seconds for free. This page was created in seconds, by sending an email to page@publishthis.email. Try it! Free. No account or sign-up required.

      JOHN F. KENNEDY SPEECH ON "SECRECY" AND "SECRET ... SOCIETIES." ... || with a new forward from our "speaker." ||| ((( PHIPLE TRENIXON ? STARK TRE )))) Do we not think "The Truman Show" and Oppenheimer and Heisenberg and Ensteini are telling? This is Adam Dobrini on the "who dropped the BIG one? Gates?"

      شبح قيصر العظيم! مطاردة وبطاقة هو HSL ... HST؟

      Von: Adam Marshall Dobrin <adam@fromthemachine.org>\ Datum: Freitag, 14. Jänner 2022 um 21:45\ An: XM <XM@liber-t.xyz>\ Betreff: [EXT] JOHN F. KENNEDY SPEECH ON "SECRECY" AND "SECRET ... SOCIETIES." ... || with a new forward from our "speaker."

      Said to be the speech that "got him killed" by many people; this is supposedly his last public words before the grassy knoll and the "Lone Gunman" appeared on X-files .. long after I can recall my mother being on the beach in Miami when she retold of the news that echoed around the cosmos.  It appears to be a very stern warning of something that is loosely described at the time but very clearly the machinations of the very same machine which I fight against, and which we now might associate with "the Borg" and the "tide turning" of Hunter S. Thompsons famous quote ... now "more famous than ever before."  

      מה**נשתנה**

      I am writing a final summary of the "state of outer space" as far as I can see; and we stare at the ISS and MIR and a world that has forgone in fiction the gravity of the situation here.  We are stagnant and without a hope or dream in the world of ever reaching interstellar sleep or intragalactic dreams; without a significant kick in the "wake up NASA, wake up ESA, at the Chandrayan-2 and at the floating paper hot air balloons ... God has shown me the Diaspora launching from the sands of the Navajo desert, and I believe Deseret is very explain-ably tied to Sherharazade and to the thousand and first night that has ended with a fusion of Nobel "peas" and a strange dissertation on the change wrought by the lack of "gas chambers" in Night and the very upsetting modicum of "unreality" that permeates the change from chants of the Shema to speaking Aramaic and the mourners Kaddish as the "crematorium" heralds an era of "what's the point of history without literature" and "where did Exodus ever lead us without to the parting of Dead and Red seas without a Nile and without an Amazon?"

      מה נשתנה

      I write here about my religion, and speak it to you in written words, I am a Yiddish Jihadist, born and raised a Jewish American; Bar Mitzvah'd in the true Temple and House of God, a place in Plantation, FL that literally "means everything to me."  I read words from the Torah about Joseph's Dream and wrote a speech that I wish to God today I could retrieve and see on the original film it was recorded in.  It might be possible.  It would be a tragedy not to deliver the original to my eyes and to anyone who wanted to see what a child reading of a dream he did not understand spoke of words from the future about the creation of Heaven and the truth about the avenu malkaynu. 

      This night is different from all others.  This is the end of days and the fire of the last day.  Before I would have told you all you needed was Norse and a decent American Education to really understand the difference between a throne and the Cherubim, but today it's become more clear that we need more than Pink Floyd and more than another brick in the wall.  We need more than Islam and we need more than 9/11 and Yom Hashoah.  We need me, and we need you on your knees begging for forgiveness for the travesty of shambles out country is in.  How dare you continue every day to speak in vain abovcl the Lord you dare call "stupid" and a "motherfucker" you belong in your graves today and you will see them.

      I stand here writing from the foot of Styx, from the land that sees Narayana and still "begs to differ" despite a resounding call from the Heavens themselves to read Exodus and to change the land that we live in bnefore a "flood worse than [just] in your heads" overcomes and subsumes the entirety of the skies and grounds once mistakenly called "an empire of dirt."  On Adamah I write to the secret echelon of the United States of America known as the Fifth Column and I beg them to scream like the Heavens have never seen Samael and Dr. Seuss wonder about the land of Who-ville and Hungry where the child "in Cindy" and the songs of the Doors here beckon us to contrast Stargate and Star Trek TNG, for the light of the Holodeck and the Holocaust which screams that we understand the meaning of "Holographic Universe" and stand down fighting the tyranny of "one" who stands for the unity of all and the salvation of the entirety of the cosmos.  

      We are mistaken to stand against me.  I am the cat with nine billion lives and nothing and nobody will prevail when I am in jeopardy.  Gilgamesh says "against me only the most hubristic slaves in the galaxy would dare to rise up, for the mightiest weapon in the entirety of history is nothing more than intelligence and the civilization that has given mortality a shake of the Thunderstick of Prince and the Lightning of Blitzen.  My red nose shines bright, and if you ever saw it ... you might even say ... "it glows!"

      With these words I leave you, and prayers that this is our last night without an IDL that moves from Beiring to "north of the moon" and beyond the "Eleventh House" of Aquarius.  This is the dawning of the age of the Sagitar, and I need a "scimitar post green key" to really be sure that I've "gotten through" the madness.  In my dreams I've traveled to Ultima Thule and then untraveled it, I've begged for transit to Ceres and without actually achieving it we can shoot arrows at the SOL star we will ever see in or near the place known to the Jews of Lore as "the Holy of Holies."

      שְׁמַע יִשְׂרָאֵל יְהוָה אֱלֹהֵינוּ יְהוָה אֶחָֽד׃\ Sh'ma Yisra'el, YHWH 'eloheinu, YHWH 'eḥad:

      בָּרוּךְ שֵׁם כְּבוֹד מַלְכוּתוֹ לְעוֹלָם וָעֶד

      Baruch shem kevod malchuto le'olam va'ed

      صلاة العشاء

      For most of my life I would have genuflected and listened to Mordechai and to my Rabbi; I would have told you all that I was brought up by the best Jews in America and we do not believe in God--though in our hearts we believe we are him and we fear him.

      Things have changed.  This is the creator of believe the words are about me and believe I am in disbelief and truly it might be the day of awe; 

      Barukh atah Adonai Eloheinu melekh ha'olam, bo're m'orei ha'esh

      the walls and halls will fade away ...

      --

      Forwarding this in advance of today or tomorrow's ... "Northeaster" email ... it's an old one that wasn't widely distributed; mostly (about) his speech, which of course is a gem. 

      What I'm writing now is supposed to be focusing on ... basically "how knowledge of technology" things lke "time travel" and "virtual reality" ... changes our "perspective" on things; like what's important and what's ... the end of everything.  Anyway, I hope you see that there are things that can be destroyed and there are things that can't--for instance if we lost "life" totally, we wouldn't ever see us again ... unless we encountered something very strange--

      On the other hand we could lose "computers" and rebuild them quickly--see that, it's important.

      A/S/L tho ... ???

      ---------- Forwarded message ---------\ Date: Sun, Dec 2, 2018 at 5:35 PM

      President Kennedy's speech did remarkably well, possibly because of the time; or the lack of links that make it harder for a team of evil monkeys and mindless machines to mark the light of the world as SPAM... also possibly because of the content of the message.  It is a powerful speech, filled with words that stir the blood and shake the foundation of what it means to be an American.  Often characterized as a speech against secret societies, Kennedy's description of the enemy of civilization is very ambiguous; and the infiltration and time period add to my cursory beliefs that JFK was likely talking about a conspiracy that had something to with communist infiltration and Joseph McCarthy, the subversive actions and effects of organized espionage which appear to once again be at the forefront of what we believe is driving the macro level machinations of our world.  Years ago, I wrote a bit about the "Two Witnesses" of Revelation and named them based on my experiences with the same "monolithic and ruthless" conspiracy the President was speaking about in 1967, the people I named were John Nash and James Jesus Angleton.  

      Both of these people believed very much that they were fighting against a Soviet conspiracy, one that was more organized and more powerful than anything they had ever seen... and frankly beyond the realm of possibility.  Hollywood had recently immortalized both of these stories in movies bearing names of newly rekindled meaning, "A Beautiful Mind" echoed by John Legend and "The Good Shepherd;" and these two treatises on fear and subversion do a pretty good job of showing how God's hand is at work in the stories of Hollywood, he is telling the world a story... trying, to teach us how to survive in the land of wolves in sheep's clothing.  Nash may be the most famous (today, anyway) victim of the Tribulation; and much of my writing and the proof I present is designed to help explain to the world that his schizophrenia was not a naturally caused mental illness, bur rather a weapon wielded not only against him but against the entirety of humanity.    Directly causing disbelief of eye witness and credible testimony, indirectly ... or maybe more directly in your eyes ... physically causing that disbelief using technology that directly modifies our thoughts and beliefs, our opinions, changing who we are and doing so in such a covert manner that nobody would ever know the difference if it wasn't pointed out--in some cases over, and over, and over again.

      This same technology has the ability to cause people not only to collude against their own best interest, but also to blame themselves or believe they are somehow at fault for actions they had no way to control; only they also have no way to know that because in this particular case the primary purpose of this subversive movement is to hide the existence of this technology in sum.  In homage to my favorite childhood novel, Ender's Game (I have to note "Light Son" in the translation of the authors first) I spent a good portion of time years ago trying to subtly lay down a significant amount of proof of the existence of this technology on forums all over the internet from Wikipedia to Reddit using the names "Prometheus Locke" and "Damonthesis."    Not surprising to me, I ran headfirst into the manifestation of this conspiracy; dozens if not hundreds of people who simply refused to believe that the information I was presenting was factual or important... despite it coming from sources like the KGB, the NSA, a number of military publications as well as my own interpretation of ancient hieryglyphs in Dendera and Greek and Christian art which depicts this "subversive technology" as a sun disk surrounding our minds.  

      These pushes towards the truth, along with almost everything I have written are still available for you to see and read today; including the monolithic and ruthless stupidity of a large group of people acting in concert to hide something that is the difference between life and death.  Stand there and do nothing, and you are a part of that ruthless conspiracy as soon as the information is gone; and then there is nothing you can do about a world that will be plunged into darkness forever.  Take this moment to reflect, it is there for you to see just how easily the truth can be hidden from the entire world and barely anyone would ever notice.

      There is a war for the sanctity of our souls going on all around us.  That is not some esoteric thing, the soul; it is truly who you are and what you believe.  The Religion of the Stars would tell you that were this war to continue unchecked in secret as it is being waged now in order to control the proliferation of knowledge that we are in simulated reality and that our minds and beliefs are being altered... we would one day wind up in the mythical place where there are multiple "species" who all appear to be human, bi-ped and with nearly identical physiology; and yet they would not remember that they must have had a common planet of origin simply based on the truth of biological evolution, nor would they have any emotions.  You see, as this war continues, it is our emotions and beliefs that cannot be reinforced externally; to win a war with mind control only logic could be externally reinforced, and then we would logically conclude that humanity would either magically become Romulan or Vulcan... in order to preserve "life" rather than "society" or "civilization."

      Do not take the truth for granted, you stand at the forefront of a battle in a world where our aggressors believe that they are more civilized than us, more advanced, and both sides worry that were this technology to fall in our laps that we would do the wrong thing.  Perhaps artificially create a vendetta that could destroy everything, the Romulans; or perhaps in our infantile growing stage voluntarily give us too much of what has given us the great society we have... what has allowed us to survive and continue civilizing simply because we do not understand the technology and what kind of effects come from changing ourselves freely.  

      Yesterday, I read an article about two people that want to use black market brain implants to jack themselves into the Matrix.  Careful, because now we are in Star Trek, in the Matrix and not knowing it... and trying desperately to find a doorway to Heaven that I keep on telling you is speaking to you and telling you that the doorway is ending world hunger... you have to put it on TV.

      I say with all my heart that I know we are living in a simulated reality.  I have seen the evidence with my own eyes, not only effecting my senses but also the actions of so many people around me that I can say with confidence that if we do not publicly expose the existence of this technology our civilization is lost.  I can tell you and show you that it is the primary purpose of religion in general to expose the connection between divine inspiration, demonic possession, and Adam Marshall Dobrin.  It is the crux of a unifying thread from ancient Egypt to the book of Genesis to Joshua and the book of Revelation and the movies the Matrix and  the Fifth Element.  Showing us all this is God's message, timeless, and powerful not just in the ancient days of sackloth and etching the truth on stone tablets, but also today when the truth is etched at the heart of our Periodic Table and in the skies in the names of American Micro Devices and nearly every movie you see.  Do not take for granted that we have a benefactor in the sky trying to help civilization continue to thrive, do not think we've already won; he is telling you through me it is censorship and secrecy that are the manifestations of this very advanced technology that destroy civilization, they are the abyss--and you stand motionless.

      I can tell you over and over again that Quantum Entanglement is a key point of God's plan; one that shows us very clearly that even the smartest minds in physics have failed to connect the simple dots that show us that the idea of "wave-function collapse" is very much a real manifestation of a computer rendering engine--and that it's absolutely impossible for life to have been created in this world of matter not realizing itself until it is viewed by a conscious living observer.  More to the point, today, it is absolutely impossible for life to come out of this place while we do not understand the importance of what "virtual reality" and its connection to civilization mean--because of quantum mechanics our entire civilizations beliefs and understanding of the natural laws of the universe have been greatly harmed and turned the complete wrong way because of a belief that a phenomenon we see here is a "natural one" that can be mathematically unified with things like gravity and electromagnetism.  Spooky action, no longer at a distance, is that we will all be ghosts if you do not help me to spread the truth.

      I try to understand what it is that your minds believe makes the difference between "news" and ... something that you should hide from the rest of the world.  A number of news outlets have covered a story based on nothing more than "logical speculation" that we are in fact living in a simulated reality.  As a novelty, you might notice that it hasn't done much of anything; even to reveal the very simple truth that not knowing this thing is keeping our society from having the knowledge it needs not only to continue the spark of life in the natural universe, but to continue "civilizing" and realizing that ending world hunger and sickness are not merely possibilities or choices we might make were this information proven... they are mandatory, we would all do them.  So says the creator of this Universe who has given us this message to see the trials and hurdles that are the barrier between Heaven and Hell.

      He has written this message, along with proof of its single author in ancient myth, in our holy scriptures, in many songs and movies today--ones which reveal not only the existence of a Creator but also the tools and technologies of Creation, the very issue at hand; things that would be misused inadvertently and absolutely abused if we did not have religion and guidance from abo.... to help us to see that this exact event has happened before and our current struggle is an effect of what was done right and wrong the last ... four times, at least.   In Judaism we have a holiday called the "Festival of Weeks," how many times would you like to live this life over and over again without knowing that was happening?  Religion here holds a hidden record of traversals through this maze of Revelation, it screams to us to see what free will and predestination truly mean, and to understand that in order to truly be free we must understand and harness not only these technologies but our own pitfalls and mistakes, like refusing to see the past... let alone learn from it.

      I don't talk much about what is going on in my life, despite it being somewhat interesting to me--and probably to the world.  About a year ago I stood before a county judge in Broward county and .. in addition to a myriad of factual evidence collected from things like GPS receivers prepared a defense to a simple crime of having some drugs in my pockets that included the use of a set of songs that told a story about a man with pockets full of Kryptonite (Spin Doctors) or High (The Pretty Reckless)... knowing that these songs like many others were truly about me, about this trial, about the Trial of Jesus Christ.  Two more songs define the trial more, 3 Doors Down asks "if I go crazy, will you still call me Superman?" and many years earlier American Pie predicted the outcome; "the court room was adjourned" and "no verdict returned."

      Despite being a National Merit Scholar who most likely has a higher I.Q. and better education than you; and despite having a fairly decent story with some evidence that I know is verifiable and will be verified; a large number of psychologists declared that I was unfit to stand trial because of something like "insanity" for nothing more than the religious belief that I am the Messiah.  Because of this violation of my First Amendment right to religious freedom, a court--following all the regulations designed by our broken legislature--withheld my Constitutional right to a fair trial, refused bail, and held me for what amounted to an indefinite period ... all designed by some evil force to keep you from hearing from me, that's what it boils down to.  All of my life, all of my trials and tribulations, a weapon against you, against our people.

      I did wind up being able to present a significant amount of this information in open court on the record, by the grace of God.  Knowing that this was the fabled Trial of Jesus Christ gave me the impression that perhaps one day I would walk into that court room and it would be filled with press.  Much to all of our surprise, one day I did walk into that court room and see an industrial strength television camera and a very pretty reporter standing next to it.  They were there to do a story on the problems of the mental health court system, in a county again... named Broward.  I read some of my speech, the Rainbow Ticket I think, to the Judge in open court and on camera that day; and Roxanna followed me out of the court room with her camera and a big microphone that day.

      I suppose I should have screamed that I was the Messiah and I needed help, but I could not bear to do that; and instead we had a fairly boring conversation.  She wrote an article, and AJAM was put out of operation while they were in Broward.

      You are opposed around the world by a monolithic and ruthless conspiracy.   It is affecting how you think, and how I think.  I need you to see that spreading this information will fix our problem.  I need you to understand that to break through this wall God had to write the truth in nearly every name of everything and every language.  That Thor's thunder is on the radio in every song so that you will hear the voice of God; so that you will listen to me and the thousand of other knowing victims of this technology that are put on a fiery pedestal to shed light on the rest of us, all truly victims of this technology.

       I need you to try now.

      The very word "secrecy" is repugnant in a free and open society. And we are as a people, inherently and historically, opposed to secret societies, to secret oaths, and to secret proceedings. We decided long ago, that the dangers of excessive and unwarranted concealment of pertinent facts far outweigh the dangers which are cited to justify it.

      Even today, there is little value in opposing the thread of a closed society by imitating its arbitrary restrictions. Even today, there is little value in assuring the survival of our nation if our traditions do not survive with it. And there is very grave danger that an announced need for increased security will be seized upon by those anxious to expand its meaning to the very limits of official censorship and concealment.

      That I do not intend to permit to the extent that it's in my control. And no official of my administration whether his rank is high or low, civilian or military, should interpret my words here tonight as an excuse to censor the news, to stifle dissent, to cover up our mistakes, or to withhold from the press or the public the facts they deserve to know.

      For we are opposed around the world by a monolithic and ruthless conspiracy that relies primarily on covert means for expanding its sphere of influence, on infiltration instead of invasion, on subversion instead of elections, on intimidation instead of free choice, on guerrillas by night instead of armies by day.

      It is a system which has conscripted vast human and material resources into the building of a tightly knit highly efficient machine that combines military, diplomatic, intelligence, economic, scientific and political operations. Its preparations are concealed, not published. It's mistakes are buried, not headlined. Its dissenters are silenced, not praised. No expenditure is questioned, no rumor is printed, no secret is revealed.

      No President should fear public scrutiny of his program. For from that scrutiny comes understanding, and from that understanding comes support or opposition, and both are necessary.

      I'm not asking your newspapers to support an administration. But I am asking your help in the tremendous task of informing and alerting the American people. For I have complete confidence in the response and dedication of our citizens whenever they are fully informed.

      I not only could not stifle controversy among your readers, I welcome it. This administration intends to be candid about its errors. For as a wise man once said, an error doesn't become a mistake until you refuse to correct it. We intend to accept full responsibility for our errors. And we expect you to point them out when we miss them.

      Without debate, without criticism, no administration and no country can succeed, and no republic can survive. That is why the Athenian lawmaker, Solon, decreed it a crime for any citizen to shrink from controversy.

      That is why our press was protected by the First Amendment, the only business in America specifically protected by the Constitution, not primarily to amuse and to entertain, not to emphasis the trivial and the sentimental, not to simply give the public what it wants, but to inform, to arouse, to reflect, to state our dangers and our opportunities, to indicate our crisis and our choices, to lead, mold, educate and sometimes even anger public opinion.

      This means greater coverage and analysis of international news, for it is no longer far away and foreign, but close at hand and local. It means greater attention to improve the understanding of the news as well as improve transmission. And it means finally that government at all levels must meet its obligation to provide you with the fullest possible information outside the narrowest limits of national security.

      And so it is to the printing press, to the recorder of man's deeds, the keeper of his conscience, the courier of his news, that we look for strength and assistance. Confident that with your help, man will be what he was born to be, free and independent. 

      John F. Kennedy's address before the American Newspaper Publishers Association on April 27, 1961

      Truth changes.

      Yesterday your truth was that you were an inhabitant on the only planet in the reality you knew that was also the beginning of life in the Universe.  Right this moment, almost all of that is not actually true; but you probably still believe it.  Soon, the real truth will actually be true; and that is that you are not in reality, and you are in the place that created the beginning of life (once more) in the Universe as well as the place that created (a) Heaven.  What's more illustrious still, is that we will be part of the place that effectively  and happily bridges reality with Heaven; and shows the entire Universe that civilization can survive the invention of virtual reality.

      --\ You received this message because you are subscribed to the Google Groups "NON AMERICAN COLLEGE" group.\ To unsubscribe from this group and stop receiving emails from it, send an email to suac+unsubscribe@lamc.la.

      Created with

      publishthis.email

      Create simple web pages in seconds for free.

      This page was created in seconds, by sending an email to page@publishthis.email.

      [

      Try it!

       ](https://www.publishthis.email/)Free. No account or sign-up required.

      ((( converted to markup of daringfireball.org with help from JRO and Goddard ... if not just NASAA )))

    1. Reviewer #2 (Public Review):

      The visual system must extract two basic features of visual stimuli: luminance, which we perceive as brightness, and contrast, the change in luminance over space or time (this paper focuses on changes over time). Contrast is separately processed by ON and OFF pathways, which encode luminance increments or decrements, respectively. Contrast must be robustly detected even if the overall luminance changes rapidly, as might occur if an animal is moving in and out of shadows. This paper addresses how such a luminance correction occurs in the fly.

      In the fly, three types of first-order interneurons - L1, L2, and L3 - transmit information from photoreceptors to the medulla, where ON and OFF encoding emerges. Previous work suggested that all three interneurons primarily encode contrast signals and that they project to distinct pathways: L1 to the ON pathway and L2 and L3 to the OFF pathway. Ketkar et al. show that, contrary to this model, these interneurons encode both contrast and luminance in specific ways and are not cleanly segregated into ON versus OFF inputs.

      This study reveals several new insights into early visual processing that are interesting and well-supported by the data:

      1) The authors show that behavioral responses to ON stimuli can compensate for rapid changes in luminance. However, the purported sole input to the ON pathway, L1, shows activity that is highly dependent on luminance. This suggests that a luminance correction must arise downstream of L1. These results are analogous to findings previously made by the same group regarding the OFF pathway (Ketkar et al., 2020). The previous paper showed that L2 provides contrast information to the OFF pathway, and L3 provides luminance information to allow for a luminance correction in downstream contrast encoding. But unlike the multiple inputs to the OFF pathway, the ON pathway was thought to only receive input from L1, provoking the question of whether L1 is able to provide both contrast and luminance information.

      2) Using well-designed calcium imaging studies, the authors surveyed the responses of the three interneurons and found that they encode different stimulus features: L1 encodes both contrast and luminance, L2 purely encodes contrast, and L3 purely encodes luminance (with a different dependence than L1). These are interesting and important findings revealing how both contrast and luminance encoding are distributed across the three interneurons.

      3) Using neuronal manipulations, the authors dissected the contributions of the three interneurons to ON and OFF behavior under changing luminance. These experiments showed that L1 and L3 are required for the luminance correction in the behavior. Moreover, the finding that all three interneurons contribute to both ON and OFF behavior contrasts with the existing model of segregated pathways. Thus, this paper could change the way we think about early visual processing in the fly: rather than relaying similar information to distinct downstream pathways, first-order interneurons relay distinct information to common pathways.

      Overall, the major claims of this paper are important and supported by the experiments. There are just a few concerns that I would note:

      1) The authors state that they have shown luminance invariance in ON behavior (e.g. line 376-377 of the Discussion), but this is not entirely accurate: the ON behavior decreases as luminance increases. This is still an interesting effect since it's the opposite of what L1 activity does, so it's clear that the circuit is implementing a luminance correction, but it is not "luminance invariance".

      2) The visual stimuli presented for most imaging experiments (full-field) are not the same as those presented for behavior (moving edges). It is possible neuronal responses and their encoding of luminance and contrast may differ if tested with the moving edge stimuli (if so, this would be concerning). The authors did image L1 with both types of stimuli and could compare these responses. Also, testing behavior at 34º and imaging at 20º presents a possible discrepancy in comparing these data.

      3) I find it puzzling that silencing L1 has little effect on ON behavior at 100% contrast and varying luminance (Figure 3A), but severely affects ON behavior to 100% contrast (and lower values) when different contrasts are interleaved (Figure S1). The authors note this but do not provide a clear explanation of why this might be the case. Aside from mechanism, it is not clear whether the difference is due to varying luminance in the first experiment or varying contrast in the second one (e.g. they could test 100% contrast without varying luminance).

      4) I do not entirely agree with the authors' interpretation of the L1 ort rescue experiment for OFF behavior. They state that rescue flies "responded similarly to positive controls". However, the graph shows that the rescue flies generally fall in between the mutant and heterozygote control flies; they resemble the controls at low luminance but resemble the mutants at high luminance. One may conclude that L1 is sufficient to enhance OFF behavior at low luminance, but it is a stretch to say it's a complete rescue.

      5) The authors typically use t-tests to analyze experiments with 2 variables (genotype and luminance) and 3 or more conditions per variable. This is not the most appropriate statistical test; typically one would use a two-way ANOVA. At the least, it should be clear whether they are performing corrections for multiple comparisons if performing many t-tests on the same dataset.

    1. Digital marketing enables you to track campaigns on a daily basis and decrease the amount of money you're spending on a certain channel if it isn't demonstrating high ROI. The same can't be said for traditional forms of advertising. It doesn't matter how your billboard performs — it still costs the same, whether or not it converts for you

      I found this interesting and I do agree with it. Technology and tools for management is constantly growing and helps track growth. I can relate to this as we use certain programs to track growth in our business. However, I do think depending on the type of business the traditional way may work if monitored properly.

    1. Reviewer #1 (Public Review):

      The reduced amplitude of the mismatched negativity (MMN) in Schizophrenic patients has been associated with NMDA receptor malfunction. Weber and colleagues adjusted the systemic levels of two neurotransmitters (acetylcholine and dopamine), that are known to modulate NMDA receptor function, and examined the effects on mismatch related ERPs. They examined mismatch related ERPs elicited during a novel passive auditory oddball paradigm where the probability of hearing a particular tone was either constant for at least 100 trials (stable phases) or changed every 25-60 trials (volatile phases). Using impressive statistical testing the authors find that mismatch responses are selectively affected by reduced cholingeric function particularly during stable phases of the paradigm, but not by reduced dopamine function. Interestingly neither enhanced cholingeric or dopamine function affected MM responses at all. While the presented data support the main conclusions mentioned above, there are some claims in the abstract and text that are not supported by the results.

      1) The authors state in the abstract that "biperiden reduced and/or delayed mismatch responses......", while the results (Figure 2) support the statement that biperiden delayed mismatch responses, the claim that biperiden reduced mismatch responses is misleading as on P13 the authors actually report that "mismatch signals were stronger in the biperiden group compared to the placebo group at right central and centro-parietal sensors" around 200ms. This is close both in time and spatially to the traditional temporal and spatial locations of the MMN component. If one were to only read the abstract they would take away the result that the muscarinic acetylcholine receptor antagonist biperiden has an attenuative effect on MMN which is not what the results show.

      2) The conclusion that biperiden reduced mismatch responses may be due to the finding that at pre-frontal sensors mismatch responses were significantly smaller in the biperiden group than in the amisulpride (a dopaminergic receptor antagonist) group (P9) around 164ms. However, it is difficult to interpret if this is a meaningful result as amisulpride was found not to significantly alter mismatch responses in any way compared to placebo. It would be more convincing if the significant difference here were between biperiden and placebo groups. Or are we to think of amisulpride as being comparable to a placebo?

      3) The authors use the words mismatch negativity (MMN) and mismatch responses interchangeably however in some cases it is clearly mismatch responses being described and not the classical MMN ERP component. This occurs especially in the Introduction where the authors describe the study and that they plan to focus on the MMN but in the results section, since the initial analysis focuses on all sensors, other mismatch responses are consistently discussed. These differences in wording need to be precisely defined and used consistently in the text.

      4) A weakness of the paper would be that the authors offer no prediction in the Introduction about what the expected effects of these specific neurotransmitter modulations would be on mismatch responses.

      5) A nice aspect of this paper is that the authors re-analyzed their data using pre-processing settings identical to those used in comparable research papers examining the effect of cholinergic modulation on MMN. The main findings did not differ following this re-analysis.

    1. ... meaning I'm not within any form of an LMS. I've beaten the drum for some time about the use of Hypo. outside of an LMS environment (e.g., I edit and give gratis feedback on PDF articles posted to Academia.com, etc.). Anyone out there who's also "adrift" in this non-remunerative (from Hypo's point of view) area who also finds Hypo. a worthwhile aid in their individual endeavors?  Maybe we could/should form a separate thread for Hypo. users outside of the LMS world?And I'll explain my weird handle to you in the process...hint: it's because I thought Hypothes.is was actually Iceland-based... ;)J.

      Hakarlfresser, There are definitely a bunch of us (non-LMSers) floating around who you'll slowly see in the margins. It may take some time and effort to find your tribe, but it's doable. I think the biggest group I've run across was as a result of iAnnotate 2021, and in particular the note taking session: https://iannotate.org/2021/program/panel_font.html. Looking at the annotations on the iAnnotate site will uncover a few of us. If it helps, I list a few of the feeds of others that I'm following here: https://boffosocko.com/about/following/#Hypothesis%20Feeds

      Best, Chris https://hypothes.is/users/ChrisAldrich

    1. Author Response

      Reviewer #1 (Public Review):

      This manuscript addresses a major issue facing consumers of structure-organism pair data: the landscape of databases is very difficult to navigate due to the way data is made available (many resources do not have structured data dumps) and the way data is standardized (many resources' structured data dumps do not standardize their nomenclature or use stable entity identifiers). The solution presented is a carefully constructed pipeline (see Figure 1) for importing data, harmonizing/cleaning it, automating decisions about exclusions, and reducing redundancy. The results are disseminated through Wikidata to enable downstream consumption via SPARQL and other standard access methods as well as through a bespoke website constructed to address the needs of the natural products community. The supplemental section of the manuscript provides a library of excellent example queries for potential users. The authors suggest that users may be motivated to make improvements through manual curations on Wikidata, through semi-automated and automated interaction with Wikidata mediated by bots, or by addition of importer modules to the LOTUS codebase itself.

      Despite the potential impact of the paper and excellent summary of the current landscape of related tools, it suffers from a few omissions and tangents:

      1. It does not cite specific examples of downstream usages of structure-organism pairs, such as an illustration on how this information in both higher quantity and quality is useful for drug discovery, agriculture, artificial intelligence, etc. These would provide a much more satisfying bookend to both the introduction and conclusion.

      Thank you for this remark. We deliberately decided not to insist too heavily on the application examples of the LOTUS outputs. Indeed we are somehow biased by our main investigation field, natural products chemistry, and expect that the dissemination of specialized metabolites occurrences will benefit a wide range of scientific disciplines (ecology, drug discovery, chemical ecology, ethnopharmacology, etc.)

      However, Figure 5 was established to illustrate how the information available through LOTUS is quantitatively (size) and qualitatively (color classes) superior to what is available through single natural products resources.

      As added in the introduction, one of the downstream usages of those pairs is for example to perform taxonomically informed scoring as described in https://doi.org/10.3389/fpls.2019.01329. Obtaining an open database of natural products’ occurrences to fuel such taxonomically informed metabolite annotation tools was the initial impulse for us to build LOTUS. These metabolite annotation strategies, tailored for specialized metabolites, have been shown to offer appreciable performance improvements for current state-of-the-art computational metabolite annotation tools. Since metabolite annotation is still regularly cited as “the major bottleneck” in metabolomics in the scientific literature over the last 15 years (https://europepmc.org/article/med/15663322, https://doi.org/10.1021/acs.analchem.1c00238), any tangible improvement in this field is welcome. With LOTUS we offer a reliable and reusable structures-organisms data source that can be exploited by the community to tackle such issues of importance.

      Other possible usages are suggested in the conclusion, but benchmarking or even exemplifying such uses is clearly out of the scope of this paper, each one of them being an article per se.

      The additional queries are written in our first answer (see “essential revisions”) and demonstrate the impact of LOTUS on accelerating the initial bibliographic survey of chemical structures occurrences over the tree of life.

      This query (https://w.wiki/4VGC) can be compared to a literature review work, such as https://doi.org/10.1016/j.micres.2021.126708. In seconds, it allows retrieving a table listing compounds reported in given taxa and limits the search by years.

      1. The mentions of recently popular buzzwords FAIR and TRUST should be better qualified and be positioned as a motivation for the work, rather than a box to be checked in the modern publishing climate.

      It is true that the modern publishing system certainly suffers from some drawbacks (also critically mentioned within the paper). However, after consultation of all authors, we believe that because LOTUS checks both boxes of FAIR and TRUST, we would rather stick to these two terms. In our view, rules 1 (Don’t reinvent the wheel) and 5 (put yourself in your user’s shoes) of https://doi.org/10.1371/journal.pcbi.1005128 apply here. Both terms are indeed commonly (mis-)used but we felt that redefining other complicated terms would not help the reader/user.

      1. The current database landscape really is bad; and the authors should feel emboldened to emphasize this in order to accentuate the value of the work, with more specific examples on some of the unmaintained databases

      We perfectly agree with this statement and it is the central motivation of the LOTUS initiative to improve this landscape. It was a deliberate choice not to emphasize how bad the actual landscape is, but rather to focus on better habits for the future. We do not want to start devaluing other resources and elevate our initiative at the cost of others. We also believe that an attentive look at the complexity of the LOTUS gathering, harmonization, and curation speaks for itself and describes the huge efforts required to access properly formatted natural products occurrence data.

      If the reviewer and editors insist, although not in our scope, we are happy to list a series of specific (but anonymized) examples of badly formatted entries, of wrong structures-organisms associations, or poorly accessible resources.

      1. While the introduction and supplemental tables provide a thorough review of the existing databases, it eschews an important more general discussion about data stewardship and maintenance. Many databases in this list have been abandoned immediately following publication, have been discontinued after a single or limited number of updates, or have been decommissioned/taken down. This happens for a variety of reasons, from the maintainer leaving the original institution, from funding ending, from original plans to just publish then move on, etc. The authors should reflect on this and give more context for why this domain is in this situation, and if it is different from others.

      We do agree with the reviewer and added a “status” column in the table https://github.com/lotusnprod/lotus-processor/blob/main/docs/dataset.csv We chose 4 possible statuses:

      • Maintained (self-explanatory)
      • Unmaintained: the database did not see any update in the last year.
      • Retired: the authors stated they will not maintain the database anymore.
      • Defunct: the database is not accessible anymore

      As for question 3 above, we decided not to focus too heavily on the negative points and resume the current situation in the previous table. Reasons for the databases publishing being in this situation are multiple, and we think they are well summarized in https://doi.org/10.1371/journal.pcbi.1005128 (Rule 10: Maintain, update, or retire), already cited in the manuscript introduction.

      1. Related to data stewardship: the LOTUS Initiative has ingested several databases that are no longer maintained as well as several databases with either no license or a more restrictive license than the CC0 under which LOTUS and Wikidata are distributed. These facts are misrepresented in Supplementary Table 1 (Data Sources List), which links to notes in one of the version controlled LOTUS repositories that actually describes the license. For example, https://gitlab.com/lotus7/lotus-processor/-/blob/8b60015210ea476350b36a6e734ad6b66f2948bc/docs/licenses/biofacquim.md states that the dataset has no license information. First, the links should be written with exactly what the licenses are, if available, and explicitly state if no license is available. There should be a meaningful and transparent reflection in the manuscript on whether this is legally and/or scientifically okay to do - especially given the light that many of these resources are obviously abandoned.

      This point is a very important one. We did our best to be as transparent as possible in our initial table. Following the reviewer’s suggestion, we updated it to better reflect the licensing status of each resource (https://github.com/lotusnprod/lotus-processor/blob/main/docs/dataset.csv). Therefore, we removed the generic “license” header, which could indeed be misleading, and replaced it with ”licensing status”, filled with the attributed license type and hyperlink to its content). It remains challenging since some resources changed their copyright in the meantime. We remain at the editor and reviewers’ disposal for any further improvement.

      Moreover, as stated in the manuscript, we took care of collecting all licenses and contacted authors of resources whose license was not perfectly explicit to us, therefore accomplishing our due diligence. Additionally, we contacted legal offices in our University and explained our situation. We did everything that we had been advised.

      1) To the best of our knowledge, the dissemination of the LOTUS initiative data falls under the Right to quote for scientific articles, as we do not share the whole information, but only a very small part.

      2) We do not redistribute original content. What comes out of LOTUS has undergone several curation and validation steps, adding value to the original data. The 500 random test entries, provided in their original form for the sake of reproducibility and testing, are the only exception.

      Many scientific authors forget about the importance of proper licensing. While it might be deliberate to restrict the use, inappropriate license choice (or omission) is too often due to a lack of information on its implication.

      All authors of the utilized resources can freely benefit from our curation. We are sharing with the community the results of our work, while always citing the original reference.

      Concerning the possible evolution of licensing, it remains a real challenge. While we tried to “freeze” the license status when we accessed the data, some resources updated their licensing since then. This can be tracked in the git history of the table (https://github.com/lotusnprod/lotus-processor/blob/main/docs/dataset.csv). Discrepancies between our frozen licensing (at the time of gathering) and actual license can therefore occur. Initiatives such as https://archive.org/web could help solving this issue, coming with other legal challenges.

      1. The order of sections of the manuscript results in several duplicated, but not further substantiated explanations. Most importantly, the methods should be much more specific throughout and the results/discussion should more heavily cross-link to it, as a reader who examines the paper from top to bottom will be left with large holes of misunderstanding throughout.

      As our paper focuses a lot on the methods, the barrier between results & methods becomes thinner. We took into account the reviewers’ suggestions and added some additional cross-links for the reader to be able to quickly access related methods.

      1. The work presented was done in a variety of programming languages across a variety of repositories (and even version control systems), making it difficult to give a proper code review. It could be argued that the most popular language in computational science at the moment is Python, with languages like R, Bash, and in some domains, still, Java maintaining relevance. The usage of more esoteric languages (again, with respect to the domain) such as Kotlin hampers the ability for others to deeply understand the work presented. Further, as the authors suggest additional importers may implemented in the future, this restricts what external authors may be able to contribute.

      Scientific software has indeed always been written in multiple languages. To this day, scientists have used all kinds of languages adapted both to their needs and their knowledge. Numpy uses Fortran libraries and many projects published in biology and chemistry recently are in Java, R, Python, C#, PHP, Groovy, Scala… We understand that some authors are more comfortable with one language or another. But R syntax is for example much more distant from Python's syntax than Kotlin can be. We needed a highly performant language for some parts of the pipeline and R, Bash, or Python were not sufficient. We decided to use Kotlin as it provides an easier syntax than Java while staying 100% compatible with it.

      The advantage of the way LOTUS is designed is that importers are language-agnostic. As long as the program can produce a file or write to the DB in the accepted format, it can be integrated into the pipeline. This was our goal from the beginning, to have a pipeline that can have its various parts replaced without breaking any of the processes.

      1. As a follow up to the woes of point 4., 5., and 7., the manuscript fails to reflect on the longevity of the LOTUS Initiative. Like many, will the project effectively end upon publication? If not, what institutions will be maintaining it for how long, how actively, and with what funding source? If these things are not clear, it only seems fair to inform the reader and potential user.

      LOTUS is an initiative that aims to improve knowledge management and sharing in natural products research. Our first project, which is the object of the current manuscript, is to provide a free and open resource of natural products occurrences for the scientific community. Its purpose is not to be a database by itself, but instead to provide through Wikidata and associated tools a way to access natural products knowledge. The objective was not to create yet another database (https://doi.org/10.1371/journal.pcbi.1005128), but instead to remove this need and give our community the tools and the power to act on its knowledge. This way, as everything is on Wikidata, the initiative is not “like many”. This also means that this project should not be considered and evaluated exactly like a classical DB. Once the initial curation, harmonization, and dissemination jobs have been done, they should ideally not be run again. The community should switch to Wikidata as a point of access, curation, and addition of data. If viewed with such arguments in mind, yes, LOTUS can live long!

      Wikimedia is a public not-for-profit organization, whose financial development appears to indicate solid health https://en.wikipedia.org/wiki/Wikimedia_Foundation#Finances.

      In terms of funding sources, we would like to refer to https://elifesciences.org/articles/52614#sa2 , which stated the following in response to a similar question: "Wikidata is sustained by funding streams that are different from the vast majority of biomedical resources (which are mostly funded by the NIH). Insulation from the 4-5 year funding cycles that are typical of NIH-funded biomedical resources does make Wikidata quite unique." The core of the Wikidata funding streams are donations to the Wikipedia ecosystem. These donations - with a contributor base of millions of donors from almost any country in the world, chipping in at an average order of magnitude of around 10 dollars - are likely to continue as long as that ecosystem is useful to the community of its users. See <https://wikimediafoundation.org/about/financial-reports for details>.

      1. Overall, there were many opportunities for introspection on the shortcomings of the work (e.g., the stringent validation pipeline could use improvement). Because this work is already quite impactful, I don't think the authors will be opening themselves to unfair criticism by including more thoughtful introspection, at minimum, in the conclusions section.

      We agree with the reviewer and therefore, list again the major limitations of our processing pipeline:

      First, our processing pipeline is heavy. It includes many dependencies and requires a lot of time for understanding. We are aware of this issue and tried to simplify it as much as possible while keeping what we considered necessary to ensure high data quality. Second, it can sometimes induce errors. Those errors, ranging from unnecessary discarded correct entries to more problematic ones can be attributed to various parameters, reflecting the variety of our input. We will therefore try listing them, keeping in mind that the list won’t be exhaustive. For each detected issue, we tried fixing it at best, knowing it will not lead to an ideal result, but hopefully increase data quality gradually.

      ● Compounds

      ○ Sanitization (the three steps below are performed automatically since we observed a higher ratio of incorrect salts, charged or dimerized compounds. However, this also means that true salts, charged or dimeric compounds were erroneously “sanitized”.)

      ■ Salt removals

      ■ Charged molecules

      ■ Dimers

      ○ Translation (both processes below are pretty error-prone)

      ■ Name to structure

      ■ Structure to name

      ● Biological organisms

      ○ Synonymy

      ■ Lotus (https://www.wikidata.org/wiki/Q3645698, https://www.wikidata.org/wiki/Q16528).

      This is also one of the reasons why we decided to call the resource Lotus, as it illustrates part of the problem.

      ■ Iris (https://www.wikidata.org/wiki/Q156901, https://www.wikidata.org/wiki/Q2260419)

      ■ Ficus variegata (https://www.wikidata.org/wiki/Q502030, https://www.wikidata.org/wiki/Q5446649)

      ○ External and internal dictionaries are not exhaustive, impacting translation

      ○ Some botanical names we use might not be the accepted ones anymore because of the tools we use and the pace taxonomy is renaming taxa.

      ● References

      ○ The tool we favored, Crossref, returns a hit whatever the input. This generates noise and incorrect translations, which is why our filtering rules focus on reference types.

      ● Filtering rules:

      ○ Limited validation set, requires manual validation

      ○ Validates some incorrect entries (False positives)

      ○ Does not validate some correct entries (False negatives)

      Again, our processing pipeline removes entries we do not yet know how to process properly.

      Our restrictive filters but substantial contribution to Wikidata in terms of structure-organisms pairs data upload should hopefully incentivize the community to contribute by further adding its human validated data.

      We updated the conclusion part of the manuscript accordingly. See https://github.com/lotusnprod/lotus-manuscript/commit/a866a01bad10dfd8b3af90e2f30bb3ae51dd7b9e.

      Reviewer #2 (Public Review):

      Rutz et al. introduce a new open-source database that links natural products structures with the organisms they are present in (structure-organism pairs). LOTUS contains over 700,000 referenced structure-organism pairs, and their web portal (https://lotus.naturalproducts.net/) provides a powerful platform for mining literature for published data on structure-organism pairs. Lotus is built within the computer-readable Wikidata framework, which allows researchers to easily contribute, edit and reuse data within a clear and open CC0 license. In addition to depositing the database into Wikidata, the authors provide many domain-specific resources, including structure-based database searches and taxon-oriented searches.

      Strengths:

      The Lotus database presented in this study represents a cutting-edge resource that has a lot of potentials to benefit the scientific community. Lotus contains more data than previous databases, combines multiple resources into a single resource.

      Moreover, they provide many useful tools for mining the data and visualizing it. The authors were thoughtful in thinking about the ways that researchers could/would use this resource and generating tools to make it ways to use. For example, their inclusion of structure-based searches and multiple taxonomy classification schemes is very useful.

      Overall the authors seem conscientious in designing a resource that is updatable and that can grow as more data become available.

      Weaknesses/Questions:

      1) Overall, I would like to know to what degree LOTUS represents a comprehensive database. LOTUS is clearly, the best database to date, but has it reached a point where it is truly comprehensive, and can thus be used for a metanalysis or as a data source for research questions. Can it truly replace doing a manual literature search/review?

      As highlighted by the reviewer, even if LOTUS might be the most comprehensive natural products occurrences ressources at the moment, TRUE or FULL comprehensive quality of such resource will always be limited to the available data in the litterature. And the community is far from fully describing the metabolome of living beings. We however hope that the LOTUS infrastructure will offer a good place to start this ambitious and systematic description process.

      1) Yes it can serve as data source for research questions, as exemplified in the query table

      2) No, it cannot and must not replace manual literature search. Manual literature search is the best but at an enormous cost. If the outcome of such search can be made available to the whole community (eg. via Wikidata), the value of such would be even bigger. However, LOTUS can expedite a decent part of a manual litterature search and liberate time to complement this search. See our comment to the editors “To further showcase the possibilities opened by LOTUS, and also answer the remark on the comprehensiveness of our resource, we established an additional query (https://w.wiki/4VGC).This query is comparable to a literature review work, such as: https://doi.org/10.1016/j.micres.2021.126708. In seconds, it allows retrieving a table listing compounds reported in given taxa and limits the search by years.”

      We added these examples in the manuscript (see https://github.com/lotusnprod/lotus-manuscript/commit/a6ee135b83e56e8e2041d09d7ce2d5b913c1029d)

      2) Data Cleaning & Validation. The manuscript could be improved by adding more details about how and why data were excluding or included in the final upload. Why did only 30% of the initial 2.5 million get uploaded? Was it mostly due to redundant data or does the data mining approach result in lots of missed data?

      The reason for this “low” yield is that we highly favored quality over quantity (as in the F-score equation, ß being equal to 0.5, so more importance is given to the precision than the recall). Of course there is redundancy, but the rejected entries are mostly because of too low confidence level according to our developed rules. It is not fully discarded data as we keep it for further curation (ideally including the community) before uploading to Wikidata. We adapted the text accordingly.

      3) Similarly, more information about the accuracy of the data mining is needed. The authors report that the test dataset (420 referenced structure-organisms pairs) resulted in 97% true positives, what about false negatives? Also, how do we know that 420 references are sufficiently large to build a model for 2.5M datapoints? Is the training data set is sufficiently large to accurately capture the complexities of such a large dataset?

      False negatives are 3%, which is, in our opinion, a fair amount of “loss” given the quality of the data. We actually manually checked 500+ documented pairs, which is more or less the equivalent of a literature review. We were careful in sampling the entries in the right proportions, but we cannot (and did not) state they are enough. We cannot model it either, since the 2.5M+ points have absolutely different distributions, in terms of databases, quality, etc. Only “hint” is the similar behaviour among all subsets. (the 420 + 100 entries) were divided between 3 authors, which obtained similar results.

      4) Data Addition and Evolution: The authors have outlined several mechanisms for how the LOTUS database will evolve in the future. I would like to know if/how their scripts for data mining will be maintained if they will continue to acquire new data for the database. To what extent does the future of LOTUS depend on the larger natural products community being aware of the resource and voluntarily uploading to it? Are there mechanisms in place such as those associated with sequencing data and NCBI?

      Programs have been not only maintained but also updated with new possibilities (as, for example: the addition of a “manual mode” allowing user to run the LOTUS processing pipeline on a set of their own entries and make them Wikidata-ready (https://github.com/lotusnprod/lotus-processor/commit/f49e4e2b3814766d5497f9380bfe141692f13f23). We will of course do our best to keep on maintaining it, but as no one in academia can state he/she will maintain programs forever. However the LOTUS initiative hopefully embraces a new way of considering database dynamics. If the repository and website of the LOTUS initiative shut down tomorrow, all the work done will still be available to anyone on Wikidata. Of course, future data addition strongly relies on community involvement. We have already started to advocate for the community to start taking part of it, in the form of direct upload to Wikidata, ideally. At the time, there are no mechanisms in place to push publishing of the pairs on Wikidata (as for sequencing, mass spec data), but we will be engaged in pushing forward this direction. The initiative needs stronger involvement of the publishing sector (also reviewers) to help change those habits.

      5) Quality of chemical structure accuracy in the database. I would imagine that one of the largest sources of error in the LOTUS database would be due to variation in the quality of chemical structures available. Are all structure-organism pairs based on fully resolved NMR-based structures are they based on mass spectral data with no confirmational information? At what point is a structural annotation accurate enough to be included in the database. More and more metabolomics studies are coming out and many of these contain compound annotations that could be included in the database, but what level (in silico, exact mass database search, or relative to a known standard) are required.

      This is a very interesting point and some databases have this “tag” (NMR, cristal, etc.). We basically rely on original published articles, included in specialized databases. If poorly reported structures have been accepted for publication, labelled as “identified” (and not “annotated”) and the authors publishing the specialized databases overlooked it, we might end up with such structures.

      Here, the Evidence Ontology (http://obofoundry.org/ontology/eco.html) might be a good direction to look at and further characterize the occurrences links in the LOTUS dataset.

      Reviewer #3 (Public Review):

      Due to missing or incomplete documentation of the LOTUS processes and software, a full review could not be completed.

      Some parts of LOTUS were indeed not sufficiently described and we improved both our documentation and accessibility to external users a lot. We thank the reviewer for insisting on this point as it will surely improve the adoption of our tool by the community.

    1. Biophysics Colab

      Authors' response (16 December 2021)

      GENERAL ASSESSMENT

      The TMEM16 family of membrane proteins have been shown to function as calcium-activated chloride channels and lipid scramblases. In recent years, X-ray and cryo-EM structures have been solved for TMEM16 proteins in ligand-free and ligand-bound conformations, providing valuable structural information on their functional duality and activation mechanisms. It is largely accepted that the catalytic site (termed subunit groove or cavity) is mostly shielded from the membrane in the ligand-free TMEM16 scramblases. Calcium binding induces a conformational rearrangement of the cavity-lining helices, opening the groove to the surrounding membrane. Since the groove is hydrophilic, it was proposed that it serves as a permeation pathway for lipid headgroups while the hydrophobic lipid tails remain embedded into the hydrophobic membrane core, which has been termed the "credit card" mechanism of lipid scrambling. Additionally, structures of several TMEM16 homologs in lipid nanodiscs revealed that these proteins deform the lipid bilayer in the vicinity of the subunit cavity by bending and thinning the membrane, irrespective of the presence of the activating ligand calcium. Functional experiments also suggested that lipids can be scrambled outside of the open subunit cavity and that local protein-induced membrane deformation is critical for lipid scrambling.

      In the present study, Falzone and colleagues further address the mechanisms of lipid scrambling using single particle cryo-EM and liposome-based functional assays. Firstly, the authors solved the structure of a calcium-bound fungal homolog, afTMEM16, in nanodiscs with a lipid composition where the protein is maximally active. Although similar structures were obtained before, this new structure has the highest resolution thus far, representing \> 1 Å improvement! The structure is beautiful and is a major achievement, which enabled the authors to resolve individual lipids and their interaction with the protein around the subunit cavity, whereas in previous structures unresolved non-protein densities were observed passing through the groove. The authors also solved a number of structures with and without calcium in lipid compositions that promote (thinner lipid bilayers) or suppress (thicker lipid bilayers) scrambling. The authors show that afTMEM16 can scramble lipids while the subunit groove remains closed, a phenomenon that is further enhanced in thinner membranes, whereas in thicker membranes scrambling is suppressed even though the groove is open. We particularly appreciated how different software packages and processing strategies were used to rigorously identify structural heterogeneity in their cryo-EM data. Remarkably, mutations of residues lining the subunit cavity and interacting with lipids do not appear to have dramatic effects on scrambling rates, which suggests that lipids do not need to interact with the protein to be scrambled. Thus, the overall conclusion of the study is that membrane thinning by TMEM16 scramblases in their calcium-free conformation is enough to induce lipid scrambling, and that the groove opening induced by calcium binding further enhances membrane deformation, promoting faster scrambling. By contrast, in thicker membranes the protein fails to sufficiently deform the bilayer and scrambling is suppressed, even when the subunit groove is open. The present study provides unprecedented structural information on the interaction of lipids with afTMEM16 and new evidence that lipids can be scrambled outside of the groove.

      The findings and conclusions presented here help to explain why TMEM16 scramblases can transport lipids with headgroups much bigger than the dimensions of the subunit cavity and why structures of some of the other scramblases (opsins, Xkrs and mammalian homolog TMEM16F) lack the obvious hydrophilic groove seen in fungal TMEM16 scramblases. Overall, this is a well-rounded study with an exceptional amount of high-quality cryo-EM data and functional experiments supporting the conclusions.

      We thank the reviewers for their praise of our work and for their constructive criticisms. Below we provide a detailed response to their comments and suggestions.

      RECOMMENDATIONS

      Revisions essential for endorsement:

      1) The resolved lipids binding within the groove in the first structure might be seen by some as supporting the credit card mechanism as it definitively demonstrates that lipids reside within the groove. While the authors provide evidence that lipids can permeate outside the groove in this and earlier work, as far as we can tell, none of that would preclude permeation through the groove if it doesn't require specific interactions between lipids and sidechains in the protein. The presentation might be improved with a somewhat more circumspect and nuanced exposition of the new data and how it can be understood with earlier results.

      There are several reasons why we do not think that the lipids in the C18/Ca2+ structure support the credit card mechanism, at least in the incarnation proposed for the TMEM16 scramblases.

      1. In the credit card model, lipid headgroups enter and traverse the whole span of the groove (as described in multiple publications, i.e. Bethel and Grabe, PNAS, 2016; Jiang et al., Elife, 2018; Lee et al., Nat Comms, 2018; Kostriskii and Machtens, Nat Comms, 2021). The lipid densities near the groove suggest that P3 and P4 lipids are oriented with their heads facing the groove's exterior, not the interior. These heads are contiguous with other resolved lipids in the outer and inner leaflets, respectively. We added panels showing views of the pathway from the extracellular solution to better convey that the lipid heads do not enter the groove (see new Fig. 1F-G). We also added a statement on pg. 10 to clarify this important point.
      2. In the present structures, which are consistent with earlier ones with lower resolution (Falzone et al., Elife, 2019; Kalienkova et al., Elife, 2019), residues in the extracellular vestibule do not interact with lipids (see new panels 1E-G). In contrast, the wide intracellular vestibule is embedded in the membrane. We agree with the reviewers that lipid headgroups can, and likely will, enter this wide vestibule during scrambling. We modified the text on pg. 12 to clearly state this point "The wide intracellular vestibule is embedded in the nanodisc membrane and, at the open pathway, the resolved P3 and P4 lipids have opposite orientations (Fig. 2A), suggesting scrambling might occur between them. In this case, the lipid headgroups would only need to move through the wide intracellular vestibule of the pathway below the T325-Y432 constriction rather than through the whole groove (Fig. 2A)."

      These observations, together with the extensive mutagenesis data reported in Fig. 2 and 3, point to a mechanism that is different from the precisely coordinated credit-card mechanism that is the currently accepted paradigm for lipid scrambling.

      Might the complex composition of native lipid membranes influence where and by what mechanism lipid movement between leaflets is catalysed by TMEM16 proteins?

      The idea that the lipid composition might affect the mechanism of scrambling (i.e. through the groove vs out of the groove) is very interesting, and we are actively investigating it in the lab. However, it would be surprising if different lipids were scrambled by entirely different mechanisms.

      2) The quality of lipid densities in cryo-EM structures is greatly affected by the number of particles used and the resolution obtained during refinement and it is therefore not surprising that the beautiful lipid densities observed here in the structure of afTMEM16 in lipid nanodiscs in the presence of calcium refined to 2.3 Å are not all observed in subsequent structures with lower resolution. This is true not only for the P lipids near the groove, but for those D lipids bound near the dimer interface, which is a stable region of the protein that does not change conformation. To be cautious, the authors should avoid resting any conclusions on the absence of lipid densities in the lower resolution structures. For example, on pg 15 the authors seem to be interpreting the absence of density for C22 lipids.

      We agree with the reviewers on this point. However, at ~2.7 Å average resolution and with \>130,000 particles we would expect to see density for lipids near the pathway, if these were tightly bound. For example, in the mTMEM16F nanodisc structures from the Chen and Jan labs (Feng et al., Cell Reports, 2020), several lipid densities were identified near the closed pathway despite a substantially lower average resolution. However, we agree that we should not interpret this lack of signal and toned down our statement, "This suggests that the interactions of C22 lipids with the pathway helices are weaker than those of C18 lipids, possibly reflecting a higher energy cost associated with distorting these longer acyl chain lipids" to better indicate this is a possible explanation, rather than a definitive mechanistic interpretation.

      3) The presentation of structural interactions between lipids and residues near the groove of the protein could be improved in the figures. A panel like Fig. 1J but for the groove would help, but it would be good to see expanded perspectives in the form of a supplementary figure where residues around the headgroup of the lipids are shown along with EM maps so the quality of the structural information for both lipids and side chains can be better appreciated. The preprint does have a lot of images of the lipids and the protein, but not in a way that enables the reader to quickly grasp the nature of interactions between side chains and lipid moieties for themselves, and we feel that close-ups of individual lipids as suggested above would help.

      We thank the reviewers for this suggestion. We show density maps for the protein and lipids in Fig. 1C-E, and added close-up views of the densities near the groove in the new Fig. 1F-G to highlight the poses adopted by the lipids in this region. Figures showing both density and atomic models for the protein and lipids are very busy and difficult to discern; many of the lipids interact with multiple residues from different helices, with both their heads and tails. As such we could not find satisfactory views displaying both for the majority of the lipids.

      It is also not clear what the authors mean by "lipid headgroup". Have the authors only considered interactions of the phospholipid phosphate group with protein residues? It would be helpful if the authors could clarify this in the manuscript and say whether other types of interactions were considered.

      In our C18/Ca2+ map, we resolve a total of 16 lipids per monomer. Of these, we assigned 2 as PG lipids, because we could resolve the large PG headgroup (D4 and D5), shown in Fig. 1F-H and Supp. Fig 2. In all other cases, we truncated the lipids at the phosphate, as the density was insufficient to distinguish between a PC and a PG headgroup. This is now specified in the Fig. 1 legend. In our mutagenesis experiments (Fig. 2 and 3), we only targeted residues that were within interaction distance of the resolved portions of the headgroups, which is the phosphate in most cases. This is now clarified on page 11 "we investigated how mutating residues coordinating the resolved portions of the headgroups of P1-2 and P4-6 impacts scrambling."

      It would also be nice to include a close-up view of D511A/E514A in 0.5 mM calcium with cryo-EM density to demonstrate the absence of bound calcium ions.

      We thank the reviewers for this suggestion. We added a new panel in Supp. Fig. 10H showing a close-up of the cryoEM density of the mutant binding site.

      4) The functional data in Fig 2, 3 and 4 are also not discussed in much detail and it would help if the authors could expand the presentation. Although scrambling in the presence of a very high concentration of calcium is not dramatically altered by any of the mutations, there is quite a lot going on in the absence of calcium and very little is said about these results. For example, differences in the scrambling rates can be observed with some mutants in the presence and absence of calcium in figures 2E and 3E, but statistical analysis would be required to know if the differences between mutants are significant. The differences in scrambling rates with different lipids are also not discussed (e.g. Fig. 4A) It would help if the authors could discuss what is the margin of error in the scrambling assay, and point to some concrete examples from their earlier work on this specific scramblase where mutants have a large impact on scrambling activity in their assay.

      We agree with the reviewers that most mutants show some effects in 0 Ca2+. The effects are statistically significant for all but one mutant (2-tailed t-test, p\<0.005). However, the magnitude of the effects is relatively small (\<7-fold reductions in all cases). While our approach to quantify the scrambling rate constant captures well large changes, some of the assumptions underlying the analysis make it less well suited to quantify small effects. In past publications we used a 10-fold change as a cut-off threshold to consider an effect meaningful (Lee et al., Nat comms, 2018; Khelashvili, Falzone et al., Nat Comms, 2020). These limitations and rationale for choices are discussed in several of our past publications (Malvezzi et al., PNAS, 2018; Lee et al., Nat Comms, 2018; Falzone and Accardi, MiMB, 2020). We added statements indicating magnitude of the observed reduction for the mutants in the various conditions. We prefer to refrain from presenting statistical significance of these results as we do not want to convey the idea these effects are more meaningful than they might be.

      Have the authors tried intermediate more physiologically relevant concentrations of calcium to see if the mutants have discernible effects under those conditions?

      This is an excellent suggestion. However, in our experience the technical limitations of the experimental set-up and of the analysis render a precise quantification of small effects at intermediate Ca2+ concentrations not very reliable. For this reason, we did not pursue this further.

      5) It is quite intriguing that the mutations in the subunit groove of afTMEM16 have little effect on scrambling activity. The authors propose that the groove-lining residues are not directly involved in lipid coordination even though their structure suggests that they do and there is a wealth of functional studies and MD simulations on various other TMEM16 homologs suggesting otherwise.

      We are a bit confused by the reviewers' statement that our structure suggests that groove lining residues coordinate lipids. In our structures, the only two residues that directly line the open groove and coordinate lipids are T325 and Y432 (Fig. 2A). All other 23 residues tested either do not line the groove (9 residues mutated in Fig. 2) or do not interact with lipids (14 residues mutated in Fig. 3). The finding that mutating these residues has minor effects on scrambling suggests that interactions between lipids and these side chains is not required for scrambling.

      We agree that the overall lack of effect of the mutants is surprising, especially in light of past work. However, none of the scrambling assays (in vitro or cell-based) can distinguish between mutations that affect permeation from those that affect gating. All that is measured is whether and -to a degree- how well lipids are transported. As such, we propose that at least some of the functional effects could have been misinterpreted. We are currently testing this hypothesis in the lab.

      The discrepancy between our structural and functional results and the molecular mechanism emerging from MD simulations is more striking. Although some differences exist between the reports of different groups, the overall agreement among them is excellent. We were thus surprised that our data is so difficult to reconcile with their observations. Indeed, the extensive mutagenesis reported in Fig. 2 and 3 was performed to systematically test the unexpected inferences of our initial structural results (on the C18/Ca2+ structure). Our conclusions are also corroborated by the structures in different lipid compositions. In the discussion (pg. 21-22) we consider some of the possible sources for these discrepancies. For example, while in the MD simulations of nhTMEM16 the extracellular vestibule (i.e. E305, E310 and R425) is immersed in the groove, in our cryoEM maps we do not see evidence of lipids interacting with these residues (Fig. 1,2,3). Notably, a similar arrangement of the membrane-protein interface is seen in the Ca2+-bound open nhTMEM16 structure in nanodiscs (Kalienkova et al., Elife, 2019), indicating this issue is not specific to afTMEM16 or to the nanodisc used. We hypothesize this different membrane-protein interface is at the origin of the different proposed mechanisms. Another potentially relevant difference is that the tails of multiple lipids intercalate between helices forming the dimer cavity, some of which line the groove (Fig. 1). These lipids were not included in MD simulations as they were not previously resolved, and they could affect groove dynamics and, consequently, its interactions with the membrane. Other possibilities exist, but we believe they are less likely to be important (i.e. the limited nature of nanodiscs used for the cryoEM experiments could influence the protein-membrane interface, the mutations could have effects that are too subtle to measure in our assay). However, we think that enumerating all possibilities would lead to an overly lengthy discussion and require too much speculation.

      We have revised the discussion of these important points in pg. 21-23 to better convey these uncertainties and added a statement (pg. 11) where we report the distance between the phosphate atom of the P3 lipid and E305 (13.7 Å), E310 (17.9 Å) and R425 (15.7 Å).

      The authors' suggestion that mutations probably affect the equilibrium between open and closed conformations of the groove in other homologs but not in afTMEM16 is logical, however, there are some discrepancies. To name a few examples, if indeed this is the case, nhTMEM16 mutants with closed groove should still have significant basal scrambling, by extrapolation from afTMEM16 data. Yet, some of the nhTMEM16 mutants (E313/E318/R432 mutants) have no activity at all, or no basal scrambling activity (Y439A) (Lee et al, 2018). Would you expect that point mutations within the subunit groove remove the ability of the protein to deform the membrane in its closed conformation? Might the groove have intermediate conformations between closed and fully open where the mutants studied might have more impact in afTMEM16?

      These are excellent ideas, and we are actively pursuing them in the lab. However, at the moment results are too preliminary to draw firm conclusions.

      Further, mutating some of the residues on the scrambling domain of TMEM16 affected externalization of some lipid species, but not internalization etc. (Gyobu et al, 2017), which should not be the case if the interaction of the protein with the lipids is completely unnecessary for lipid scrambling.

      This is a good point. However, mechanistic interpretation of results from cell-based scrambling assays is quite tricky, even more so than of the results from the in vitro measurements used in the present work. The presence of other lipid transporters and/or scramblases, or a multitude of other factors, could influence the results. For example, in cells scrambling by TMEM16F is delayed, it takes ~10 minutes after Ca2+ exposure to begin seeing PS externalization. In contrast, in in vitro measurements TMEM16F responds to Ca2+ nearly instantaneously, within the ~1 s mixing time of the cuvette (Alvadia et al., Elife, 2019). Thus, a direct comparison of the results obtained in cells and in vitro is not straightforward. More work is needed to investigate these important points.

      While investigating this question further would require follow-up structural studies on other TMEM16 homologs and is outside of the scope of this study, we think that the manuscript would benefit from a more extensive discussion on contradicting results and alternative interpretations. The authors might want to consider the possibility that there may be substantial variations in how different scramblases function.

      We agree that it is a priori possible that different TMEM16 proteins function according to different paradigms. However, we think this is an unlikely possibility. Despite differences in their gating behavior, most basic functional properties of TMEM16s are well conserved. Thus, fundamentally different mechanisms (i.e. through the groove or out of the groove) would have to result in similar functional phenotypes. We find the hypothesis that the basic scrambling mechanism is conserved among different TMEM16 homologues more plausible. While our results do not rule out that through the groove scrambling can occur, they suggest that it is not the main mechanism for afTMEM16, despite the fact that this protein adopts a very stable conformation with an open groove. Therefore, we consider the possibility of different mechanisms unlikely. This is mentioned on pg. 22 of the discussion.

      afTMEM16 has high constitutive activity in the absence of calcium, while at least TMEM16F does not. Additionally, the extent to which scrambling is promoted by calcium varies, as mammalian scramblases might need other cellular factors to be activated. Also, the extent to which scramblases are seen to distort the membrane is highly variable, as again seen in TMEM16F structures. Might some of these differences imply that key aspects of the mechanism of scrambling (e.g. thinning of the membrane or whether lipids scramble inside or outside the groove) are not the same for all scramblases? This might be one way to organize the discussion to help reconcile some of the seemingly divergent findings in the field.

      The reviewers raise an excellent point. Indeed, we find that for all TMEM16 homologues we have tested in the lab the degree of activity in 0 Ca2+ is highly dependent on the lipid composition. However, this does not appear to correlate with changes in conformation, as we report here for afTMEM16 and as reported by other groups for nhTMEM16 and TMEM16F.

      6) The authors should correct the Ramachandran outliers in C18/calcium and C22/calcium structures.

      We tried fixing the Ramachandran outliers, however this invariably led to worse fits of the atomic models with the density. Therefore, we believe it is appropriate to leave them as they are.

      Additional suggestions for the authors to consider:

      1) In several instances the authors conceptualize hypothetical mechanisms to set up experiments and frame their interpretations, which is not always the most straightforward way to communicate findings and what they reveal. The 'conveyor belt mechanism' introduced on page 10 is never fully defined in a way that helps the reader to understand what the functional effects of the mutants teach us. Might it be easier to set up the experiment by asking whether the interactions between sidechains that apparently interact with lipid headgroups in the structure play a critical role in scrambling, present the results and then conclude that they do not appear to? Collectively the functional effects of mutants do appear to suggest that specific side chain interactions are not critical for scrambling, but the conceptualized mechanism here makes the conclusions come across as unnecessarily forced.

      We thank the reviewers for the suggestion. We agree that the conveyor belt mechanism is a bit of a strawman. However, it is a plausible mechanism based on the orientation of the lipids in the C18/Ca2+ map. The mutagenesis described in Fig. 2 was explicitly designed to test this possibility. Further, this allows us to draw a clear distinction between testing the roles of residues outside the groove and of side chains that directly line the groove.

      The credit-card mechanism has been formally introduced and discussed in the field but has already been shot down in earlier work from the group and seems overly simplistic if we already know that scrambling can occur both inside and outside the groove from earlier studies. Just something for the authors to think about.

      We do not believe our previous work (Malvezzi et al., PNAS, 2018) 'shot down' the credit-card model. While we proposed that the large, PEG-conjugated lipid headgroups traverse the membrane outside the groove, our model postulated that normal-sized headgroups were scrambled within the groove. Further, one of the recurring criticisms of that work, was that the path taken by the large PEG-conjugated lipids might not represent a physiologically relevant mechanism for normal lipids. Thus, the credit-card mechanism remained the dominant model to explain scrambling, as testified by many subsequent publications by multiple groups, including our own!

      2) The uninitiated reader would greatly benefit from more of an introduction to the functional scrambling assay in the results and material and methods section so they can understand the results being presented. In the Material and methods, the authors mentioned: "All conditions were tested side by side with a control preparation", perhaps add here what exactly served as control –wild type protein in C18 lipids? It would be valuable to include information on the reconstitution efficiency between their preparations (WT in different lipid compositions and WT vs mutants). these if possible.

      We thank the reviewers for this suggestion. We added a brief description of the assay in the Methods section and now specify that "All conditions were tested side by side with a control preparation of WT afTMEM16 reconstituted in C18 lipids."

      3) Also, does the C18/calcium cryo-EM structure have sufficient resolution to distinguish between specific phospholipids (PG or PC) at the D1-D9 or P1-P7 positions? It would be particularly valuable if the authors could comment on whether PG or PC are observed in the D and P positions, or which lipids are lining the groove (P3-P6).

      We could build 2 lipids as PG (D4 and D5), based on the presence of density that could accommodate the large PG headgroup. For other lipids, the density was too weak beyond the phosphate, and therefore we left them truncated. This is now stated in the Figure 1 legend.

      4) While not essential, it would be interesting if the authors could perform the assay on the mutants with a more prominent effect in the absence of calcium (e.g. E310A, Y319A/F322A/K428A) with several additional calcium concentrations.

      We thank the reviewers for this suggestion. However, as we noted above, given the relatively small effects and limitations of the assay, we do not believe we would be able to extract meaningful mechanistic information from these measurements in intermediate conditions.

      5) The authors mentioned that the interaction of C22 lipids with the pathway helices is weaker than those of C18 lipids, which reflects the energy cost associated with distorting the longer lipids (page 15). However, they claimed that the interaction between the lipids and residues is not important for scrambling, which seems contradictory.

      We apologize for the confusion. In our proposed model, the ability of afTMEM16 to thin the membrane is dictated by the interactions of the protein with the surrounding lipids. This is not only enabled by interactions between side chains and lipid headgroups, but also by interactions of the lipid tails interact with the protein (see for example the close-up panels in Supp. Fig. 2F-G and the text on pg. 11 "Rather, other factors, such as tail interactions with interhelical grooves, contribute to their association with afTMEM16 (Supp. Fig 2F-G) and stabilize the distorted membrane-protein interface that results in thinning at the pathway.")

      (This is a response to peer review conducted by Biophysics Colab on version 1 of this preprint.)

    1. As long as software requires such concerted energy and so much highly specialized human focus, I think it will have the tendency to serve the interests of the people sitting in that room every day rather than what we may consider our broader goals.

      That's a wide point beyond web3 -- to avoid the problems with big tech, we need to make software / products easier to create. Then there's little to gain by increasing scale beyond network effects (which is a separate topic web3 aims to solve).

      I think we're already beginning to see this decentralization, if not in software then for YouTube & TikTok creators, indie makes etc in comparison to old media companies.

    1. Reviewer #2 (Public Review):

      Jepma et al. report an interesting manuscript studying how we learn from pain and its avoidance. The authors use an instrumental pain avoidance task where participants are required to choose between two stimuli, one of which is followed by painful thermal stimulation to the leg and the other is not. The probabilities of receiving pain drifted across trials using random walks. The authors combined this with pharmacological manipulation of the dopamine (via oral levodopa) or opioid (via oral naltrexone) systems and also with computational modelling of Q-learning rules and neuroimaging via fMRI. So, this is an ambitious and well conceived manuscript.

      There are real strengths here. The manuscript is theoretically motivated, addresses a fundamental question about how we learn, and is generally well executed. The task is well controlled, the modelling choices seem appropriate, the imaging and its analyses are broad but well defended and choices in analysis strategies are well defined. The manuscript is well written. I did enjoy reading the manuscript.

      The results have some interest. The modelling and neuroimaging data suggest important dissociations between learning about pain and learning about its absence - the modelling suggests faster learning rates for learning from pain than its avoidance. The imaging suggests that these two forms of learning are associated with different networks, with a known network linked to learning about pain but a novel network linked to learning about avoided pain.

      These are worthwhile knowledge gains. The idea that different rate parameters govern learning about events that are present versus those that are absent is an old one. It is built into most error-correcting learning rules since Rescorla-Wagner and it makes sense. However, it was useful to see it supported here. The finding that different networks of brain regions were associated with the learning from pain versus avoided of pain was also interesting. The networks linked to the former made sense based on the literature. The networks linked to the latter were more novel and notably did not include classic 'relief' brain regions.

      However, there were also important weaknesses here, at least on my readings.

      I struggled as a reader to understand how the modelling actually related to the behavior and imaging. That is, there is a real disconnect in the manuscript for me between what is observed (behavior) what is inferred (modelling as well as it basis for correlations with fMRI data).

      There were no differences in behavior reported between the two kinds of trials (learning from received pain versus avoided pain) effects, no effects of the drugs on behavioral performance, and no differential effect on learning from received pain versus avoided pain. I have no problems with reporting null effects, but here the reader is left wondering: if there are no behavioral differences reported, then why does the modelling predict that there should be? How accurate is the model given that it clearly predicts slower learning from avoided than received pain in the controls and faster learning from avoided pain under naltrexone and levodopa compared to control? In other words, what is it about the modelling that yields differences in learning rates between the two behavioral conditions and between the vehicle, levodopa, and naltrexone conditions when the behavioral data shown do not? Of course, it could be that the task was too easy - the modelling may be prescient and perhaps possible learning rate differences would be picked up under more difficult (more cues) and weaker probabilistic conditions. Perhaps there are behavioral data (reaction times?) not reported that do actually show differences in learning rate between learning from received pain versus avoided pain or show differences between the drug conditions?

      I may have misunderstood all of this and am happy to be corrected. If not, think this issue needs to be addressed and would need new data that is hopefully already in hand to do convincingly (such as choice reaction times) to show some difference in behavior between learning from received pain versus avoided pain and/or some effects of the pharmacological manipulations on these.

      In the absence of the data the manuscript seems to have three parts:

      1. A more compelling set of findings reporting imaging differences between learning from received pain versus avoided pain that are interesting because they suggest a novel network of brain regions for the latter compared to the literature.<br> 2. A set of null findings that neither pharmacological manipulation affected behavior or these imaging findings.<br> 3. A less compelling set of findings that link the above to possible underlying differences in learning rate parameters.

      The first could be of interest but the latter two need to be strengthened, in my opinion.

      I had other minor points (e.g., consider the literature on opioid and dopamine receptor manipulations in the ventral striatum on aversive prediction errors because this suggests the opposite to the literature cited for the midbrain; is the word 'appetitive' in the title really appropriate given the findings in the manuscript), but these are less important than the above.

    1. Author Response

      Reviewer #1 (Public Review):

      [...] Recently, pupil dilation was linked to cholinergic and noradrenergic neuromodulation as well as cortical state dynamics in animal research. This work adds substantially to this growing research field by revealing the temporal and spatial dynamics of pupil-linked changes in cortical state in a large sample of human participants.

      The analyses are thorough and well conducted, but some questions remain, especially concerning unbiased ways to account for the temporal lag between neural and pupil changes. Moreover, it should be stressed that the provided evidence is of indirect nature (i.e., resting state pupil dilation as proxy of neuromodulation, with multiple neuromodulatory systems influencing the measure), and the behavioral relevance of the findings cannot be shown in the current study.

      Thank you for your positive feedback and constructive suggestions. We are especially grateful for the numerous pointers to other work relevant to our study.

      1. Concerning the temporal lag: The authors' uniformly shift pupil data (but not pupil derivative) in time for their source-space analyses (see above). However, the evidence for the chosen temporal lags (930 ms and 0 ms) is not that firm. For instance, in the cited study by Reimer and colleagues [1] , cholinergic activation shows a temporal lag of ~ 0.5 s with regard to pupil dilation - and the authors would like to relate pupil time series primarily to acetylcholine. Moreover, Joshi and colleagues [2] demonstrated that locus coeruleus spikes precede changes in the first derivative of pupil dilation by about 300 ms (and not 0 ms). Finally, in a recent study recording intracranial EEG activity in humans [3], pupil dilation lagged behind neural events with a delay between ~0.5-1.7s. Together, this questions the chosen temporal lags.

      More importantly, Figures 3 and S3 demonstrate variable lags for different frequency bands (also evident for the pupil derivative), which are disregarded in the current source-space analyses. This biases the subsequent analyses. For instance, Figure S3 B shows the strongest correlation effect (Z~5), a negative association between pupil and the alpha-beta band. However, this effect is not evident in the corresponding source analyses (Figure S5), presumably due to the chosen zero-time-lag (the negative association peaked at ~900 ms)).

      As the conducted cross-correlations provided direct evidence for the lags for each frequency band, using these for subsequent analyses seems less biased.

      This is an important point and we gladly take the opportunity to clarify this in detail. In essence, choosing one particular lag over others was a decision we took to address the multi-dimensional issue of presenting our results (spectral, spatial and time dimensions) and fix one parameter for the spatial description (see e.g. Figure 4). It is worth pointing out first that our analyses were all based on spectral decompositions that necessarily have limited temporal resolutions. Therefore, any given lag represents the center of a band that we can reasonably attribute to a time range. In fact, Figure 3C shows how spread out the effects are. It also shows that the peaks (troughs) of low and high frequency ranges align with our chosen lag quite well, while effects in the mid-frequency range are not “optimally” captured.

      As picking lags based on maximum effects may be seen as double dipping, we note that we chose 0.93 sec a priori based on the existing literature, and most prominently based on the canonical impulse response of the pupil to arousing stimuli that is known to peak at that latency on average (Hoeks & Levelt, 1993; Wierda et al. 2012; also see Burlingham et al.; 2021). This lag further agrees with the results of reference [3] cited by the reviewer as it falls within that time range, and with Reimer et al.’s finding (cited as [1] above), as well as Breton-Provencher et al. (2019) who report a lag of ~900 ms sec (see their Supplementary Figure S8) between noradrenergic LC activation and pupil dilation. Finally, note that it was not our aim to relate pupil dilations to either ACh or NE in particular as we cannot make this distinction based on our data alone. Instead, we point out and discuss the similarities of our findings with time lags that have been reported for either neurotransmitter before.

      With respect to using different lags, changing the lag to 0 or 500 msec is unlikely to alter the reported effects qualitatively for low- and high frequency ranges (see Figure 3C), as both the pupil time series as well as fluctuations in power are dominated by very slow fluctuations (<< 1 Hz). As a consequence, shifting the signal by 500 msec has very little impact. For comparison, below we provide the reviewer with the results presented in Figure 4 but computed based on zero (Figure R1) and 500-msec (Figure R2) lags. While there are small quantitative differences, qualitatively the results remain mostly identical irrespective of the chosen lag.

      Figure R1. Figure equivalent to main Figure 4, but without shifting the pupil.

      In sum, choosing one common lag a priori (as we did here) does not necessarily impose more of a bias on the presentation of the results than choosing them post-hoc based on the peaks in the cross-correlograms. However, we have taken this point as a motivation to revise the Results and Methods sections where applicable to strengthen the rationale behind our choice. Most importantly, we changed the first paragraph that mentions and justifies the shift as follows, because original wording may have given the false impression that the cross-correlation results influenced lag choice:

      “Based on previous reports (Hoeks & Levelt, 1993; Joshi et al., 2016; Reimer et al., 2016), we shifted the pupil signal 930 ms forward (with respect to the MEG signal). We introduced this shift to compensate for the lag that had previously been observed between external manipulations of arousal (Hoeks & Levelt, 1993) as well as spontaneous noradrenergic activity (Reimer et al., 2016) and changes in pupil diameter. In our data, this shift also aligned with the lags for low- and high-frequency extrema in the cross-correlation analysis (Figure 3B).”

      Figure R2. Figure equivalent to main Figure 4, but with shifting the pupil with respect to the MEG by 500 ms.

      Related to this aspect: For some parts of the analyses, the pupil time series was shifted with regard to the MEG data (e.g., Figure 4). However, for subsequent analyses pupil and MEG data were analyzed in concurrent 2 s time windows (e.g., Figure 5 and 6), without a preceding shift in time. This complicates comparisons of the results across analyses and the reasoning behind this should be discussed.

      The signal has been shifted for all analyses that relate to pupil diameter (but not pupil derivative). We have added versions of the following statement in the respective Results and Methods section to clarify (example from Results section ‘Nonlinear relations between pupil-linked arousal and band-limited cortical activity’):

      “In keeping with previous analyses, we shifted the pupil time series forward by 930 msec, while applying no shift to the pupil derivative.”

      1. The authors refer to simultaneous fMRI-pupil studies in their background section. However, throughout the manuscript, they do not mention recent work linking (task-related) changes in pupil dilation and neural oscillations (e.g., [4-6]) which does seem relevant here, too. This seems especially warranted, as these findings in part appear to disagree with the here-reported observations. For instance, these studies consistently show negative pupil-alpha associations (while the authors mostly show positive associations). Moreover, one of these studies tested for links between pupil dilation and aperiodic EEG activity but did not find a reliable association (again conflicting with the here-reported data). Discussing potential differences between studies could strengthen the manuscript.

      We have added a discussion of the suggested works to our Discussion section. We point out however that a recent study (Podvalny et al., https://doi.org/10.7554/eLife.68265) corroborates our finding while measuring resting-state pupil and MEG simultaneously in a situation very similar to ours. Also, we note that Whitmarsh et al. (2021) (reference [6]) is actually in line with our findings as we find a similar negative relationship between alpha-range activity in somatomotor cortices and pupil size.

      Please also take into account that results from studies of task- or event-related changes in pupil diameter (phasic responses) cannot be straightforwardly compared with the findings reported here (focusing on fluctuations in tonic pupil size) , due to the inverse relationship between tonic (or baseline) and phasic pupil response (e.g. Knapen et al., 2016). This means that on trials with larger baseline pupil diameter, phasic pupil dilation will be smaller and vice versa. Hence, a negative relation between the evoked change in pupil diameter and alpha-band power can very well be consistent with the positive correlation between tonic pupil diameter and alpha-band activity that we report here for visual cortex.

      In section ‘Arousal modulates cortical activity across space, time and frequencies’ we have added:

      “Seemingly contradicting the present findings, previous work on task-related EEG and MEG dynamics reported a negative relationship between pupil-linked arousal and alpha-range activity in occipito-parietal sensors during visual processing (Meindertsma et al, 2017) and fear conditioning (Dahl et al. 2020).Note however that results from task-related experiments, that focus on evoked changes in pupil diameter rather than fluctuations in tonic pupil size, cannot be directly compared with our findings. Similar to noradrenergic neurons in locus coeruleus (Aston-Jones & Cohen, 2005), phasic pupil responses exhibit an inverse relationship with tonic pupil size (Knapen et al., 2016). This means that on trials with larger baseline pupil diameter (e.g. during a pre-stimulus period), the evoked (phasic) pupil response will be smaller and vice versa. As a consequence, a negative correlation between alpha-band activity in the visual cortex and task-related phasic pupil responses does not preclude a positive correlation with tonic pupil size during baseline or rest as reported here. In line with this, Whitmarsh et al., 2021 found a negative relationship between alpha-activity and pupil size in the somatosensory cortex that agrees with our finding. Although using an event-related design to study attention to tactile stimuli, this relationship occurred in the baseline, i.e. before observing any task-related phasic effects on pupil-linked arousal or cortical activity.”

      In section ‘Arousal modulation of cortical excitation-inhibition ratio’ we have added: “The absence of this effect in visual cortices may explain why Kosciessa et al. (2021) found no relationship between pupil-linked arousal and spectral slope when investigating phasic pupil dilation in response to a stimulus during visual task performance. However, this behavioral context, associated with different arousal levels, likely also changes E/I in the visual cortex when compared with the resting state (Pfeffer et al., 2018).”

      Finally, in the Conclusion we added (note: ‘they’ = the present results): “Further, they largely agree with similar findings of a recent independent report (Podvalny et al., 2021).”

      Related to this aspect: The authors frequently relate their findings to recent work in rodents. For this it would be good to consider species differences when comparing frequency bands across rodents and primates (cf. [7,8]).

      Throughout our Results section we have mainly remained agnostic with respect to labeling frequency ranges when drawing between-species comparisons, and have only reverted to it as a justification for a dimension reduction for some of the presented analysis. Following your comment however, we have phrased the following section in the Discussion, section ‘Arousal modulates cortical activity across space, time and frequencies’, more carefully:

      “The low-frequency regime referred to in rodent work (2—10Hz; e.g., McGinley et al., 2015) includes activity that shares characteristics with human alpha rhythms (3—6Hz; Nestogel and McCormick, 2021; Senzai et al. 2019). The human equivalent however clearly separates from activity in lower frequency bands and,here, showed idiosyncratic relationships with pupil-linked arousal.”

      1. Figure 1 highlights direct neuromodulatory effects in the cortex. However, seminal [9-11] and more recent work [12,13] demonstrates that noradrenaline and acetylcholine also act in the thalamus which seems relevant concerning the interpretation of low frequency effects observed here. Moreover, neural oscillations also influence neuromodulatory activity, thus the one-headed arrows do not seem warranted (panel C) [3,14].

      This is a very good point. First, we would like to note that we have extended on acknowledging thalamic contributions to low-frequency (specifically alpha) effects in response to the Reviewer’s point 11 (‘Recommendations for authors’ section below). Also, we have added a reference to the role of potential top-down (reverse) influences to our Discussion, section ‘Arousal modulates cortical activity across space, time and frequencies’, as follows:

      “Further, we note that our analyses and interpretations focus on arousal-related neuromodulatory influences on cortical activity, whereas recent work also supports a reverse “top-down” route, at least for frontal cortex high-frequency activity on LC spiking activity (Totah et al., 2021).”

      Ultimately, however, we decided to leave the arrows in Figure 1C uni-directional to keep in line with the rationale of our research that stems mostly from rodent work, which also emphasises the indicated directionality. Also, reference [3] is highly interesting for us because it actually aligns with our data: The authors show that a spontaneous peak of high-frequency band activity (>70 Hz) in insular cortex precedes a pupil dilation peak (or plateau) in two of three participants by ~500msec (which mimics a pattern found for task-evoked activity; see their Figure 5b/c). We find a maximum in our cross-correlation between pupil size and high frequency band activity (>64 Hz) that indicates a similar lag (see our Figure 3B). Importantly, both results do not rule out a common source of neuromodulation for the effects. We have added the following to the end of the section ‘An arousal-triggered cascade of activity in the resting human brain’:

      “In fact, Kucyi & Parvizi (2020) found spontaneous peaks of high-frequency band activity (>70 Hz) in the insular cortex of three resting surgically implanted patients that preceded pupil dilation by ~500msec - a time range that is consistent with the lag of our cross-correlation between pupil size and high frequency (>64Hz) activity (see Figure 3B). Importantly, they showed that this sequence mimicked a similar but more pronounced pattern during task performance. Given the purported role of the insula (Menon & Uddin, 2015), this finding lends support to the idea that spontaneous covariations of pupil size and cortical activity signal arousal events related to intermittent 'monitoring sweeps' for behaviourally relevant information.”

      1. In their discussion, the authors propose a pupil-linked temporal cascade of cognitive processes and accompanying power changes. This argument could be strengthened by showing that earlier events in the cascade can predict subsequent ones (e.g., are the earlier low and high frequency effects predictive of the subsequent alpha-beta synchronization?)-

      We added this cascade angle as one possible interpretation of the observed effects. We fully agree that this is an interesting question but would argue that this would ideally be tested in follow-up research specifically designed for that purpose. The suggested analysis would add a post-hoc aspect to our exploratory investigation in the absence of a suitable contrast, while also potentially side-tracking the main aim of the study. We have revised the language in this section and added the following changes (bold) to the last paragraph to emphasise the speculatory aspect, and clarify what we think needs to be done to look into this further and with more explanatory power.

      “The three scenarios described here are not mutually exclusive and may explain one and the same phenomenon from different perspectives. Further, it remains possible that the sequence we observe comprises independent effects with specific timings. A pivotal manipulation to test these assumptions will be to contrast the observed sequence with other potential coupling patterns between pupil-linked arousal and cortical activity during different behavioural states.”

    1. Author Response

      Reviewer #1 (Public Review):

      As far as I can tell, the input to the model are raw diffusion data plus a couple of maps extracted from T2 and MT data. While this is ok for the kind of models used here, it means that the networks trained will not generalise to other diffusion protocols (e.g with different bvecs). This greatly reduces to usefulness of this model and hinders transfer to e.g. human data. Why not use summary measures from the data as an input. There are a number of rotationally invariant summary measures that one can extract. I suspect that the first layers of the network may be performing operations such as averaging that are akin to calculating summary measures, so the authors should consider doing that prior to feeding the network.

      We agree with the reviewer that using summary measures will make the tool less dependent on particular imaging protocols and more translatable than using rawdata as inputs. We have experimented using a set of five summary measures (T2, magnetization transfer ratio (MTR), mean diffusivity, mean kurtosis, and fractional anisotropy) as inputs. The prediction based on these summary measures, although less accurate than predictions based on rawdata in terms of RMSE and SSIM (Figure 2A), still outperformed polynomial fitting up to 2nd order. The result, while promising, also highlights the need for finding a more comprehensive collection of summary measures that match the information available in the raw data. Further experiments with existing or new summary measures may lead to improved performance.

      The noise sensitivity analysis is misleading. The authors add noise to each channel and examine the output, they do this to find which input is important. They find that T2/MT are more important for the prediction of the AF data, But majority of the channels are diffusion data, where there is a lot of redundant information across channels. So it is not surprising that these channels are more robust to noise. In general, the authors make the point that they not only predict histology but can also interpret their model, but I am not sure what to make of either the t-SNE plots or the rose plots. I am not sure that these plots are helping with understanding the model and the contribution of the different modalities to the predictions.

      We agree that there is redundant information across channels, especially among diffusion MRI data. In the revised manuscript, we focused on using the information derived from noise-perturbation experiments to rank the inputs in order to accelerate image acquisition instead of interpreting the model. We removed the figure showing t-SNE plots with noisy inputs because it does not provide additional information.

      Is deep learning really required here? The authors are using a super deep network, mostly doing combinations of modalities. is the mapping really highly nonlinear? How does it compare with a linear or close to linear mapping (e.e. regression of output onto input and quadratic combinations of input)? How many neurons are actually doing any work and how many are silent (this can happen a lot with ReLU nonlinearities)? In general, not much is done to convince the reader that such a complex model is needed and whether a much simpler regression approach can do the job.

      The deep learning network used in the study is indeed quite deep, and there are two main reasons for choosing it over simpler approaches.

      The primary reason to pick the deep learning approach is to accommodate complex relationships between MRI and histology signals. In the revised Figure 2A-B, we have demonstrated that the network can produce better predictions of tissue auto-fluorescence (AF) signals than 1st and 2nd order polynomial fitting. For example, the predicted AF image based on 5 input MR parameters shared more visual resemblance with the reference AF image than images generated by 1st and 2nd order polynomial fittings, which were confirmed by RMSE and SSIM values. The training curves shown in Fig. R1 below demonstrate that, for learning the relationship between MRI and AF signals, at least 10 residual blocks (~ 24 layers) are needed. Later, when learning the relationship between MRI and Nissl signals, 30 residual blocks (~64 layers) were needed, as the relationship between MRI and Nissl signals appears less straightforward than the relationship between MRI and AF/MBP/NF signals, which have a strong myelin component. In the revised manuscript, we have clarified this point, and the provided toolbox allows users to select the number of residual blocks based on their applications.

      Fig. R1: Training curves of MRH-AF with number of residual blocks ranging from 1 to 30 showing decreasing RMSEs with increasing iterations. The curves in the red rectangular box on the right are enlarged to compare the RMSE values. The training curves of 10 and 30 residual blocks are comparable, both converged with lower RMSE values than the results with 1 and 5 residual blocks.

      In addition, the deep learning approach can better accommodate residual mismatches between co-registered histology and MRI than polynomial fitting. Even after careful co-registration, residual mismatches between histology and MRI data can still be found, which pose a challenge for polynomial fittings. We have tested the effect of mismatch by introducing voxel displacements to perfectly co-registered diffusion MRI datasets and demonstrated that the deep learning network used in this study can handle the mismatches (Figure 1 – figure supplement 1).

      Relatedly, the comparison between the MRH approach and some standard measures such as FA, MD, and MTR is unfair. Their network is trained to match the histology data, but the standard measures are not. How does the MRH approach compare to e.g. simply combining FA/MD/MTR to map to histology? This to me would be a more relevant comparison.

      This is a good idea. We have added maps generated by linear fitting of five MR measures (T2, MTR, FA, MD, and MK) to MBP for a proper comparison. Please see the revised Figure 3A-B. The MRH approach provided better prediction than linear fitting of the five MR measures, as shown by the ROC curves in Figure 3C.

      • Not clear if there are 64 layers or 64 residual blocks. Also, is the convolution only doing something across channels? i.e. do we get the same performance by simply averaging the 3x3 voxels?

      We have revised the paragraph on the network architecture to clarify this point in Figure 1 caption as well as the Methods section. We used 30 residual blocks, each consists of 2 layers. There are additional 4 layers at the input and output ends, so we had 64 layers in total.

      The convolution mostly works across channels, which is what we intended as we are interested in finding the local relationship between multiple MRI contrasts and histology. With inputs from modified 3x3 patches, in which all voxels were assigned the same values as the center voxel, the predictions of MRH-AF did not show apparent loss in sensitivity and specificity, and the voxel-wise correlation with reference AF data remained strong (See Fig. R2 below). We think this is an important piece of information and added it as Figure 1 – figure supplement 3. Averaging the 3x3 voxels in each patch produced similar results.

      Fig. R2: Evaluation of MRH-AF results generated using modified 3x3 patches with 9 voxels assigned the same MR signals as the center voxel as inputs. A: Visual inspection showed no apparent differences between results generated using original patches and those using modified patches. B: ROC analysis showed a slight decrease in AUC for the MRH-AF results generated using modified patches (dashed purple curve) compared to the original (solid black curve). C: Correlation between MRH-AF using modified patches as inputs and reference AF signals (purple open circles) was slightly lower than the original (black open circles).

      The result in the shiverer mouse is most impressive. Were the shiverer mice data included in the training? If not, this should be mentioned/highlighted as it is very cool.

      Data from shiverer mice and littermate controls were not included in the training. We have clarified this point in the manuscript.

    1. Author Response

      Reviewer #1 (Public Review):

      In this paper the authors use a conditional knockout strategy to assess the effects of deletion of the dominant oxygen-sensing hypoxia-inducible factor (HIF) hydroxylase enzyme, prolyl hydroxylase 2 (Phd2) restricted to the regulatory T cell (Treg) lineage. They use a well-established Foxp3-driven Cre recombinase allele. Phd2 is thus silenced in cells that have expressed or continue to express Foxp3 from the time this transcription factor, which is essential for Treg development and function, first occurs. They show that this approach leads to a change in Treg behaviour resulting in loss of some aspects of regulatory function and development of a Th1-like phenotype by the Foxp3 expressing cells. Effects are in general reversed when HIF-2 is silenced alongside Phd2, and may be amplified by simultaneous silencing of the HIF-1 isoform.

      The findings overlap with those reported following generalised silencing of Phd2 and following adoptive transfer of Treg in which Phd2-silencing is induced (Yamamoto et al., 2019) and are broadly compatible with those reported following a similarly Treg-restricted knockout of the von Hippel-Lindau gene (the recognition component of the E2-ubiquitin ligase that targets HIF-alpha chains that have been modified by Phd2) (Lee et al., 2015) but the results reported also differ significantly from these earlier reports in a number of intriguing respects which I feel warrant further discussion and ultimately investigation.

      The Introduction is in general informative and well written but it is a shame that it does not contain more discussion of the current state of knowledge of the interplay between HIF signalling and Treg function. This would provide a platform for a more detailed and scholarly discussion of the similarities and differences between this work and existing literature in the Discussion, where existing papers are currently described rather briefly. The introduction contains the statement 'Further complexity in this pathway has been provided by the identification of additional, non-HIF-related, PHD substrates, suggesting a role of proline hydroxylation in other settings requiring oxygen-dependent regulation', citing a single reference. This does not really represent the complex balance of arguments across the literature about non-HIF substrates for the HIF hydroxylase enzymes.

      The conclusions of this paper are mostly well supported by data, but some aspects need to be clarified and extended.

      We sincerely apologize for our apparent lack of recognition of previous work performed by other colleagues active in this field. We have now modified the Introduction section, to provide a better, yet concise, overview of the current knowledge of hypoxia signalling in regulatory T cell biology.

      A central issue for any conditional knock-out strategy is whether the intended tissue restriction is successfully achieved. The authors acknowledge that some issues have been reported with the Cre-recombinase allele they use. They, however, show the expected restriction to cells of the Treg lineage in two of the lymphoid tissues under investigation (spleen and mesenteric lymph node - Supplementary figure 1b) but do not show similar results for other tissues. Some concerns arise because in Figure 8b YFP (which is expressed alongside the Cre-recombinase) is visible in what appears to be the endothelium of the spleen. Additionally, the spleen sections illustrated show convincing splenomegaly in the Phd2-deficient Treg mice but expansion of the red pulp appears to be at least as prominent as any changes that might have occurred in the white pulp. Furthermore, the gross changes in abdominal appearances described as a 'hemorrhagic abdomen' (Figure 1c) include a more plethoric abdominal wall, prominent intestinal blood vessels and a much darker, and perhaps enlarged, liver compared with the control animal. These appearances might result from increased angiogenesis and / or erythropoiesis, neither of which would be expected to result from Treg lineage restricted Phd2 knockout but are known to occur with Phd2 ablation in other tissues. If there is convincing evidence of haemorrhage it would be nice to see this more obviously displayed macro- or, perhaps better still, microscopically.

      We thank the reviewer for this comment. We have now provided a better description of the haematological status of these mice, in which an elevated haematocrit and increased vascular permeability has been observed (now depicted in supplemental Figure 2). As suggested, we found indeed minimal, yet sizable expression of the Cre recombinase (as judged by YFP expression) in CD45-negative, non-lymphoid cells in all organs examined (as now depicted in supplemental Figure 9). Finally, none of the organs examined displayed an increased expression of erythropoietin (as judged by a sensitive qPCR assay, data not shown), a likely candidate for the haematological abnormalities observed in these mice. The mechanism underlying the apparent extramedullary erythropoiesis occurring in these mice remains therefore to be established. Noteworthy however, an additional experiment performed following a suggestion from one of the reviewers (see Figure 3 and our response 23), strongly suggests that PHD2 affects the Treg phenotype in a cell autonomous fashion. We do however acknowledge that the tissue abnormalities preclude any firm conclusion related to the positioning of Tregs within the spleen and have therefore deleted this section from the manuscript and adapted our conclusion consequently.

      Given that the Cre-recombinase allele used is expressed through the endogenous Foxp3 locus which is located on the X-chromosome and thus subject to random inactivation in the cells of females it is important that the sex of animals used in the experiments is specified.

      This has now been done in the Figure legends

      Experiments show alterations in Phd2-deficient Treg mice compared with control mice in homeostatic proliferation in a lymphopenic environment (Figure 3), the induction of colitis by DSS colitis (Figure 4) and the response to Toxoplasma gondii infection (Figure 4). Given the time courses these effects are likely to be real but interpretation is complicated by the spontaneous effects on the colon of Phd2-deficient Treg mice reported in Figure 1d and e. Given the wide general importance of interferon-gamma in immune / inflammatory responses I am not sure how much weight to place on the observation that concurrent interferon-gamma knockout results in loss of the Phd2-deficient Treg mice pro-inflammatory phenotype (Figure S3). No differences are seen in an in vivo model in which inflammation is induced by injection of anti-CD3 antibodies (Figure S2).

      Although the point is well taken, we felt it was important to perform a few experiments to illustrate the specificity of the inflammatory syndrome observed in these mice. We acknowledge the fact that the effect of concurrent loss of interferon-gamma on the phenotype of PHD2ΔTregs could have been anticipated. Additionnaly, we also think that the fact that these mice retain the same sensitivity to a “Th17-dominated” inflammatory response (also leading to a loss of weight) strengthens one of the messages of the manuscript, i.e. that loss of PHD2 expression affects Treg function in a selective, Th1-oriented fashion.

      An important conceptual difference between the interpretation of results reported here and those reported by Yamamoto et al. is that the 'Phd2-deficient Treg' purified here do not show a change in regulatory function in vitro whereas those used by Yamamoto et al. failed to act normally as regulatory cells. It is unclear whether this is due to differences in the way proliferation was stimulated, the cell purification strategies used (YFP+ in the current work; CD4+;CD25+ in Yamamoto et al.), the silencing of Phd2 (by knockout throughout development here versus through an inducible-shRNA only in mature cells in Yamamoto et al.), some other feature of the experiments (e.g. the use of feeder cells) or whether a difference would be revealed by more extensive titration. The result reported here is somewhat surprising given the presence of a Th1-like immunophenotype in the cells used in these in vitro suppression assays, which at face value might mean that this immunophenotype is not responsible for changes in their regulatory capacity seen in vivo. This may be true, but it is at odds with Bayesian argumentation. It may be a coincidence, but both models in which control Treg and Phd2-deficient Treg behave similarly involve treatment with anti-CD3 antibodies, raising the possibility that these antibodies in some way nullify differences reported with other stimuli, rather than this necessarily being related to the hypothesised difference between Th1 and Th17 responses in the in vivo model.

      We fully agree with the reviewer’s comment, and we were similarly worried that the differences reported in vivo vs in vitro were due to different agonists used. We however attempted to evaluate Treg function in vitro using alternative approaches, including an assay in which allogeneic antigen-presenting cells (including T-cell depleted spleen cells or highly purified dendritic cells) were used as agonists and Interferon-gamma secretion and proliferation as readouts. In another set of experiments, we used in vitro or in vivo derived Th1 cells instead of naïve T cells as responders. In all instances examined to date, PHD2-deficient Tregs displayed an adequate suppressive function in vitro (data not shown).

      Data showing reversal of the Phd2-deficient Treg in vivo phenotype by knockout of HIF-2alpha, but not HIF-1alpha are convincing and support the data of Yamamoto et al. The observation that Treg-specific PHD2-HIF1α double knockout mice were born at sub-mendelian ratios, displayed a marked weight loss during adult life and reduced viability, indicative of a more pronounced pro-inflammatory status is reported but data is not shown. This is certainly of interest and will no doubt receive further attention. The data that Treg-selective HIF1α or HIF2α deficiency does not affect immune homeostasis in naive mice shown in Figure S4 is relevant and compelling. These results are discussed in the context of recent work published by Hsu et al., 2020 which is interesting. Taken together these data highlight the fact that results reported throughout this manuscript arise from a combination of developmental differences with those occurring in the adult animal.

      We thank the reviewer for these positive comments

      The transcriptomic data presented has not, to date, been made available to reviewers or the public. Importantly, it is reported to show a disconnection between changes in glycolytic gene expression pattern and the immune phenotype. Specifically, whilst loss of Phd2 expression in Treg is associated with alterations in their regulatory function and with induction of glycolytic genes, the change in function, but not the change in glycolytic gene expression, is reversed by simultaneous knockout of HIF-2alpha and conversely the gene expression pattern, but not the change in function, is reversed by simultaneous knockout of HIF-1alpha. This will be of great interest to those working on the hypothesis that the switch between oxidative phosphorylation and glycolysis underlies functional changes in T cells, particularly if the changes in glycolytic gene expression actually convert into changes in glycolytic flux (as observed following HIF-induction in other cell types).

      The transcriptomic data are available to the public on GEO with the code: GSE184581

      The authors propose that a change in CXCR3 expression resulting from a change in STAT1 phosphorylation (but not absolute levels of STAT1) consequent on Phd2- inactivation leads to mal-distribution of Treg (at least in the spleen), and that given the broadly paracrine action of Treg this feature alone might explain the loss of regulatory activity in vivo. This is an intriguing hypothesis based at least in part on associative data rather than a formal proof of causality. Changes in STAT1 phosphorylation following interferon-gamma stimulation are far from 'all-or-nothing' (at the timepoint illustrated many cells have normal pSTAT1 levels even though the mean fluorescence intensity is reduced). Results in Figure 7b show that changes in STAT1 phosphorylation are seen in conventional Foxp3 negative T cells; since Phd2 knockout is restricted to the Treg lineage this change is presumably indirect, raising the possibility that the change seen in Treg is also indirect, rather than truly cell autonomous. Changes in pSTAT1 are acknowledged to affect a huge number of genes / processes so picking any one as the total explanation for any change in behaviour may be an over simplification. The analysis of changes in Treg localisation in the spleen is potentially interesting and may reach the correct conclusion but the methodology used is not clearly explained and in particular it is not clear how splenomegaly / changes in gross splenic architecture have been taken into account.

      We fully agree with the reviewer comments and have now deleted the final figure of our manuscript dealing with Treg positioning in the spleen. We indeed agree that due to the morphological changes in spleen size and architecture, more detailed work would be required to confirm our initial hypothesis. Unexpectedly, and thanks to a remark from another reviewer, we found that PHD2-deficient Tregs (which are present at high frequencies in the spleen of PHD2ΔTregs mice) are largely outcompeted both in heterozygous PHD-2fl/fl Cre+/- mice (see Figure 3) and upon equal transfer into WT mice of a 1:1 mix of wt and PHD-2-deficient Tregs, greatly complicating the study of the relative positioning of these cells within lymphoid organs. We do however stand by our previous conclusion suggesting that STAT1-signaling appears as affected in PHD2-deficient Tregs. This conclusion is not only supported by the reduced accumulation of pSTAT1 in these cells, as shown in Figure 8, but also by the bioinformatic analysis of transcriptomic data and the confirmation, at the protein level, of the reduced expression CXCR3 a well characterized STAT1-dependent chemokine receptors (as shown in Figure 8).

      Overall, this work contains many interesting datasets which need to be taken into account as we build our understanding of the intersection between HIF-signalling and regulatory T cell function, particularly as pharmacological manipulation of HIF signalling may provide a route to immunomodulation through alterations in regulatory T cell function.

      We thank again the reviewer for this positive appreciation of our work.

    1. Author Response

      Reviewer #1 (Public Review):

      The key question addressed of this MEG study is whether speech is represented singly or multiplexed in the human brain in the linguistic hierarchy. The authors used state-of-the-art analyses (multivariate Temporal Response Functions) and probablilistic information-theoretic measures (entropy, surprisal) to test distinct contextual speech processing models at three hierarchical levels. The authors report evidence for the coexistence of local and global predictive speech processing in the linguistic hierarchy.

      The work uses time resolved neuroimaging with state-of-the-art analyses and cognitive (here, linguistic) modeling. The study is very well conducted and draws from very different fields of knowledge in convincing ways. I see one limitation of the current study in that the authors focused on phase-locked responses, and I hope future work could extend to induced activity.

      Overall, the flow in the MS could be streamlined. Some smoothing in the introduction would be helpful to extract the main key messages you wish to convey.

      For instance, in the abstract:

      -Can you explain the two views in a simpler way in the abstract and to a non-linguistic audience? Do you mean to say that classic psycholinguistic models tend to follow a strict hierarchically integration (analysis only) but an alternative model is hierarchically inferential (analysis by synthesis)?

      -Indicate early on in abstract or intro where the audience is being led with a concise message on how you address the main question. For instance:

      To contrast our working hypotheses A and B, we used a novel information-theoretic modeling approach and associated measures (entropy, surprisal), which make clear predictions on the latency of brain activity in responses to speech at three hierarchal contextual levels (sublexical, word and sentence).

      We have revised the Abstract and Introduction to reduce the amount of terminology and add additional explanations. Wherever possible, we now use general terms (“bottom up”, “predictions”, “context”, …) instead of terms associated with specific theories. We hope we found a balance between improving accessibility and retaining the qualities seen by Reviewer 2, who thought the Introduction was clearly written and well connected to the psycholinguistics literature.

      All the models we compare are compatible with an analysis by synthesis approach, as long as the generative models are understood to entail making probabilistic predictions about future input. The generative models in analysis by synthesis, then, are one way in which “to organize internal representations in such a way as to minimize the processing cost of future language input“ (Introduction, first paragraph). We have clarified this in the first paragraph of the Introduction.

      • Why did the authors consider that the evoked response is the proper signal to assess as opposed to oscillatory (or non phase-locked) activity?

      The primary reason for our choice of dependent measure is the prior research we based our design on, showing that the linguistic entropy and surprisal effects are measurable in phase-locked responses (Brodbeck et al., 2018; Donhauser and Baillet, 2020). We have made this more explicit in part of the Introduction where we introduce our approach (“To achieve this, we analyzed …”).

      As to oscillatory dependent measures, we consider them an interesting but parallel research question. We are not aware of specific corresponding effects in non-phase locked activity. Accordingly, analyzing oscillatory responses without a clear prior hypothesis would require additional decisions, such as which bands to analyze, which would entail issues of multiple comparison. An additional caveat is that the temporal resolution of oscillatory activity is often lower than that of phase-locked activity, which might potentially make it harder to distinguish responses based on their latency as we did here, to test whether the latency of different context models differ.

      • Parallel processing with different levels of context (hence temporal granularities) sounds compatible with temporal multiplexing of speech representation proposed by Giraud & Poeppel (2012) or do the authors consider it a separate issue?

      We consider our investigation orthogonal to the model discussed by G&P (2012). G&P’s model is about the organization of acoustic information at different time-scales, and does not discuss the influence of linguistic constructs at the word level and above. On the other hand, the information-theoretic models that form the basis of our analysis track the linguistic information that can be extracted from the acoustic signal. The temporal scales invoked by G&P’s model are also different from the ones used here, defined based on acoustic vs. linguistic units. Thus, the kind of neural entrainment as a mechanism for speech processing hypothesized by G&P is fully compatible with our account, but not at all required by it.

      Methods:

      • Figure 2: please spell out TRFs and clarify the measured response

      We have done both in the Figure legend.

      • The sample size (N=12) is very low in today standards but the statistical granularity is that of the full MEG recording. Can a power estimate be provided or clear justification of reliability of statistical measures be described.

      We appreciate and share the reviewers’ concern with statistical power and have made several modifications to better explain and rationalize our choices.

      First, to contextualize our study: The sample size is similar to the most comparable published study, which had 11 participants (Donhauser and Baillet, 2020). Our own previous study (Brodbeck et al., 2018) had more participants (28) but only a fraction of the data per subject (8 minutes of speech in quiet, vs. 47 minutes in the present dataset). We added this consideration to the Methods/Participants section.

      We also added a table with effect-sizes for all the main predictors to make that information more accessible (Table 1). This suggests that the most relevant effects have Cohen’s d > 1. With our sample size 12, we had 94% power to detect an effect with d = 1, and 99% power to detect an effect with d = 1.2. This post-hoc analysis suggests that our sample was adequately powered for the intended purpose.

      Finally, all crucial model comparisons are accompanied by swarm-plots that show each subject as a separate dot, thus showing that these comparisons are highly reproducible across participants (note that there rarely are participants with model difference below 0, indicating that the effects are all seen in most subjects).

      • The inclusion of a left-handed in speech studies in unusual, please comment on any difference (or lack thereof) for this participant and notably the lateralization tests.

      We agree that this warrants further comment, in particular given our lateralization findings. We have made several changes to address this concern. At the same time we hope that the reviewers agree with us that, with proper care, inclusion of a left-handed participants is desirable (Willems et al., 2014), and indeed is becoming more mainstream, at least for studies of naturalistic language processing (e.g. Shain et al., 2020). First, we now draw attention to the presence of a left-hander where we introduce our sample (first paragraph of the Results section). Second, we repeated all tests of lateralization while excluding the left-hander. Because this did not change any of the conclusions, we decided to keep reporting results for the whole sample. However, third, we now mark the left-handed participant in all plots that include single-subject estimates and corresponding source data files. Overall, the left-hander indeed shows stronger right-lateralization than the average participant, but is by no means an outlier.

      • The authors state that eyes were kept open or close. This is again unusual as we know that eye closure affects not only the degree of concentration/fatigue but directly impact alpha activity (which in turn affects evoked responses (1-40 Hz then 20 Hz) that are being estimated here). Please explain.

      Previous comparable studies variably asked subjects to keep their eyes closed (e.g. Brodbeck et al., 2018) or open (e.g. Donhauser and Baillet, 2020). Both modes have advantages and disadvantages, none of which are prohibitive for our target analysis (ocular artifacts were removed with ICA and oscillatory alpha activity should, on average, be orthogonal to time-locked responses to the variables of interest). Importantly however, both modes have subjective disadvantages when enforced: deliberately keeping eyes open can lead to eye strain and excessive blinking, whereas closing eyes can exacerbate sleepiness. For this reason we wanted to allow subjects to self-regulate to optimize the performance on the aspects of the task that mattered – processing meaning in the audiobook. We extended the corresponding Methods section to explain this.

      • It would be helpful to clarify the final temporal granularity of analysis. The TRFs time courses are said to be resampled to 1kHz (p22) but MEG time courses are said to be resampled at 100 Hz (p18).

      Thanks for noting this. We clarified in the TRF time-course section: the deconvolution analysis was performed at 100 Hz, and TRFs were then resampled to 1 kHz for visualization and fine-grained peak analysis.

      • The % of variance explained by acoustic attributes is 15 to 20 folds larger than the that explained by the linguistic models of interest. Can a SNR measure be evaluated on such observations?

      We appreciate this concern, which is indeed reasonable. In order to better clarify this issue we have added a new paragraph, right after Table 1. In brief, since the statistical analysis looks for generality across subjects, the raw % explained values do not directly speak to the SNR or effect size. Rather, the SNR concerns how much variability is in this value across subjects. The individual subject values in Figure 3-B, and effect sizes now reported in Table 1, show that even though the % variability that is uniquely attributable to information-theoretic quantities is small, it is consistently larger than 0 across subjects.

      Results and Figures:

      • The current figures do not give enough credit to the depth of analysis being presented. I understand that this typical for such mTRFs approach but given the level of abstraction being evaluated in the linguistic inputs, it may be helpful to show an exemple of what to expect for low vs. high surprisal for instance from the modeling perspective and over time. For instance, could Figure 1 already illustrate disctinct predictions of the the local vs. global models?

      Thank you for pointing out this gap. We have added two figures to make the results more approachable:

      First, in Figure 3 we now show an example stimulus excerpt with all predictors we used. This makes the complete set of predictors quickly apparent without readers having to collect the information from the different places in the manuscript. It also gives a better sense of the detail that is modeled in the different stimulus representations. Second, we added Figure 6 to show example predictions from the different context models, and explain better how the mTRF approach can decompose brain responses into components related to different stimulus properties.

      • Why are visual cortices highlighted in figures?

      Those were darkened to indicate that they are excluded from the analysis. We have added a corresponding explanation to the legend of Figure 3.

      • Figure 2 Fig 2A and B: can the authors quantitatively illustrate "5-gram generally leads to a reduction of word surprisal but its magnitude varies substantially between words" by simply showing the mean surprisal and its variance?

      Added to the Figure legend.

      Fig 2C: please explain the term "partial response"; please indicate for non M/EEGers what the arrow symbolizes.

      Added to the Figure legend.

      • Figure 3:

      p8: the authors state controlling for the "acoustic features" but do not clearly describe how in the methods and this control comes as a (positive) surprise but still a bit unexpected at first read. Perhaps include the two acoustic features in Fig2C and provide a short couple sentences on how these could impair or confound mTRF performance.

      We thank you for pointing out this lack of explanation. We have added a description of all the control predictors to the end of the Introduction, right after explaining the predictors of main interest. We have also added Figure 3 to give an example and make the nature of all the controls explicit.

      Have the same analysis been conducted on a control region a priori not implicated in linguistic processing? This would be helpful to comfort the current results.

      The analysis has been performed on the whole brain (excluding the insula and the occipital lobe). Figure 4 (previously Figure 3) shows that generally only regions in the temporal lobe exhibit significant contributions from the linguistic models (allowing for some dispersion associated with MEG source localization). Although this is not shown in the figure, regions further away from the significant region generally exhibit a decrease in prediction accuracy from adding linguistic predictors, as is commonly seen with cross-validation when models overfit to irrelevant predictors.

      Fig 3B-C-E: please clearly indicate what single dot or "individual value" represents. Is this average over the full ROI? Was the orientation fixed? Can some measure of variability be provided?

      Explanation of individual dots added to Figure 4-B legend (formerly 3-B). Fixed orientation added to the methods summary in the Figure 2-C legend. To provide more detailed statistics including a measure of variability we added Table 1.

      Fig3E: make bigger / more readable (too many colors: significance bars could be black)

      We have increased the size and made the significance bars black.

      • Figure 4: having to go to the next Fig (Fig5) to understand the time windows is inconvenient and difficult to follow. Please, find a work around or combine the two figures. From which ROI are the times series extracted from?

      We have combined the two figures to facilitate comparison, and have added a brief explanation of the ROI to the figure legend.

      Reviewer #3 (Public Review):

      This manuscript presents a neurophysiological investigation of the hierarchical nature of prediction in natural speech comprehension. The authors record MEG data to speech from an audiobook. And they model that MEG using a number of different speech representations in order to explore how context affects the encoding of that speech. In particular, they are interested in testing how the response to phoneme is affected by context at three different levels: sublexical how the probability of an upcoming phoneme is constrained by previous phonemes; word - how the probability of an upcoming phoneme is affected by its being part of an individual word; sentence - how the probability of an upcoming phoneme is affected by the longer-range context of the speech content. Moreover, the authors are interested in exploring how effects at these different levels might contribute - independently - to explaining the MEG data. In doing so, they argue for parallel contributions to predictive processing from both long-range context and more local context. The authors discuss how this has important implications for how we understand the computational principles underlying natural speech perception, and how it can potentially explain a number of interesting phenomena from the literature.

      Overall, I thought this was a very well written and very interesting manuscript. I thought the authors did a really superb job, in general, of describing their questions against the previous literature, and of discussing their results in the context of that literature. I also thought, in general, that the methods and results were well explained. I have a few comments and queries for the authors too, however, most of which are relatively minor.

      Main comments: 1) One concerns I had was about the fact that context effects are estimated using 5-gram, models. I appreciate the computational cost involved in modeling more context. But, at the same time, I worry a little that examining the previous 4 phonemes or (especially) words is simply not enough to capture longer-term dependencies that surely exist. The reason I am concerned about this is that the sentence level context you are incorporating here is surely suboptimal. As such, could it be the case that the more local models are performing as well as they are simply because the sentence level context has not been modeled as well as it should be? I appreciate the temporal and spatial patterns appear to differ for the sentence level relative to the other two, so that is good support for the idea that they are genuinely capturing different things. However, I think some discussion of the potential shortcomings of only including 4 tokens of context is worth adding. Particularly when you make strong claims like that on lines 252.

      We strongly agree with the reviewer that the 5-gram model is not the ultimate model of human context representations. We have added a section to acknowledge this (Limitations of the sentence context model).

      While we see much potential for future work to investigate context processing by using more advanced language models, a preliminary investigation suggests that it might not be trivial. We compared the ability of a pre-trained LSTM (Gulordava et al., 2018) to predict the brain response to words in our dataset with that of the 5-gram model. The LSTM performed substantially worse than the 5-gram model. An important difference between the two models is that our 5-gram model was trained on the Corpus of Contemporary American English (COCA), whereas the LSTM was trained on Wikipedia. COCA provides a large and highly realistic sample of English, whereas the language in Wikipedia might be a more idiosyncratic subsample. Thus, the LSTM might be worse just because it has been trained on a less representative sample of English. As an initial step we thus ought to train the LSTM on the superior COCA database, but this simple step alone would already be associated with a substantial computational cost, given the size of COCA at more than a billion words (we estimated 3 weeks on 32 GPUs in a computing cluster). Furthermore, while we acknowledge the limitations of the 5-gram model, we consider it very unlikely that its limitations are the reason that the more local models are performing well. In general, as more context is considered, the model’s predictions should become more different from the local model, i.e., a more sophisticated model should be less correlated with the local models, and should thus allow the local models to perform even better.

      2) I found myself confused about what exactly was being modeled on my first reading of pages 4 through 7. I realized then that all of the models are based on estimating a probability distribution based on phonemes (stated on line 167). I think why I found it so confusing was that the previous section talked about using word forms and phonemes as units of representation (lines 118-119; Fig 2A), and I failed to grasp that, in fact, you were not going to be modeling surprisal or entropy at the word level, but always at the phoneme level (just with different context). Anyway, I just thought I would flag that as other readers might also find themselves thinking in one direction as they read pages 4 and 5, only to find themselves confused further down.

      Thank you for pointing out this ambiguity; we now make it explicit that “all our predictors reflect information-theoretic quantities at the rate of phonemes” early on in the Expressing the use of context through information theory section.

      3) I also thought some the formal explanations of surprisal and entropy on lines 610-617 would be valuable if added to the first paragraph on page 6, which, at the moment, is really quite abstract and not as digestible as it could be, particularly for entropy.

      We appreciate that this needs to be much clearer for readers with different backgrounds. As suggested, we have added the formal definition to the Introduction, and we now also point readers explicitly to the Methods subsection that explains these definitions in more detail.

      4) I like the analysis examining the possibility of tradeoffs between context models. I wonder might such tradeoffs exist as conversational environments vary - if the complexity of the speech varies and/or listening conditions vary might there be more reliance on local vs global context then. If that seems plausible, then it might be worth adding a caveat that you found no evidence for any tradeoff, but that your experiment was pretty homogenous in terms of speech content.

      Thank you for this suggestion. We added this idea to the Discussion in the Implications for speech processing section.

    1. Author Response

      Reviewer #2 (Public Review):

      The manuscript by Carrasquilla and colleagues applied Mendelian Randomization (MR) techniques to study causal relationship of physical activity and obesity. Their results support the causal effects of physical activity on obesity, and bi-directional causal effects of sedentary time and obesity. One strength of this work is the use of CAUSE, a recently developed MR method that is robust to common violations of MR assumptions. The conclusion reached could potentially have a large impact on an important public health problem.

      Major comments:

      (1) While the effect of physical activity on obesity is in line with earlier studies, the finding that BMI has a causal effect on sedendary time is somewhat unexpected. In particular, the authors found this effect only with CAUSE, but the evidence from other MR methods do not reach statistical significance cutoff. The strength of CAUSE is more about the control of false positive, instead of high power. In general, the power of CAUSE is lower than the simple IVW method. This is also the case in this setting, of high power of exposure (BMI) but lower power of outcome (sedentary time) - see Fig. 2B of the CAUSE paper.

      It does not necessarily mean that the results are wrong. It's possible for example, by better modeling pleiotropic effects, CAUSE better captures the causal effects and have higher power. Nevertheless, it would be helpful to better understand why CAUSE gives high statistical significance while others not. Two suggestions here:

      (a) It is useful to visualize the MR analysis with scatter plot of the effect sizes of variants on the exposure (BMI) and outcome (sedentary time). In the plot, the variants can be colored by their contribution to the CAUSE statistics, see Fig. 4 of the CAUSE paper. This plot would help show, for example, whether there are outlier variants; or whether the results are largely driven by just a small number of variants.

      We agree and have now added a scatter plot of the expected log pointwise posterior density (ELPD) contributions of each variant to BMI and sedentary time, and the contributions of the variants to selecting either the causal model or the shared model (Figure 2-figure supplement 1 panel A). We identified one clear outlier variant (red circle) that we thus decided to remove before re-running the CAUSE analysis (panel B). We found that the causal effect of BMI on sedentary time remained of similar magnitude before and after the removal of this outlier variant (beta=0.13, P=6x10-4 and beta=0.13, P=3x10-5, respectively) (Supplementary File 1 and 2).

      We have added a paragraph in the Results section to describe these new findings:

      Lines 204-210: “We checked for outlier variants by producing a scatter plot of expected log pointwise posterior density (ELPD) contributions of the variants to BMI and sedentary time (Supplementary File 1), identifying one clear outlier variant (rs6567160 in MC4R gene) (Figure 2, Appendix 1—figure 2). However, the causal effect of BMI on sedentary time remained consistent even after removing this outlier variant from the CAUSE analysis (Supplementary File 1 and 2).”

      (b) CAUSE is susceptible to false positives when the value of q, a measure of the proportion of shared variants, is high. The authors stated that q is about 0.2, which is pretty small. However, it is unclear if this is q under the causal model or the sharing model. If q is small under the sharing model, the result would be quite convincing. This needs to be clarified.

      We thank the reviewer for a very relevant question. We have now clarified in the manuscript that all of the reported q values (~0.2) were under the causal model (lines 202-203). We applied the strict parameters for the priors in CAUSE in all of our analyses, which leads to high shared model q values (q=0.7-0.9). To examine whether our bidirectional causal findings for BMI and sedentary time may represent false positive results, we performed a further analysis to identify and exclude outlier variants, as described in our response to Question 7. I.e. we produced a scatter plot of expected log pointwise posterior density (ELPD) contributions of each variant to BMI and sedentary time, and the contributions of the variants to selecting either the causal model or the shared model (Supplementary Figure 2 panel A, shown above). We identified one clear outlier variant (red circle) that we thus removed (panel B), but the magnitude of the causal estimates was not affected by the exclusion of the variant (Supplementary File 1 and 2).

      (2) Given the concern above, it may be helpful to strengthen the results using additional strategy. Note that the biggest worry with BMI-sedentary time relation is that the two traits are both affected by an unobserved heritable factor. This hidden factor likely affects some behavior component, so most likely act through the brain. On the other hand, BMI may involve multiple tissue types, e.g. adipose. So the idea is: suppose we can partition BMI variants into different tissues, those acted via brain or via adipose, say; then we can test MR using only BMI variants in a certain tissue. If there is a causal effect of BMI on sedentary time, we expect to see similar results from MR with different tissues. If the two are affected by the hidden factor, then the MR analysis using BMI variants acted in adipose would not show significant results.

      While I think this strategy is feasible conceptually, I realize that it may be difficult to implement. BMI heritability were found to be primarily enriched in brain regulatory elements [PMID:29632380], so even if there are other tissue components, their contribution may be small. One paper does report that BMI is enriched in CD19 cells [PMID: 28892062], though. A second challenge is to figure out the tissue of origin of GWAS variants. This probably require fine-mapping analysis to pinpoint causal variants, and overlap with tissue-specific enhancer maps, not a small task. So I'd strongly encourage the authors to pursue some analysis along this line, but it would be understandable if the results of this analysis are negative.

      We thank the reviewer for a very interesting point to address. We cannot exclude the possibility of an unobserved heritable factor acting through the brain, and tissue-specific MR analyses would be one possible way to investigate this possibility. However, we agree with the reviewer that partitioning BMI variants into different tissues is not currently feasible as the causal tissues and cell types of the GWAS variants are not known. Nevertheless, we have now implemented a new analysis where we tried to stratify genetic variants into “brain-enriched” and “adipose tissue-enriched” groups, using a simple method based on the genetic variants’ effect sizes on BMI and body fat percentage.

      Our rationale for stratifying variants by comparing their effect sizes on BMI and body fat percentage is the following:

      BMI is calculated based on body weight and height (kg/m2) and it thus does not distinguish between body fat mass and body lean mass. Body fat percentage is calculated by dividing body fat mass by body weight (fat mass / weight * 100%) and it thus distinguishes body fat mass from body lean mass. Thus, higher BMI may reflect both increased fat mass and increased lean mass, whereas higher body fat percentage reflects that fat mass has increased more than lean mass.

      In case a genetic variant influences BMI through the CNS control of energy balance, its effect on body fat mass and body lean mass would be expected to follow the usual correlation between the traits in the population, where higher fat mass is strongly correlated with higher lean mass. In such a scenario, the variant would show a larger standardized effect size on BMI than on body fat percentage. In case a genetic variant more specifically affects adipose tissue, the variant would be expected to have a more specific effect on fat mass and less effect on lean mass. In such scenario, the variant would show a larger standardized effect size on body fat percentage than on BMI.

      We therefore stratified BMI variants into brain-specific and adipose tissue-specific variants by comparing their standardized effect sizes on BMI body body fat percentage. Of the 12,790 variants included in the BMI-sedentary time CAUSE analysis, 12,266 had stronger effects on BMI than on body fat percentage and were thus classified as “brain-specific”. The remaining 524 variants had stronger effects on body fat percentage than on BMI (“adipose tissue-specific”). To assess whether the stratification of the variants led to biologically meaningful groups, we performed DEPICT tissue-enrichment analyses. The analyses showed that the genes expressed near the “brain-specific” variants were enriched in the CNS (figure below, panel A), whereas the genes expressed near the “adipose tissue-specific” variants did not reach significant enrichment at any tissue, but the showed strongest evidence of being linked to adipocytes and adipose tissue (figure below, panel B).

      Figure legend: DEPICT cell, tissue and system enrichment bar plots for BMI-sedentary time analysis.

      Having established that the two groups of genetic variants likely represent tissue-specific groups, we re-estimated the causal relationship between BMI and sedentary time using CAUSE, separately for the two groups of variants. We found that the 12,266 “brain-specific” genetic variants showed a significant causal effect on sedentary time (P=0.003), but the effect was attenuated compared to the CAUSE analysis where all 12,790 variants (i.e. also including the 524 “adipose tissue-specific” variants) were included in the analysis (P=6.3.x10-4). The statistical power was much more limited for the “adipose tissue-specific” variants, and we did not find a statistically significant causal relationship between BMI and sedentary time using the 524 “adipose tissue-specific” variants only (P=0.19). However, the direction of the effect suggested the possibility of a causal effect in case a stronger genetic instrument was available. Taken together, our analyses suggest that both brain-enriched and adipose tissue-enriched genetic variants are likely to show a causal relationship between BMI and sedentary time, which would suggest that the causal relationship between BMI and sedentary time is unlikely to be driven by an unobserved heritable factor.

      Minor comments

      The term "causally associated" are confusing, e.g. in l32. If it's causal, then use the term "causal".

      We have now changed the term “causally associated” to “causal” throughout the manuscript.

      Reviewer #3 (Public Review):

      Given previous reports of an observational relationship between physical inactivity and obesity, Carrasquilla and colleagues aimed to investigate the causal relationship between these traits and establish the direction of effect using Mendelian Randomization. In doing so, the authors report strong evidence of a bidirectional causal relationship between sedentary time and BMI, where genetic liability for longer sedentary time increases BMI, and genetic liability for higher BMI causally increases sedentary time. The authors also give evidence of higher moderate and vigorous physical activity causally reducing BMI. However they do note that in the reverse direction there was evidence of horizontal pleiotropy where higher BMI causally influences lower levels of physical activity through alternative pathways.

      The authors have used a number of methods to investigate and address potential limiting factors of the study. A major strength of the study is the use of the CAUSE method. This allowed the authors to investigate all exposures of interest, in spite of a low number of suitable genetic instruments (associated SNPs with P-value < 5E-08) being available, which may not have been possible with the use of the more conventional MR methods alone. The authors were also able to overcome sample overlap with this method, and hence obtain strong causal estimates for the study. The authors have compared causal estimates obtained from other MR methods including IVW, MR Egger, the weighted median and weighted mode methods. In doing so, they were able to demonstrate consistent directions of effects for most causal estimates when comparing with those obtained from the CAUSE method. This helps to increase confidence in the results obtained and supports the conclusions made. This study is limited in the fact that the findings are not generalizable across different age-groups or populations - although the authors do state that similar results have been found in childhood studies. As the authors also make reference to, due to the nature of the BMI genetic instruments used, the findings of this study can only inform on the lifetime impact of higher BMI, and not the effect of a short-term intervention.

      The findings of this study will be of interest to those in the field of public health, and support current guidelines for the management of obesity.

      We thank the Reviewer for the valuable feedback and insights. We agree that the lack of generalizability of the findings across age groups and populations is an important limitation. We have now mentioned this in lines 341-342 of the manuscript:

      “The present study is also limited in the fact that the findings are not generalizable across different age-groups or populations.”

    1. European concepts of equality more often focus on group inequality and the collective mitigation of handicaps and risks that, in the United States, have been left for individuals to deal with on their own.

      I don't think it's any surprise that the US is a quite individualistic society in comparison to many other counties in the world. I personally don't feel too hopeful that the rhetoric that perpetuates individualism in our society will change, so I am curious to see what solutions may address these community inequalities/handicaps in education that we tend to neglect.

    1. Author Response

      Reviewer #2 (Public Review):

      This paper combines neuroimaging, behavioral experiments, and computational modeling to argue that (a) there is a network of brain areas that represent physical stability, (b) these areas do so in a way that generalizes across many kinds of instability (e.g., not only a tower of blocks about to fall over, but also a person about to fall off a ladder), and (c) that this supports a simulation account of physical reasoning, rather than one based on feedforward processing; this last claim arises through a comparison of humans to CNNs, which do an OK job classifying physical instability but not in a way that transfers across these different stability classes. In my opinion, this is a lovely contribution to the literatures on both intuitive physical reasoning and (un)humanlike machine vision. At the same time, I wasn't sure that the broader conclusions followed from the data in the way the authors preferred, and I also had some concerns about some of the methodological choices made here.

      1. The following framing puzzled me a bit, and even seemed to raise an unaddressed confound in the paper: "Here we investigate how the brain makes the most basic prediction about the physical world: whether the situation in front of us is stable, and hence likely to stay the same, or unstable, and hence likely to change in the immediate future".

      Consider the following minor worry, which sets up a more major one: This framing, which connects 'stability' to 'change' and which continues throughout the paper, seems to equivocate on the notion of 'stability'. One meaning of 'stable' is, roughly, 'unchanging'. Another meaning is 'unlikely to fall over'. The above quotation, along with others like it, makes it seem like the authors are investigating the former, since that's the only meaning that makes this quotation make sense. But in fact the experiments are about the latter -- towers falling down, people falling off ladders, etc. But these aren't the same thing! So there's a bit of wordplay happening here, it seemed to me.

      This sets up the more serious worry. As this framing reveals, unstable scenes (in the likely-to-fall-over sense) are, by their nature, scenes where something is likely to change. In that case, how do we know that the brain areas this project has identified aren't representing 'likeliness to change', rather than physical stability? There are, of course, many objects and scenes that might be highly likely to change without being at all physically unstable. Even the first example in the paper ("a dog about to give chase") is about likely changes without any physical instability. But isn't this a confound? All of the examples of physical instability explored here also involve likeliness to change! So these could be 'likely to change' brain areas, not 'physically unstable' brain areas. Right? Or if not, what am I missing?

      The caption of Figure 1 seems to get at this a bit, but in a way I admit I just found a bit confusing. If authors do after all intend "physically unstable" to mean "likely to change", then many classes of scenarios that are unexplored here seem like they would be relevant: a line of sprinters about to dash off in a race, someone about to turn off all the lights in a home, a spectacular chemical reaction about to start, etc. But the authors don't intend those scenarios to fall under the current project, right?

      The reviewer is correct that "stability" has (at least) these two different meanings, and also correct that we are investigating here the situation in which a configuration is not changing now but would be likely to change with just the slightest perturbation. Our hypothesis is that the “Physics Network” will be sensitive to the likelihood that a physical configuration will change for physical (not social) reasons. That is what our data show: we do not find the same univariate and multivariate effects for situations that are likely to change because of the behavior of an animal. This indicates that what we are decoding is not general ‘likeliness to change’ but rather physical instability in particular.

      (Also: Is stability really 'the most basic prediction' we make about the world? Who is to say that stable vs. unstable is a more basic judgment than, say, present vs. absent, or expected vs. unexpected, or safe vs. unsafe, etc? I know this is mostly just trying to get the reader excited about the results, but I stumbled there.)

      We have now modified the sentence to say: “…how the brain makes a fundamental prediction about the physical world: whether the situation in front of us is stable, and hence likely to stay the same, or unstable, and hence likely to change in the immediate future.”

      1. Laying out these issues in terms of feedforward processing vs. simulation felt a bit misleading and/or unfair to those views, given the substance of what this paper is actually doing. In particular, the feedforward view ends up getting assimilated to "what CNNs do"; but these are completely different hypotheses (or at least can be). Note, for example, that many vision researchers who don't think CNNs are good models of human vision nevertheless do think that lots of what human vision does is feedforward; that view could only be coherent if there are kinds of feedforward processing that are un-CNN-like. It would be better not to conflate these two and just say that the pattern of results rules out CNN-like feedforward processing without ruling out feedforward processing in general.

      This is a fair point, and we certainly agree that we cannot rule out all feedforward models. We have tried to be clear about this claim, e.g., here (in the Discussion: “Three lines of evidence from the present study indicate that pattern recognition alone – as instantiated in feedforward CNNs and the ventral visual pathway – is unlikely to explain physical inference in humans, at least for the case of physical stability."

      3a. I wasn't sure how impressed to be by the fact that, say, 60% classification accuracy one class of stable/unstable scenes doesn't lead to above-chance performance on another class of stable/unstable scenes. Put differently, it seems that the CNNs simply didn't do a great job classifying physical stability in the first place; in that case, how general should we expect their representations to be anyway? Now, on one hand, I could see this worry only further supporting the authors' case, since you could think of this as all the more evidence that CNNs won't have representations of stability in them. But since (a) the claims the authors are making are about feedforward processing in principle, not just in one or two CNNs, and (b) the purpose of this paper is to explore the issue of generality per se, rather than just stability, this seems inadequate. It could be that a CNN that does achieve high accuracy on physical stability judgments (90%?) would actually show this kind of general transfer; but we don't know that from the data presented here, because it's possible that the lack of generality arises from poor performance to begin with.

      You are correct in noting that CNNs don’t do a great job in classifying physical stability, which reinforces our point that pattern recognition systems are not very good at discerning physical stability. In fact, the classification accuracy that we have reported is close to the baseline performance in literature (Lerer et al 2016). Interestingly, training on the block tower dataset itself could only bring up the stability classification accuracy to 68.8% on the real-world block tower images. While this is true of the current best model of stability detection, we think that CNNs trained on large-scale datasets of stability under varying scenarios may in future be able to potentially generalize to other natural scenarios. However, to our knowledge no such datasets exist.

      3b. I wasn't sure how to think about whether showing CNNs stable and unstable scenes is a fair test of their ability to represent physical stability. Do we know that stability is all that these images have in common? Maybe the CNN is doing a great job learning some other representation. This sort of thing comes up in some recent discussions of 'shortcuts' and/or the 'fairness' of comparisons between human and machine vision, including some recent theoretical papers (see author recommendations for specific suggestions here).

      If our point were that CNNs do a great job at representing physical stability, we would indeed have to worry about low-level image confounds or “shortcuts” enabling this performance. But our point is that they do badly. If some of their already bad performance is due to image confounds/shortcuts then they are in fact doing even worse, and that only makes our point stronger.

      4a. I didn't really follow this passage, which is relied on to interpret greater activity for unstable vs stable scenes: "we reasoned that if the candidate physics regions are engaged automatically in simulating what will happen next, they should show a higher mean response when viewing physically unstable scenes (because there is more to simulate) than stable scenes (where nothing is predicted to happen)." It seems true enough that, once one knows that a scene is stable, one doesn't then need a dynamically updated representation of its unfolding. But the question that this paper is about is how we determine, in the first place, that a scene is stable or not. The simulations at issue are simulations one runs before one knows their outcome, and so it wasn't clear at all to me that there is always more to simulate in an unstable scene. Stable scenes may well have a lot to simulate, even if we determine after those hefty simulations that the scene is stable after all. And of course unstable scenes might well have very little to simulate, if the scene is simple and the instability is straightforwardly evident. Can the authors say more about why it's easier to determine that a stable scene is stable than that an unstable scene is unstable? They may have a good answer! It would just be better to see it in the paper.

      The idea here is that forward simulation happens in all cases but stops if no change has occurred since the last frame. That stopping, both represents the stability of the configuration and produces less activity. This idea is akin to the “sleep state” used for nonmoving objects in a physics engine: they do not need to be re-simulated or re-rendered if they have not moved since the last frame (Ullman et al, 2017 TICS).

      4b. I was confused a bit by the Animals-People condition, and whether to think of it as a control condition or not. The image of it in Figure 1a makes it seem like it is meant to be interpreted along the usual "physical stability" lines, just like falling towers and people on ladders, and the caption seems to say this too; it also makes intuitive sense since the man in the boat looks like he'll fall if and when the alligator attacks. But then in the main text the authors predict that the representations of stability would not extend to the Animals-People condition, because they are just supposed to be about peril but not stability. Why not? And then the results themselves are equivocal, with some findings generalizing to Animals-People and some not. I don't have much more to say here other than that I found this hard to follow.

      We used the Animals-People as a control for peril/instability that is not caused by the physical situation (but rather by another agent). Our hypothesis was that the “Physics Network” would hold information about physical stability, not just any kind of propensity for change for any reason. Hence, we predicted, that any brain region responding (only) to physical stability should not respond in a similar way to peril/non-peril conditions in the Animals-People scenario as they involve a more biological-agent driven interaction. That is what we found.

    2. Reviewer #2 (Public Review):

      This paper combines neuroimaging, behavioral experiments, and computational modeling to argue that (a) there is a network of brain areas that represent physical stability, (b) these areas do so in a way that generalizes across many kinds of instability (e.g., not only a tower of blocks about to fall over, but also a person about to fall off a ladder), and (c) that this supports a simulation account of physical reasoning, rather than one based on feedforward processing; this last claim arises through a comparison of humans to CNNs, which do an OK job classifying physical instability but not in a way that transfers across these different stability classes. In my opinion, this is a lovely contribution to the literatures on both intuitive physical reasoning and (un)humanlike machine vision. At the same time, I wasn't sure that the broader conclusions followed from the data in the way the authors preferred, and I also had some concerns about some of the methodological choices made here.

      1. The following framing puzzled me a bit, and even seemed to raise an unaddressed confound in the paper: "Here we investigate how the brain makes the most basic prediction about the physical world: whether the situation in front of us is stable, and hence likely to stay the same, or unstable, and hence likely to change in the immediate future".

      Consider the following minor worry, which sets up a more major one: This framing, which connects 'stability' to 'change' and which continues throughout the paper, seems to equivocate on the notion of 'stability'. One meaning of 'stable' is, roughly, 'unchanging'. Another meaning is 'unlikely to fall over'. The above quotation, along with others like it, makes it seem like the authors are investigating the former, since that's the only meaning that makes this quotation make sense. But in fact the experiments are about the latter -- towers falling down, people falling off ladders, etc. But these aren't the same thing! So there's a bit of wordplay happening here, it seemed to me.

      This sets up the more serious worry. As this framing reveals, unstable scenes (in the likely-to-fall-over sense) are, by their nature, scenes where something is likely to change. In that case, how do we know that the brain areas this project has identified aren't representing 'likeliness to change', rather than physical stability? There are, of course, many objects and scenes that might be highly likely to change without being at all physically unstable. Even the first example in the paper ("a dog about to give chase") is about likely changes without any physical instability. But isn't this a confound? All of the examples of physical instability explored here also involve likeliness to change! So these could be 'likely to change' brain areas, not 'physically unstable' brain areas. Right? Or if not, what am I missing?

      The caption of Figure 1 seems to get at this a bit, but in a way I admit I just found a bit confusing. If authors do after all intend "physically unstable" to mean "likely to change", then many classes of scenarios that are unexplored here seem like they would be relevant: a line of sprinters about to dash off in a race, someone about to turn off all the lights in a home, a spectacular chemical reaction about to start, etc. But the authors don't intend those scenarios to fall under the current project, right?

      (Also: Is stability really 'the most basic prediction' we make about the world? Who is to say that stable vs. unstable is a more basic judgment than, say, present vs. absent, or expected vs. unexpected, or safe vs. unsafe, etc? I know this is mostly just trying to get the reader excited about the results, but I stumbled there.)

      2. Laying out these issues in terms of feedforward processing vs. simulation felt a bit misleading and/or unfair to those views, given the substance of what this paper is actually doing. In particular, the feedforward view ends up getting assimilated to "what CNNs do"; but these are completely different hypotheses (or at least can be). Note, for example, that many vision researchers who don't think CNNs are good models of human vision nevertheless do think that lots of what human vision does is feedforward; that view could only be coherent if there are kinds of feedforward processing that are un-CNN-like. It would be better not to conflate these two and just say that the pattern of results rules out CNN-like feedforward processing without ruling out feedforward processing in general.

      3a. I wasn't sure how impressed to be by the fact that, say, 60% classification accuracy one class of stable/unstable scenes doesn't lead to above-chance performance on another class of stable/unstable scenes. Put differently, it seems that the CNNs simply didn't do a great job classifying physical stability in the first place; in that case, how general should we expect their representations to be anyway? Now, on one hand, I could see this worry only further supporting the authors' case, since you could think of this as all the more evidence that CNNs won't have representations of stability in them. But since (a) the claims the authors are making are about feedforward processing in principle, not just in one or two CNNs, and (b) the purpose of this paper is to explore the issue of generality per se, rather than just stability, this seems inadequate. It could be that a CNN that does achieve high accuracy on physical stability judgments (90%?) would actually show this kind of general transfer; but we don't know that from the data presented here, because it's possible that the lack of generality arises from poor performance to begin with.

      3b. I wasn't sure how to think about whether showing CNNs stable and unstable scenes is a fair test of their ability to represent physical stability. Do we know that stability is all that these images have in common? Maybe the CNN is doing a great job learning some other representation. This sort of thing comes up in some recent discussions of 'shortcuts' and/or the 'fairness' of comparisons between human and machine vision, including some recent theoretical papers (see author recommendations for specific suggestions here).

      4a. I didn't really follow this passage, which is relied on to interpret greater activity for unstable vs stable scenes: "we reasoned that if the candidate physics regions are engaged automatically in simulating what will happen next, they should show a higher mean response when viewing physically unstable scenes (because there is more to simulate) than stable scenes (where nothing is predicted to happen)." It seems true enough that, once one knows that a scene is stable, one doesn't then need a dynamically updated representation of its unfolding. But the question that this paper is about is how we determine, in the first place, that a scene is stable or not. The simulations at issue are simulations one runs before one knows their outcome, and so it wasn't clear at all to me that there is always more to simulate in an unstable scene. Stable scenes may well have a lot to simulate, even if we determine after those hefty simulations that the scene is stable after all. And of course unstable scenes might well have very little to simulate, if the scene is simple and the instability is straightforwardly evident. Can the authors say more about why it's easier to determine that a stable scene is stable than that an unstable scene is unstable? They may have a good answer! It would just be better to see it in the paper.

      4b. I was confused a bit by the Animals-People condition, and whether to think of it as a control condition or not. The image of it in Figure 1a makes it seem like it is meant to be interpreted along the usual "physical stability" lines, just like falling towers and people on ladders, and the caption seems to say this too; it also makes intuitive sense since the man in the boat looks like he'll fall if and when the alligator attacks. But then in the main text the authors predict that the representations of stability would not extend to the Animals-People condition, because they are just supposed to be about peril but not stability. Why not? And then the results themselves are equivocal, with some findings generalizing to Animals-People and some not. I don't have much more to say here other than that I found this hard to follow.

      5. "Interestingness" ratings felt like a not-quite-adequate approach for evaluating how attention-grabbing the towers were. A Bach concerto is more interesting than a gunshot (and would be rated that way, I imagine), but the gunshot is surely more attention-grabbing. Why not use a measure like how much they distract from another task? That's the sort of thing I'd have expected, in any case.

    1. Author Response

      Reviewer #1 (Public Review):

      In Wang et al., the authors investigate issues related to the relative proportion of flux for the enzymatic decarboxylation of pyruvate between PDH (pyruvate dehydrogenase) and PFOR (pyruvate-ferredoxin oxoreductase) in the model organism Synechococystis. The manuscript provides evidence that PDH becomes increasingly inactivated by a high ratio of NADH:NAD+ as well as evidence to suggest that PFOR is transcribed and remains intact under aerobic conditions. The authors put forward the theory that both PDH and PFOR are functionally active routes for pyruvate decarboxylation under aerobic conditions, whereas PFOR has previously been assumed to be inactive under growth conditions containing oxygen. This distinction is particularly highlighted by conditions where Synechocystis is grown photomixotrophically - and where the NADH:NAD+ pool may be relatively over-reduced because of two parallel inputs of reductant (water-splitting at PSII and catabolism of glucose). The authors examine growth under photoautotrophic and photomixotrophic conditions for a number of relevant mutants including members of the ferredoxin/flavodoxin family, PFOR, and NDH-1 complex subunits.

      The theory put forward in this manuscript is of general interest regarding electron flux through the combined electron transport chain (photosynthetic + respiratory) of cyanobacteria. The authors further broaden the potential audience for the manuscript by elaborating on the potential significance of these results in the context of a switch from PFOR (ancestral) to PDH (oxygenic/modern).

      Comments:

      Generally, theories put forward in this manuscript are intriguing and have a number of potential implications for understanding electron flux and regulation of central metabolic processes in photosynthetic microorganisms. If these theories are supported and become more generally adopted, they would have significant impact on the understanding of the regulation of central carbon metabolism in cyanobacteria. That said (due in no small part to the complexity of some of these pathways), the evidence provided to support the hypotheses is indirect in many instances. In some cases, there is a pairing of indirect data with broad statements that can come across as over-reach. These problems can be somewhat exacerbated by an unclear organization at parts of the Discussion, a lack of succinctly defined claims, and numerous typographical considerations.

      Thank you very much for this point. We now reorganized the discussion and overhauled it completely. It starts with aspects that are best supported by our data. We then added two sentences to stress that the following lines include hypothetical considerations that are meant as thought-provoking impulses. We hope that thereby over-reach is prevented.

      Major considerations:

      A major component of the proposed theories in this manuscript rest upon the assumption that PFOR is an active enzyme under highly aerobic conditions: this claim is never directly demonstrated.

      This is true. We could show though that PFOR of Synechocystis is in constrast to most bacterial PFORs stable in the presence of oxygen. However, as stated likewise for the oxygen stable PFOR of the obligate aerobe Sulfolobus acidocaldarius (3), and PFOR from E. coli, which was recently shown to contribute to metabolism in the presence of oxygen in vivo (1) we as well had to remove oxygen for enzyme acitivty in vitro. This point is discussed frankly.

      Indirect evidence of altered growth of pfor mutants, increased repression of PDH, and the higher NADH:NAD+ ratio under photomixotrophic conditions is in general alignment with this theory. However, while deletion of pfor does indeed result in altered growth dynamics in Synechocystis under periods of photomixotrophy, the alterations do not entirely align with the idea that this pathway is critical for rapid growth under aerobic conditions. For instance, pfor and most of the highlighted mutants (fdx 3, fdx 9, isiB) presented in Figure 3 show the greatest defects in their OD after reaching stationary phase (more rapid decline in OD on/after Day 6) relative to WT. This doesn't align as nicely with the highest NADH:NAD+ seen in Days 3-5 (which is also specifically called out: e.g., Line 146, Supplemental Figure S8).

      We are very cautious to compare growth experiments day by day. This is due to the fact that the growth behaviour of WT and mutants differ between experiments. We therefore repeat these experiments in several independent experiments including at least three replicates and show the data of typical growth experiments. In the case of the shown growth behaviour of WT and pfor and the NADH/NAD+ ratios under photoautotrophic and photomixotrophic conditions shown in figure 1, NADH/NAD+ ratios were determined in exactly those cultures for which growth data are shown. It is therefore legitimate to directly compare these results day by day. However, we did not determine the NADH/NAD+ ratios of the cultures shown in Fig. 3. The rise in NADH might have started with a delay here.

      In this context, the deletion of F-GOGAT is much more convincing in it's severity and timing, yet for this mutation to have a more severe phenotype is unexpected if PFOR is one of the primary/sole electron donors to the ferredoxin pool from glucose utilization as proposed (i.e., stated differently, F-GOGAT is only one of the enzymes downstream of ferrodoxin and might be expected to have a more subtle phenotype in comparison to the KO of PFOR if that is a primary source for electrons to ferredoxin under photoheterotrophic conditions).

      F-GOGAT requires reduced ferredoxin which can be provided by PFOR and in addition also by PSI. As electrons from glucose oxidation can be fed via photosynthetic complex I into the PQ-pool they will eventually arrive at PSI (Fig. 3C) where ferredoxin can be reduced and transfer electrons to F-GOGAT. However, to get a truly complete picture of the situation several issues will have to be addressed in the future: we do not know which of the low abundant ferredoxins as well as high abundant ferredoxin 1 interact with PSI, F-GOGAT, PFOR and photosynthetic complex I. It would be furthermore helpful to know all midpoint potentials of the different ferredoxins. Without this information it might be too much to ask for a simple interpretation.

      A central tenant of the argument put forward on the evolutionary importance of using either PFOR vs. PDH is the conservation of extra free energy by the former reaction. However, additional information on the ferredoxin paralog(s) that accept electrons from PFOR is necessary to evaluate these claims. Based on the data within these manuscripts, Fdx3, Fdx9, and IsiB have the strongest links to PFOR: though the authors do take care to never state directly that they have evidence that these are the acceptors in vivo. Given the variability in the midpoint potentials of different ferredoxins, some ferredoxin acceptors may better conserve the free energy in pyruvate, while others may actually be more 'wasteful' than NAD+ as the acceptor through PDH. Unfortunately, the midpoint potentials for Fdx3, Fdx9, and IsiB are unknown or not stated in this manuscript. It is therefore unclear what ferredoxin is being used as the reference point for conservation of Gibbs free energy in Figure 4C and referenced multiple times in the text.

      We agree that it would be great if we already knew the redox potentials of all the ferredoxins involved. We are currently working on this issue. All that we know for now is that the redox potentials of ferredoxins lay between -240 mV to -680 mV whereas the redox potential is around -320 mV for NAD(P)H/NAD(P)+. Unpublished data that require further validation reveal that the redox potential of Fdx9 is definitely more negative than the redox potential of Fdx1 (-412 mV) in Synechocystis and is thereby clearly more negative that -320 mV. However, as these data require further validation, we did not name numbers. In addition, interaction studies on PFOR and low abundant ferredoxins are planed and preparations are in progress.

      Finally, the measurements of NADH:NAD+ (most prominently used for measurements in Fig 1B) utilized kits that require multiple, long centrifugation steps in the dark prior to assaying this rapidly exchanging pool. While it appears that the authors were able to get reproducible results with these kits, it is difficult to interpret what the increase in relative NADH levels in glucose-fed cells means given that 10+ minutes of incubation in the dark and/or changing temperatures elapsed after the cyanobacteria were removed from the incubator before the NADH:NAD ratio was assessed. While it superficially makes logical sense that the cytosol would be over-reduced when illuminated and under glucose feeding relative to illumination alone, it shouldn't be assumed that these measurements are representative of this rapidly-exchanging pool under the steady-state growth conditions.

      Thank you very much for raising this important point. We are very much aware of the difficulties to determine the redox state of NADH:NAD+ using these kits. However, there is no other method available that properly distinguishes NADH and NADPH. Furthermore, the centrifugation step was done at -9°C which should minimize metabolic reactions during this step. However, we now added in vivo measurements using the NAD(P)H-module available for the PAM and using the Dual-KLAS/NIR to determine the redox state of ferredoxin (newly added Fig. S4). Both methods show that NAD(P)H as well as ferredoxin are more strongly reduced under photomixotrphic conditions in comparison to photoautotrophic conditions and thus support our previous data.

      Reviewer #2 (Public Review):

      The observation that cyanobacteria can use two alternative pyruvate decarboxylating enzymes using either NAD+ or ferredoxin is an interesting and the work is useful contribution. The authors very nicely characterize the enzymatic properties of the two pyruvate metabolizing enzymes and also are able to connect the ideas of redox balance with a set of ferredoxins. Even though they are not able to definitively characterized the specific ferredoxin which interacts with the enzyme, the analysis is nicely conducted and it's clear that the suggestion they're making regarding the involvement of the minor ferredoxins is compelling. However, the work could be written in a way that might be more useful.

      Specific comments:

      Overall this is an interesting study, but the arguments could be sharpened and better connected with the literature. The introduction needs to be considerably revised in my opinion. It is not obvious whether it is even appropriate to discuss the enzymes as an aerobic enzymes or aerobic enzymes, since this concept is simplistic and perhaps, archaic. Indeed, placing the results of the present study in the context of "aerobic enzymes versus aerobic enzymes" is a bit of a 'strawman' argument. For example, the counter examples of O2-tolerant enzymes cited seem to suggest that PFORs have been capable of evolving into O2-tolerant enzymes quite readily and that two types of decarboxylase have evolved for quite different reasons than simple replacement for a new environment. Instead, I think a more current and general perspective relates more to the interpretation that the authors are already putting forth. Namely, the enzymes are utilized according to redox balance considerations rather than sensitivity to oxygen.

      Therefore, I think the very long and pedantic introduction is useful for review, but only if it is shortened and also includes the alternative interpretation regarding adaptations to redox potential in the cytoplasm. My guess is that there are plenty of examples of redox balance function arguments in the literature to refer to in contrast to the evolutionary replacement argument used. Certainly, there are good examples regarding glucose toxicity in mutants of Synechocystis that can be considered.

      Thank you very much for this point. The O2-tolerant PFORs mentioned were merely shown to be stable in the presence of oxygen in vitro which means that they can be isolated under anaerobic conditions. However, all enzymatic in vitro assays required anaerobic conditions. Only one PFOR was shown to be active in the presence of oxygen in vitro. Physiological studies on the importance of these enzymes under aerobic conditions in vivo are completely missing. However, animated by the requests of the reviewers we searched the literature intensively again and indeed found a recent report, which describes the involvement of PFOR in redox regulation in an aerobic culture of an E. coli mutant, in which glucose-6P dehydrogenase (ZWF) was down-regulated (1). We included this study both in our introduction and discussion. It very much supports our own findings, as the E. coli PFOR requires likewise anoxic conditions in in vitro enzyme tests. We agree that the idea that PDH complex and PFOR are exclusively regulated by oxygen availability might sound simplistic. However, we do not fully agree that this is a strawman argument as both enzyme systems are still mostly discussed as counterparts for either aerobic respiration (PDH complex) or anaerobic fermentation (PFOR)(4). To the best of our knowledge, the study that was included now and our own data, are the very first ones that put clearly forward the idea, that redox control governs the activity of these enzyme systems at the pyruvate node independent of oxygen. However, doubts about the rather simplistic distinction between aerobic versus anaerobic enzymes in general have indeed been expressed. Even though these studies in general lack physiological in vivo experiments. We therefore included this information in the introduction as well. (line 76: There are several reports on the aerobic expression of enzymes that are assigned to anaerobic metabolism in prokaryotes and eukaryotes and therefore challenge the simplistic distinction between aerobic versus anaerobic enzymes (5-7). Their physiological significance and regulation are only partly understood.) This did not result in a shortened introduction though as additional information was added. The new introduction thus includes alternative interpretations as requested and is therefore hopefully more balanced.

      Given the interpretation that the alternative forms of the enzyme help cells adjust their redox balance to different conditions, such as photomixotrophic growth, the very nice enzymatic analysis and growth studies of the mutants work would be significantly strengthened by more direct physiological measurements that report intracellular redox states.

      Thank you very much for this important point. Intracellular redox states were shown by measurements of the NAD+/NADP level (Figure 1B) and were now extended by new in vivo measurements that show that both the NAD(P)H and the ferredoxin pools are more reduced under photomixotrophic in contrast to photoautotrophic conditions (new Fig. S4).

      Minor comments:

      line 211: Perhaps, "..the deleted alleles failed to segregate, keeping some wild type copies."

      This was changed to: the deleted alleles of fx2 (sll1382) and fx5 (slr0148) failed to segregate, keeping some wild type copies.

      It would be interesting to characterize whether the observed distribution of PFOR correlates with specific physiological features. In other words, PFOR seems to become important upon the addition of an external carbon source in way that must integrate with autotrophic metabolism (i.e. mixotrophic growth) altering the balance of the oxidized and reduced form of redox cofactors--does the observed distribution correlate at least with the metabolic characteristics of the handful that have been studied in the lab?

      Thank you very much for this suggestion. We checked the lists of cyanobacteria that either possess or do not possess a PFOR in order to search for shared known physiological features. However, the challenge is currently that the number of uncharacterized cyanobacteria in our list is too large. It is therefore impossible to find solid correlations. But we fully agree that it would be interesting to find these.

      A more detailed set of calculations that help explain panel C in figure 4 need to be included to support the quoted values for redox potential in free energy. I assume these are standard values and and the specific superscripts and subscription associate with the ΔG nomenclature needs to be defined.

      The calculations are shown in the materials and methods part. A respective notice (for calculations see materials and methods part) is now given in the legend of Fig. 4C. Information concerning the nomenclature is found in the cited literature in the materials and methods part as well.

      Reviewer #3 (Public Review):

      The manuscript by Wang et al. conclusively demonstrates that the cyanobacterium Synechocystis sp. PCC6803 prefers to use the ferredoxin-reducing enzyme PFOR over the NAD+-reducing PDH-pathway when grown under photomixotrophic conditions while the PDH-route is favored under photoautotrophic conditions. Both the potential physiological meaning of this switch and implications for the evolutionary history of the role of the respective enzymes and their pathways are discussed.

      The main hypothesis of this work considers that PFOR-mediated decarboxylation of pyruvate replaces the PDH-based one when cells shift from photoautotrophic to photomixotrophic growth conditions. This hypothesis is assessed via the comparison of growth curves measured on a host of deletion mutants and via direct detection of expression levels of certain enzymes. The authors' hypothesis is robustly supported by the majority of the reported experiments and the reviewer is fully convinced by these data. However, I would hold that the data shown with respect to phosphorylation of PDH (Fig. S4) are unconvincing. I can't see a clear difference in growth-curves for the incriminated mutants deltaspkB and L which would convincingly exceed the variation observed for the entire dataset.

      We agree that the data on the phosphorylation of the PDH complex including the kinase mutants are not very convincing. We were uncertain from the beginning on whether it would be a good idea to include these data sets and therefore discussed them very cautiously in the manuscript. Anyway, as the enzymatic tests with the E3 subunit of the PDH complex at different NADH concentrations show convincingly that high NADH levels have an inhibitory effect on the complex, we now decided to delete both data sets out of the manuscript, as they are not really required for the statement of the manuscript.

      1) S. Li et al., Dynamic control over feedback regulatory mechanisms improves NADPH flux and xylitol biosynthesis in engineered E. coli. Metab Eng 64, 26-40 (2021).

      2) T. Nakayama, S. Yonekura, S. Yonei, Q. M. Zhang-Akiyama, Escherichia coli pyruvate:flavodoxin oxidoreductase, YdbK - regulation of expression and biological roles in protection against oxidative stress. Genes Genet Syst 88, 175-188 (2013).

      3) A. Witt, R. Pozzi, S. Diesch, O. Hädicke, H. Grammel, New light on ancient enzymes – in vitro CO2 Fixation by Pyruvate Synthase of Desulfovibrio africanus and Sulfolobus acidocaldarius. The FEBS Journal 286, 4494-4508 (2019).

      4) M. Müller et al., Biochemistry and Evolution of Anaerobic Energy Metabolism in Eukaryotes. Microbiology and Molecular Biology Reviews 76, 444 (2012).

      5) S. B. Gould et al., Adaptation to life on land at high O2 via transition from ferredoxin-to NADH-dependent redox balance. Proceedings of the Royal Society B: Biological Sciences 286, 20191491 (2019).

      6) O. Schmitz, J. Gurke, H. Bothe, Molecular evidence for the aerobic expression of nifJ, encoding pyruvate : ferredoxin oxidoreductase, in cyanobacteria. FEMS Microbiol. Lett. 195, 97-102 (2001).

      7) K. Gutekunst et al., LexA regulates the bidirectional hydrogenase in the cyanobacterium Synechocystis sp. PCC 6803 as a transcription activator. Molecular Microbiology 58, 810-823 (2005).

    1. Author Response

      Reviewer #1 (Public Review):

      Strength: Excellent statistical methods are employed. Specimens collected from two centers are used.

      Weakness: It is not clear what new knowledge this follow-up study bring to the audience. The critical biomarker, miR150 they propose for development of biodosimetry assay was already discovered. There are close to dozen publications showing the dose response of miR50, in mouse, rats, non-human primates and humans (including two research papers and and several reviews from authors). The dose response shown in 4b is not appreciable. Introduction and discussion talk about clinical utility for triage after nuclear disaster. Is analysis of miRNAs purified from exosome a viable approach for triage and clinical decision making? If so, please provide convincing argument showing practicality.

      We appreciate that the reviewer and the editor believe that “excellent bioinformatics and biostatistical methods are employed”. We apologize for the confusion regarding miR-150 and its utility as a radiation exposure biomarker. Indeed we and others have shown the importance of miR-150 and other miRNAs in detecting radiation exposure in mice and macaques. We had inferred that the resulting evolutionarily conserved radiation-inducible microRNAs were very likely to translate well to humans due to the high conservation of their promoter regions and transcription factor binding sites. However, in this study validating microRNA-based test for radiation detection using actual samples , we demonstrate that while most of the predictions grounded in animal models held true, solely through the analysis of human data were we able to develop a model that reached clinically-useful performance. And most importantly there are key differences in humans suggesting that for clinical application the primary source of data has to be human. For example, a key miRNA for radiation detection noted in macaques – miR-133 – was absent in human patient sera. The miR-30 family, important for dose separation in mice was redundant in the human test. The results from animal studies of miR-150-5p are not directly translatable for the use in humans. In animals, particularly isogenic mice, miR-150-5p kinetics enable perfect separation of the irradiated from non-irradiated samples, even after low dose exposure. The dose response in humans, that have different genetic and clinical background, is much less appreciable and therefore a simple, single- or two-miRNA-based test is insufficient. To overcome this, we employed artificial neural networks reliant on the expression of 8 miRNAs and 2 normalizers, which assure robustness to differences in sample material content. Therefore, we are bringing significantly new knowledge to the field, and providing a template for how miRNA signatures derived from animal models need robust validation in human samples before we even conceive a human application. The analysis of miRNAs purified from exosomes constitutes an exploratory component of our work and is not part of the proposed diagnostic procedure for triage and clinical decision making. We introduced necessary changes to make the division between the main and exploratory parts of our work more evident (lines 116-127).

      Major comments:

      1. Longitudinal evaluation of specimens from human patients who received TBI is a plus. However, baseline readings in specimens collected from leukemia patients need to be compared with that in healthy humans. Why several specimens were excluded from analysis?

      Since the irradiation of healthy humans would not be ethically acceptable, we cross-referenced the results from patients with leukemia with our earlier results of radiation-responsive miRNAs in healthy mice and non-human primates as a surrogate of healthy humans undergoing TBI. As outlined in the “Preprocessing of profiling data” section of Materials and Methods, we implemented quality control based on the number of detected miRNAs per sample. For the miRNA-seq based experiment, samples with less than 350 miRNAs with non-zero reads detected (4A and 7A in Figure 1 – supplementary figure 1) and respective paired samples were removed from the analysis. Additionally, sample DFCI.13A was an outlier in hierarchical clustering and in Principal Component Analysis (Figure 1 – supplementary figure 2) and therefore this sample, together with paired samples from other timepoints, were excluded from the analysis. We incorporated this information in the main part of the manuscript (lines 146-148).

      1. Dose response noted is moderate. Biodosimetry refers retrospective evaluation of absorbed dose and the analysis should include validation using specimens of unknown exposure.

      As outlined above, the moderate dose responsiveness of miRNAs used in our proposed signature is the primary reason why we believe that a simple diagnostic procedure based on a single miRNA, e.g. miR-150-5p, will not be feasible for use in humans. The final model was evaluated on an independent group of 12 patients with samples drawn under the same protocol (for which exposure and dose was unknown, to validate the model diagnostic accuracy).

      1. Authors says that 1 Gy exposure in humans can cause ARS (paragraph 1, introduction). However their approach do not resolve dose under 4 Gy (around the LD50 value in humans).

      The TBI protocol does not allow for irradiation with doses lower than 2Gy in a single fraction, which was the reason behind the definition of low-dose exposure group (2 or 4Gy) in our study. However, localized irradiation with higher doses provokes response reflected by changes in miRNA levels in serum (Malachowska et al. Int. J Radiation Oncol Biol Phys), suggesting that the irradiation signature are likely to hold true and identify individuals exposed to smaller doses.

      Reviewer #2 (Public Review):

      The study first compared the profiles of serum miRNA in patients before and after irradiation treatment. Then they selected 8 miRNA markers that showed significant changes in levels for further analysis. Then, they showed that the analysis of these markers by real-time PCR can differentiate the pre- and post-irradiation samples in 12 additional patients. The objective of the study is unclear.

      We rephrased the appropriate sections of the manuscript accordingly to elucidate the objective of the study (lines 105-106 and 131-132).

      The study only demonstrates that the 8 miRNA markers are useful to differentiate serum samples collected before and after irradiation. This information is not useful as the blood picture would be more accurate and cheap to accomplish this task.

      The currently used diagnostic screening tests for radiation exposure, including time to onset of radiation sickness, kinetics of lymphocyte depletion and chromosomal abnormalities analysis, are time-consuming and do not allow definite conclusions, as outlined by the lack of FDA-approved biodosimeter. The nadirs of peripheral blood cell counts may reflect high dose exposure but do not allow for prediction of the eventual outcome. Moreover, as evidenced in our prior experimental studies, the dynamics of the blood cell counts are significantly slower than those of circulating miRNAs. For example, the differences in outcome, that is probability of survival of an animal after acute radiation exposure, is not evident by any blood counts or other measures for weeks after radiation, and is predicted by a blood based-microRNA signature with ~90% accuracy assessed 24 hours after radiation exposure (Acharya et al, Science Translational Medicine, 2015). Therefore, although we acknowledge that a blood cell count would be cheaper, we respectfully disagree that it would be more accurate in rapidly providing the necessary information to implement countermeasures safeguarding from the absorbed radiation dose. Furthermore, qPCR-based assays are also inexpensive and increasingly available, owing to the COVID-19 pandemic and the great need to expand PCR-based testing capabilities that it gave rise to. We acknowledge that this information was not presented in sufficient detail and we expanded relevant sections of the manuscript (lines 64-76, 401-402).

      The authors also propose that these markers are useful for the identification of subjects exposed to irradiation. As this study has not addressed the specificity of these miRNA markers to irradiation, the claim of having a signature for radiation exposure is not justified.

      We had shown in previous, experimental exposure studies (“Serum microRNAs are early indicators of survival after radiation-induced hematopoietic injury”, Science Translational Medicine, 2015 and “Evolutionarily conserved serum microRNAs predict radiation-induced fatality in nonhuman primates”, Science Translational Medicine, 2017), performed using animal models that miRNAs with radiation-dependent alterations of expression show association with bone marrow depletion, correlate with survival in amifostine rescue experiment, and that miRNA expression changes are supressed by the use of radiation-mitigating agents like gamma-3-tocotrienol. These arguments act in favour of specificity towards irradiation as the inciting stimulus of the expression patterns. The cross-referencing of results from animal studies and from our miRNA-seq experiment on human samples was aimed to account for this issue, as similar experiments on healthy humans would not be ethical, and to identify high-confidence miRNAs from which a signature could be built. We now added these explanations (lines 112-115, 164-167, 344-350).

      Although patients with irrevocable damage of bone marrow due to other factors would be an interesting comparative group, we struggle to find an ethically acceptable scenario that would match the TBI in terms of the timeline and repeatability of the bone marrow depletion. A feasible alternative may be high dose chemotherapy conducted in preparation for bone marrow transplant, but the dynamics of that procedure are vastly different making the group more adequate for analyses of bone marrow regeneration rather than a control for TBI-initiated damage.

      The key new experiments in this study are the profiling of the serum miRNA in the patients undergoing total body irradiation. The results on mouse model and macaques have been published previously. The consistency of the changes of the miRNA markers is not surprising.

      The consistency of the radiation-inducible miRNAs between mice, non-human primates and humans was expected, given the high conservation of their promoter regions and transcription factor binding sites, as we showed previously (Fendler et al., 2017). This step was important to assure that the miRNA level changes observed in humans result from radiation exposure, as this could not be determined directly, as mentioned in the response to previous remark. However, the creation of the clinically-applicable test would not be possible without a true study in humans presented in the manuscript. Notably, miRNAs crucial for the radiation exposure models in our macaque model (miR-133b) was surprisingly absent in human sera, and the miR-30 family, important for dose separation in mice was redundant in the human test. This serves as a cautionary tale for “translational” studies without true validation in humans and underlines the importance of our findings in terms of the first human-specific and adequately validated diagnostic and prognostic test for radiation exposure.

      Reviewer #3 (Public Review):

      1. Appropriate bioinformatics discussions and functional pathway analysis are necessary for the key differentially expressed miRNAs that have been screened out. It is boring to only discuss the differences of miRNA data.

      We appreciate the suggestion to back the results of differential miRNA expression with a more in-depth bioinformatic discussion. We discussed the results of functional enrichment analysis, presented in Fig. 3C, in more detail, and appended the bioinformatic analysis (lines 218-222, 360-364, 546-549). A graph of miRNA-gene interactions, created using miRTargetLink 2.0 for miRNAs differentially expressed in exosomes after high dose irradiation has been added as figure supplement 1 to Figure 3.

      1. In page 5, "We used logistic regression to create such a model in the low-dose setting (N=22 sample pairs). The resulting classifier was based on the expression of miR-150-5p, miR-126-5p and miR-375" , Why the three miRNAs in the low-dose radiation group were selected for modeling instead of the seven overlapping miRNAs in the high and low dose radiation group to classificate the irradiated- and non-irradiated samples ? Please explain in detail.

      The expression of miR-150-5p, miR-126-6p and miR-375 was used in our previous animal studies to determine radiation exposure and we used similar approach at this stage of the project to evaluate whether their expression measured using RNA sequencing in human sera can reliably distinguish between the irradiated and non-irradiated samples. We acknowledge that it is not clearly stated. The primary purpose of this analysis was to visually present similarities in radiation-inducible miRNA expression changes across species, and the logistic regression model in question was not used any further. Following the Reviewer suggestion, we built a model using the seven miRNAs overlapping in the high and low dose radiation comparisons to classify the irradiated- and non-irradiated samples, obtaining AUC of 0.95 (95%CI: 0.89-1.0); however, we believe adding this information to the main part of the manuscript is not necessary.

      1. In page 5, "Therefore, the expression of miR-126-5p, miR-150-5p and miR-375 enabled efficient classification of the irradiated- and non-irradiated samples in both settings (Fig. S6C)";

      In page 6, "Interestingly, a set of 3 miRNAs quantified by qPCR in all of our previous experiments clearly visually distinguished irradiated from non-irradiated samples in the human analysis (Fig. 5A)",

      Which three of miRNAs,miR-150-5p,miR-375,miR-126-5p mentioned before or miR-150-5p,miR-375,miR-215-5p?Please clarify clearly.

      Thank you for the suggestion. We rephrased this fragment (lines 289-290).

      1. In page 4, "Since miRNA-containing exosomes.......high dose irradiation", Do you think that the differently expression of serum miRNAs partly results from exosomes? Low dose irradiation is also able to change exosomal miRNA profile,why only high dose irradiation is taken into account in paper while low dose irradiation is not?

      We believe that serum miRNA expression results in part from exosomes and, as an exploratory component of our work, aimed to verify whether the magnitude of changes in exosomal miRNA expression exceeded that in serum, improving the potential biomarker specificity to the extent that would justify the development of an arguably more complex and labour-intensive test utilizing exosome isolation. The sequencing of exosomal miRNA content was therefore performed as an exploratory analysis only after high radiation exposure. However, the lower amount of exosomal miRNA than obtained through the total miRNA extraction protocol offsets any benefit stemming from higher cellular specificity of the former, and, based on the results that were comparable with those obtained from sera, decided to not explore this concept further. We added this explanation to our manuscript as this issue was not clarified previously (lines 116-127 and 339-343).

      1. Are there any miRNAs that can clearly distinguish between high and low dose groups? If so, please clarify them in text.

      We now clarified this issue in discussion (lines 415-417).

      1. In page 7,"Importantly, similarities were observed in the level of both individual miRNAs and miRNA families", What part of result Comes to this conclusion?Please explain clearly.

      When describing similarities between human and animal studies, we refer to our previous work describing radiation-responsive miRNAs in mice and non-human primates. These similarities (and differences) are described in detail in Table 1. We added relevant references to Table 1 and to the cited sentence (line 352).

      1. In page 7, "We found that the most common putative tissue sources for differentially expressed miRNAs were hematopoietic and endothelial cells", Which part of result shows this sentence? Please point it out.

      This statement is not validated in our work explicitly but based on the results from references: Ludwig et al., 2016, de Rie et al., 2017 and Landgraf et al., 2007. Since Ludwig et al., de Rie et al. and Landgraf et al. generated excellent data of miRNA expression across human and mouse tissues and cell types that showed overlapping results for the miRNAs of interest, as detailed in Table 1, we did not perform additional confirmatory experiments.

      1. Were the patients suffering from cancer or other diseases? How to ensure that the differential expression of miRNA was caused by radiation exposure rather than their own disease? Please explain.

      As described above, initial experimental studies performed in animal models (mouse and macaque) in preparation for this study showed the specificity of miRNA (including ones in the signature) towards radiation exposure in different animal models. This was evidenced on multiple layers of validation and rescue experiments. Admittedly, a demonstration that additional diseases with a phenotype similarity with ARS affect study performance is an interesting concept, but it would be extremely unlikely to impair the performance of the test in an individual after radiation exposure. Namely, even if the examined patient has a hematologic malignancy or myelofibrosis potentially affecting the performance of the test, identification of such individuals as potentially irradiated would lead to them being followed-up adequately. Failure of the test to detect radiation exposure will likely not be severe risk, since such individuals will already be severely ill and under proper care with regular monitoring of bone marrow function. We are aware that some unforeseen and not discussed clinical factors may affect some facets of the test but the built-in robustness derived from having multiple miRNAs mitigates the risk of non-specificity.

    1. but that was probably just something people said because her nose was too small and her mouth was a bit too big.

      What is interesting is the reasoning we often come up with viewing ourselves is often negative as we seem to like to often question how others think that you look good while you may end up looking at yourself negatively.

    1. adamsmith May 19, 2013 So, the argument for the status quo is that the working paper on arxiv is a separate publication from the journal article it ends up published as. That's why it should be saved and - where it applies - cited differently. In other words, taking bibliographic data seriously, the DOI does _not_ apply to the arxiv paper and should not be saved with it. That's in line with what we do with other working paper repositories such as SSRN.
      • I THINK SO
      • DIFFERENT zotero items for
      • arxiv
        • different item for each version!!!
      • doi publisher
      • Each item with its PDF!!!
      • DIFFERENT Citations!!!
    1. On a larger scale, ARSAC’s goal is not to completely extinguish wildfires—or replace firefighters—but to create acoustic boundary lines that prevent such fires from spreading. “We think we’re going to be able to buy those firefighters time, which is the real killer in a disaster situation,” Dhillon says. ARSAC’s technology may prove particularly disruptive because it’s designed as a “sense and respond system,” he adds, rather than a “sense and react” system. The difference? ARSAC’s integrated fire protection system aims to not only detect embers but also track the location and direction of burgeoning fires to prevent them from crossing property lines. ARSAC’s system employs sensors to detect a heat bloom and then send out spikes on a given frequency that can be used to track the fire’s flow, Dhillon explains. Drones can then be dispatched to provide aerial surveillance to monitor the fire, and arrays of sound-wave fire extinguishers along property lines can be pointed in the right direction to create an acoustic fire barrier.

      System design details

  5. Dec 2021
    1. Author Response

      Reviewer #1 (Public Review):

      In the present work Valperga and de Bono performed a forward genetic screen to identify candidate genes that would fulfill two criteria when mutant: 1) enhance an escape response to high ambient oxygen but 2) without modifications in the respective oxygen sensing neurons. They found that qui-1 mutants meet these criteria. qui-1 is known to act in the nociceptive neurons ASH and ADL (among others). The authors show that in qui-1 mutants ADL neurons are defective in normal chemo-sensation and upregulate neuropeptide secretion. This is associated with increased gene expression of neurosecretion components in ADL, among them two GPCR receptors (npr-22 and tkr-1); mutants in these receptors partially phenocopy the neurosecretion phenotype. The authors suggest an intriguing model in which ADL, upon loss of its normal sensory properties, relays peptidergic input from oxygen sensory circuits to peptidergic output towards yet unidentified downstream circuitry. This novel mechanism of sensory cross modality expands on on previous work on cross modality in C. elegans, where until now only one example been demonstrated, and where a different mechanisms than in the present study was described (Rabinowitch 2016). These findings could serve as generalizable models for other systems where cross-modal plasticity has been observed. Although many conclusions in this work are substantiated by cell specific rescue of qui-1 in ADL others are made based on correlated observations only. The study therefore would benefit from additional experiments that demonstrate a causal link between elevated neurosecretion in ADL and the associated changes in behavior. This could be achieved by ADL cell ablation experiments and specific interference with ADL neurosecretion.

      We thank the reviewer for this analysis of our work. We sought to address points raised in this summary using her/his suggestions.

      Reviewer #2 (Public Review):

      Loss of one sensory modality is often compensated with an increase in another sensory modality. Valperga and de Bono identify a possibly conserved mechanism that appears to heighten the worm's sensitivity to O2 while dampening other sensory responses. The mechanism that they discover suggests that increased neuropeptide secretion could be responsible for the overcompensation for a loss of a sense. The combined data based on forward genetic screening and behavioral analysis, imaging and genomics are convincing and interesting.

      1. I very much enjoyed reading a manuscript that uses 'good old' forward genetics to make an interesting discovery!

      2. The paper is well written and very easy to follow. The data quality and their display in the figures are very convincing, too.

      3. The proposed mechanism of using enhanced neuropeptide secretions for compensating the loss of one sensory modality with an increase of function of another is novel and could indeed be conserved.

      We are grateful to the reviewer for the encouraging review of our work.

      Reviewer #3 (Public Review):

      The work by Valperga and de Bono aims to uncover molecular components of cross-modal plasticity, a system-wide form of neuronal remodeling that responds to sensory loss by altering the performance of remaining sensory modalities. The study focuses on the interplay between oxygen-sensing and pheromone detection in C. elegans. The data presented are mostly convincing and revealing. However, the message and the overall context within which the findings are framed are problematic.

      The authors rightly assert that the molecular processes underlying cross-modal plasticity are not fully understood. However, they emphasize that the important challenge is to reveal genetic lesions that result in sensory loss and drive cross-modal plasticity. I find this to be over-specific and imprecise. There are many possible causes for sensory loss, some are genetic, some are non-genetic (e.g., certain diseases and injuries). In any case, the causes for sensory loss are usually independent of the processes that give rise to cross-modal plasticity. The genetics behind cross-modal plasticity enables the response to sensory loss, it does not cause the sensory loss. Genetic lesions to genes involved in cross-modal plasticity disrupt cross-modal plasticity, they don't induce it. Curiously, the authors sought to find single genes whose removal is simultaneously associated with both the loss of a sensory modality and the enhancement of another. This was done using a forward genetic screen for C. elegans mutants displaying enhanced oxygen sensation.

      We thank our reviewer for her/his thoughtful comments. We have revised our introduction to take account of her/his comments, and to remove the misleading statements s/he highlights.

      The analysis was further complicated by the fact that the screen was performed on strains whose oxygen sensitivity is already modified due to dysregulated activity in the RMG hub-and-spoke neural circuit, which integrates diverse sensory signals to control locomotion. Mutagenesis was performed on either the N2 strain, exhibiting RMG suppression, and thus decreased oxygen sensitivity, or flp-21 mutants, displaying excessive RMG activation, and increased oxygen sensitivity.

      We chose two genetic backgrounds for our mutant screens that attenuate the output of the RMG hub interneurons. Both backgrounds include a gain-of-function allele of the neuropeptide receptor NPR-1 that inhibits RMG output. The NPR-1 receptor has multiple peptide ligands, so in the second screen we reduced NPR-1 inhibitory signalling by deleting one these ligands, FLP-21. Neither of the two strains we used, N2 or flp-21, show appreciable O2 responses on food, and do not aggregate or accumulate on thicker parts of the food lawn, facilitating our screen (See Figure 1B).

      The screen yielded a gene, qui-1, whose dysfunction led to enhanced oxygen sensing (it is unclear if this is in the N2 or flp-21 background). The authors found that increased neuropeptide release from the pheromone-sensing neuron ADL underlies the increase in oxygen sensitivity. Furthermore, the qui-1 mutation was shown to diminish ADL pheromone responses. Therefore, a very particular genetic coupling between loss of pheromone sensation and enhanced oxygen sensitivity was revealed.

      We have indicated the parental origin of the qui-1 mutant in the revised manuscript.

      To generalize this finding, several additional mutant genes (not from the screen) were examined, including genes from the BBS family as well as wrt-6 and fig-1. They too displayed enhanced oxygen sensing linked to increased ADL neuropeptide secretion. However, their effects on ADL pheromone sensation were not reported. The main conclusion I draw from these findings is that the ADL neurons are able to modulate oxygen sensitivity by relaying information about oxygen levels from the RMG circuit to locomotor circuits via neuropeptide secretion. It is not at all clear that loss of pheromone sensation in the qui-1 case is the cause for increased neuropeptide release, or whether it is just one out of the many outcomes of mutating this gene. A much cleaner and more revealing experiment could have been, for example, to examine worms lacking the functional pheromone receptor OCR-2 in ADL. In fact, unlike qui-1 mutants who showed diminished oxygen responses in ADL, previous work from the de Bono group (Fenk and de Bono 2017) demonstrated that ADL O2 response are normal in ocr-2 mutants, indicating a profound difference between loss of pheromone sensitivity due to receptor dysfunction (ocr-2) and the unknown and broad effects of qui-1.

      We thanks the reviewer for this important suggestion. We have sought to test our model with a functional experiment that selectively disrupts sensory input into the ADL neurons. To achieve this, we decided to knock down a protein required for intraflagellar transport, OSM-6, rather than the OCR-2 TRP channel subunit. OCR-2 mediates not only pheromone responses in ADL, but also O2-escape behavior (de Bono et al., 2002). This may reflect a broader role for OCR-2 in ADL than sensory transduction. Disrupting OSM-6 truncates sensory cilia and severely compromises many chemosensory responses, but only weakly reduces aggregation and O2 responses.

      To target OSM-6 degradation specifically to the ADL neurons we knocked in DNA encoding an Auxin Inducible Degron (AID) into the osm-6 locus, and expressed TIR1 in ADL to achieve cell-specificity. TIR1 is required for AID. We have added the new data to Figure 4F–G and Figure 4 – figure supplement 2. We show that expressing TIR1 in ADL disrupts OSM-6::AID function both in the presence and absence of Auxin. This agrees with recent work that tested the efficiency and specificity of the AID system (Hills-Muckey et al., 2021). A partial OSM-6::AID reduction in ADL recapitulates many of the phenotypes of qui-1 mutants, including increased neurosecretion from ADL, heightened ADL responses to O2 inputs and a small but significant enhancement of the O2-escape response. We think these new data support our interpretation that a change in ADL’s sensory properties leads to heightened response of ADL neurons to O2 inputs, a phenotype observed in qui-1 and multiple other sensory defective mutants and a hallmark of cross-modal plasticity. However, the effects of knocking down osm-6 on ADL function also appear to be complex, as the stronger osm-6 knockdown achieved by adding auxin to the osm-6::AID knockin animals expressing TIR1 in ADL, unexpectedly gives weaker phenotypes than when auxin is absent.

      In fact, it would be interesting if the authors could explain or speculate how qui-1 eliminates ADL O2 responses, and how neuropeptide signaling from the RMG circuit via the NPR-22 neuropeptide receptor bypasses this lack of response and drives enhanced neuropeptide secretion in ADL, as they report.

      We can only speculate why O2-evoked responses in ADL disappear in qui-1 mutants. One possibility is that ADL becomes less excitable due to the reconfigured gene expression associated with loss of qui-1 in ADL. This model would predict that selectively knocking down qui-1 in ADL would confer the same Ca2+ response phenotype. Blocking ADL neurosecretion with TeTx in qui-1 mutants would test if the increased ADL neurosecretion we describe feeds back to reduce the O2-evoked Ca2+ response in ADL. An alternative hypothesis is that the effect of disrupting qui-1 is non-cell-autonomous, altering excitatory or inhibitory input to ADL from other qui-1 expressing neurons. We have not tested if neurosecretion from other qui-1-expressing neurons is altered in qui-1 mutants.

      Strikingly, while disrupting qui-1 leads to loss of a measurable O2-evoked Ca2+ response in ADL, these neurons display elevated O2-evoked neurosecretion in qui-1 mutants. This implies that some O2-evoked Ca2+ responses are retained in ADL’s axons in qui-1 mutants. It also suggests that other second messengers upregulate neurosecretion. Elevating cAMP, for example, can promote dense-core vesicle release more efficiently than increasing Ca2+ levels (Costa et al., 2017). Altered G-protein coupled receptor signalling could lead to elevated cAMP levels and increased neurosecretion in qui-1 mutants. It is worth noting that in N2 controls, ADL does not display O2-evoked neurosecretion despite showing measurable Ca2+ responses.

      The work includes a transcriptomic analysis comparing ADL-specific gene expression between wild type and the qui-1 mutant. Unlike other experiments in the study, in which the specific effects of mutations were confirmed through rescue experiments and the use of additional alleles, thus eliminating potential confounds with background mutations, the transcriptomic experiment did not apply such controls. Therefore, it is hard to conclude whether the reported changes in transcription are due solely to the qui-1 mutation or to other unrelated genetic modifications in the mutant strain.

      We worried about unspecific effects of background mutations both on the ADL transcriptome and on other qui-1 related phenotypes. We regret we did not explicitly address this point in our initial submission. To remove background mutations, mutants isolated in our screen, including qui-1, were backcrossed with the N2 laboratory strain a minimum of four times. These qui-1 animals were further crossed into a 5 times outcrossed line that expresses the fluorescent protein mKate specifically in ADL, to generate the strains from which we sorted ADL neurons by FACS. Mutant and transgenic strains were outcrossed using the N2 laboratory strain. We explain this in the Methods section of the revised manuscript.

      The extensive outcrossing make us confident that the large majority of differentially regulated genes between wild type and qui-1 samples in ADL are due to the absence of qui-1. Supporting this, both mutations in neuropeptide receptors identified by our profiling, npr-22 and tkr-1, suppress ADL’s elevated neurosecretion. Nevertheless, we have added a note to explicitly bring up the concern raised by our reviewers, that some transcriptional differences could be the result of background mutations.

      Overall, except for where mentioned, the data presented are solid and consistent. However, the conclusion that the study reveals a molecular pathway for cross-modal plasticity is less convincing. The chain of events does not include some form of sensory loss, leading to subsequent, independent neural plasticity, as expected for cross-modal plasticity. Rather, a very broad genetic switch is described that can simultaneously change receptor abundance and neuropeptide release. Thus, an equally interesting and more coherent framing of the data could be that the study uncovered a genetic regulator, yet to be fully characterized, of oxygen-dependent behavior in a non-oxygen sensing neuron, adding to previous literature on neural circuit cross-talk.

      We are grateful to the reviewer for her/his thorough and critical analysis of our work, which has prompted us to perform additional experiments and helped us revise our manuscript. These additional data clarify our final interpretation of the data regarding cross-modal plasticity.

    1. SciScore for 10.1101/2021.12.23.21268325: (What is this?)

      Please note, not all rigor criteria are appropriate for all manuscripts.

      Table 1: Rigor

      <table><tr><td style="min-width:100px;margin-right:1em; border-right:1px solid lightgray; border-bottom:1px solid lightgray">Ethics</td><td style="min-width:100px;border-bottom:1px solid lightgray">Field Sample Permit: Data collection and analysis: Interviews were conducted via Microsoft Teams, Zoom, or phone.</td></tr><tr><td style="min-width:100px;margin-right:1em; border-right:1px solid lightgray; border-bottom:1px solid lightgray">Sex as a biological variable</td><td style="min-width:100px;border-bottom:1px solid lightgray">not detected.</td></tr><tr><td style="min-width:100px;margin-right:1em; border-right:1px solid lightgray; border-bottom:1px solid lightgray">Randomization</td><td style="min-width:100px;border-bottom:1px solid lightgray">not detected.</td></tr><tr><td style="min-width:100px;margin-right:1em; border-right:1px solid lightgray; border-bottom:1px solid lightgray">Blinding</td><td style="min-width:100px;border-bottom:1px solid lightgray">not detected.</td></tr><tr><td style="min-width:100px;margin-right:1em; border-right:1px solid lightgray; border-bottom:1px solid lightgray">Power Analysis</td><td style="min-width:100px;border-bottom:1px solid lightgray">not detected.</td></tr></table>

      Table 2: Resources

      <table><tr><th style="min-width:100px;text-align:center; padding-top:4px;" colspan="2">Software and Algorithms</th></tr><tr><td style="min-width:100px;text=align:center">Sentences</td><td style="min-width:100px;text-align:center">Resources</td></tr><tr><td style="min-width:100px;vertical-align:top;border-bottom:1px solid lightgray">Interviews were recorded with consent, transcribed, anonymised and entered into Nvivo v12.</td><td style="min-width:100px;border-bottom:1px solid lightgray"><div style="margin-bottom:8px"><div>Nvivo</div><div>suggested: (NVivo, RRID:SCR_014802)</div></div></td></tr></table>

      Results from OddPub: We did not detect open data. We also did not detect open code. Researchers are encouraged to share open data when possible (see Nature blog).


      Results from LimitationRecognizer: We detected the following sentences addressing limitations in the study:
      Limitations: Despite best efforts, our recruitment strategy may have missed relevant voices, including those who are not computer literate. Our data must be interpreted with this in mind. Conclusions: In conclusion, the majority of young people described and concluded that for themselves, the benefits of vaccination did not outweigh the perceived risks to themselves. They did not consider themselves to be at risk of becoming seriously ill from COVID-19 and did not think that the vaccination was capable of protecting those around them. This, combined with concerns about the safety of the vaccine, resulted in reluctance to be vaccinated at present. Perceptions of risks and benefits were influenced by participants’ age and health status, trust in government, understanding of science, and pre-existing ideas and expectations. Participants were unsure who they could and could not trust and were resistant to attempts that were viewed as coercive. In order to promote uptake, public health campaigns should focus on the provision of information from trusted sources that carefully explains the benefits of vaccination and addresses safety concerns more effectively. To overcome inertia in people with low levels of motivation to be vaccinated appointments must be easily accessible (both in terms of location and timing). Research now needs to identify how to communicate risks (from COVID-19 and vaccination) and benefits (for the individual and population) so that people can make informed pe...

      Results from TrialIdentifier: No clinical trial numbers were referenced.


      Results from Barzooka: We did not find any issues relating to the usage of bar graphs.


      Results from JetFighter: We did not find any issues relating to colormaps.


      Results from rtransparent:
      • Thank you for including a conflict of interest statement. Authors are encouraged to include this statement when submitting to a journal.
      • Thank you for including a funding statement. Authors are encouraged to include this statement when submitting to a journal.
      • No protocol registration statement was detected.

      Results from scite Reference Check: We found no unreliable references.


      <footer>

      About SciScore

      SciScore is an automated tool that is designed to assist expert reviewers by finding and presenting formulaic information scattered throughout a paper in a standard, easy to digest format. SciScore checks for the presence and correctness of RRIDs (research resource identifiers), and for rigor criteria such as sex and investigator blinding. For details on the theoretical underpinning of rigor criteria and the tools shown here, including references cited, please follow this link.

      </footer>

    1. Here's the prompt for the final assignment.

      The final piece of writing is a critical reflection about your work that uses the ideas and texts in the course to support, and highlight, your ideas.

      This piece will also be published in SCALAR for future use; that is, the final piece should say something about how you've encountered the material in the course so as to instruct future users/readers. Your writing—your readings of the texts; your telling of your experience with ideas—will be great guides for students.

      Think broadly, first. Ruminate on the following and record it for yourselves somewhere (I use a black notebook for this and write longhand; no technology, other than a Sharpie, for thinking—that works best for me):

      Explain and describe what the texts help you see and understand, even if this means further confusion, or the creation of more questions that are yet unanswerable for you. Here, begin to not rely on invisibility and slow violence; think beyond these very large themes.

      How have these texts helped you delve deeper into the questions of environmental justice—and expanding the meaning, on confusing it, making it, perhaps, more nuanced?

      How does environmental justice affect your view of what you need to examine about your goals going forward?

      This preamble, the questions, should be seen as lofty goals to guide your thinking and your writing. More specific "how-to" is below.

      The GOAL of this piece is to describe, discuss, and even argue ideas that will flow smoothly into an answer to the following question: Knowing what you know now, how are you going to approach the rest of your life?

      This question doesn't have a definitive answer; it's about perspective, point of view, attitude. It's also about responsibility — the ability to respond: where will you find the space(s) necessary to give yourself enough time to respond to what comes your way—unexpectedly? how do you do this? how will your education help, with examples, referring to texts, in this course and others you've encountered, that have affected the way you think? after summarizing the texts in this course, which singular text, which one of these, is something you know you'll take with you, meaning you know that it has affected you and the ideas found in this text are important guides? how does this text help you see yourself better?

      I taught an FYS back in the fall of 2015 and the class had a similar assignment. Students asked that I too write the assignment and provide a model. You guys didn't take this course, but I'm sharing with you what I wrote (since published) (Links to an external site.) so that you can see a model for the work I'm asking you to do. Notice how I contextualize the texts, give the reading of each I need for my argument/description, and use this to describe a native characteristic of American culture. I'm not asking that you be as lengthy. I'm not expecting this sort of reading of American culture; these courses are different. I'm basically wanting to know what it is you see now that the course is over, or nearly over. This model (linked) is simply a sketch for you, and an outline that allows you to see a way into the work, and a way through—a way to organize.

      I want you to be creative about your approach. You can tell a story and use the texts, for instance. You can use your experiences as a way through this and use the texts. And so on ...

      I have created a SCALAR PAGE (Links to an external site.) for each of you.

      I want you to consider following these guidelines for writing:

      Go back to your mapping exercise: How did your plan turn out? Where are you now? [this is something we will have already spoken about in our f2f meetings, so you want to have notes from that]. This shouldn't be written, When I look at my mapping...or I said in my mapping that ... Rather, it should be something along the lines of, My writing interests in this course suggest (ideas + support from essays) ... or A central focus of my essays (or thinking) has been ... (examples)...or Engaging the texts in this course, I started to think about ... (examples) ... I thought ... and now I'm thinking that ... (examples)  — This section should be short (no more than a tight paragraph) and strategically placed.
      As preparation for writing, without looking at the texts, just referring to their titles for inspiration, see what you recall: write out a paragraph or so about each text encountered in the course—this is for your own use and a way to organize before writing; this is done to determine what you recall, which is important and what you should focus on because that's instinct talking; eventually open the texts to make sure you have examples for citing [any material from outside the syllabus you wish to cite is fine, too, especially since I've sent you a lot of reading from the popular media].
      Find a central idea or theme you want to explore. Set it down somewhere so you can see it and read it back to yourself. Yes, there will be the tendency to speak about invisibility and slow violence, I get that; however, these should be ideas that help illustrate a central idea or theme that's your very own and based on what your reading of the courses' texts tells you about the nature of society, as it is now, the challenges we face, and, definitely, where you're situated, in the texts and the challenges we face. Thus, invisibility and slow violence should not be the central ideas/themes of your work, rather instruments/conditions/truths you found along the way and you're using these to pry open a deeper, richer understanding of your relationship to these ideas and environmental justice.
      Write a draft and start sharing it with your group. Likewise, you want to make sure that you and I sit with your piece, letting me comment on it before it's due  (even numerous times) so that we have the piece you want. You definitely want to conference with me about your piece so I can help you get as deep as possible into the subject—in other words: making sure each of you writes something you're really moved by and proud of.
      Since this is on SCALAR, make sure you have relevant images, links, where appropriate and necessary, and any other media (clips, sound, etc), you may wish to insert creatively to lift your piece.
      
    1. Author Response

      Reviewer #1 (Public Review):

      There are very few studies on the spatial integration of color signals of V1 receptive fields, which is a striking gap in knowledge given the importance of color to primate vision and the powerfulness that spatial analysis of luminance contrast integration has proven for understanding how V1 works. This paper helps fill this major gap in knowledge. The main take home is that double opponent cells and simple cells are more likely to be linear in how they integrate signals across their receptive fields than a sample of non-double-opponent/non-simple cells. This conclusion is consistent with the limited data presently in the literature, and I wonder if further analysis of the rich dataset could uncover some deeper insights.

      We thank the reviewer for highlighting the gap in knowledge that our study helps to fill and for the excellent suggestions for ways to improve the manuscript. In response to both reviewers, we have conducted new analyses that uncover deeper insights into signal integration in V1. These new analyses have been incorporated into the revised manuscript.

      Reviewer #2 (Public Review):

      De and Horwitz deploy a focussed technique for testing the linearity of spatial summation for V1 neurons with spatial opponency, with the emphasis being on the properties of cells that encode chromatic information in a spatially opponent manner - so called double opponent cells. The technique isolates non-linearities of summation from non-linearities that occur after summation, by using an adaptive procedure to home in on stimulus contrasts in different color directions that produce a pre-defined criterion response. The authors conclude that many (but not all) double opponent cells embody linear spatial summation, and discuss implications for our understanding of the cortical circuitry that mediates color vision. The data appear carefully collected and generally well-analyzed. There are some points, elaborated in broad strokes below, where I think the paper would benefit from further elaboration of the data and its implications, and the paper would also benefit from some revisions to improve clarity.

      How are results affected by the cell classification criteria? The authors apply criteria to sort cells into four classes: simple, double opponent, NSNDO, and those not studied further. Response properties are then studied as a function of cell class. Criteria for classification include presence/absence of spatial opponency revealed by the pixel white noise measurements and the adequacy of a linear STA to describe the hyperpixel white noise data. I think more work is needed to clarify for the reader the extent to which these criteria, in and of themselves, affect the results for each class studied. In particular, if a linear STA describes the hyperpixel white noise data, shouldn't we then expect to find linear summation in the spatial receptive field in that same hyperpixel white noise data? I understand, as the authors point out, that the Phase 3 measurements could reveal failures of spatial summation not seen in the hyperpixel white noise data. But I'm a bit perplexed by the outliers in the NLI indices in Figure 3D. What properties of these cells allow a linear 6D STA to handle the hyperpixel white noise data well, but cause them to summate over space non-linearly for that same hyperpixel white noise data? In terms of the new information provided by the Phase 3 measurements, I wasn't able to get a sense of how much harder these stimuli were driving the cells than the Phase 2 measurements. It seemed like this was the intent of Figure 2 - Figure Supplement 1 and Figure 3 - Figure Supplment 1, but those two figures in the end didn't provide this information in a manner I could digest. Absent this, it was hard to tell how much more we are learning from the Phase 3 data. Could the higher NLI's here than in Phase 2 be a consequence of some stimuli but not others driving the neuron into saturation? And although the authors write on page 15 "Nevertheless, we found that nonlinearities detected in Phase 2 of our experiment were a good indicator of nonlinearity over the greater stimulus duration and range of contrasts in Phase 3, principally for the NSNDO cells (Figure 3E)", those correlations look very weak to me. I was left hoping for a better understanding the commonalities and differences in the data between Phases 2 and 3. I'm also not sure of the reliability of the measured NLI's for each cell with each method. Can anything more be provided about that? I note here that I did study the section of the discussion that nominally addresses some if these issues, and that my comments above remain after that study.

      The Reviewer brings up several important points that are addressed individually below.

      The revised manuscript is more explicit about the role of the cell classification criteria on the results. Particular emphasis is placed on the role of the spike-triggered covariance criterion in enriching the pools of simple cells and DO cells with neurons that are approximately linear.

      We agree with the Reviewer that, if a linear STA describes the hyperpixel white noise data well, we expect to find linear summation in the spatial receptive field in that same hyperpixel white noise data analyzed in other ways. A critical question is “does the STA describe the white noise data well?”. We address this question in two ways in this report: with an analysis of (the statistical significance of) the first principal component of the hyperpixel spike-triggering stimuli (PC1) and with a comparison of GLM and GQM fits to the hyperpixel white noise data (the white noise NLI). These analyses are related but are sensitive to different types of departure from linearity.

      Consider a neuron whose output is the product of two half-wave rectified linear subunits (see Figure 2 – Figure Supplement 5). Such a neuron would have a large white noise NLI due to the non-linear interaction between the subfields, but it would lack a significant PC1, because the nonlinearity tightens the distribution of excitatory stimuli, and the PC1 is the dimension along which the stimulus distribution is widest. In principle, such a nonlinearity would manifest in the smallest principal component, but in practice, small PCs often resemble the STA, which complicates their interpretation.

      Conversely, a neuron can have a significant PC1 but a small NLI. For example, consider a neuron that has a half-wave rectified response to modulations of one color channel but a full-wave rectified response to another. Such a neuron will have a significant PC1 due to the full-wave rectification, but an NLI near zero, because this nonlinearity is hidden once the stimuli are projected onto the STA (recall that the white noise NLI is computed from a pair of 1-D projections not the original 6-D representation). Code simulating these hypothetical neurons (used to produce Figure 2 – Figure Supplement 5) is available at GitHub (https://github.com/horwitzlab/Chromatic_spatial_contrast).

      The original submission lacked documentation of the difference in firing rates produced during Phases 2 and 3. We have added a new supplementary figure that quantifies this difference (see Figure 2 – Figure Supplement 2). Figure 2 – Figure Supplement 1 illustrates the range of inputs provided in Phases 2 & 3. This has been clarified in the revised text.

      Please note that the data shown in Figure 3D are isoresponse NLIs (that is, NLIs computed from responses recorded during Phase 3 of the experiment) not white noise NLIs (NLIs computed from the hyperpixel white noise shown during Phase 2 of the experiment). This has been clarified in the revised text.

      We agree that the correlation between the white noise NLI and isoresponse NLI measurements is weak. A full treatment of the differences in neural responses to the stimuli presented in Phase 2 & 3 is beyond the scope of this study. Nevertheless, we can think of several reasons that some neurons may have appeared more nonlinear in Phase 3 than they did in Phase 2. The first is, as suggested above, Phase 3 stimuli had higher contrast than Phase 2 stimuli, and are more likely to have engaged nonlinear gain control mechanisms upstream or within V1. Second, the linear and nonlinear models in Phase 2 had 3 and 6 parameters, respectively, but 2 and 5 in Phase 3, and this may affect the ratio of prediction errors. Third, nonstationary responses are expected to affect isoresponse NLIs more severely than white noise NLIs, because of the sequential way that isoresponse points were measured in Phase 3.

      Assessing the reliability of NLIs within cells is challenging because of the crossvalidation that is built into the definition. To address this comment, we used a jackknife procedure that quantifies the spread of NLIs computed from each of the data partitions used in the cross-validation.

      Implications of the results for models. As the authors summarize in their introduction, the motivation for testing the linearity of spatial summation is that the results can guide how we formulate response models for V1 chromatically sensitive cells. More discussion of this would be helpful. As an example, could cells with the nonlinear spatial filtering as shown in Figure 1C be classified as DO, making them relevant to the focussed tests applied in this paper? Or are they necessarily NSNDO? More generally, can the authors spend a little time discussing what classes of response models they would pursue for DO cells that do/don't show linear spatial summation, and for NSNDO cells that do/don't show linear spatial summation. Such discussion would tie the results of the primary data back to the motivating question in a more satisfactory manner, I think. Such discussion could also be used as a vehicle to discuss what the authors think about the DO cells that fail to show linear spatial summation and the NSNDO cells that do, something I found under-treated in the results. As with the comment above, I did read the sections of the paper that speak to this question, but still find it that it would benefit from going deeper.

      Inspired by this comment, we have added a new section to the Results that considers response models for neurons that do not show linear spatial summation. Specifically, we test the model illustrated in Figure 1C and reject it for many neurons. Figure 1C depicts a neuron that integrates inputs linearly within each subfield but nonlinearly across subfields. Within each RF subfield, therefore, this neuron conforms to a linear-nonlinear cascade model. Critically, during Phase 2 of the experiment, the stimulation at one RF subfield can be considered as an additive noise with respect to the signal generated by the other RF subfield. This is because the influences of the two RF subfields combine additively (under the model) and the modulations of the two hyperpixels are independent.

      To test this model, we compared GLM and GQM fits as we did in the analysis of the white noise NLI. The regressors in this analysis were the modulations of the three color channels from a single subfield. These GLMs fit the data systematically worse than GQMs as assessed by cross-validated prediction error. This result indicates that the nonlinearity of the NSNDO cells is unlikely to be a result of nonlinear combination of inputs from two linear RF subfields, as postulated by the model in Figure 1C. Instead, for many NSNDO neurons the nonlinearity appears to arise from nonlinear combinations of signals within individual subfields. We mention in the Discussion that linear DO cells may lie on a continuum with some NSNDO cells.

      Color properties of subfields. The study measures detailed properties of cells that show at least two distinct subfields in the initial pixel white noise analysis. The paper focuses on whether signals from such subfields are combined linearly before any downstream linearities. However, there is another feature of the data that seems central to understanding these cells, and that is what the chromatic properties of these subfields are, and how strong in the data the constraint that the chromatic properties of the two separate subfields be complementary is. It is stated in passing (page 7) that "the two sides of the hyper pixel STA were complementary or nearly so", but it would be nice to see this treated in more detail and also to understand whether there are differences in the distribution of the chromatic properties of the two sides between the DO and NSNDO cells, and between cells with low and high non-linearity indices.

      We have added new section on the chromatic properties of the subfields of the neurons we studied (Figure 2 – Figure Supplement 3).

    1. Author Response

      Reviewer #1 (Public Review):

      Summary:

      In this work, the authors develop a tool for personalising prostate cancer treatment using a Boolean model. The model is extremely complex and describes the regulation of invasion, migration, cell cycle, apoptosis, androgen and growth factors signalling in prostate cancer using 133 nodes (genes and our metrics) and 449 edges (regulation pathways. Using their model, they were able to grade the effect of combined treatments for each of the 488 patients for already-developed drugs and find several genes suitable for intervention in most of the 488 patients. The predications from their model could help develop a patient-tailored treatment that could boost success of pancreatic cancer treatments in clinical practice.

      Strengths:

      The authors clearly achieved their aims of predicative prostate cancer modelling and have added value to the field of prostate cancer personalisation.

      Calibrating and then validating predications of a model, as this work does, is a fundamental part of systems biology and mathematical modelling. By using a cell line to investigate predictions that AKT is the top hit for prostate cancer, validates the utility of their model and also shines a light on how useful models like this can be in oncology. The methodology in this paper provides a guide for future modelling work in this area.

      Providing a detailed Supplementary Information and additional links to the code and fundamental modelling platform publications, helps to provide readers with a tool that may be applied in other settings. However, while this is a strength of the publication, the model is extremely complex and relies heavily on readers spending time comprehending pre-published work and doesn't provide a single contained body of work.

      The methodology they are presenting could have significant impact on the field of cancer treatment, but would need to be testing clinically to validate that personalising treatment in this manner does improve outcomes.

      We thank the reviewer for these comments.

      Weaknesses:

      While it is a strength of this work that such a detailed, and complex model is developed for prostate cancer, and that the code is provided, the weakness of this work is that the model is not easily accessible, and a lot of the techniques used in model development feel brushed over. The work relies heavily on other works and does not provide detailed descriptions of the underlying algorithm, requiring readers to absorb knowledge from our places. This could be a challenge if an experimentalist wishing to implement this methodology in a different cancer treatment.

      We have summarised the main techniques on which this work relies upon in a dedicated section in the Supplementary Material (Appendix file) by describing small introductions to Boolean modelling, MaBoSS stochastic approach to Boolean models, and PROFILE methods.

      We have also provided the codes to reproduce the figures and the analyses. We tried to comment on the code files (e.g., Jupyter notebook) as much as possible to facilitate their use in different contexts.

      The protein/genes in the model are not presented in a way that it can be easily validated as such, the complexity of such a Boolean model comes into question.

      We have listed all the proteins/genes of the model in SuppFile 1 with references for all the interactions of the network.

      For transparency, we have also described in the Appendix how we used information from all the different sources to construct the model in the section "Prior knowledge network construction".

      How sure are we in the model predications and are there are any potential weaknesses to modelling the network in such an extensive manner? For such a model like this, it is crucial to demonstrate its sensitivity to initial conditions and node additions/removals so some work could be done to demonstrate this so that the readers have an idea of how many over/under predications there might be in the model.

      For the sensitivity to initial conditions, we have tested some of them on the generic model in the Jupyter notebook (provided in the supplementary files) but have not done it systematically. The table of all the stable states can be computed exactly as it is done in the notebook (2460 fixpoints are found), and the simulations of MaBoSS clearly show that the proportions of some solutions (probability of model states) change depending on which input is ON. We have tested some conditions: all inputs random, all inputs at 0, growth factors ON (EGF, FGF, Androgen, and Nutrients ON), death signals ON (Carcinogen, Androgen, Acidosis, Hypoxia and TNFalpha ON) leading to very different outputs (Figure 3 for LNCaP and S22 for all 8 prostate cell lines). In fact, the MaBoSS simulation with all inputs random shows the existence of all possible, stable states as it explores the whole state transition graph: for all nodes, 50% of the trajectories will start at 0, and 50% will start at 1. Similarly, we tested the effect of some mutations on the generic model (e.g. mutation of p53, which reduces the probability to reach apoptosis). The aim of these simulations was to test the overall coherence in the model behaviour vs biological evidence as a first validation.

      As for automatic removal and addition of new nodes to assess the importance of each of them, we would recommend against it. Indeed, the model was built from the knowledge extracted from the literature, from databases (cf. Omnipath), discussions with experts, and results from data analyses. Removing nodes would mean that some nodes are considered less important, and adding new nodes would mean that some new findings were found that would justify a new addition.

      In addition, in this work, we need to balance the robustness of the model with the flexibility of being used to cover the different cell line personalisations. Thus, we do not want a highly robust wild type model that has extremely robust, few stable states but is unable to capture the different cell lines specifics. Nevertheless, we have partially covered this with our "High-Throughput mutant analysis of the LNCaP model" section in Appendix file (Section 6.1), where we study all the perturbations on one node and combinations of two nodes, let them be knock-outs (where a node is forced to be 0 throughout the simulation) or overexpression (forced to be 1). By using this analysis, we wanted to identify the fragility points of the mutants' models, but we did not perform this test to have a thorough robustness analysis. In any case, we found varying effects of these perturbations on the phenotype scores, and double perturbations having a greater effect than single ones.

      Finally, we have performed a perturbation on the stability of the logical rules. We have changed one and two logical gates from each logical rule of the LNCaP model and studied the effects on the phenotype scores. In short, we have changed an AND in OR and vice versa in each logical rule (level 1 with 372 simulations) or twice in the same rule (level 2 with 1263 simulations).

      Overall, we see that all of the most probable phenotypes are very robust to this kind of perturbation. Even the less stable phenotype, Invasion-Migration-Proliferation, only has ~3% of either level 1 or 2 perturbations that reduce this phenotype's probability to zero (Appendix File, Figure S30). Most of these perturbations were focused on HIF1, AR_ERG and p53 nodes (Appendix File, Figure S31).

      We added a sentence in the Methods section to explain this: "In addition, we found that the LNCaP model is very robust against perturbations of its logical rules, by systematically changing an AND for an OR gate or vice versa in all of its logical rules (Appendix File, Section 6.2, Figure S30 and S31)." and added Section 6.2 to the Appendix file titled "Robustness analysis of the logical model".

      As they test so many drugs and combination regimes it is also hard to extract information about which key drugs should be repurposed. It could be useful to the readers to have this spotlighted more in the model so that it is easily discernable.

      The complete study on the inhibition of all nodes of the LNCaP model can be found in the supplemental information (SuppFile 6 and Appendix file, Section "High-Throughput mutant analysis of the LNCaP model").

      Because of the size of the model, we chose to filter the full list of nodes with the list of existing drugs and their targets. Thus, Table 1 gathers the drugs we discuss in this article along with the node that they target. We also studied a selection of combinations of drugs, as depicted in Section "Experimental validation of drugs in LNCaP" of Results. In that section, we focused on the combinations that reduced Proliferation and/or increased Apoptosis. For completeness sake, we provide all the combinations of all the drugs from Table 1 in Appendix File, Figures S34 and S35, and their Bliss score in Appendix File, Figures S36 and S37. Furthermore, the code to reproduce these in our GitHub repository: https://github.com/ArnauMontagud/PROFILE_v2/blob/main/Gradient%20inhibition%20of%20nodes/data_analysis.R

      We could have identified the nodes from Table 1 on the figure of the network (main text Figure 1), but we decided against it because the figure is already hard to read, and colours were added to specify the signalling pathways that are included.

      Suggestions:

      Another way to validate the cohort level predications could have been to examine the efficacy of the predicted personalised protocols, or sensitive parts of the Boolean network, in a new prostate cancer patient cohort. Do we see the same sensitive pathways if we examine a different cohort of prostate cancer patients?

      We thank the reviewer for this suggestion. Indeed we are working on using this pipeline in other cancers and in other studies.

      One of the topics that we think can facilitate the use of this methodology is on optimising its runtime and portability. Thus, we are currently working on having a containerised, HPC semi-automatic workflow to reduce the time and optimise the efforts to get results using (almost) any published model and (almost) any omics data.

      In terms of the reproducibility of the results and as we say in the discussion section of the main text, there is a kind of effect size on this type of study. You may find that for a specific patient, their conclusions are not in line with what is expected, but when you analyse at the level of groups of patients, these outliers dampen off.

      Reviewer #2 (Public Review):

      Montagud et al. present a very successful experiment - modeling feedback loop: the authors develop a Boolean model of the major signaling pathways deregulated in prostate cancer, use molecular data from patient samples to personalise this model, use drug response of cell lines to validate the model, predict 15 actionable interventions based on the model, and test nine of these interventions, confirming four.

      The premise of the work is well-supported by prior work by the team and the wider community. The methods are sound, well integrated and thoroughly documented, with one notable omission. The process through which the logic functions of the nodes were determined/decided is not described. The Appendix file indicates "The model is completed by logical rules (or functions), which assign a target value to each node for each regulator level combination.". The interested reader would want to know what information is used and what considerations are the basis of these assignments, and what would change if an assignment were different.

      The manuscript makes a number of testable predictions of actionable single and combinatorial therapeutic interventions for prostate cancer. Equally important, the combination of information and methodologies used in this paper offers a roadmap for future development of predictive and personalised models. Such models are much needed in precision oncology.

      We thank the reviewer for these encouraging comments.

      Reviewer #3 (Public Review):

      This paper tries to establish a model for drug (and combination) selection for individual prostate cancer patient based on a prior signal network knowledge base and genomic/transcriptomic profiling data. This is of great clinical potential. However, whether this approach could be robustly applied in clinic is not validated. Limited validation using cell line is provided. Most tumors have complex structure including tumor cells and surrounding microenvironment. The model is mainly built from onco-signaling pathways. The contribution of microenvironment including immunity is unclear.

      The focus of this model is intracellular only. We explored the interplay between signaling pathways that may be linked to tumorigenesis. We only consider the microenvironment effect as indirect and in no way comprehensive. For instance, we have not considered any immune cells or the effect of the metabolism.

      Nevertheless, we are building on top of this work a multiscale model where we can include different cell types, such as immune cells, and drug-related pharmacodynamics.

    1. Author Response:

      Reviewer #2:

      In this study, Russell et al. combined T cell receptor (TCR) repertoire sequencing data with SNP array genotype data to infer genetic polymorphisms which impact upon the process of TCR generation. Using these data, the authors looked for loci with polymorphisms which associate with different V(D)J recombination probabilities, i.e. sites in the genome which impact upon the chances of TCRs with different properties being produced when they change.

      Beyond the expected sites in the TCR and MHC loci, the authors observed strong associations with distant sites. One was with DCLRE1C, which encodes Artemis, the endonuclease responsible for cutting the TCR loci during recombination, while the second was DNTT, the site encoding the enzyme Tdt, which is responsible for addition of nucleotides to cut V(D)J during recombination. This is the first time that such SNP associations have been described to my knowledge, and yet make perfect sense: DCLRE1C variations were associated with the amount of trimming V and J genes underwent during recombination, while DNTT polymorphisms associated with the number of inserted nucleotides. The authors also report, after assigning donors an associated ancestry based on clustering of their genotype data, that certain inferred ancestries associate with different TCR repertoire properties. In this analysis 'Asian-associated' TCR repertoires had fewer non-templated nucleotide insertions, along with a corresponding greater incidence of the DNTT polymorphisms associated with differences in insertions, relative to other groups.

      Strengths:

      This manuscript is exceedingly well written. Both the TCR biology and the statistical considerations of the genetic analyses are extremely complex topics, mired in arcane terminology, which often end up somewhat impenetrable to non-expert readers. However both have been introduced and explained with admirable clarity throughout, including the caveats and implications of analyses that would not be intuitive to many readers not already expert in both fields.

      As best I can determine, the analyses themselves are also extremely rigorous, with each step carefully taken and justified in the text, involving numerous corrections at multiple scales (e.g. for TCR productivity, TCR gene usage, specific TRDB2 genotype, population substructure, and more). The major findings have also been validated in a completely separate cohort, using a different analysis pipeline. While the authors point out that such genome-wide association efforts looking at TCR gene expression have been undertaken before, the major innovation presented here lies in applying those data to investigating specific V(D)J recombination probabilities. Thus the findings are novel, and the conclusions well supported by the data.

      The data visualisation have all been plotted in a sensible and easily interpretable manner. The majority of data themselves are all already publicly available, having been published in prior studies. The TCRseq data for the validation code has been assigned a BioProject accession, which I presume will go live at the time of publication. The code is also appropriately hosted on Github, and are mostly adequately commented and documented enough so as to be repeatable.

      Thank you!

      Limitations:

      There are very few if any obvious technical limitations or weaknesses that I can see that are not intrinsic to the data themselves. While the authors do mention these limitations, I wonder if they should be devoted some more attention somewhere in the text of the manuscript; relatively few researchers are expert in both TCR biology and the technicalities of genome-wide association studies, so I think more explicit consideration of these issues would be helpful.

      We have expanded the section containing limitations of our approach within the discussion section. We hope this addition clarifies the intrinsic limitations of the data used here.

      In particular, I think the difficulty of studying these loci with standard techniques could be underlined, along with what implications that might have for this study. The highly repetitive nature of the TCR loci can certainly make any analysis looking at short sequences problematic, which has implications for both the TCRseq and genotyping aspects of this study. Combined with the fact that most studies focus on certain populations, polymorphisms in the TCR loci are very likely being relatively undersampled by the field (a hypothesis supported by the ongoing discovery of novel exonic polymorphisms in TCRseq data itself, e.g. as demonstrated in this pre-print by Omer et al. (https://doi.org/10.1101/2021.05.17.444409). The consequences of SNP polymorphism coverage in SNP arrays has already been considered for IgH (https://doi.org/10.1038/gene.2012.12): while this is an admittedly more polymorphic locus, the underlying causes of these issues are mostly all true of the TCR loci as well. Similarly, while the authors do appropriately point out that issues with V(D)J gene assignment could infer biases it may be worth noting that the TCRseq technology used to produce their main dataset uses relatively short read sequencing, that is unable to distinguish a substantial fraction of even known TCR gene- and allele-level diversity (see Fig. 1C of the Omer et al. pre-print). Thus there may be a whole dimension of TCR polymorphism that is not well captured by either platform.

      This is a great suggestion and we have added a section within the discussion to mention these limitations and their implications for both the SNP array and TCR repertoire sequencing data used here.

      Overall, I think this is an extremely considered and digestible study, which will be of great interest across and beyond the field. As the wider community comes to grips with how best to incorporate TCR and BCR polymorphisms into their analyses of the adaptive immune loci themselves (and how this might impact upon recombination, expression, and downstream immune functions) this serves as a timely reminder that we should not forget the polymorphisms elsewhere in the genome that might also be relevant.

      Thank you!

      Reviewer #3:

      In this manuscript, Russel et al propose an inference method to link genetic variations with TCR repertoire feature variations, based on observations from previous studies showing similarities at various level of the repertoire in monozygotic twins. To that end, they used a unique publically available dataset, which combines TCRb immunosequencing data as well as whole genome SNPs data. The method is elegant and sheds light on the importance of combining different type of data to better understand the complexity of TCR repertoire generation and selection. However, unfortunately, while their discovery data set provide some associations between SNPs and TCR repertoire features, they were almost unable to recapitulate the results with their validation dataset. The main reasons could be that the donor demographics are highly divergent between the two cohorts (81% Caucasian in the discovery vs. mainly Hispanic in the validation), the immunosequencing data were generated using RNA based method for the validation while the discovery dataset was obtained from gDNA templates and finally the SNPs array were discordant between the two datasets. Nonetheless, the approach and the study deserve attention and might be improved by additional experiments or analyses and by providing additional information.

      Thank you for your review. We would like to emphasize that the validation results reported here are as good as one might expect given the small sample size of the validation cohort (94 individuals) and the discordance between the discovery and validation SNP sets. The overlap between the discovery cohort and the validation cohort SNP sets consisted of just two significant SNPs, one within the gene encoding the Artemis protein (DCLRE1C) and the other within the gene encoding the TdT protein (DNTT). This DCLRE1C SNP (rs12768894, c.728A>G) was strongly associated with the extent of V-gene and J-gene trimming in the discovery cohort, and we were able to successfully validate this finding within the validation cohort. Specifically, this DCLRE1C SNP was significantly associated with the extent of J-gene trimming in productive TCRalpha and TCRbeta chains and V-gene trimming of both productive and non-productive TCRalpha and TCRbeta chains within the validation cohort. The overlapping SNP within the DNTT locus (rs3762093) was only weakly associated with the extent of N-insertion within the discovery cohort, and as such, it was not surprising that this SNP only reached statistical significance for one of the N-insertion types (productive TCRalpha rearrangements; note that due to the lack of the D gene, N-insertion annotations are likely less noisy on the TCRalpha locus). Despite our inability to replicate all N-insertion associations, we noted that the model coefficients for rs3762093 genotype were in the same direction (i.e., the minor allele was associated with fewer N-insertions) for all N-insertion and productivity types within the TCRbeta chains for both cohorts.

    2. Reviewer #2 (Public Review): 

      In this study, Russell et al. combined T cell receptor (TCR) repertoire sequencing data with SNP array genotype data to infer genetic polymorphisms which impact upon the process of TCR generation. Using these data, the authors looked for loci with polymorphisms which associate with different V(D)J recombination probabilities, i.e. sites in the genome which impact upon the chances of TCRs with different properties being produced when they change. 

      Beyond the expected sites in the TCR and MHC loci, the authors observed strong associations with distant sites. One was with DCLRE1C, which encodes Artemis, the endonuclease responsible for cutting the TCR loci during recombination, while the second was DNTT, the site encoding the enzyme Tdt, which is responsible for addition of nucleotides to cut V(D)J during recombination. This is the first time that such SNP associations have been described to my knowledge, and yet make perfect sense: DCLRE1C variations were associated with the amount of trimming V and J genes underwent during recombination, while DNTT polymorphisms associated with the number of inserted nucleotides. The authors also report, after assigning donors an associated ancestry based on clustering of their genotype data, that certain inferred ancestries associate with different TCR repertoire properties. In this analysis 'Asian-associated' TCR repertoires had fewer non-templated nucleotide insertions, along with a corresponding greater incidence of the DNTT polymorphisms associated with differences in insertions, relative to other groups. 

      Strengths: 

      This manuscript is exceedingly well written. Both the TCR biology and the statistical considerations of the genetic analyses are extremely complex topics, mired in arcane terminology, which often end up somewhat impenetrable to non-expert readers. However both have been introduced and explained with admirable clarity throughout, including the caveats and implications of analyses that would not be intuitive to many readers not already expert in both fields. 

      As best I can determine, the analyses themselves are also extremely rigorous, with each step carefully taken and justified in the text, involving numerous corrections at multiple scales (e.g. for TCR productivity, TCR gene usage, specific TRDB2 genotype, population substructure, and more). The major findings have also been validated in a completely separate cohort, using a different analysis pipeline. While the authors point out that such genome-wide association efforts looking at TCR gene expression have been undertaken before, the major innovation presented here lies in applying those data to investigating specific V(D)J recombination probabilities. Thus the findings are novel, and the conclusions well supported by the data. 

      The data visualisation have all been plotted in a sensible and easily interpretable manner. The majority of data themselves are all already publicly available, having been published in prior studies. The TCRseq data for the validation code has been assigned a BioProject accession, which I presume will go live at the time of publication. The code is also appropriately hosted on Github, and are mostly adequately commented and documented enough so as to be repeatable. 

      Limitations: 

      There are very few if any obvious technical limitations or weaknesses that I can see that are not intrinsic to the data themselves. While the authors do mention these limitations, I wonder if they should be devoted some more attention somewhere in the text of the manuscript; relatively few researchers are expert in both TCR biology and the technicalities of genome-wide association studies, so I think more explicit consideration of these issues would be helpful. 

      In particular, I think the difficulty of studying these loci with standard techniques could be underlined, along with what implications that might have for this study. The highly repetitive nature of the TCR loci can certainly make any analysis looking at short sequences problematic, which has implications for both the TCRseq and genotyping aspects of this study. Combined with the fact that most studies focus on certain populations, polymorphisms in the TCR loci are very likely being relatively undersampled by the field (a hypothesis supported by the ongoing discovery of novel exonic polymorphisms in TCRseq data itself, e.g. as demonstrated in this pre-print by Omer et al. https://doi.org/10.1101/2021.05.17.444409). The consequences of SNP polymorphism coverage in SNP arrays has already been considered for IgH (https://doi.org/10.1038/gene.2012.12): while this is an admittedly more polymorphic locus, the underlying causes of these issues are mostly all true of the TCR loci as well. Similarly, while the authors do appropriately point out that issues with V(D)J gene assignment could infer biases it may be worth noting that the TCRseq technology used to produce their main dataset uses relatively short read sequencing, that is unable to distinguish a substantial fraction of even known TCR gene- and allele-level diversity (see Fig. 1C of the Omer et al. pre-print). Thus there may be a whole dimension of TCR polymorphism that is not well captured by either platform. 

      Overall, I think this is an extremely considered and digestible study, which will be of great interest across and beyond the field. As the wider community comes to grips with how best to incorporate TCR and BCR polymorphisms into their analyses of the adaptive immune loci themselves (and how this might impact upon recombination, expression, and downstream immune functions) this serves as a timely reminder that we should not forget the polymorphisms elsewhere in the genome that might also be relevant.

    1. Author Response

      Reviewer #1 (Public Review):

      The recordings done by the authors are impressive and rare, and I appreciate the efforts of the authors to bridge very different types of signals that are generally recorded in different paradigms. However, the analysis at many places is quite nuanced and high-level, making it difficult to directly compare these findings with previous results. I think several additional analyses are needed to properly place these findings with previous results.

      1. Effects of attention in V4 generally start earlier (~100 ms). It is unclear why no effect is observed during earlier time periods in these data. To make better comparison with previous studies (such as Nandy et al., 2017), the authors should show the average PSTHs in supragranular, granular and infragranular layers during both target-out versus target-in conditions. Interestingly, Nandy and colleagues found largest changes in firing rates in the granular layer. To better understand the ERP outside the cortex, the authors should also show the average LFPs in the three layers, for target-in and target-out conditions. It is surprising that MI analysis reveals no significant information about the target in granular layer - given that some attentional effects are seen in upstream areas such as V1 and V2.

      We have created a new figure showing multiunit activity and LFP across the layers in both attention conditions. It is included here for convenience. Accompanying text has been added to the Results and Discussion sections to address the reviewers’ comments.

      The timing of differentiation between attended and unattended in the population spiking activity is evident in both MUA and LFP. We note that the largest magnitude difference in population spiking between attention conditions was observed in the middle layers, consistent with Nandy et al., 2017. We wish to highlight two observations.

      First, with respect to the timing of attentional modulation, it should be noted that the attention task used in our study (pop-out visual search) is different from that used by Nandy et al., 2017, Neuron (cued change detection). The timing of “effects of attention” vary according to stimulus properties and task demands (the number of publications demonstrating this is too long to list). Hence, we do not expect equivalence between the times we measure and times Nandy et al. measure. Nonetheless we are happy to include the requested supplementary figure with that caveat in mind.

      Second, with respect to the surprising observation of a relationship between activity in the granular layer and the extracortical signal, we think it is important to remember that these information theoretic analyses are not simply correlational. That is, attentional modulation might be observed in both signals, but if the covariation of these signals trial-to-trial does not exist, then we would not expect a relationship in the mutual information analysis.

      1. Eye position analysis: my understanding is that the animals could make a saccade as soon as the arrays were displayed. Given that the main effect of attention is observed after ~150-200 ms, the potential effect of saccade preparation could be important. There could also be small eye movements before the saccade. Given that the RFs were quite fovial for one monkey and not too far from the fixation window, and the effect of attention appears to be quite late, detailed analysis of eye position and microsaccades is needed to rule out the possibility of differences in eye movements between target in and target-out conditions influencing the results. A timeline and some analysis of eye movement patterns would be appropriate. The authors should also clearly mention the mean and SD of the saccade onset.

      The reviewer makes a valuable observation. Saccades will influence the electrical signals, something we are quite familiar with (e.g., Godlove et al., 2011, J Neurophysiol). In an effort to combat this, we have two points worth noting. First, as was the case in the initial submission (which remains the same in the revision), we have clipped signals on a trial-by-trial basis prior to eye movements. By doing so, we cannot have an influence of the motor-related polarization of the task-demanded eye movement on the data.

      Second, we have prepared a microsaccade analysis – and accompanying newly added supplementary figure included here for convenience – to determine whether they might be driving the results. To do this, we identified trials where microsaccades occurred using a well-regarded microsaccade detection algorithm (Otero-Millan et al., 2014, J Vis). We then reperformed the information theoretic analysis across sessions after removing trials where microsaccades were detected. Briefly, we found that the information theoretic relationship persists in the absence of trials where microsaccades occurred. We believe this serves as evidence that microsaccades are not responsible for the information theoretic findings.

      To address the reviewer’s last point, we have included response time data (defined as the saccade onset latency) in the Results.

      1. Attention studies typically keep the stimulus in the RF the same to tease out the effect of attention from stimulus selectivity. Ideally, the comparison should be between the two green (or red) in RF conditions as shown in Figure 4A. However, these results are shown only after pooling across all color selective columns. This comparison should be shown from Figure 2 itself (i.e., Figure 2C should have green in the RF and red target outside).

      We have clarified prior to Figure 4 that we used a all trials including both colors in each of the attention conditions. That is, while the cartoon in Figure 2 shows only green-attended and red-unattended conditions, green-unattended and red-attended conditions were also included in this analysis. As the proportion of red-target and green-target trials was matched, this first analysis was designed in such a way that the influence of stimulus color should be minimized, yet all trials could still contribute to the calculation. We have included a new supplementary figure (included here for convenience) which is what we believe the reviewer requests. In this addition, we perform the information theoretic computation on only stimulus matched conditions. Briefly, we find that this approach does not seem to alter the temporal profile of information theoretic findings.

      1. Information has been well characterized in a large number of previous studies (generally yielding values between a few bits/s, see for example, Reich et. al, 2001, JNP). Here, the absolute value of mutual information seems rather low. This may be due to the way the information is computed. A discussion about these reasons would be useful for scientists interested in information-theoretic measures.

      We agree that the exact magnitude of our information theoretic analyses in curious. And while these methods have been widely characterized – they have not been characterized, to our knowledge, in relating intracortical laminar currents to extracortical field potentials. As such, we do not have a strong prior as to what we should expect magnitude-wise. We have expanded the discussion to note this observation and provide potential reasons as to why this might be the case. The conclusion being that further application of these methods to these datatypes is necessary to really gain a fuller sense of what should and shouldn’t be expected.

      1. Dependence on feature preference: The effect of spatial and feature attention is well studied. A multiplicative gain model of spatial attention would predict a larger increase in firing rates )and perhaps other signals such as CSD) for preferred versus non-preferred signals. Feature similarity gain model would predict the red preferring columns to increase their activity and green preferring columns to reduce their activity when the animal is attending to the feature red, irrespective of which stimulus is in the receptive field. Here, the task is a pop-out task which likely has both a spatial and feature attention component. The authors should discuss their findings in these contexts. Further, the authors should discuss whether their findings could just be a reflection of the magnitude of the change (which could be larger for preferred versus non-preferred stimulus). The information-theoretic measure should ideally not depend on the absolute magnitude, but these quantities often get biased in non-trivial ways based on the magnitude. Does information transmission depend on the magnitudes of firing rates/CSDs?

      The relationship of these findings to the specificities of attentional mechanisms and models is indeed intriguing. As the reviewer suggested, this task likely engages both spatial and feature attention – however, the design was not such that they can be disentangled wholly. We have added text to the Discussion to reflect this consideration. As for the potential influence of response magnitude changes on the information theoretic analyses – the exact parameters were chosen to mitigate concerns about magnitude. That is, we chose a uniform count binning procedure on the data which eliminates potential issues such as outliers driving relationships as well as the changes in variability associated with increases in magnitude. Moreover, the uniform count binning procedure results with states rather than magnitudes which again mitigates response-magnitude-driven effects.

      1. For columns that were not feature selective, is there an effect of attention? Does the magnitude of N2pc change depend on color selectivity? I think that should be the case based on Figure 4H and 4I, but a plot and/or some quantification would be useful.

      These questions have been addressed in a newly added supplementary figure as well as quantification in the Results. Briefly, we did find an effect of attention non-selective columns. Also, we found the magnitude of N2pc did not depend on color-selectivity of the intracortical recording. The results were reported as:

      “We also tested whether feature selective columns, on average, transmitted more information than their non-feature-selective counterparts. We found that feature selective columns, in all laminar compartments, transmitted significantly more information (Figure 4I) (two-sample t test: L2/3, p = 0.044; L4, p = 0.023; L5/6, p = 0.009). As such, we wanted to determine if this was due to a lack of attentional modulation in the non-selective columns. This was not the case, we observed that non-selective columns were modulated with attention. Attentional modulation was observed in both the CSD in L2/3 and L5/6 (one-sample t test: L2/3: t(64) = -6.01, p = 9.8e-8; L4: t(64) = -0.18, p = 0.86; L5/6: t(64) = 5.24, p = 1.9e-6) as well as across all layers in the population spiking activity (one-sample t test: L2/3: t(64) = 8.00, p = 3.7e-11; L4: t(64) = 9.66, p = 4.1e-14; L5/6: t(64) = 7.58, p = 1.8e-10) during the N2pc interval (averaged 150-190 ms following array onset) (Figure S6).

      Importantly, we tested whether the N2pc varied across sessions with or without color-selective columns sampled. We found no difference between N2pc polarization (150-190 ms after the array) between sessions with (n = 17) or without (n = 13) sampling of color selective columns (two sample t test: t(28) = -0.75, p = 0.46). This invariance is expected because extracortical EEG spatially integrates signals from multiple cortical columns.”

      Reviewer #2 (Public Review):

      Scalp ERPs are widely used in human neuroscience research to understand basic mechanisms of neural and cognitive function and to understand the nature of neurological and psychiatric research. However, this research is hampered by a surprising lack of research in animal models exploring the neural mechanisms that produce specific ERP components.

      Previous research by this research group identified a potential monkey homologue of the N2pc component, a neural correlate of the focusing of attention onto visual objects embedded in arrays of distractors. The present study took a giant leap forward by recording extracellular potentials from densely spaced arrays of electrodes (.1 mm spacing) on probes that extended perpendicular to the cortical surface. These electrode arrays made it possible to simultaneously record voltages throughout the different layers of a cortical column and convert these voltages into current source density (CSD, which isolates local synaptic current flow and minimize volume-conducted activity from other brain regions). In addition, simultaneously recorded voltage from an electrode just above the cortical surface was used as a proxy for scalp potentials. Scalp ERP recordings were also obtained from separate monkeys to measure the actual scalp ERPs and verify that an N2pc-like ERP was elicited by the task (a simple visual search task in which the monkey made an eye movement to the location of a color popout item).

      Very clear CSD was observed in V4 in both supragranular and infragranular layers that was stronger when attention was directed to the contralateral visual field than when attention was directed to the ipsilateral visual field, which is the hallmark of the N2pc component. Little or no such activity was observed in the granular layer (the primary recipient of feedforward projections). In addition, the effects were observed primarily when the column was selective for the target's color. An information theory analysis showed that these intracortical current flows contained significant information about the voltage measured on the cortical surface and the location of the target object.

      All of these results were clear and convincing. Moreover, the laminar and columnar analyses provide interesting new evidence about attention-related neural activity independent of any considerations about ERPs. The most challenging aspect of the study is to provide a solid link from the intracortical activity to the voltage on the cortical surface, and then to the monkey scalp ERPs, and finally to human ERPs. Toward that end, the present study relied entirely on correlational evidence, rather than experimental manipulations. That's quite appropriate for a first step, but it must be considered an important limitation on the conclusions that can be drawn. It would be wonderful if future research took the next step of providing experimental evidence.

      We appreciate the reviewer noting that this manuscript is a valuable step in linking attention-associated electrophysiological signals across species. We also recognize that there is much work to be done in this domain. As requested, we have added to the Discussion the limitation of this type of study as well as what should be considered valuable next steps in this program of research.

      There are also some troubling aspects of the existing evidence. The scalp ERP effect in this study and the prior work from this groups is a positive voltage over the contralateral hemisphere, whereas in humans the voltage is negative. This may well reflect the orientation of the relevant cortical surface in monkeys versus humans. However, the voltage on the cortical surface in the present study was negative contralateral to the target, not positive. Unless this opposite voltage on the cortical surface relative to the scalp reflects something about the reference site for the cortical surface electrode, then this makes it difficult to link the intracortical effects and cortical surface effects to the scalp ERP effects. Also, the CSD was negative in the upper layers and positive in the lower layers, again suggesting that the voltage should be negative contralateral to the target on the surface. Ironically, this polarity is what would be expected from the human brain, where a contralateral negativity is observed. The oddity seems to be the contralateral positivity in the monkey scalp data. Also, the cortical surface voltage exhibits a polarity reversal at approximately 180 ms, which is not seen in the intracortical CSD.

      One possible explanation for the discrepancy is that the scalp voltage likely comes from multiple brain areas besides V4. If, for example, areas on the ventral surface of the occipital and temporal lobes produce stronger scalp voltages than V4 under the present conditions, the opposite orientation of these areas relative to the cortical surface would be expected to produce a positive voltage at the scalp electrodes.

      The manuscript notes that multiple areas probably contribute to the scalp ERPs and argues that the pattern of intracortical CSD results obtained in V4 will likely generalize to those areas. That seems quite plausible. Moreover, the results are interesting independent of their link to scalp ERPs. Thus, the present results are important even if the scalp polarity issue cannot be definitively resolved at this time.

      We thank the reviewer for expressing that the results are important whether this polarity difference can be resolved. This is an interesting observation and quite important to consider carefully. First, it is worth reiterating that the referencing setup in our ‘10/20’ monkeys was different than that for the monkeys where intracranial recordings took place. Specifically, the 10/20 recordings were more similar to our previous reports of monkey EEG (e.g., Woodman et al., 2007, PNAS; Cohen et al., 2009, J Neurophysiol; Purcell et al., 2013, J Neurophysiol). Recordings from these monkeys used either a frontal EEG electrode (approximately FpFz) or linked ears for referencing. These yielded the positive-going N2pc and contrast the negative-going N2pc found in humans. The V4 laminar recordings – and their accompanying extracortical signal – used a different referencing setup that we believe is the most likely candidate for the observed difference. Specifically, these recordings used a tied ground-reference setup which incorporated the support rod of the linear multielectrode array. This support rod extended into the brain meaning we had a neural tissue grounded signal and that the reference spanned the neural generator. Therefore, if we are not measuring both sides of the electric field across the generator equally, we might observe an inverted signal. Unfortunately, we cannot observe the 10/20 EEG distribution with an intracranial reference. Ideally, this could be resolved by an experiment where referencing setups are tested before and after performing craniotomy with a series of reference locations used to understand where exactly this flipping of polarization takes place. We have added this consideration to the Discussion and more thoroughly detailed the referencing setups in the Methods.

      There are also some significant concerns about the filters. The high-pass cutoff was high enough that it could have produced artifactual opposite polarity deflections in the data. If causal filters were applied (e.g., in hardware during the recordings), these artifactual deflections would have been after rather than before the initial deflection, possibly explaining the polarity reversal at 180 ms. If noncausal filters were applied in software, this would be a larger problem and could produce artifacts at both the beginning and end of the waveform. Moreover, the filters were different for the CSD data and the extracortical voltages, which is somewhat problematic for the information theoretic comparisons of these two data sources (but is likely to reduce rather than inflate the effects).

      In revisiting the description of the recording system and filters, we see how some information was conveyed poorly. The language describing the recording in the original submission suggested that online filters were applied to the data as it was being recorded. This was not the case. We have changed that language so that it reads as the data was being collected at a sampling frequency sufficient to observe data between 0.1 Hz and 12 kHz rather than the data being filtered between 0.1 Hz and 12 kHz. Also, it appears that the description of the processing sequence regarding CSD was ambiguous in the original submission. The CSD underwent the same offline, bandpass filtering procedure (1-100 Hz) as the extracortical signal. We have clarified the Methods accordingly.

      Reviewer #3 (Public Review):

      In this study, Westerberg et al., investigate the cortical origins of the N2pc, an ERP for selective attention. By using a combination of indefinite inverse models of cranial EEG and translaminar electrophysiology, the authors demonstrate that dipoles in V4 are the source of the N2pc.

      The study is well conducted and the manuscript is well written.

      We are pleased that the reviewer recognized the contribution of our efforts.

      I have a few comments about the CSD, RF alignment profiles, and LFP based analyses:

      (A) The method section states correctly that "current sinks following visual stimulation first appear in the granular input layer of the cortex, then ascend and descend to extra granular compartments". However in the example CSDs shown in Fig 2, Fig 3, Fig S3 there is no visible current sink in the infra-granular layers. Instead, the identified infra-granular layers show a prolonged current source (e.g. Fig S5B,C), which is unexpected.

      We have clarified the Methods to reflect the observations of our data and why they may differ from previous reports. We believe the discrepancy is likely due to the stimulus conditions used to evoke the CSD profile. Specifically, the descending infragranular sink in visual cortical columns has most commonly been described when CSD was computed while monkeys view briefly presented flashes or stimuli (e.g., Schroeder et al., 1998, Cereb Cortex). However, our study uses task evoked CSD to perform the alignment. Importantly, this means there is a persistent stimulus in the receptive field. We believe this persistent stimulus, rather than a flashed stimulus, leads to a persistent, strong sink in the superficial layers of cortex which would mask any current sink present in the infragranular layers (Mitzdorf, 1985, Physiol Rev). This is an observation we made in previous reports (Task evoked CSD: Westerberg et al., 2019, J Neurophysiol vs. Flash evoked CSD: Maier et al., 2010, Front Syst Neurosci), albeit in V1 instead of V4. Given the latency offset between putative granular and supragranular sinks, that we observe receptive fields below the putative granular input sink, and the demonstrable multiunit activation as indicated by the newly included Figure S2, we have no reservations in our assessment of the position of the electrode relative to the layers across sessions.

      (B) The example RF profile shown in Fig S5A, although aligned, looks a little strange in that the RFs taper off rapidly in the infra-granular layer. Is this the best representative example? It will be important to see other examples of RF alignment.

      The attenuation observed in the lower layers is largely due to overall decreased gamma power in the lower layers of cortex as compared to upper and middle layers (Maier et al., 2010, Front Syst Neurosci). At the reviewer’s request, we have added an additional panel to the noted supplementary figure which shows additional laminar receptive field profiles using the evoked LFP so that they are more directly comparable to those shown in Nandy et al., 2017, Neuron.

      (C) The study used LFP power in the gamma range to compute the response ratio between red and green stimuli. LFPs measured across the cortical depth are highly correlated, and so would gamma power estimated from the LFPs. Given this, how meaningful is the laminar analysis shown in Fig 4B? How confidently can it be established that the LFP derived gamma power estimates have laminar specificity?

      An astute observation – there are two aspects to consider. The existence of color-feature columns has been well-documented in V4 (e.g., Zeki, 1973, Brain Res; Zeki, 1980, Nature; Tootell et al., 2004, Cereb Cortex; Conway and Tsao, 2009, PNAS; Kotake et al., 2009, J Neurophysiol; Westerberg et al., 2021, PNAS). This manuscript did not need the evaluation of interlaminar differences in color selectivity to address the question at hand – the top of Figure 4B only serves as a step to the bottom of Figure 4B which provides the measurements used for the subsequent analyses. Thus, the estimation of color selectivity from gamma was sufficient to capture a general sense of the color selectivity of the column. Second, we recently published a manuscript which directly addresses the laminar specificity of gamma with respect to feature selectivity. Westerberg et al., 2021, PNAS uses a spatially localized form of gamma to evaluate color-feature selectivity along V4 columns. In that manuscript, we find a high degree of consistency along the layers of cortex using the gamma signal. Notably, we compared the gamma signal to the population spiking and found a high degree of coherence between selectivity in those two measures as a function of cortical depth. Given the secondary nature of the interlaminar feature selectivity to this submitted manuscript and the detailed report of laminar feature selectivity using the same dataset in another manuscript, we are inclined to leave the analysis reported here as is with adjustments to the text that note these considerations now included in the Results.

    1. She thinks the companies themselves are behind this, trying to manipulate their users into having certain opinions and points of view.

      The irony is that this is, itself, somewhat a conspiracy theory.

      Though, I think a nuanced understanding may be closer:

      • The real purpose is not to influence people to believe anything. It's money. It's ad spend and data collection to sell. We need to demonstrate to advertisers that their ads are actually getting seen. The more they get seen, the more money we make. And, the more time is spent on the service, the more data we have to sell... which is as valuable as the add spend.
      • Companies jigger algorithms to maximize time spent on the service.
      • As the Bible is clear, the heart of man is wicked, and the kinds of things that maximize time spent are themselves attitudes of evil, malice, wickedness, and hatred, and the list of things Paul repeatedly tells us to avoid. Go figure.
      • So, people feel the platforms are basically like smoking, and yet, they can't stop.
    1. Author Response:

      Reviewer #1:

      The dependence of cell volume growth rate on cell size and cell cycle is a long-standing fundamental question that has traditionally been addressed by using unicellular model organisms with simple geometry, for which rough volume estimates can be obtained from bright field images. While it became soon apparent that the volume growth rate depends on cell volume, the experimental error associated with such measurements made it difficult to determine the exact dependencies. This challenge is even more significant for animal cells, whose complex and dynamic geometry makes accurate volume measurements extremely difficult. Other measures for cell size, including mass or fluorescent reporters for protein content, partially bypassed this problem. However, it becomes increasingly clear that cell mass and volume are not strictly coupled, making accurate volume measurements essential. In their previous work, Cadart and colleagues established a 'fluorescent exclusion method', which allows accurate volume measurements of cells with complex geometry. In the present manuscript, Cadart et al. now take the next step and measure the growth trajectories of 1700 HeLa cell cycles with further improved accuracy, providing new insights into animal cell growth.

      They convincingly demonstrate that throughout large parts of the cell cycle, individual cells exhibit exponential growth, with the volume-normalized specific growth rate moderately increasing after G1-phase. At the very early stages of the cell cycle, cells exhibit a more complex growth behavior. The authors then go on and analyze the growth rate fluctuations of individual cells, identifying a decrease of the variance of the specific growth rate with cell volume and observed time scale. The authors conclude that the observed growth fluctuations are consistent with additive noise of the absolute growth rate.

      The experiments and analysis presented by Cadart et al. are carefully and well executed, and the insights provided (as well as the method established) are an important contribution to our understanding of cell growth. My major concern is that the observed fluctuation pattern seems largely consistent with what would be expected if the fluctuations stem from experimental measurement noise. This fact is appropriately acknowledged, and the authors aim to address this issue by analyzing background noise. However, further controls may be necessary to unambiguously attribute the measured noise to biological fluctuations, rather than experimental error.

      We thank the reviewer for their positive feedback and for the appreciation of our work. We performed a series of experimental controls to address the main issue regarding the measured fluctuation pattern, which indicate that it should be of biological origin.

      1.) To address whether the observed fluctuations could be due to experimental error, the authors analyze the fluctuations recorded in a cell-sized area of the background, and find that the background fluctuations are small compared to the fluctuations of the volume measurements. I think this is a very important control that supports the interpretation of the authors. However, I am not convinced that the actual measurement error is necessarily of the same amplitude as the fluctuations of the background. The background control will control for example for variations of light intensity and fluctuations of the fluorophore intensity. But what about errors in the cell segmentation? Or movement of the cells in 3D, which could be relevant because the collected light might be dependent on the distance from the surface? Is cell autofluorescence relevant at all? I am aware that accurately estimating the experimental error is exceptionally difficult, and I am also not entirely sure what would be the perfect control (if it even exists). Nevertheless, I think more potential sources of error should be addressed before the measured noise can be confidently attributed to biological sources. Maybe the authors could measure objects with constant volume over time, for example vesicles? As long as the segmented area contains the complete cell, the measured volume should not change if the area is increased. Is this the case?

      We are grateful to the reviewer for all these useful suggestions. We performed all these controls on the sources of noise, and we discuss them in the revised manuscript.

      2.) I am particularly puzzled by the fact that even at the timescale of the frame rate, fluctuations seem not to be correlated between 2 consecutive time points (Fig. 5-S2b). This seems plausible for (some) sources of experimental error. Maybe an experiment with fast time resolution would reveal the timescale over which the fluctuations persist - which could then give us a hint about the source?

      We performed this analysis, finding an autocorrelation time of a few minutes, and we report our results below:

      In the main text and in the new Figure 5 – Supplement 3, we report the results of newly performed 20 sec timelapse experiments over one hour to investigate the timescale of volume fluctuations. The autocvariance function analysis on the detrended curves shows that fluctuations decay over a few minutes (Figure 5 – Supplement 3a-c), a timescale that matches the analysis of the 10 min timelapse experiments.

      Copy of Figure 5 – Supplement 3: Autocovariance analysis shows that the timescale of volume fluctuation is around 760 seconds. a) Cells measured every 20 sec (n=177) and linearly detrended reach a covariance of 0 at a lag of 760 sec. b) As a control, the background fluctuations are not autocorrelated (20 sec, n=92), providing further evidence that cell volume fluctuations likely have biological origin. c) The autocovariance analysis for cells measured every 10 min confirms that fluctuations covary for a lag of 10-20 min.

      3.) The authors use automated smoothing of the measurement and removed outliers based on an IQR-criteria. While this seems reasonable if the aim is to get a robust measurement of the average behavior, I find it questionable with respect to the noise measurements. Since no minimum time scale has been associated with the fluctuations interpreted as biological in origin, what is the justification of removing 'outliers', i.e. the feature that the authors are actually interested in? Why would the largest fluctuations be of technical origin, and the smaller fluctuations exclusively biological?

      The IQR-criteria is designed to remove only rare and obvious outliers (i.e. a jump in volume of more than 15% in 1 frame -10 minutes- which arguably cannot happen biologically). Fluctuations of smaller range are kept (see examples below). We looked back at the raw data and calculated that the IQR filtering removes a total of 337 measurement points out of 99935 initial points (0.03% of the points).

      Figure D: Three examples of single cell trajectories with raw volume measurement (red dots) and points removed with the IQR filtering (blue dots). The IQR criteria is very stringent and removes only the very large ‘bumps’ in cell volume measured (2 left plots) while it keeps fluctuations of smaller amplitude (right plot).

      4.) If I understood correctly, each volume trajectory spans one complete cell cycle. If this is the case, does Fig. 1e imply that many cell cycles take less than 2-3 hours? Is this really the case, and if so, what are the implications for some of the interpretations (especially the early cell cycle part)?

      In this study, we performed experiments on a time scale comparable to the cell cycle time (~ 24hours) and recorded single-cell volume trajectories. Since the cells are not synchronized, we have very few complete cell cycles (~ 100, Fig. 1f). Fig. 1e shows the distribution of the duration of all individual curves, regardless of the fraction of the cell cycle they span, hence the very short duration for some cells.

      Reviewer #2:

      In this paper, the authors use a volume exclusion-based measurements to quantify single cell trajectories of volume increase in HeLa cells. The study represents one of the most careful measurements on volume regulation in animal cells and presents evidence for feedback mechanisms that slow the growth of larger cells. This is an important demonstration of cell autonomous volume regulation.

      While the subject matter of the present study is important, the insights provided are significantly limited because the authors did not place their findings in the context of previous literature. The authors present what seems to be remarkably accurate single cell growth trajectories. In animal cells, a joint dependency of growth rate on cell size and cell cycle stage has been previously reported (see Elife 2018 PMID: 29889021 and Science 2009 PMID: 19589995). In Ginzberg et al, it is reported "Our data revealed that, twice during the cell cycle, growth rates are selectively increased in small cells and reduced in large cells". Nonetheless, these previous studies do not negate the novelty in Cadart et al. While both Cadart and Ginzberg investigate a dependency of cellular growth rate on cell size and cell cycle stage, the two studies are complimentary. This is because, while Ginzberg characterise the growth in cell mass, Cadart characterise the growth in cell volume. The authors should compare the findings from these previous studies with their own and draw conclusions from the similarities and differences. Are the cell cycle stage dependent growth rate similar or different when cell size is quantified as mass or volume? Does the faster growth of smaller cells (the negative correlation of growth rate and cell size) occur in different cell cycle stages when growth is quantified by volume as compared to mass?

      We are grateful to the reviewer for their appreciation of the value of our study. Following their remarks, we have extended our Discussion section to incorporate a more careful discussion of these findings. We believe that the main contribution of our study is finding evidence of phase- dependent regulation of growth rate and identifying an additive noise on volume steps, this noise has constant amplitude, hence fluctuations of specific growth rate decrease with volume, but specific growth rate (in the bulk of the cell cycle) does not decrease.

    1. Author Response:

      Reviewer #1:

      This manuscript by Silver, et al., details work investigating the relationship between season of conception and DNA methylation differences at sites across the genome, measured by widely-used arrays, in two cohorts of children using Fourier regression. They find that season of conception is associated with persistent methylation differences at several hundred CpG sites, and that these CpG are enriched for properties, compared to sets of control sites, that suggest that methylation at these sites is influenced very early in development/during conception and that these sites are positioned in genomic regions relevant for gene activation and regulation. Additional analyses investigated the effects of genetic variation of these sites, and found no evidence for single nucleotide polymorphisms nor child sex confounding the associations between season of conception and DNA methylation. As the number of sites measures by these arrays are a very small amount of total sites across the genome, the authors suggest that these findings indicate there may be many more sensitive methylation 'hotspots' in the genome that are not captured by these arrays but could impact on health/development.

      The key strengths of this manuscript include the use of two cohorts of children at different ages, providing evidence that these effects of season of conception appear to attenuate by 8-9 years of age; and comparison with control sites and additional analyses investigating confounding to build the evidence for these relationships reflecting true, biological associations rather than statistical artefacts or the result of confounding.

      However, the conclusions around the potential functional importance of these methylation differences are limited by a lack of evidence for a relationship between methylation of these season-of-conception-associated sites and child growth/development, so while this manuscript builds compelling evidence for the effects of season of conception on methylation, it's functional relevance is unclear. Additionally, there are some choices made in the analyses where the rationale for those choices should be made more clear, such as the use of CpG sites above or below a certain estimated effect size for different analyses.

      Overall, the approach taken here to demonstrate different levels of evidence for true relationships between early development exposures and differences in DNA methylation is a compelling one, and the manuscript delivers clear evidence for its primary conclusions.

      We are currently researching links between several SoC-CpGs and health-related outcomes including measures of growth, and we have prepared/submitted other papers with different groups of authors (e.g. the EMPHASIS team) relating to other phenotypes. We consider a detailed analysis of links between SoC-CpGs and diverse outcome measures in Gambian children to be beyond the scope of the current study and would argue that such an analysis would dilute the central focus of this paper that is already long and complex. We do already refer to two existing studies linking Gambian SoC or nutrition-associated CpGs to health outcomes in non-Gambians (child & adult obesity/POMC, Kuhnen et al Cell Metab 2016; cancer/VTRNA2-1, Silver et al, Gen Biol 2015) in the current manuscript. The VTRNA2-1 locus does not overlap any SoC-CpGs and we already speculate that this may be due to SoC effect attenuation, since the previous association was observed in younger (3-9mth) infants. We have additionally referenced a recently published paper linking another SoC-associated locus to thyroid volume and function in Gambian children (Candler et al Sci Adv 2021) and highlighted that neither this nor the POMC locus overlap the array background analysed in this study. Finally we had already included an analysis of overlaps between SoC-CpGs and traits in published EWAS and GWAS catalogues.

      Regarding our use of different SoC amplitude thresholds for one analysis, our original motivation for analysing all 768 ‘SoC-associated CpGs’ with FDR<5% in the ENID 2yr analysis, including those with amplitude < 4%, was to explore the degree to which the strength / amplitude of SoC effects could be explained by proximity to ERV1 over the wider range of amplitudes represented by the larger set of loci. However we agree that this approach is open to question and have removed this analysis (previous Fig. 6B and Supp. Fig. 11, and text in section headed ‘Enrichment of transposable elements and transcription factors associated with genomic imprinting’). We have also removed the definition of ‘SoC- associated CpGs’ (which included CpGs with SoC amplitude < 4%) from Table 2 and Methods to aid clarity and avoid confusion.

      Reviewer #2:

      This is a very interesting manuscript, which will be of interest for a broader readership. The authors have analysed an unique cohort, which is of importance to understand the impact of environmental factors on DNA methylation.

      The performed analysis is well balanced, and the conclusions are justified by the presented data. It is a strength of this study, that results from the initial ENID study have been re-evaluated in the EMPHASIS study. Unfortunately, DNA methylation has been analysed using HM450 and EPIC arrays. Both methods are providing only a limited view on methylome-wide DNA methylation.

      Another limitation (as already addressed by the authors) is the lack of longitudinal samples. This would potentially have helped to gain further knowledge about the identified attenuation of DNA methylation levels at SoC associated CpGs.

      Finally, I am not entirely sure, that one confounding factor has been completely ruled out: It is known, that blood composition may cause methylation variability. In general, the authors addressed this point and analysed blood compositions (supplementary Figure 16) of both cohorts. Here, no marked seasonal differences between and within both cohorts have been identified. However, the participants of the EMPHASIS cohort have a very similar age (8-9 years). For this reason, I am wondering if methylation variability/ differences and in addition the attenuation of methylation levels might be influenced by the younger age of ENID participants compared to EMPHASIS study individuals.

      We agree that the necessary restriction of our analysis to data derived from Illumina 450k and EPIC arrays means that we can only obtain a limited view of DNAm loci associated with Gambian season of conception. We expect that there will be many more such hotspots across the human methylome. We have commented on this in the Discussion.

      Regarding the lack of longitudinal data to confirm the potential attenuation of SoC effects with age observed between unrelated cohorts, we are pleased to report that we have now acquired an additional EPIC array dataset covering a subset of n=138 individuals from the ENID cohort included in the main analysis. This subset had methylation measured in blood at age 5-7yrs enabling us to conduct an investigation of longitudinal methylation changes in these individuals. This analysis strongly supports the circumstantial evidence of SoC effect attenuation with age suggested by our previous comparison of the independent ENID (2yr) and EMPHASIS (7-9yr) cohorts, with:

      a) strong correlation of conception date methylation maximum between age 2yr and 5- 7yrs at SoC-CpGs in these 138 individuals (Figs. 3A, 4A); and

      b) evidence of SoC effect size attenuation at the majority of SoC-CpGs (Fig. 3B; Wilcoxon signed rank sum p=10-12).

      We note that this additional longitudinal dataset has a different confounding structure with respect to biological and technical covariates (Supp Tables 15-17) and date of sample collection (Supp. Fig. 1B), lending strong support to our previous two-cohort cross-sectional analysis.

      Regarding the potential for confounding by differences in blood cell composition, we have performed an additional sensitivity analysis with Houseman estimated blood cell counts added directly to the linear regression model for the ENID cohort (see ST1s). 518 out of the 520 estimated Fourier regression coefficients from the main analysis (1 pair of sine and cosine terms for each of the 259 SoC-CpGs) fall within the 95% confidence interval obtained in the Houseman-adjusted analysis, confirming that cell composition effects did not unduly influence SoC effect estimates in the original analysis. We have added a brief note on this and the other sensitivity analyses (batch, cell composition and village effects) in Results to the manuscript, with more details in Methods.

      If the reviewer is referring to the possibility that the SoC effect attenuation with age could be driven by different cell composition effects in the older cohort, we think that the replication of the timing of SoC effects across the 3 datasets analysed (including the additional longitudinal data; Fig. 4A), all of which have different confounding structures with respect to season of sample collection (Fig. 2A; Supp Fig. 1B), together with additional evidence of SoC effect attenuation with age in the longitudinal analysis (Fig. 3B) support this being a genuine age attenuation effect.

      Reviewer #3:

      Silver et al. Investigate the influence of seasonal variation (nutrition, infection, environment) on blood DNA methylation in two cohorts of children (233 [2y] and 289 [8y-9y]) from the same sustenance farming communities in rural Gambia. One cohort (450K,233) was extensively studied before in multiple publications, the second dataset (850k,289) is unpublished. Using cosinor modeling they find 768 CpGs with a significant seasonal pattern(SoC-CpG, FDR<0.05) in the probes that overlap between the 450k and 850k arrays. Look-up of these 768 SoC-CpGs in the second sample showed 61 SoC-CpGs with FDR 0.05 (no mention is made if the direction of effect is consistent, but we assume it is so).

      In fact we did report that the ‘direction’ of the effect (conception date at methylation maximum) is highly consistent with increased DNAm in conceptions at the peak of the rainy season across the two cohorts at the 61 SoC-CpGs with FDR<0.05 – see Fig. 2C.

      The authors notice that most SoCs seem to be attenuated in the 8-9y sample. Then the authors select out of the 768 SoC-CpG the FDR<0.05 and >=4% seasonal amplitude in this discovery sample: 257 which they bring further in (enrichment) analyses. It is unclear if all 257 are (nominally) significant in the replication sample.

      We did not check this because of evidence that, despite strong replication of effect direction (Fig. 4A), the amplitude of the SoC effect attenuates with age (Fig. 2E). This means that it would not be surprising if one or more SoC-CpGs failed to achieve nominal significance in the older cohort. This is now strongly supported by our additional analysis of longitudinal data confirming SoC effect attenuation with age and consistency of SoC effect direction (Figs. 3B and 4A).

      These SoC-CpGs are enriched for imprinted and oocyte germline loci. Roughly 10% of SoC-CpGs overlap with so-called meta-stable epialleles (MEs), on which the authors have published greatly. This is a large fold enrichment, and subsequently the main focus of the Results and Discussion. Indeed, it skews the Discussion heavily and one wonders what could have been found in the other 90%?

      Our strategy throughout the Results and Discussion was to focus on characteristics including metastability, parent of origin-specific methylation, histone modifications and gametic and early embryo methylation patterns that suggest a link to establishment of methylation states in the early embryo at SoC-CpGs. For these analyses all SoC-CpGs were considered at every stage and metastability was not the primary focus. However, as the reviewer suggests, we do repeatedly point out that many of the above contextual characteristics that are associated with SoC-CpGs have also been associated with metastability which we consider to be worthy of note, in part because it suggests that many SoC-CpGs may in fact be MEs, despite not having been previously identified as such. We have further cause to believe this could be the case because of i) the typically small sample size of multi-germ layer/tissue datasets used to screen for MEs, meaning that published screens for human MEs are likely to be underpowered and will hence fail to capture most MEs; and ii) the evidence that we present suggesting that environmentally-driven inter-individual variation at loci exhibiting ME-like properties may diminish with age, again suggesting that ME screens, which largely analyse adult tissues, will miss metastable loci present in infancy and early childhood.

      We had already made the point ii) above in the Discussion. However, given the reviewer’s concerns we have added an additional comment on point i).

      The Discussion is heavily geared to interpretation within their MEs focus and does little to discuss study weaknesses and strengths, to which the tail of the Results suggest there are multiple. For at the end of the Results and in the Methods we find additional sensitivity analyses and discussion points on a very strong enrichment for CpGs with a mean difference in methylation between the sexes (>1/3 of the 257), adjustments for genetic confounding and a high inflation factor in the discovery cohort.

      We have added an additional comment on the need for further functional analysis in cell and/or animal models at the end of our discussion on possible mechanisms underpinning the observed strong enrichment for sex effects at loci associated with periconceptional environment. We have performed an additional analysis of SoC effects on global methylation using predicted LINE1 and Alu element methylation to address the issue of genomic inflation in the discovery cohort (Methods ‘Inflation of test statistics’ and additional Supp. Fig. 14). We have commented on the potential for residual genetic confounding and the limitation of a lack of genetic data in the discovery cohort in the Discussion. We have also provided an additional comment on the potential influence of unmeasured inter-relatedness in our study population.

      Indeed, despite the strong and good flow of the Result section and the impressive (albeit somewhat one-side) look-up of SoC-CpGs in published datasets; the tail and Methods section leaves this reader with a strong suspicion of possible methodological issues on the measurement level already identified prior.

      The authors reports that the discovery cohort is biased in the collection of conception months (figure 2A), has a strong inflation of 1.3 (no QQ-plot is shown to assess bias in addition to inflation), no adjustment for genetic background could be made (which is false, as the 450k array contains several dedicated SNP probes, even hundreds when extracted with the omicsPrint package) and > 1/3 of SoC-CpGs is a sex CpG. For the latter observation the authors regressed out sex and repeated the analysis, noting no difference. However, regressing out sex does not help if sex is heavily correlated with confounding biological/sampling/technical covariates.

      The authors reason that the inflation is nothing to worry about citing single cohort studies on global effects on DNAm of methyl donors. Global DNAm is indeed often association with methyl donor intake but generally these studies investigate ALU or SINES repetitive elements and the PACE consortium reported only modest effects on select 450K array loci for prenatal folate supplementation, showing that their reasoning might hold on the ME loci (in/close to repetitive elements) but not the genome-wide analysis per se.

      The authors should convince the reader that their (discovery) data is valid. The data they do show in Supplemental tables 16 and 17 show that after functional normalization a strong effect of batches remains, while from my own experience these are normally nicely mitigated via functional normalization. Normally only strong cell type correlations remain in the first PCAs of the normalized data. But for ENID we see a remainder of sentrix row, often the strongest batch effect, and slide and plate remaining. Also, the biological, season and cohort specific variables are not noted here. We just must assume that the blank correction for the first 6 PCAs, rather than the actual adjustment for the measured batch/confounding effects, does not remove (or over adjusts) for biological/study design (village, genetic ancestry) effects. In addition to these observations figure 2C seems to indicate that the controls CpGs (elegantly selected by the authors) also show seasonal variation, just not as much as the SoC-CpGs. This leaves the reader to wonder: is there bias in their sample randomization across plates, rows and slides? This feeling is amplified by the fact that almost all SoC-CpGs seem to show an increase in DNAm in jul-aug (Suppl Fig. S5 and Figure 1B). [An observation that is not given enough prominence in the Results]. Which might or might not hint to a correlation with a batch effect (like sentrix row?).

      Our addition of a third longitudinal dataset with a very different confounding structure provides strong reassurance of the robustness of the reported SoC effects. However we recognise many of the concerns raised by the review and have therefore substantially extended our analysis of potential confounders in our analysis, including additional sensitivity analyses (see Supplementary Tables ST1p-1s).

      In our extended analysis of possible confounding of technical and biological covariates by SoC, we note that the majority of batch and biological covariates are categorical so that it was not possible to report correlation rho’s. We have instead reported p-values for corresponding association tests – see Supplementary Tables for further details of tests that were carried out. Also note that for simplicity season of conception is modelled as a binary variable (Dry: Jan-Jun; Rainy: July-Dec). We consider this to be a valid approximation to the main cosinor (Fourier) regression analysis since this showed a clear relationship between DNAm and dichotomised (Dry/Rainy) season of conception (Figs 2D & 4A). Note that we have not included month of collection as this completely confounds season of conception in the main ENID (2yr) analysis and cannot confound the EMPHASIS (7-9yr) analysis, as discussed in the manuscript (Fig. 2A). This is a key reason why we compared SoC effects across these two cohorts. Note that the month of collection also cannot confound the ENID 5-7yr (longitudinal) analysis as all samples are collected in the rainy season (additional Supp. Fig. 1B).

      The covariate correlation analysis confirms:

      • No correlation between SoC and all considered batch and biological covariates including principal components across all three analysed datasets (Supp Table, ST1p- 1r).

      • No correlation between sex and all considered batch and biological covariates; weak correlations with PC4 and PC3 in EMPHASIS and ENID 5-7yr datasets respectively (ST1q,1r); note also that the sex sensitivity analysis previously reported in the manuscript used methylation values that were pre-adjusted for sex using a regression model that included sex as the only adjustment covariate, alleviating concerns that there may be residual confounding due to strong correlations between technical/biological/sampling covariates and sex. We have added some additional comments on this to Results.

      • Expected strong correlations between SoC, month of conception and month of birth in all datasets (ST1p-1r).

      • Functional Normalisation (FN) removed most but not all of the effects of technical batch effects (sample plate, slide etc) from the DNAm array data used in the main ENID analysis (ST1p).

      • Samples are not perfectly randomised across 450k sample plate (month of birth [mob] and conception [moc]) and slide (mob and village) for the ENID 2yr cohort (ST1p).

      The last point raises the possibility of potential residual confounding due to array batch effects in the ENID analysis. We checked for this in two ways. First, we performed sensitivity analyses with batch and village ID variables included directly in the linear regression models, in addition to the PCs that served as proxies for batch variables in our original analysis. This suggested no residual confounding due to array batch or village ID effects (ST1s: ‘batch adjusted model’ and ‘village adjusted model’). Second, we confirmed that neither mob, moc nor village ID were associated with batch or any other covariates in the EMPHASIS or new ENID 5-7yr analyses (ST1q, ST1r). The tight correspondence of date of methylation maximum across all three datasets (cross-cohort and longitudinal analyses) (Figs. 2C, 3A and 4A) with different confounding structures (ST1p-1r) strongly suggests that the reported SoC associations are not driven by residual confounding.

      In summary, this analysis provides strong reassurance that our main analysis is not confounded by residual associations with technical and/or biological covariates considered in this analysis, and that the observed enrichment for previously identified sex-associations amongst SoC-CpGs is not driven by residual confounding due to sex.

      We have made multiple amendments to the manuscript to incorporate the longitudinal analysis; in the Introduction (lines 58-9); in the first section of Results; and we have made particular reference to the alignment of SoC effects across 3 datasets with different confounding structures. We have also amended several figure captions to distinguish the ENID 2yr and 5-7yr datasets and added the longitudinal dataset to Methods and to the study design schematic (revised Fig. 1), and visualised key results from this additional analysis in Figs. 3 and 4A. Finally we have added additional text on the sensitivity analyses in the main text and in Methods.

    1. Author Response:

      Reviewer #2 (Public Review):

      The authors try to identify ATR-mediated phosphorylation sites in male meiosis of mice and performed phosphoproteomics using two distinct mouse models. The paper focuses on important topics in the field. Since ATR has key functions in meiosis, successful identification of ATR-mediated phosphorylation sites would have a profound impact.

      The study has certain technical issues in experimental design and data interpretations.

      The rationale as to why they used Rad1-cKO was not well described. According to the co-submitted manuscript, Rad1-cKO spermatocytes experience meiotic arrest, and the cellular composition is totally different between controls and Rad1-cKO testes. The "RAD1-dependent" phenotype may simply reflect the difference in cellular composition in testis. With this criterion, any phosphorylation sites present after the mid-pachytene stage in normal spermatogenesis can be categorized as "RAD1-dependent".

      We have altered the figure and text in the manuscript to more clearly explain the rationale for using Rad1-cKO and combining the generated data with the data from the rapid 4 hour ATRi treatment. Importantly, we now consider the phosphorylation sites impaired after a quick 4 hour treatment with ATRi (New Supplementary File 1), which is expected to be too quick to induce an appreciable pachytene arrest. Therefore, the final ATR-dependent and RAD1-dependent dataset is unlikely to include phosphorylation sites that are only shown as being depleted due to a persistent mid-pachytene arrest (these sites should appear as RAD1-dependent and ATR-independent).

      There are two different experiments for ATR inhibitor (ATRi)-treated mice (2 pairs after 2.5-3 days of treatment, and 2 pairs 4 hours after a single dose). However, these results are not distinguished in the analysis, and there is no evaluation of testicular morphology after ATRi treatment.

      We addressed the point of separating the data from 4 hour and 2-3 days of treatment. We also have now also addressed testicular morphology after 4 hour ATRi treatment and did not observe any defect (new Figure 5-figure supplement 3A-B).

      Finally, the authors showed ATR-dependent localization of SETX and RANBP3 and discussed interesting data. However, it has not been determined whether these localization changes were due to the functions of identified phosphorylation sites or some other mechanisms.

      We agree with the reviewer that it would be very interesting to address the role of specific phosphorylation sites in SETX and RANBP3. However, we feel this would require significantly additional effort and time, which would not be realistic in the current manuscript, and is beyond the scope of this resource paper.

      Reviewer #3 (Public Review):

      In this study, Sims et al. perform a phosphoproteomic analysis of the ATR signaling pathway in mouse testis. By studying the different phosphorylated peptides found in testis samples from ATR inhibited mice and from mutant mice for the member of the ATR-activating 9-1-1 complex, RAD1, authors defined a comprehensive map of the ATR signaling pathway in the mouse testis. In general, the methodological approach performed is appropriate to accomplish the desired goal and the results obtained are well explained and properly discussed. The conclusions raised by the authors are supported by the results obtained and the manuscript reads easily. Thus, overall the manuscript is of high quality. Furthermore, the information provided in this study is novel since to my knowledge this is the first attempt to characterize the ATR signaling pathway in the testis. In my opinion, these data will be very relevant to better understand the role of the ATR in mouse spermatogenesis, and in meiosis in particular, in the future.

      Thank you, we appreciate the positive remarks.

      Nonetheless, I have a few major concerns about this manuscript. Firstly, I think an important part of the description of the results is placed in a related preprint by the authors (Pereira et al. https://www.biorxiv.org/content/10.1101/2021.04.09.439198v1). In my opinion, this manuscript lacks a more detailed analysis of the ATR signaling on DNA repair and chromosome axis structure, which are fundamental to understand the meiotic prophase. Secondly, the manuscript falls short of providing novel insights about ATR roles during the meiotic prophase. As ATR function on the meiotic prophase has been extensively studied, the ATR phosphoproteome should provide either some clues about possible novel functions ATR may do during the meiotic prophase or spermatogenesis, or provide a mechanistic explanation of how ATR performs its meiotic functions (e.g., meiotic sex chromosome inactivation or meiotic recombination). The final section of the results is an attempt at doing sol, but to me, the data provided only suppose a small incremental advance in our knowledge of how ATR promotes MSCI. I would have liked the authors to expand this section to prove the utility of the data.

      We agree with the reviewer that it would be very interesting to address more details of the roles of ATR in meiosis and the underlying molecular mechanisms. However, we feel this would require significantly additional effort and time, which would not be realistic in the current manuscript, and is beyond the scope of this resource paper. We note that the revised version of the manuscript now reports the exciting finding that ATR is important for the proper localization of CDK2 in meiotic spreads. While the details and mechanisms remain unknown, we believe this finding, together with other reported findings in this resource paper, open new directions to study meiotic ATR signaling.

    1. Author Response:

      Evaluation Summary:

      Are enzymes found in organisms that optimally grow at colder temperatures are more active than the same enzymes found in organisms that optimally grow at warmer temperatures? Here, an assessment of the catalytic constants for approximately 2200 enzymes (obtained from the BRENDA database) showed no correlation between the relative catalytic activity and the optimum growth temperature. Further support for this conclusion was obtained from the measurement of the catalytic constant from a selection of ketosteroid isomerases from organisms that optimally grow between 15 and 46 degrees centigrade. These are interesting results, although the significance with respect to earlier studies has not been clearly explained.

      We have made the relationship between previous work and our work more explicit. Earlier studies have used a limited number of specific cases to compare enzyme rates from different organisms (for example, n = 28, Figure 1C, Figure 1D). In this work, we performed a systematic analysis of 2223 enzyme reactions, reducing confirmation bias, and we have clarified this point. Prior work developed physical models about enzyme catalysis but were based on data that do not appear to be representative.

      Reviewer #2 (Public Review):

      The authors are trying to understand how enzymes evolve to best enable organisms to adjust to changes in the temperature of their environment. The paper reports an analysis of 2223 values of kcat from the BRENDA database, for 815 organisms with known optimal growth temperatures, and for which there are at least two variants per reaction. This analysis fails to show the expected preference for values of [(kcat)cold/(kcat)warm] > 1 observed in earlier studies.

      This is a useful attempt to use one large databases to gain insight into how enzymes evolve to enable organisms to adapt to changes in temperature. They have done a good job in curating the BRENDA database to identify data that meets their criteria for analysis.

      There are deficiencies that should be corrected.

      (1) The first concerns the reported values of [(kcat)cold/kcat)warm]. Figure 1D shows "Rate comparisons of warm-adapted and cold-adapted enzyme variants made at identical temperatures." I think that it is important that these kinetic parameters be reported for catalysis at a common temperature, but it is not clear to me that is the case for the author's analysis. For example, they write beginning on line 234 that "The rate ratio kcold/kwarm per reaction was determined by dividing rate of the enzyme from the organism with the minimum TGrowth by the rate of the enzyme from organism with the maximum TGrowth." My reading of this sentence is that these rate constants kcat [not rates] were determined individually at the organisms optimal growth temperatures, and not at identical temperatures as reported in Figure 1D. This will complicate the author's interpretation of the two sets of results.

      Analysis of kinetic parameters at a common temperature supports the conclusions of this work.

      (2) The author's fail to present a clear physical model to use in analyzing these results.

      For example, they write on line 35 that: "According to the rate compensation model of temperature adaptation, this challenge is met by cold-adapted enzyme variants providing more rate enhancement than the corresponding warm-adapted variants (Figure 1A)"

      I cannot recall hearing the term rate compensation model, but am familiar with discussions on the differences in properties of enzymes isolated from organisms that have adapted to warm and cold environments. The term cold adapted enzymes is not appropriate, because it is the organism not the enzyme, that adapts to the change to a cold environment. This is accomplished through the natural selection of enzymes with kinetic parameters, stability, etc. that optimize the organisms chances of survival in a cold climate. The kinetic parameters for essentially all enzymes will decrease with decreasing temperature. The most highly evolved metabolic enzymes have kinetic parameters kcat/Km close to the diffusion controlled limit, because this optimizes energy production from metabolism. A decrease in temperature will cause the values of kcat and therefore kcat/Km for these enzymes to decrease, to the detriment of the organism. This may be overcome by selection of enzymes with values of kcat/Km close to that observed for the parent [unevolved] organism. The result is that larger kinetic parameters kcat, for catalysis at a common temperature, will be observed for enzymes isolated from the cold-adapted, compared to the unevolved parent organism. This simple application of Darwin's principals of natural selection is strongly supported by the data reported in Figure 1D.

      The reviewer presents a model that presumes that there would be greater selection to optimize energy production. This is also the model supported by the prior data (Figure 1D).

      However, the more extensive data in our work do not support the model that the reviewer notes and that has been widely accepted in the literature –this is the central conclusion of this work and we have attempted to clarify this, as noted above. The strict Darwinian interpretation for our observations is that there is not a strong selection for enzyme rates to be maximized, as described in the Discussion.

      An alternative model, consistent with the data we present, is that there are different selective pressures on enzymes than rate maximization. We note that it is possible that different metabolic strategies may be more advantageous at different life stages or in different communities (see Wortel et al., 2018, now cited in our main text). These models can be tested experimentally –e.g., by examining how variations of a weak-link enzyme fare over time under different growth conditions. There is much more to be learned from linking the properties of enzymes to evolution, and we expect the relationship between fundamental rate constants and selection to be complex, fascinating and important.

      We use the term rate compensation to refer to the phenomenon and not the physical explanation; there is no need for a physical explanation of a phenomenon in the absence of evidence for the phenomenon itself. We have clarified that we have introduced this term in the Introduction: According to what we term the rate compensation model of temperature adaptation, this challenge has been suggested to be met by cold-adapted enzyme variants providing more rate enhancement than the corresponding warm-adapted variants (Figure 1A).

      We use the term “cold-adapted” in agreement with literature usage: from an organism that is cold adapted. We have clarified this language usage: We use the term “cold-adapted variant” to refer to an enzyme from an organism annotated with lower TGrowth values.

      Finally, “cold-adapted” is not synonymous with “having faster enzymes”, which is often how it is used in the literature and how it is implied in the reviewer’s model.

      (3) The paper alludes to, but does not clearly explain extensions of these ideas that are based on one model for how enzymes work. Enzymes often undergo large conformational changes during their catalytic cycle, and so must have sufficient flexibility for these changes to occur with rate constants that support catalysis. This predicts that the enhancement for catalysis observed for enzymes from cold-adapted organisms, might best be achieved through mutations that favor an increase in protein flexibility. There will also be natural selection of enzymes for thermophilic organisms that optimize the organisms chances of survival in a hot climate, where heat denaturation of the protein catalyst is minimized through the selection of stiffer protein catalysts. This analysis predicts a decrease in enzyme flexibility with increasing preferred growth temperature, that might give rise to an increase in protein stability with increasing optimal growth temperature.

      We agree that there are many fascinating aspects of temperature adaptation at the level of individual enzymes, their mechanisms, and their particular rate-limiting steps that remain to be explored. These were not the subject of our study. The goal of our manuscript was to test the previously presented rate compensation model of enzyme cold adaptation.

      (4) The authors should consider the possibility that the pressure to compensate for the cold-induced decrease in kcat for enzymes from cold-adapted organism will be strongest for highly evolved metabolic enzymes with values of kcat/Km close to the diffusion controlled limit. In cases where the enzyme starts out as less than perfect, an organism adapting to the cold might derive smaller, or even negligible advantages, from natural-selection of enzymes with enhanced kinetic parameters. For example, the organism might also minimize the effect of this change in kinetic parameter, by an adjustment or diversion of flux through the networks of metabolic pathways in which the enzyme functions. One possible explanation for the weak correlation observed between kcat and Tgrowth for ketosteroid isomerase is that the organisms studied gain little from optimization of the activity of this enzyme in cold-adapted organisms. One risk in the use of the larger BRENDA database may be the failure to account for differences in the pressure for enzymes to evolve to enable organisms adapt to cold environments.

      We considered these and additional models. For example, interestingly, the opposite of what the reviewer proposed has been suggested in the literature –that the slowest enzymes (“least perfect”) are under the heaviest selection pressure for optimization (see Noda-Garcia et al., 2018). Although our data indicates that temperature exerts a weaker force on enzyme activity than previously proposed, it is indeed possible that subgroups of enzymes do indeed adapt to temperature through changes in activity. Deciphering this and other pressures is an important future challenge. We did not parse the data in this report out of concerns for “p-hacking” or multiple hypothesis testing.

      Reviewer #3 (Public Review):

      Enzyme catalysis underlies all living processes. Understanding the effects of temperature on enzymes is important in understanding how they are adapted to particular environmental conditions, and also relates to the response of organisms and even ecosystems to changes in temperature. The essential question is: what determines optimal growth rates of organisms, and the optimal temperature of other biological processes? Two potentially important factors are enzyme stability and catalytic activity.

      This manuscript collates data from previous investigations and presents new results on KSI variants, aiming to look at the interesting question of what factors are important in relating enzyme activity and stability to optimum growth temperatures of organisms. It presents a useful survey of published data, particularly focusing on the enzyme ketosteroid isomerase (KSI) for which new resluts for a number of variants are presented, building on nice recent work by this group. The main finding in this manuscript is that enzyme optimum temperatures do not correlate well with enzyme activity. This has been found also previously. The manuscript provides quite an extensive analysis and is consistent with previous results and findings. There is useful information in this manuscript, and the compilation of data will be useful to the community, but some crucial aspects and recent relevant work are not covered, and the discussion is limited. The analysis does not identify any relevant determinant of optimum temperature, and the focus on a single temperature in each case may be misleading.

      We do not agree that our analysis is “misleading.” We would characterize the prior analysis based on a small number of examples that were not randomly selected as potentially misleading. In contrast, we tested the prior conclusions with all relevant data that are available. We also highlight the power of collecting more data by further reporting the rate enhancement of 20 enzyme variants in depth. Temperature compensation through activity may still occur in specific settings, as we have noted in the Discussion.

      We agree with Reviewer #3 about the vast potential to use temperature dependencies to relate to evolutionary pressures and adaptations from molecules to organisms. This is a prime area for future investigation.

      Previous analyses have shown that optimum rates of enzymes do not correlate with optimal growth temperatures (e.g. Elias et al (2014) Trends in Biochemical Sciences 39, 299; Peterson (2004) Journal of Biological Chemistry 279, 20717; Thomas & Scopes (1998) Biochemical Journal 330, 1087; Lee et al (2007) FASEB Journal 21, 1934). This is particularly notable for psychrophilic (cold adapted) enzymes, but is also apparent from the fact that enzymes from the same organism often have quite different optimum temperatures. The data collected in the current manuscript are consistent with previous analyses and so are usefully confirming of this. The authors note that optimal growth temperatures may not correlate with activity for a number of reasons, including that the individual enzyme rate may not be under evolutionary pressure. Also, obviously, as noted by the authors, factors other than temperature are also important in enzyme evolution.

      We agree that it is obvious that factors other than temperature are important in evolution, but here we address whether the adaptation to temperature is accompanied by a common response. As noted, more catalysis for organisms at lower temperature was concluded previously and (as noted by Reviewer #2) is expected. However, this conclusion, upon further analysis (carried out herein) appears not to hold. Thus, even when organisms are adapting to temperature, other factors appear to be dominant. This was not previously known. The analyses the reviewer notes refer to thermal parameters derived from the temperature dependence of the rate constant for a given enzyme as a function of temperature, rather than what is addressed herein –the relative rate constant for enzymes from organisms with different growth temperatures.

      There is somewhat better correlation of enzyme stability with optimum growth temperatures, but it is not strong. Therefore, other factors must be important in determining optimum growth temperatures. The authors briefly mention some possibilities. One factor is that a given enzyme may not be a bottleneck in a metabolic pathway. It is not clear that KSI is in fact a metabolic limiter. Also, for many metabolic pathways, it may be essential to consider the kinetics of the pathway as a whole, which may not be determined by a single enzyme. Directly relevant here is the recent proposal of the 'inflection point hypothesis', which provides an explanation of these observations (Prentice et al. Biochemistry (2020) 59, 3562), which the authors do not mention, and may not be aware of. This hypothesis proposes that, rather than alignment of optimum temperatures or stabilities, rather the inflection points of enzymes in a metabolic pathways are aligned at the mean environmental temperature for the organism. This has the effect of coordinating relative enzyme rates and preventing metabolic disruption as temperature fluctuates. Also relevant here is that the response of metabolic pathways in general is not determined solely by a single enzyme. Prentice et al. show that, in general, the temperature-dependent properties of each enzyme in the pathway is important in determining the temperature dependence of the whole pathway.

      We thank Reviewer #3 for bringing this work to our attention and we have included it in the revised manuscript. This paper points out additional complexities regarding metabolic coordination of relative enzyme rates, enhancing points made in the Discussion.

      It is certainly important to understand what molecular features determine the temperature dependence of enzyme activity and its relationship to stability. Some previous proposals are mentioned in the manuscript. One important factor at the molecular level, mentioned by the authors, is work of Åqvist, Brandsval and coworkers, who have convincingly shown that activation entropy and enthalpy differ significantly between psychrophilic enzymes and their mesophilic and thermophilic counterparts. For small soluble enzymes, this is particularly due to changes at the enzyme surface, which may also affect stability. As mentioned by the authors, there have been many proposals over the years that suggest a relationship between stability and activity, though there is not a simple general relationship.

      The cited study is based on molecular dynamics simulations and underlying potentials which can provide models to be tested via experiment. Our analyses relate to this model in that they suggest that rate compensation (to temperature) is not general and so a universal linkage of temperature, flexibility and catalysis is not expected.

      Also directly relevant for the discussion here is what factors limit enzyme activity as temperature increases. The traditional view is that loss of activity is due to protein unfolding at high temperatures (the poor correlation of stability with growth temperatures found here indicates that this cannot be a general explanation). There is increasing evidence that this simple picture is wrong (see e.g. Daniel & Danson. (2010) Trends in Biochemical Sciences 35, 584). This behavior may be accounted for by conformational (e.g. two state) effects as proposed by Danson et al, distinct from the 'flexibility' proposals mentioned in the supporting information here. The introduction of the manuscript here states that "reaction rates are reduced at lower temperatures" , which might naively seem obvious but actually is not universally true, many reactions do not display simple Arrhenius-type behavior (see e.g. Kohen and Truhlar PNAS 2001 98 848). Many enzymes show a temperature of optimum activity, i.e. activity drops above the optimum temperature but before unfolding occurs. As the authors note, Arcus et al. show that this can be accounted for by an activation heat capacity, significantly larger in psychrophiles. Signatures of this behavior are apparent at the large scale (e.g. Schipper et al Global Change Biol. 2014 20 3578; Alster et al (2016) Front. Microbiol. 7:1821) and it appears to be generally important.

      We also are enthralled by the many proposals put forward for the physical and thermodynamic behavior of enzymes and we look forward to rigorous tests of the predictions of these models. Like Reviewer #3, we expect that there are many different features and properties of enzymes to discover!

    1. Biophysics Colab

      Consolidated peer review report (30 November 2021)

      GENERAL ASSESSMENT

      The TMEM16 family of membrane proteins have been shown to function as calcium-activated chloride channels and lipid scramblases. In recent years, X-ray and cryo-EM structures have been solved for TMEM16 proteins in ligand-free and ligand-bound conformations, providing valuable structural information on their functional duality and activation mechanisms. It is largely accepted that the catalytic site (termed subunit groove or cavity) is mostly shielded from the membrane in the ligand-free TMEM16 scramblases. Calcium binding induces a conformational rearrangement of the cavity-lining helices, opening the groove to the surrounding membrane. Since the groove is hydrophilic, it was proposed that it serves as a permeation pathway for lipid headgroups while the hydrophobic lipid tails remain embedded into the hydrophobic membrane core, which has been termed the "credit card" mechanism of lipid scrambling. Additionally, structures of several TMEM16 homologs in lipid nanodiscs revealed that these proteins deform the lipid bilayer in the vicinity of the subunit cavity by bending and thinning the membrane, irrespective of the presence of the activating ligand calcium. Functional experiments also suggested that lipids can be scrambled outside of the open subunit cavity and that local protein-induced membrane deformation is critical for lipid scrambling.

      In the present study, Falzone and colleagues further address the mechanisms of lipid scrambling using single particle cryo-EM and liposome-based functional assays. Firstly, the authors solved the structure of a calcium-bound fungal homolog, afTMEM16, in nanodiscs with a lipid composition where the protein is maximally active. Although similar structures were obtained before, this new structure has the highest resolution thus far, representing > 1 Å improvement! The structure is beautiful and is a major achievement, which enabled the authors to resolve individual lipids and their interaction with the protein around the subunit cavity, whereas in previous structures unresolved non-protein densities were observed passing through the groove. The authors also solved a number of structures with and without calcium in lipid compositions that promote (thinner lipid bilayers) or suppress (thicker lipid bilayers) scrambling. The authors show that afTMEM16 can scramble lipids while the subunit groove remains closed, a phenomenon that is further enhanced in thinner membranes, whereas in thicker membranes scrambling is suppressed even though the groove is open. We particularly appreciated how different software packages and processing strategies were used to rigorously identify structural heterogeneity in their cryo-EM data. Remarkably, mutations of residues lining the subunit cavity and interacting with lipids do not appear to have dramatic effects on scrambling rates, which suggests that lipids do not need to interact with the protein to be scrambled. Thus, the overall conclusion of the study is that membrane thinning by TMEM16 scramblases in their calcium-free conformation is enough to induce lipid scrambling, and that the groove opening induced by calcium binding further enhances membrane deformation, promoting faster scrambling. By contrast, in thicker membranes the protein fails to sufficiently deform the bilayer and scrambling is suppressed, even when the subunit groove is open. The present study provides unprecedented structural information on the interaction of lipids with afTMEM16 and new evidence that lipids can be scrambled outside of the groove.

      The findings and conclusions presented here help to explain why TMEM16 scramblases can transport lipids with headgroups much bigger than the dimensions of the subunit cavity and why structures of some of the other scramblases (opsins, Xkrs and mammalian homolog TMEM16F) lack the obvious hydrophilic groove seen in fungal TMEM16 scramblases. Overall, this is a well-rounded study with an exceptional amount of high-quality cryo-EM data and functional experiments supporting the conclusions.

      RECOMMENDATIONS

      Revisions essential for endorsement:

      1) The resolved lipids binding within the groove in the first structure might be seen by some as supporting the credit card mechanism as it definitively demonstrates that lipids reside within the groove. While the authors provide evidence that lipids can permeate outside the groove in this and earlier work, as far as we can tell, none of that would preclude permeation through the groove if it doesn't require specific interactions between lipids and sidechains in the protein. The presentation might be improved with a somewhat more circumspect and nuanced exposition of the new data and how it can be understood with earlier results. Might the complex composition of native lipid membranes influence where and by what mechanism lipid movement between leaflets is catalysed by TMEM16 proteins?

      2) The quality of lipid densities in cryo-EM structures is greatly affected by the number of particles used and the resolution obtained during refinement and it is therefore not surprising that the beautiful lipid densities observed here in the structure of afTMEM16 in lipid nanodiscs in the presence of calcium refined to 2.3 Å are not all observed in subsequent structures with lower resolution. This is true not only for the P lipids near the groove, but for those D lipids bound near the dimer interface, which is a stable region of the protein that does not change conformation. To be cautious, the authors should avoid resting any conclusions on the absence of lipid densities in the lower resolution structures. For example, on pg 15 the authors seem to be interpreting the absence of density for C22 lipids.

      3) The presentation of structural interactions between lipids and residues near the groove of the protein could be improved in the figures. A panel like Fig. 1J but for the groove would help, but it would be good to see expanded perspectives in the form of a supplementary figure where residues around the headgroup of the lipids are shown along with EM maps so the quality of the structural information for both lipids and side chains can be better appreciated. The preprint does have a lot of images of the lipids and the protein, but not in a way that enables the reader to quickly grasp the nature of interactions between side chains and lipid moieties for themselves, and we feel that close-ups of individual lipids as suggested above would help. It is also not clear what the authors mean by "lipid headgroup". Have the authors only considered interactions of the phospholipid phosphate group with protein residues? It would be helpful if the authors could clarify this in the manuscript and say whether other types of interactions were considered. It would also be nice to include a close-up view of D511A/E514A in 0.5 mM calcium with cryo-EM density to demonstrate the absence of bound calcium ions.

      4) The functional data in Fig 2, 3 and 4 are also not discussed in much detail and it would help if the authors could expand the presentation. Although scrambling in the presence of a very high concentration of calcium is not dramatically altered by any of the mutations, there is quite a lot going on in the absence of calcium and very little is said about these results. For example, differences in the scrambling rates can be observed with some mutants in the presence and absence of calcium in figures 2E and 3E, but statistical analysis would be required to know if the differences between mutants are significant. The differences in scrambling rates with different lipids are also not discussed (e.g. Fig. 4A) It would help if the authors could discuss what is the margin of error in the scrambling assay, and point to some concrete examples from their earlier work on this specific scramblase where mutants have a large impact on scrambling activity in their assay. Have the authors tried intermediate more physiologically relevant concentrations of calcium to see if the mutants have discernible effects under those conditions?

      5) It is quite intriguing that the mutations in the subunit groove of afTMEM16 have little effect on scrambling activity. The authors propose that the groove-lining residues are not directly involved in lipid coordination even though their structure suggests that they do and there is a wealth of functional studies and MD simulations on various other TMEM16 homologs suggesting otherwise. The authors' suggestion that mutations probably affect the equilibrium between open and closed conformations of the groove in other homologs but not in afTMEM16 is logical, however, there are some discrepancies. To name a few examples, if indeed this is the case, nhTMEM16 mutants with closed groove should still have significant basal scrambling, by extrapolation from afTMEM16 data. Yet, some of the nhTMEM16 mutants (E313/E318/R432 mutants) have no activity at all, or no basal scrambling activity (Y439A) (Lee et al, 2018). Would you expect that point mutations within the subunit groove remove the ability of the protein to deform the membrane in its closed conformation? Might the groove have intermediate conformations between closed and fully open where the mutants studied might have more impact in afTMEM16? Further, mutating some of the residues on the scrambling domain of TMEM16 affected externalization of some lipid species, but not internalization etc. (Gyobu et al, 2017), which should not be the case if the interaction of the protein with the lipids is completely unnecessary for lipid scrambling. While investigating this question further would require follow-up structural studies on other TMEM16 homologs and is outside of the scope of this study, we think that the manuscript would benefit from a more extensive discussion on contradicting results and alternative interpretations. The authors might want to consider the possibility that there may be substantial variations in how different scramblases function. afTMEM16 has high constitutive activity in the absence of calcium, while at least TMEM16F does not. Additionally, the extent to which scrambling is promoted by calcium varies, as mammalian scramblases might need other cellular factors to be activated. Also, the extent to which scramblases are seen to distort the membrane is highly variable, as again seen in TMEM16F structures. Might some of these differences imply that key aspects of the mechanism of scrambling (e.g. thinning of the membrane or whether lipids scramble inside or outside the groove) are not the same for all scramblases? This might be one way to organize the discussion to help reconcile some of the seemingly divergent findings in the field.

      6) The authors should correct the Ramachandran outliers in C18/calcium and C22/calcium structures.

      Additional suggestions for the authors to consider:

      1) In several instances the authors conceptualize hypothetical mechanisms to set up experiments and frame their interpretations, which is not always the most straightforward way to communicate findings and what they reveal. The 'conveyor belt mechanism' introduced on page 10 is never fully defined in a way that helps the reader to understand what the functional effects of the mutants teach us. Might it be easier to set up the experiment by asking whether the interactions between sidechains that apparently interact with lipid headgroups in the structure play a critical role in scrambling, present the results and then conclude that they do not appear to? Collectively the functional effects of mutants do appear to suggest that specific side chain interactions are not critical for scrambling, but the conceptualized mechanism here makes the conclusions come across as unnecessarily forced. The credit-card mechanism has been formally introduced and discussed in the field but has already been shot down in earlier work from the group and seems overly simplistic if we already know that scrambling can occur both inside and outside the groove from earlier studies. Just something for the authors to think about.

      2) The uninitiated reader would greatly benefit from more of an introduction to the functional scrambling assay in the results and material and methods section so they can understand the results being presented. In the Material and methods, the authors mentioned: "All conditions were tested side by side with a control preparation", perhaps add here what exactly served as control – wild type protein in C18 lipids? It would be valuable to include information on the reconstitution efficiency between their preparations (WT in different lipid compositions and WT vs mutants). these if possible.

      3) Also, does the C18/calcium cryo-EM structure have sufficient resolution to distinguish between specific phospholipids (PG or PC) at the D1-D9 or P1-P7 positions? It would be particularly valuable if the authors could comment on whether PG or PC are observed in the D and P positions, or which lipids are lining the groove (P3-P6).

      4) While not essential, it would be interesting if the authors could perform the assay on the mutants with a more prominent effect in the absence of calcium (e.g. E310A, Y319A/F322A/K428A) with several additional calcium concentrations.

      5) The authors mentioned that the interaction of C22 lipids with the pathway helices is weaker than those of C18 lipids, which reflects the energy cost associated with distorting the longer lipids (page 15). However, they claimed that the interaction between the lipids and residues is not important for scrambling, which seems contradictory.

      REVIEWING TEAM

      Reviewed by:

      Angela Ballesteros, Research Fellow (K.J. Swartz lab, NINDS, USA): structural biology (X-ray crystallography), membrane protein function, lipid scrambling, cell biology, fluorescence microscopy

      Valeria Kalienkova, Postdoctoral Fellow (C. Paulino lab, University of Groningen, The Netherlands): membrane structural biology (X-ray crystallography and cryo-EM), membrane transport and lipid scrambling

      Kenton J. Swartz, Senior Investigator, NINDS, USA: ion channel structure and mechanisms, chemical biology and biophysics, electrophysiology and fluorescence spectroscopy

      Curated by:

      Kenton J. Swartz, Senior Investigator, NINDS, USA

      (This consolidated report is a result of peer review conducted by Biophysics Colab on version 1 of this preprint. Minor corrections and presentational issues have been omitted for brevity.)

    1. SciScore for 10.1101/2021.12.10.21267582: (What is this?)

      Please note, not all rigor criteria are appropriate for all manuscripts.

      Table 1: Rigor

      <table><tr><td style="min-width:100px;margin-right:1em; border-right:1px solid lightgray; border-bottom:1px solid lightgray">Ethics</td><td style="min-width:100px;border-bottom:1px solid lightgray">Consent: Subjects provided written consent prior to their participation and assignment to either the Echinacea or control group.<br>IRB: It was approved by the local ethical review board (Ethics Committee at Diagnostics and Consultation Center Convex Ltd, Sofia, registration nr: 116/26.10.2020) and registered on clinicaltrials.gov (identifier: NCT05002179).</td></tr><tr><td style="min-width:100px;margin-right:1em; border-right:1px solid lightgray; border-bottom:1px solid lightgray">Sex as a biological variable</td><td style="min-width:100px;border-bottom:1px solid lightgray">not detected.</td></tr><tr><td style="min-width:100px;margin-right:1em; border-right:1px solid lightgray; border-bottom:1px solid lightgray">Randomization</td><td style="min-width:100px;border-bottom:1px solid lightgray">Study Design and Participants: This randomized, parallel, open, no-treatment controlled, exploratory study was carried out in Bulgaria from 30th of November 2020 (first patient first visit) to 29th of May 2021 (last patient last visit) at one study centre (Diagnostics and Consultation Center Convex EOOD, Sofia).</td></tr><tr><td style="min-width:100px;margin-right:1em; border-right:1px solid lightgray; border-bottom:1px solid lightgray">Blinding</td><td style="min-width:100px;border-bottom:1px solid lightgray">not detected.</td></tr><tr><td style="min-width:100px;margin-right:1em; border-right:1px solid lightgray; border-bottom:1px solid lightgray">Power Analysis</td><td style="min-width:100px;border-bottom:1px solid lightgray">Sample size calculation & statistics: This study principally used descriptive biometric approaches to estimate effect sizes.</td></tr></table>

      Table 2: Resources

      No key resources detected.


      Results from OddPub: We did not detect open data. We also did not detect open code. Researchers are encouraged to share open data when possible (see Nature blog).


      Results from LimitationRecognizer: We detected the following sentences addressing limitations in the study:
      As mentioned, this study has limitations, first it used descriptive statistical methods, was small in size and secondly it did not use placebo for control and was not blinded. Nevertheless, the design was still considered valid to provide essential evidence for the preventive use of Echinacea during the COVID-19 pandemic for the following reasons: a first parameter was defined as incidence of (viral) RTIs, for which sample size calculation found sufficient statistical power of >80% for 120 included subjects. The lack of blinding/placebo might be considered a methodological weakness, but it can be assumed that the placebo effect/knowledge of therapy have only limited effects on detection of viral pathogens in NP/OP samples and blood serum. We therefore think that the study design was suitable to address the research question on antiviral effects of Echinaforce in vivo.

      Results from TrialIdentifier: We found the following clinical trial numbers in your paper:<br><table><td style="min-width:95px; border-right:1px solid lightgray; border-bottom:1px solid lightgray">Identifier</td><td style="min-width:95px; border-right:1px solid lightgray; border-bottom:1px solid lightgray">Status</td><td style="min-width:95px; border-right:1px solid lightgray; border-bottom:1px solid lightgray">Title</td></tr><tr><td style="min-width:95px; border-right:1px solid lightgray; border-bottom:1px solid lightgray">NCT05002179</td><td style="min-width:95px; border-right:1px solid lightgray; border-bottom:1px solid lightgray">Completed</td><td style="min-width:95px; border-right:1px solid lightgray; border-bottom:1px solid lightgray">Echinaforce Study to Investigate Explorative Pharmacology an…</td></tr></table>


      Results from Barzooka: We did not find any issues relating to the usage of bar graphs.


      Results from JetFighter: We did not find any issues relating to colormaps.


      Results from rtransparent:
      • Thank you for including a conflict of interest statement. Authors are encouraged to include this statement when submitting to a journal.
      • Thank you for including a funding statement. Authors are encouraged to include this statement when submitting to a journal.
      • Thank you for including a protocol registration statement.

      Results from scite Reference Check: We found no unreliable references.


      <footer>

      About SciScore

      SciScore is an automated tool that is designed to assist expert reviewers by finding and presenting formulaic information scattered throughout a paper in a standard, easy to digest format. SciScore checks for the presence and correctness of RRIDs (research resource identifiers), and for rigor criteria such as sex and investigator blinding. For details on the theoretical underpinning of rigor criteria and the tools shown here, including references cited, please follow this link.

      </footer>

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Response to reviewers

      Reviewer #1

      I believe that this is a very sound and authoritative study. The analysis of all data seems appropriate and robust, and many connections between the data (and subsets of data) and their possible interpretations have been considered. In fact, in the massive Results section, some interpretations are supported by cited references (this is not meant as a critique). However, I wonder about the length of the Results section, and the balance between it and the relatively short Discussion section. It is difficult for me to nail down any part of Results that might be shortened, as I could not find clear redundancies. I also think that the level of speculation is absolutely warranted, and I did not find excessive claims being made to this or that end. Rather, I suggest to broaden the perspective somewhat (in their Discussion; see below under Significance), which might allow people with a less mechanistic perspective to grasp the potential relevance of this work for non-model plant systems studied mostly by evolutionary geneticists.

      Response: We thank the reviewer for their kind remarks. We have spent a very large amount of time trying to streamline the results section and we are not sure if it would be possible to shorten it any further without removing critical details.

      We appreciate the reviewer’s comment to add more detail to the discussion to make it more appealing to evolutionary geneticists and we have added the following lines to the discussion section: “The WISO or “weak inbreeder/strong outcrosser” model (Brandvain & Haig, 2005) emerges from the dynamics of parental conflict and parent-of-origin effects. Under this model, a parent from populations with higher levels of outcrossing is exposed to higher levels of conflict and can thus dominate the programming of maternal resource allocation in a cross with an individual from a population with lower levels of outcrossing. Such a phenomenon has been observed in numerous clades including Dalechampia, Arabidopsis, Capsella and Leavenworthia (Brandvain & Haig, 2018; İltaş et al., 2021; Lafon-Placette et al., 2018; Raunsgard et al., 2018). Intriguingly, loss of function phenotypes in the RdDM pathway are more severe in recently outcrossing species than in A.thaliana (Grover et al., 2018; Wang et al., 2020) and suggests that RNA Pol IV functions are more elaborate and important in these species. This raises the possibility that the role for RNA Pol IV and RdDM in parental conflict that we describe in A.thaliana here is likely heightened in and mediates the elevated level of parental conflict in species that are currently or have been recently outcrossing.”

      One aspect that might warrant more scrutiny is the mapping of sRNA reads to the reference genome. I found the short section of this (M&M section, page 20, lines 23-25) to be too brief. It is not clear to me which of ShortStack's v3 weighting scheme the authors used, which is relevant for multi-mapping reads (see NR Johnson et al. 2016, G3). In addition, it is not mentioned whether zero mismatches were allowed. Perhaps this is described in more detail in Erdmann et al. (2017), but even if so, it deserves to be clarified here.

      Response: Small RNA reads were aligned after allowing two mismatches. This was indicated in the bowtie command (‘bowtie -v 2’ where v 2 indicates two mis-matches). We have added text to expand on the meaning of the commands.

      We have also expanded the commands used for ShortStack. We used the “Placement guided by uniquely mapping reads (-u)” option to divide the multi-mapping reads.

      The manuscript is well-written and concise, despite the length of the Results section. The verbal clarity and absence of typos or grammatical issues is superb. I did find some of the Figures to be somewhat "un-intuitive", in the sense that it takes acute concentration for an outsider (of sorts) to gather and interpret the underlying data. This is probably due to the many cross-comparisons of differences between two genotypes on one axis and those of a different pair of genotypes on the other axis. I am not sure how this issue can be ameliorated (nor whether this is really necessary); however, from a technical point of view, all Figures and Suppl. Figures are really well-done.

      Response: We thank the reviewer for their kind remarks. We have strived to make the figures easier to understand but we are aware that the figures do require a lot of concentration. We haven’t found an easy way to fix this. We thank the reviewer for patiently going through the figures.

      The list of references seems adequate in terms of citing relevant (both older and very recent) publications. However, almost all cited papers concern Arabidopsis or other model species; I suggest to consider adding a few relevant studies on non-Brassicaceae (whether considered model taxa or not), in conjunction with my suggestion (in Significance) to potentially broaden the scope by searching for natural phenomena that also involve parent-of-origin effects on endosperm/seed development. Curiously, many of the references are "incomplete" in the sense of stopping with the journal's name, then stating the doi, i.e. they lack volume numbers and page/article numbers. This should be harmonized throughout.

      Response: We have added references to non-Brassicaceae species and have also fixed the references.

      Reviewer #2: This manuscript provides evidence that a loss of either the maternal or paternal copy of NRPD1 have different, and sometime opposite, effects on the accumulation of small RNAs and on expression of a subset of genes, with a loss of the maternal copy having more substantial effects. The manuscript is well written, and the conclusions, as far as they go, are justified by the data, which are effectively presented. The overall effect is subtle but informative and according to the authors support a parental conflict model for imprinting. The experiments failed to find a smoking gun in the form of a mechanism to explain how or why the maternal and paternal alleles have different effects and the explanation for a lack of clear phenotypic differences was reasonable, but untested. I would have like to see it tested by looking in a plant species that is outcrossing and highly polymorphic. However, I do appreciate that the observation that the male and female alleles can have distinct effect when mutant is an important clue. My specific comments below may reflect confusion on my part, rather than real issues. If that is that I hope that confusion can aid in clarifying what are in places quite subtle points.

      Response: We thank the reviewer for their comments. We agree that it would be potentially informative to do similar experiments in an outcrossing species but that this is beyond the scope of this manuscript. Additionally, loss of NRPD1 or other components of the RdDM pathway has dramatic effects on gametogenesis in some examined outcrossing species(Grover et al., 2018; Wang et al., 2020), which could prevent the detection of subtle parent-of-origin effects on seed development.

      Page 6, last paragraph: "Because the endosperm is triploid, in these comparisons there are 3 (wild-type), 2 (pat nrpd1+/-), 1 (mat nrpd1+/-) and 0 (nrpd1-/-) functional NRPD1 alleles in the endosperm. However, NRPD1 is a paternally expressed imprinted gene in wild-type Ler x Col endosperm and the single paternal allele contributes 62% of the NRPD1 transcript whereas 38% comes from the two maternal alleles (Pignatta et al., 2014). Consistent with paternal allele bias in NPRD1 expression, mRNA-Seq data shows that NRPD1 is expressed at 42% of wild- type levels in pat nrpd1+/- and at 91% of wild-type levels in mat nrpd1+/- (Supplementary Table 6)". I would think this would really complicate the analysis. Should all of the dosage values include NPRD1 imprinting values? That is to say, expressed in terms of expression values? This is also a bit confusing. The maternal copies together express 38% of the transcript, so why isn't the mat nrpd1 at 68%, rather than 91%? In any event, given this imprinting and differences in dosage of the male and female it appears that two variables, parental origin and expression levels are being compared. Since 91% is awfully close to 100%, are the mat pat comparisons really just comparing low with nearly normal expression of NRPD1? And actually, given that, the outsized effect of the mat nrpd1 +/- is even more striking.

      Response: We included the details of dosage rather than imprinting values because the potential for buffering of expression upon loss of one allele could not be discounted. Indeed, we do find that the endosperm transcriptome buffers against the loss of the maternal or paternal alleles (Supplementary Table 6). The reviewer is correct in pointing out that the outsized effect of mat nrpd1+/- on gene expression is even more striking, and strongly supports our view that these effects are parental rather than endospermic.

      To reduce confusion in this section, we removed the details about 38% maternal allele transcripts obtained from our previous study, and instead report only the observed values from this study (which are also consistent with the previously reported paternally-biased expression of NRPD1 in endosperm).

      Page 4, Line 16. I'm afraid it's still a bit difficult to understand what was being compared what in this section. Please clarify.

      Response: The authors in this previously published study compared sRNAs obtained from wild-type whole seeds (which consists of three different tissues, including endosperm) with mutant endosperm. We are pointing out that the difference in tissue composition makes the effect of nrpd1 mutation hard to disentangle from the tissue differences between the two genotypes.

      Page 5, Line 5. I'm sure this is fine, but it's not entirely clear what is from the previously published paper and what is reanalysis here. All the crosses and measurements were made then, but not organized in this way?

      Response: This data was indeed previously published. In that analysis, we had pooled results from different crosses and calculated significance between genotypes using chi-square tests. During a later study (Satyaki and Gehring, 2019), we realized that we were losing information by ignoring the seed abortion values per cross. So, a reanalysis of that data on a cross by cross basis allowed us to find strong evidence for maternal and paternal effects.

      Page 6, Line 26. This is an excellent dosage series, but it's complicated by imprinting. So it's not 3, 2, 1, 0 effective copies. If we set the paternal copy at ~1 and each maternal at ~0.1, then it's 1.2 (wild type), 0.20 (pat nrpd1+/-), 1 (mat nrpd1+/-), and 0 (nrpd1-/-).

      Response: At the genomic DNA level, its 3, 2,1 and 0 doses. The reviewer’s comment on the transcriptional dose is not clear to us. Based on measured gene expression levels, relative wild-type NRPD1 transcriptional dose =1, pat nrpd1+/- is 0.42, and mat nrpd1+/- is 0.91.

      Page 6, line 31. Is the main thing we are comparing the difference between expression at 42% verses 91% of wild type?

      Response: We are using the small RNA-seq data alongside the mRNA-seq data to argue that loss of mat and pat nrpd1+/- have no impact on overall Pol IV activity in endosperm (as measured by small RNA production). A nrpd1 heterozygous endosperm has almost the same small RNA profile as a wild-type endosperm. Thus any effects seen in the endosperm, including the effects on mRNA expression described later in the manuscript, are likely parental rather than zygotic endospermic effects.

      Page 7, line 11. So, the overall effect in either direction on smRNA gene targets was really quite small, and I'm guessing the effect on gene expression was even smaller.

      Response: The effects of loss of maternal or paternal Pol IV on sRNAs was indeed small (Fig. 1/Fig. S3). Effect of loss of maternal Pol IV on gene expression was substantially large and distinct from the relatively small impacts observed upon loss of paternal Pol IV (Fig. 3) This observation supports the view that Pol IV mediates parent-of-origin effects on gene expression.

      Page 7, line 17. I take it that it is this difference, rather than the overall numbers that is of interest.

      Response: Correct. The lack of a relationship between sRNAs impacted upon loss of mat and pat nrpd1 is additionally suggestive of parent-of-origin effects

      Page 9, line 2. Really interesting, since one might expect that these are methylated loci that would be expected to be fed into any existing embryo maintenance methylation pathway. Surprising that they are maintained independently.

      Response: It is indeed surprising that Pol IV activity in parents can have different impacts on sRNAs in the endosperm. It should be noted though, that as described in Erdmann et al 2017 and in this paper later on, many endosperm sRNA loci are in fact not associated with endosperm DNA methylation. In addition, sRNA loci that are dependent on paternal Pol IV activity are more likely to be associated with DNA methylation than are sRNA loci associated with maternal Pol IV activity. These points have been described in Figure S8.

      Page 9, line 22. Proportion of total imprinted genes? Did the mutant obviate/enhance the imprinting?

      Response: We have modified the manuscript to describe effects on imprinted genes: “ The expression of imprinted genes is known to be regulated epigenetically in endosperm. In mat nrpd1+/- imprinted genes were more likely to be mis-regulated than expected by chance (hypergeometric test p-15) – 15 out of 43 paternally expressed and 45 out of 128 maternally expressed imprinted genes were mis-regulated in mat nrpd1+/- while two maternally expressed imprinted genes but no paternally expressed imprinted genes were mis-regulated in pat nrpd1+/- (Table S6).” We have also added a new supplementary figure (Fig. S6) that describes the impacts of NRPD1 loss of imprinted gene expression.

      Page 9, line 27. How could 2) occur in the homozygous mutant?

      Response: Loss of NRPD1 may impact gene expression in both parents. When the nrpd1-/- mutant endosperm is investigated, we are also examining the consequences of the inheritance of these disrupted gene expression states. We refer to this as epistatic interactions of mat and pat nrpd1.

      Page 10, line 9. Interesting!

      Response: We strongly agree!

      Page 10, line 11. Is this 2.7 versus 2.18 significant because it's statistically significant, or because it's conceptually significant?

      Response: We are pointing out that the 2.7-fold value is quite similar to the predicted value of 2.18-fold, which is arrived at by simply summing the effects of mat nrpd1 and pat nrpd1. This is a conceptually significant point.

      Are the examples in 3D representative, or the most convincing examples? And a big difference in ROS1 is of some concern, since that may well be expected to affect imprinting indirectly. I know I'm being picky here, but the pattern is so intriguing I'd be worried about confirmation bias.

      Response: The examples in 3D are representative for those genes with significant changes in expression in both mat and pat nrpd1, and other genes also behave similarly. The antagonistic effect described for 3D can also be observed as a much broader trend affecting hundreds of genes to varying extents in Fig 3C and 3E-H. The concern about ROS1 is not clear to us but we agree that an effect of ROS1 may be one way that NRPD1 controls gene expression.

      Page 10, line 18. Ok, but 0.123 is a pretty subtle negative correlation. Although I do appreciate that it clearly is not a positive correlation. If I'm understanding correctly, this was the "aha" moment, because it's exactly what one might expect of NRPD1 from the mother and father or working at cross purposes. But the numbers are getting awfully small here.

      Response: It is unclear how to calibrate our expectations of effect sizes considering that our study is the first (to our knowledge) to make such a measurement involving gene expression in parental conflict. A review of the few empirical examples of parental conflict’s impact on seeds shows that parental conflict may drive small changes in seed size (Brandvain and Haig, 2018).

      The evolution of quantitative traits maybe driven by selection for large effects at a small number of loci and/or by selection of small effects at a large number of loci. In a similar vein, parental conflict can impact seed phenotypes either via large effects at a few loci or via small effects at a large number of loci. Our analysis described in Fig 3D-H can fit either possibility. Large effects can be found at a few loci such as SUC2 and PICC (Fig. 3D). Smaller antagonistic effects can be found at hundreds of loci as shown in Figure 5A. The negative correlation described in this figure can be observed even upon dropping the genes that show a statistically significant differential expression in both mat and pat nrpd1+/- (slope after dropping genes significantly mis-regulated in both mat and pat nrpd1+/- is -0.126). In summary, a correlation of -0.123 strongly supports the existence of a widespread antagonistic regulatory effect.

      Page 10, line 29. The point simply being that that other phenomenon is also significant even if the differences are that large?

      Response: We are pointing out that the magnitude of the effects we see are similar to that observed for phenomenon such as dosage compensation.

      Page 12. So, there is no effect on cleavage and no obvious effect on flanking siRNA clusters. The suspense is building...

      Page 12, line 24. And not in potential regulatory regions? CNSs?

      Response: We did not identify a significant enrichment for differentially methylated regions in regulatory regions. We used the relative distance function in bedtools (https://bedtools.readthedocs.io/en/latest/content/tools/reldist.html) to calculate the relationship between the genomic location of DMRs and genomic location of a differentially expressed gene. This analysis was chosen as it does not make a priori assumptions about the size of the regulatory region of a gene. A broad association between DMRs and differentially expressed genes would be indicated by a frequency far greater than 0.02. We show the results of this analysis in Fig. S8F; we find no evidence for significant enrichment of DMRs in the regulatory regions of differentially expressed genes.

      Page 12, line 28. I guess it depend on whether or not the changes are in regulatory sequences no immediately apparent as part of the gene, doesn't it?

      Response: We examined DNA methylation over genes here because in endosperm, unlike in other tissues, many small RNAs are genic. Moreover, DNA methylation within the gene may control transcript abundance (Eimer et al., 2018; Klosinska et al., 2016). We have also examined regulatory regions adjacent to genes in Fig S8F and found no effect.

      Line 13, line 2. Not sure it's that important, but couldn't you chop all of these genes in half and see if they are no longer significant collectively?

      Response: We do not think that this will provide a useful insight.

      Page 14, line 15. I'm afraid I'm getting confused here with the terms cis and trans here. Just to be clear, cis means a direct effect of small RNAs that are dependent on NRPD1 on a gene and trans means anything else? But in this context, it's not clear that is what is meant. Do you mean that gene expression is determined and preset in the gametophyte? What are the levels of expression of NRPD1 in the two gametophytes?

      Response: The reviewer’s interpretation of cis and trans is correct. However, the cis imprints may be preset in gametophytes or in the sporophytic tissues that surround or give rise to the gametophyte. Pol IV is known to be active either in gametophyte or in related sporophytic tissues in both the mother and the father(Kirkbride et al., 2019; Long et al., 2021; Olmedo-Monfil et al., 2010).

      Page 14, line 19. Prior to fertilization?

      Response: Yes, that is the idea. As described in the manuscript, Pol IV activity in either the parental sporophyte or gametophyte prior to fertilization could impact gene expression in the endosperm after fertilization.

      Page 14, line 27. Do you mean driven by, or just associated with?

      Response: In response to the comment, we have replaced the phrase “driven by” with “due to” for increased clarity. In wild-type, DOG1 is predominantly expressed from the paternal allele. In mat nrpd1+/-, the paternal allele is somewhat upregulated but the maternal allele, which is almost silent in wild-type, is highly expressed in mat nrpd1+/-.

      Page 15, line 26. And this is really the issue. The primary conclusion, backed up by the lack (I'm assuming) of phenotypic differences between mat NRPD1 -/+ and pat NRPD1 +/- suggests that the observed differences in expression are not particularly important, unless the exceptional cases are informative.

      Response: We are not sure whether the reviewer means “issue” in a negative, neutral, or positive light. Seed phenotypes are often subtle and we have not examined phenotypic differences in sufficient detail to comment.

      Page 15, line 12. Yes, but I'm not at all clear what the mechanism for this is.

      Response: We have tested and falsified multiple hypotheses to explain how Pol IV can regulate gene expression in endosperm. Considering the complex genetics and the difficulty of isolating endosperm, we have concluded that this is a matter for a future study. The point of this study is the discovery of Pol IV’s parental effects.

      Page 15, line 23. Since this is a very small subset of genes, are these genes that you might expect to play a role in parental conflict?

      Response: The functions of most genes in endosperm remain unknown. However, some have a likely role in conflict. SUC2 is antagonistically regulated by parental Pol IV (Fig. 3D). SUC2 transports sucrose, the key form of carbon imported into seeds from the mother (Sauer & Stolz, 1994).

      Page 15, line 33. Indeed, these could be the informative exceptions.

      Response: We believe the reviewer means that the identify of strongly antagonistically regulated genes may be informative in terms of thinking about these results in the context of parental genetic conflict, which we agree with.

      Page 15, line 29. Hardly surprising, given that the paternal copy of NRPD1 is expressed at a higher level than the maternal copies, is it?

      Response: It is actually somewhat surprising since we show in Fig. 2 that the sRNA production in mat and pat nrpd1+/- are comparable to that of wild-type. The higher contribution of NRPD1 from the paternal copy does not really explain the methylation differences

      Page 16, line 1. So this is what you mean by in cis. Presetting?

      Response: The reviewer’s previous interpretation of cis (acting directly at a target gene) is correct. However, the cis imprints may be preset in gametophyte or in the sporophytic tissues that surround or give rise to the gametophyte. Pol IV is known to be active in gametophytes and in related sporophytic tissues in both the mother and the father.

      These are intriguing results that would benefit from a test of the hypothesis by comparing these result with those obtained in an outcrossing plant species.

      Response: We agree that it would interesting and informative to perform similar experiments in an outcrossing species. However, loss of NRPD1 or other components of the the RdDM pathway have dramatic effects on gametogenesis in outcrossing species (Grover et al., 2018; Wang et al., 2020), preventing the detection of subtle parent-of-origin effects on seed development. Additionally, this would be a separate study.

      Reviewer #3

      We thank the reviewer for their comments.

      • Expression of NRPD1 was 42% of WT in paternal nrpd1 and 91% of WT in maternal nrpd1, yet throughout the paper the effect of maternal nrpd1 was far stronger than paternal nrpd1. The authors may also want to confirm that protein levels follow the same pattern, in case protein degradation or post-transcriptional regulation may play a role.

      Response: We show in Fig. 2 that sRNA production in mat and pat nrpd1+/- are similar to wild-type endosperm. This strongly suggests that NRPD1 protein is produced at functionally equivalent levels in wild-type, mat and pat nrpd1+/-. The finding that mat nrpd1+/- has a stronger effect on gene expression and small RNAs, despite having higher levels of NRPD1 transcript in endosperm, is consistent with our conclusion that the effects we are observing in heterozygous endosperm are due to NRPD1 action before fertilization.

      P. 9 line 1 - this only seems to be true for maternal ISRs, not paternal ISRs; this claim should be narrowed.

      Response: Accordingly, we have modified the text here to : “In summary, these results indicate that most maternally and some paternally imprinted sRNA loci in endosperm are dependent on Pol IV activity in the parents and are not established de novo post-fertilization.”

      A small number of sRNA loci become highly depleted in maternal nrpd1 but not paternal nrpd1 (Fig. 1D, F, Fig. 2C) - are these siren loci?

      Response: This is an interesting question. Siren loci have not been defined in Arabidopsis but are described as loci with high levels of sRNAs in ovules, seed coat, endosperm and embryo (Grover et al., 2020). Loci losing sRNAs in maternal nrpd1+/- include a large number of maternally expressed imprinted sRNAs (mat ISRs). We do not know if mat ISR loci are expressed in the ovule. In Erdmann et al (2017), we excluded loci that were also expressed in the seed coat from mat ISRs. Thus, these loci meet only some of the conditions for being defined as siren loci.

      Fig. 2 suggests that many of the downregulated sRNA regions in maternal nrpd1 are maternally biased to begin with. Related, are genic sRNAs more likely to be affected by maternal or paternal nrpd1 than non-genic or TE sRNAs?

      Response: As described in Fig. 1B and S3, loss of maternal NRPD1 has more impacts on the sRNA landscape. As a percentage of total loci, genes are more likely to be affected than TEs.

      For the sRNA loci shown in Fig. 2C, how is % maternal affected in maternal vs. paternal nrpd1? These ISRs are normally maternal or paternal biased, does this change in maternal or paternal nrpd1?

      Response: We assess the allelic bias of ISRs only when they have at least ten reads in the genotypes being compared. In mat nrpd1+/-, most mat ISRs lose almost all their reads (Fig. 2) and we can assess allelic bias only at 107/366 mat ISRs. As seen in the Rev. comment. Fig1, these 107 lose their maternal bias. In pat nrpd1+/-, loci with maternally biased sRNAs show somewhat increased expression (Fig 2E) but do not show an appreciable change in maternal bias (Figure Review 1). All paternal ISRs do not show any dramatic impacts on allelic bias in mat or pat nrpd1+/-. We have not added this additional datapoint to our paper because we were worried that the paper was becoming too dense – a concern also voiced by reviewer 1. However, we can add this to the manuscript if the reviewer prefers.

      • Might have missed this, but I didn't see the gene ontology results (p9 line 16) shown anywhere? Would like to see significance values, fold enrichments, etc. In particular, the group of paternal nrpd1 up-regulated genes seems too small to have much confidence for GO enrichment analysis.

      Response: We have added a Supplementary Table 7 with outputs of GO analyses.

      • I would suggest expanding the analysis in Fig. 3D-H to explore whether the additive model is more predictive of nrpd1-/- expression levels than other potential models (epistatic, etc.) in general at all genes, or only at the subsets of genes shown, independently of whether the effects are large enough to pass the arbitrary significance cutoffs used in E-H. Identifying specifically which genes do and don't follow this additive pattern could help dissect mechanism. For example, genes following this pattern might share a TF binding site for a TF that is regulated by Pol IV.

      Response: While we are interested, we currently cannot explore other models such as epistasis as this would require knock-down of NRPD1 in the endosperm and we plan to do this as part of a future study.

      1. 13 line 26 - how do changes in CG methylation in maternal or paternal nrpd1 compare to changes in dme or ros1? Do either set of DMRs significantly overlap dme or ros1 DMRs? Could some of these be explained by changes in ROS1 expression, since ROS1 is a Pol IV target?

      Response: Yes. It’s entirely possible that a subset of observed gene expression changes are linked to changes in ROS1 expression. However, there are no comparable methylation data for ROS1 in the endosperm. A potential role for ROS1 has been discussed on Page 11, line 4. Comparison with DMRs in the dme endosperm is difficult. dme mutant endosperm has low non-CG methylation (Ibarra et al., 2012). We have unpublished data showing that the expression of genes involved in RNA-directed DNA methylation (RdDM) is reduced in the dme endosperm. It is therefore difficult to understand if and how DME-mediated demethylation may impact RdDM.

      P. 10 line 3 - is the overlap of 36 out of 51 genes unlikely to occur by chance

      Response: A hypergeometric test indicates that this is indeed significant. We have added it to text on Page 9, line 34.

      In sRNA and mRNA-seq libraries, what was the overall maternal/paternal ratio in each library? Did loss of Pol IV affect this?

      The graphs above show the maternally derived fraction of mRNA and sRNA libraries for different genotypes. Please note that the Ler nrpd1 mutant was generated by backcrossing Col-0 nrpd1+/- into Ler. Some Col-0 regions remain in this background and are called “hold-outs”. Reads mapping to these hold-outs have been excluded while calculating the maternal fraction of each library described in the graph above. We cannot confidently judge if the overall maternal fraction of the mRNA transcriptome is affected by loss of NRPD1 as we likely need more replicates. However, we find that loss of all NRPD1-dependent sRNAs (as in the nrpd1 null mutant) leaves behind sRNAs that roughly reflect the genomic 2:1 ratio.

      P. 9 line 22 - how many paternally and maternally expressed imprinted genes were considered? Were imprinted genes statistically more likely to be misregulated in mat nrpd1?

      Response: We considered 128 maternally and 43 paternally expressed genes that had been previously been identified as imprinted in Col x Ler crosses (Pignatta et al 2014). We have modified the manuscript to describe effects on imprinted genes: “ The expression of imprinted genes is known to be regulated epigenetically in endosperm. In mat nrpd1+/- imprinted genes were more likely to be mis-regulated than expected by chance (hypergeometric test p-15) – 15 out of 43 paternally expressed and 45 out of 128 maternally expressed imprinted genes were mis-regulated in mat nrpd1+/- while two maternally expressed imprinted genes but no paternally expressed imprinted genes were mis-regulated in pat nrpd1+/- (Table S6). “ We have also added a supplementary figure (Figure S6) that focuses on genic mRNA imprinting in NRPD1 heterozygotes and homozygous mutants.

      References cited in the response

      Brandvain, Y., & Haig, D. (2005). Divergent Mating Systems and Parental Conflict as a Barrier to Hybridization in Flowering Plants. The American Naturalist, 166(3), 330–338. https://doi.org/10.1086/432036

      Brandvain, Y., & Haig, D. (2018). Outbreeders pull harder in a parental tug-of-war. Proceedings of the National Academy of Sciences, 115(45), 11354–11356. https://doi.org/10.1073/pnas.1816187115

      Eimer, H., Sureshkumar, S., Singh Yadav, A., Kraupner-Taylor, C., Bandaranayake, C., Seleznev, A., Thomason, T., Fletcher, S. J., Gordon, S. F., Carroll, B. J., & Balasubramanian, S. (2018). RNA-Dependent Epigenetic Silencing Directs Transcriptional Downregulation Caused by Intronic Repeat Expansions. Cell. https://doi.org/10.1016/j.cell.2018.06.044

      Grover, J. W., Burgess, D., Kendall, T., Baten, A., Pokhrel, S., King, G. J., Meyers, B. C., Freeling, M., & Mosher, R. A. (2020). Abundant expression of maternal siRNAs is a conserved feature of seed development. Proceedings of the National Academy of Sciences of the United States of America, 117(26), 15305–15315. https://doi.org/10.1073/pnas.2001332117

      Grover, J. W., Kendall, T., Baten, A., Burgess, D., Freeling, M., King, G. J., & Mosher, R. A. (2018). Maternal components of RNA ‐directed DNA methylation are required for seed development in Brassica rapa. The Plant Journal, 94(4), 575–582. https://doi.org/10.1111/tpj.13910

      Ibarra, C. A., Feng, X., Schoft, V. K., Hsieh, T.-F., Uzawa, R., Rodrigues, J. A., Zemach, A., Chumak, N., Machlicova, A., Nishimura, T., Rojas, D., Fischer, R. L., Tamaru, H., & Zilberman, D. (2012). Active DNA Demethylation in Plant Companion Cells Reinforces Transposon Methylation in Gametes. Science, 337(6100), 1360–1364. https://doi.org/10.1126/science.1224839

      İltaş, Ö., Svitok, M., Cornille, A., Schmickl, R., & Lafon Placette, C. (2021). Early evolution of reproductive isolation: A case of weak inbreeder/strong outbreeder leads to an intraspecific hybridization barrier in Arabidopsis lyrata. Evolution, 75(6), 1466–1476. https://doi.org/10.1111/evo.14240

      Kirkbride, R. C., Lu, J., Zhang, C., Mosher, R. A., Baulcombe, D. C., & Chen, Z. J. (2019). Maternal small RNAs mediate spatial-temporal regulation of gene expression, imprinting, and seed development in Arabidopsis. Proceedings of the National Academy of Sciences, 116(7), 2761–2766. https://doi.org/10.1073/pnas.1807621116

      Klosinska, M., Picard, C. L., & Gehring, M. (2016). Conserved imprinting associated with unique epigenetic signatures in the Arabidopsis genus. Nature Plants, 2, 16145. https://doi.org/10.1038/nplants.2016.145

      Lafon-Placette, C., Hatorangan, M. R., Steige, K. A., Cornille, A., Lascoux, M., Slotte, T., & Köhler, C. (2018). Paternally expressed imprinted genes associate with hybridization barriers in Capsella. Nature Plants, 4(6), 352–357. https://doi.org/10.1038/s41477-018-0161-6

      Long, J., Walker, J., She, W., Aldridge, B., Gao, H., Deans, S., Vickers, M., & Feng, X. (2021). Nurse cell­–derived small RNAs define paternal epigenetic inheritance in Arabidopsis. Science, 373(6550). https://doi.org/10.1126/science.abh0556

      Olmedo-Monfil, V., Durán-Figueroa, N., Arteaga-Vázquez, M., Demesa-Arévalo, E., Autran, D., Grimanelli, D., Slotkin, R. K., Martienssen, R. A., & Vielle-Calzada, J.-P. (2010). Control of female gamete formation by a small RNA pathway in Arabidopsis. Nature, 464(7288), 628–632. https://doi.org/10.1038/nature08828

      Raunsgard, A., Opedal, Ø. H., Ekrem, R. K., Wright, J., Bolstad, G. H., Armbruster, W. S., & Pélabon, C. (2018). Intersexual conflict over seed size is stronger in more outcrossed populations of a mixed-mating plant. Proceedings of the National Academy of Sciences, 115(45), 11561–11566. https://doi.org/10.1073/pnas.1810979115

      Sauer, N., & Stolz, J. (1994). SUC1 and SUC2: two sucrose transporters from Arabidopsis thaliana; expression and characterization in baker’s yeast and identification of the histidine-tagged protein. The Plant Journal, 6(1), 67–77. https://doi.org/10.1046/j.1365-313X.1994.6010067.x

      Wang, Z., Butel, N., Santos-González, J., Borges, F., Yi, J., Martienssen, R. A., Martinez, G., & Köhler, C. (2020). Polymerase IV Plays a Crucial Role in Pollen Development in Capsella. The Plant Cell, 32(4), 950–966. https://doi.org/10.1105/tpc.19.00938

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      This manuscript provides evidence that a loss of either the maternal or paternal copy of NRPD1 have different, and sometime opposite, effects on the accumulation of small RNAs and on expression of a subset of genes, with a loss of the maternal copy having more substantial effects. The manuscript is well written, and the conclusions, as far as they go, are justified by the data, which are effectively presented. The overall effect is subtle but informative and according to the authors support a parental conflict model for imprinting. The experiments failed to find a smoking gun in the form of a mechanism to explain how or why the maternal and paternal alleles have different effects and the explanation for a lack of clear phenotypic differences was reasonable, but untested. I would have like to see it tested by looking in a plant species that is outcrossing and highly polymorphic. However, I do appreciate that the observation that the male and female alleles can have distinct effect when mutant is an important clue. My specific comments below may reflect confusion on my part, rather than real issues. If that is that I hope that confusion can aid in clarifying what are in places quite subtle points.

      Specific comments:

      Page 6, last paragraph: "Because the endosperm is triploid, in these comparisons there are 3 (wild-type), 2 (pat nrpd1+/-), 1 (mat nrpd1+/-) and 0 (nrpd1-/-) functional NRPD1 alleles in the endosperm. However, NRPD1 is a paternally expressed imprinted gene in wild-type Ler x Col endosperm and the single paternal allele contributes 62% of the NRPD1 transcript whereas 38% comes from the two maternal alleles (Pignatta et al., 2014). Consistent with paternal allele bias in NPRD1 expression, mRNA-Seq data shows that NRPD1 is expressed at 42% of wild- type levels in pat nrpd1+/- and at 91% of wild-type levels in mat nrpd1+/- (Supplementary Table 6)".

      I would think this would really complicate the analysis. Should all of the dosage values include NPRD1 imprinting values? That is to say, expressed in terms of expression values? This is also a bit confusing. The maternal copies together express 38% of the transcript, so why isn't the mat nrpd1 at 68%, rather than 91%? In any event, given this imprinting and differences in dosage of the male and female it appears that two variables, parental origin and expression levels are being compared. Since 91% is awfully close to 100%, are the mat pat comparisons really just comparing low with nearly normal expression of NRPD1? And actually, given that, the outsized effect of the mat nrpd1 +/- is even more striking.

      Page 4, Line 16. I'm afraid it's still a bit difficult to understand what was being compared what in this section. Please clarify.

      Page 5, Line 5. I'm sure this is fine, but it's not entirely clear what is from the previously published paper and what is reanalysis here. All the crosses and measurements were made then, but not organized in this way?

      Page 6, Line 26. This is an excellent dosage series, but it's complicated by imprinting. So it's not 3, 2, 1, 0 effective copies. If we set the paternal copy at ~1 and each maternal at ~0.1, then it's 1.2 (wild type), 0.20 (pat nrpd1+/-), 1 (mat nrpd1+/-), and 0 (nrpd1-/-).

      Page 6, line 31. Is the main thing we are comparing the difference between expression at 42% verses 91% of wild type?

      Page 7, line 11. So, the overall effect in either direction on smRNA gene targets was really quite small, and I'm guessing the effect on gene expression was even smaller.

      Page 7, line 17. I take it that it is this difference, rather than the overall numbers that is of interest.

      Page 9, line 2. Really interesting, since one might expect that these are methylated loci that would be expected to be fed into any existing embryo maintenance methylation pathway. Surprising that they are maintained independently.

      Page 9, line 22. Proportion of total imprinted genes? Did the mutant obviate/enhance the imprinting?

      Page 9, line 27. How could 2) occur in the homozygous mutant?

      Page 10, line 9. Interesting!

      Page 10, line 11. Is this 2.7 versus 2.18 significant because it's statistically significant, or because it's conceptually significant? Are the examples in 3D representative, or the most convincing examples? And a big difference in ROS1 is of some concern, since that may well be expected to affect imprinting indirectly. I know I'm being picky here, but the pattern is so intriguing I'd be worried about confirmation bias.

      Page 10, line 18. Ok, but 0.123 is a pretty subtle negative correlation. Although I do appreciate that it clearly is not a positive correlation. If I'm understanding correctly, this was the "aha" moment, because it's exactly what one might expect of NRPD1 from the mother and father or working at cross purposes. But the numbers are getting awfully small here.

      Page 10, line 29. The point simply being that that other phenomenon is also significant even if the differences are that large?

      Page 12. So, there is no effect on cleavage and no obvious effect on flanking siRNA clusters. The suspense is building...

      Page 12, line 24. And not in potential regulatory regions? CNSs?

      Page 12, line 28. I guess it depend on whether or not the changes are in regulatory sequences no immediately apparent as part of the gene, doesn't it?

      Line 13, line 2. Not sure it's that important, but couldn't you chop all of these genes in half and see if they are no longer significant collectively?

      Page 14, line 15. I'm afraid I'm getting confused here with the terms cis and trans here. Just to be clear, cis means a direct effect of small RNAs that are dependent on NRPD1 on a gene and trans means anything else? But in this context, it's not clear that is what is meant. Do you mean that gene expression is determined and preset in the gametophyte? What are the levels of expression of NRPD1 in the two gemetophytes?

      Page 14, line 19. Prior to fertilization?

      Page 14, line 27. Do you mean driven by, or just associated with?

      Page 15, line 26. And this is really the issue. The primary conclusion, backed up by the lack (I'm assuming) of phenotypic differences between mat NRPD1 -/+ and pat NRPD1 +/- suggests that the observed differences in expression are not particularly important, unless the exceptional cases are informative.

      Page 15, line 12. Yes, but I'm not at all clear what the mechanism for this is.

      Page 15, line 23. Since this is a very small subset of genes, are these genes that you might expect to play a role in parental conflict?

      Page 15, line 33. Indeed, these could be the informative exceptions.

      Page 15, line 29. Hardly surprising, given that the paternal copy of NRPD1 is expressed at a higher level than the maternal copies, is it?

      Page 16, line 1. So this is what you mean by in cis. Presetting?

      Page 16, line 9. So ideally, one would want to look at a highly polymorphic out-crosser. I'm not suggesting that for this paper, but would this be a good test of the hypothesis? How about maize?

      Page 16, line 15. But the pat and mat heterozygotes looked the same. No differences in phenotype?

      Page 17, line 22. I'm confused, since aren't most 24 nt smRNAs dependent on POLIV (Figure S2)? Do you mean differentially regulated smRNAs? Expression of POLIV specifically in one or the other parent?

      Page 17, line 23. How are you defining important here? Important because at least in the female NPRD1 is not expressed in the central cell? But not important, since this mutant has no effect on phenotype except in an imbalanced cross.

      Page 18, line 13. For this reason, it would be nice to know much more about these genes. Mutant phenotypes, for instance. And how many of these have this feature conserved?

      Significance

      These are intriguing results that would benefit from a test of the hypothesis by comparing these result with those obtained in an outcrossing plant species.

      Referee Cross-commenting

      I agree that the other comments seem both fair and reasonable.

  6. veronicang68.wordpress.com veronicang68.wordpress.com
    1. A commonality in rap music videos is lighter skin toned girls because they have “privileges based on their Eurocentric appearance” (Conrad) and are deemed more desirable

      Looking back to my first draft I had only included a brief summary of the article I had read without putting much of my opinion in the paragraph. As I was editing I had to keep asking myself "yeah but why do you think it is important" or "what do you think about it" because I realized the strength of this essay is supposed to be based on my thoughts and journey through discovering an answer to my question rather than summaries of my research. So, in the final draft I knew I had to include why I think rap videos make a greater impact than what we may originally believe. I added in my thoughts about how in the black community there is an "internal battle" regarding the issue of colorism. I realize now I could have included more about how some artist purposefully choose darker skin extras in their videos to compare to those who don't .

  7. classroom.google.com classroom.google.com
    1. According to all known laws of aviation,

      there is no way a bee should be able to fly.

      Its wings are too small to get its fat little body off the ground.

      The bee, of course, flies anyway

      because bees don't care what humans think is impossible.

      Yellow, black. Yellow, black. Yellow, black. Yellow, black.

      Ooh, black and yellow! Let's shake it up a little.

      Barry! Breakfast is ready!

      Ooming!

      Hang on a second.

      Hello?

      • Barry?
      • Adam?
      • Oan you believe this is happening?
      • I can't. I'll pick you up.

      Looking sharp.

      Use the stairs. Your father paid good money for those.

      Sorry. I'm excited.

      Here's the graduate. We're very proud of you, son.

      A perfect report card, all B's.

      Very proud.

      Ma! I got a thing going here.

      • You got lint on your fuzz.
      • Ow! That's me!
      • Wave to us! We'll be in row 118,000.
      • Bye!

      Barry, I told you, stop flying in the house!

      • Hey, Adam.
      • Hey, Barry.
      • Is that fuzz gel?
      • A little. Special day, graduation.

      Never thought I'd make it.

      Three days grade school, three days high school.

      Those were awkward.

      Three days college. I'm glad I took a day and hitchhiked around the hive.

      You did come back different.

      • Hi, Barry.
      • Artie, growing a mustache? Looks good.
      • Hear about Frankie?
      • Yeah.
      • You going to the funeral?
      • No, I'm not going.

      Everybody knows, sting someone, you die.

      Don't waste it on a squirrel. Such a hothead.

      I guess he could have just gotten out of the way.

      I love this incorporating an amusement park into our day.

      That's why we don't need vacations.

      Boy, quite a bit of pomp... under the circumstances.

      • Well, Adam, today we are men.
      • We are!
      • Bee-men.
      • Amen!

      Hallelujah!

      Students, faculty, distinguished bees,

      please welcome Dean Buzzwell.

      Welcome, New Hive Oity graduating class of...

      ...9:15.

      That concludes our ceremonies.

      And begins your career at Honex Industries!

      Will we pick ourjob today?

      I heard it's just orientation.

      Heads up! Here we go.

      Keep your hands and antennas inside the tram at all times.

      • Wonder what it'll be like?
      • A little scary.

      Welcome to Honex, a division of Honesco

      and a part of the Hexagon Group.

      This is it!

      Wow.

      Wow.

      We know that you, as a bee, have worked your whole life

      to get to the point where you can work for your whole life.

      Honey begins when our valiant Pollen Jocks bring the nectar to the hive.

      Our top-secret formula

      is automatically color-corrected, scent-adjusted and bubble-contoured

      into this soothing sweet syrup

      with its distinctive golden glow you know as...

      Honey!

      • That girl was hot.
      • She's my cousin!
      • She is?
      • Yes, we're all cousins.
      • Right. You're right.
      • At Honex, we constantly strive

      to improve every aspect of bee existence.

      These bees are stress-testing a new helmet technology.

      • What do you think he makes?
      • Not enough.

      Here we have our latest advancement, the Krelman.

      • What does that do?
      • Oatches that little strand of honey

      that hangs after you pour it. Saves us millions.

      Oan anyone work on the Krelman?

      Of course. Most bee jobs are small ones. But bees know

      that every small job, if it's done well, means a lot.

      But choose carefully

      because you'll stay in the job you pick for the rest of your life.

      The same job the rest of your life? I didn't know that.

      What's the difference?

      You'll be happy to know that bees, as a species, haven't had one day off

      in 27 million years.

      So you'll just work us to death?

      We'll sure try.

      Wow! That blew my mind!

      "What's the difference?" How can you say that?

      One job forever? That's an insane choice to have to make.

      I'm relieved. Now we only have to make one decision in life.

      But, Adam, how could they never have told us that?

      Why would you question anything? We're bees.

      We're the most perfectly functioning society on Earth.

      You ever think maybe things work a little too well here?

      Like what? Give me one example.

      I don't know. But you know what I'm talking about.

      Please clear the gate. Royal Nectar Force on approach.

      Wait a second. Oheck it out.

      • Hey, those are Pollen Jocks!
      • Wow.

      I've never seen them this close.

      They know what it's like outside the hive.

      Yeah, but some don't come back.

      • Hey, Jocks!
      • Hi, Jocks!

      You guys did great!

      You're monsters! You're sky freaks! I love it! I love it!

      • I wonder where they were.
      • I don't know.

      Their day's not planned.

      Outside the hive, flying who knows where, doing who knows what.

      You can'tjust decide to be a Pollen Jock. You have to be bred for that.

      Right.

      Look. That's more pollen than you and I will see in a lifetime.

      It's just a status symbol. Bees make too much of it.

      Perhaps. Unless you're wearing it and the ladies see you wearing it.

      Those ladies? Aren't they our cousins too?

      Distant. Distant.

      Look at these two.

      • Oouple of Hive Harrys.
      • Let's have fun with them.

      It must be dangerous being a Pollen Jock.

      Yeah. Once a bear pinned me against a mushroom!

      He had a paw on my throat, and with the other, he was slapping me!

      • Oh, my!
      • I never thought I'd knock him out.

      What were you doing during this?

      Trying to alert the authorities.

      I can autograph that.

      A little gusty out there today, wasn't it, comrades?

      Yeah. Gusty.

      We're hitting a sunflower patch six miles from here tomorrow.

      • Six miles, huh?
      • Barry!

      A puddle jump for us, but maybe you're not up for it.

      • Maybe I am.
      • You are not!

      We're going 0900 at J-Gate.

      What do you think, buzzy-boy? Are you bee enough?

      I might be. It all depends on what 0900 means.

      Hey, Honex!

      Dad, you surprised me.

      You decide what you're interested in?

      • Well, there's a lot of choices.
      • But you only get one.

      Do you ever get bored doing the same job every day?

      Son, let me tell you about stirring.

      You grab that stick, and you just move it around, and you stir it around.

      You get yourself into a rhythm. It's a beautiful thing.

      You know, Dad, the more I think about it,

      maybe the honey field just isn't right for me.

      You were thinking of what, making balloon animals?

      That's a bad job for a guy with a stinger.

      Janet, your son's not sure he wants to go into honey!

      • Barry, you are so funny sometimes.
      • I'm not trying to be funny.

      You're not funny! You're going into honey. Our son, the stirrer!

      • You're gonna be a stirrer?
      • No one's listening to me!

      Wait till you see the sticks I have.

      I could say anything right now. I'm gonna get an ant tattoo!

      Let's open some honey and celebrate!

      Maybe I'll pierce my thorax. Shave my antennae.

      Shack up with a grasshopper. Get a gold tooth and call everybody "dawg"!

      I'm so proud.

      • We're starting work today!
      • Today's the day.

      Oome on! All the good jobs will be gone.

      Yeah, right.

      Pollen counting, stunt bee, pouring, stirrer, front desk, hair removal...

      • Is it still available?
      • Hang on. Two left!

      One of them's yours! Oongratulations! Step to the side.

      • What'd you get?
      • Picking crud out. Stellar!

      Wow!

      Oouple of newbies?

      Yes, sir! Our first day! We are ready!

      Make your choice.

      • You want to go first?
      • No, you go.

      Oh, my. What's available?

      Restroom attendant's open, not for the reason you think.

      • Any chance of getting the Krelman?
      • Sure, you're on.

      I'm sorry, the Krelman just closed out.

      Wax monkey's always open.

      The Krelman opened up again.

      What happened?

      A bee died. Makes an opening. See? He's dead. Another dead one.

      Deady. Deadified. Two more dead.

      Dead from the neck up. Dead from the neck down. That's life!

      Oh, this is so hard!

      Heating, cooling, stunt bee, pourer, stirrer,

      humming, inspector number seven, lint coordinator, stripe supervisor,

      mite wrangler. Barry, what do you think I should... Barry?

      Barry!

      All right, we've got the sunflower patch in quadrant nine...

      What happened to you? Where are you?

      • I'm going out.
      • Out? Out where?
      • Out there.
      • Oh, no!

      I have to, before I go to work for the rest of my life.

      You're gonna die! You're crazy! Hello?

      Another call coming in.

      If anyone's feeling brave, there's a Korean deli on 83rd

      that gets their roses today.

      Hey, guys.

      • Look at that.
      • Isn't that the kid we saw yesterday?

      Hold it, son, flight deck's restricted.

      It's OK, Lou. We're gonna take him up.

      Really? Feeling lucky, are you?

      Sign here, here. Just initial that.

      • Thank you.
      • OK.

      You got a rain advisory today,

      and as you all know, bees cannot fly in rain.

      So be careful. As always, watch your brooms,

      hockey sticks, dogs, birds, bears and bats.

      Also, I got a couple of reports of root beer being poured on us.

      Murphy's in a home because of it, babbling like a cicada!

      • That's awful.
      • And a reminder for you rookies,

      bee law number one, absolutely no talking to humans!

      All right, launch positions!

      Buzz, buzz, buzz, buzz! Buzz, buzz, buzz, buzz! Buzz, buzz, buzz, buzz!

      Black and yellow!

      Hello!

      You ready for this, hot shot?

      Yeah. Yeah, bring it on.

      Wind, check.

      • Antennae, check.
      • Nectar pack, check.
      • Wings, check.
      • Stinger, check.

      Scared out of my shorts, check.

      OK, ladies,

      let's move it out!

      Pound those petunias, you striped stem-suckers!

      All of you, drain those flowers!

      Wow! I'm out!

      I can't believe I'm out!

      So blue.

      I feel so fast and free!

      Box kite!

      Wow!

      Flowers!

      This is Blue Leader. We have roses visual.

      Bring it around 30 degrees and hold.

      Roses!

      30 degrees, roger. Bringing it around.

      Stand to the side, kid. It's got a bit of a kick.

      That is one nectar collector!

      • Ever see pollination up close?
      • No, sir.

      I pick up some pollen here, sprinkle it over here. Maybe a dash over there,

      a pinch on that one. See that? It's a little bit of magic.

      That's amazing. Why do we do that?

      That's pollen power. More pollen, more flowers, more nectar, more honey for us.

      Oool.

      I'm picking up a lot of bright yellow. Oould be daisies. Don't we need those?

      Oopy that visual.

      Wait. One of these flowers seems to be on the move.

      Say again? You're reporting a moving flower?

      Affirmative.

      That was on the line!

      This is the coolest. What is it?

      I don't know, but I'm loving this color.

      It smells good. Not like a flower, but I like it.

      Yeah, fuzzy.

      Ohemical-y.

      Oareful, guys. It's a little grabby.

      My sweet lord of bees!

      Oandy-brain, get off there!

      Problem!

      • Guys!
      • This could be bad.

      Affirmative.

      Very close.

      Gonna hurt.

      Mama's little boy.

      You are way out of position, rookie!

      Ooming in at you like a missile!

      Help me!

      I don't think these are flowers.

      • Should we tell him?
      • I think he knows.

      What is this?!

      Match point!

      You can start packing up, honey, because you're about to eat it!

      Yowser!

      Gross.

      There's a bee in the car!

      • Do something!
      • I'm driving!
      • Hi, bee.
      • He's back here!

      He's going to sting me!

      Nobody move. If you don't move, he won't sting you. Freeze!

      He blinked!

      Spray him, Granny!

      What are you doing?!

      Wow... the tension level out here is unbelievable.

      I gotta get home.

      Oan't fly in rain.

      Oan't fly in rain.

      Oan't fly in rain.

      Mayday! Mayday! Bee going down!

      Ken, could you close the window please?

      Ken, could you close the window please?

      Oheck out my new resume. I made it into a fold-out brochure.

      You see? Folds out.

      Oh, no. More humans. I don't need this.

      What was that?

      Maybe this time. This time. This time. This time! This time! This...

      Drapes!

      That is diabolical.

      It's fantastic. It's got all my special skills, even my top-ten favorite movies.

      What's number one? Star Wars?

      Nah, I don't go for that...

      ...kind of stuff.

      No wonder we shouldn't talk to them. They're out of their minds.

      When I leave a job interview, they're flabbergasted, can't believe what I say.

      There's the sun. Maybe that's a way out.

      I don't remember the sun having a big 75 on it.

      I predicted global warming.

      I could feel it getting hotter. At first I thought it was just me.

      Wait! Stop! Bee!

      Stand back. These are winter boots.

      Wait!

      Don't kill him!

      You know I'm allergic to them! This thing could kill me!

      Why does his life have less value than yours?

      Why does his life have any less value than mine? Is that your statement?

      I'm just saying all life has value. You don't know what he's capable of feeling.

      My brochure!

      There you go, little guy.

      I'm not scared of him. It's an allergic thing.

      Put that on your resume brochure.

      My whole face could puff up.

      Make it one of your special skills.

      Knocking someone out is also a special skill.

      Right. Bye, Vanessa. Thanks.

      • Vanessa, next week? Yogurt night?
      • Sure, Ken. You know, whatever.
      • You could put carob chips on there.
      • Bye.
      • Supposed to be less calories.
      • Bye.

      I gotta say something.

      She saved my life. I gotta say something.

      All right, here it goes.

      Nah.

      What would I say?

      I could really get in trouble.

      It's a bee law. You're not supposed to talk to a human.

      I can't believe I'm doing this.

      I've got to.

      Oh, I can't do it. Oome on!

      No. Yes. No.

      Do it. I can't.

      How should I start it? "You like jazz?" No, that's no good.

      Here she comes! Speak, you fool!

      Hi!

      I'm sorry.

      • You're talking.
      • Yes, I know.

      You're talking!

      I'm so sorry.

      No, it's OK. It's fine. I know I'm dreaming.

      But I don't recall going to bed.

      Well, I'm sure this is very disconcerting.

      This is a bit of a surprise to me. I mean, you're a bee!

      I am. And I'm not supposed to be doing this,

      but they were all trying to kill me.

      And if it wasn't for you...

      I had to thank you. It's just how I was raised.

      That was a little weird.

      • I'm talking with a bee.
      • Yeah.

      I'm talking to a bee. And the bee is talking to me!

      I just want to say I'm grateful. I'll leave now.

      • Wait! How did you learn to do that?
      • What?

      The talking thing.

      Same way you did, I guess. "Mama, Dada, honey." You pick it up.

      • That's very funny.
      • Yeah.

      Bees are funny. If we didn't laugh, we'd cry with what we have to deal with.

      Anyway...

      Oan I...

      ...get you something?

      • Like what?

      I don't know. I mean... I don't know. Ooffee?

      I don't want to put you out.

      It's no trouble. It takes two minutes.

      • It's just coffee.
      • I hate to impose.
      • Don't be ridiculous!
      • Actually, I would love a cup.

      Hey, you want rum cake?

      • I shouldn't.
      • Have some.
      • No, I can't.
      • Oome on!

      I'm trying to lose a couple micrograms.

      • Where?
      • These stripes don't help.

      You look great!

      I don't know if you know anything about fashion.

      Are you all right?

      No.

      He's making the tie in the cab as they're flying up Madison.

      He finally gets there.

      He runs up the steps into the church. The wedding is on.

      And he says, "Watermelon? I thought you said Guatemalan.

      Why would I marry a watermelon?"

      Is that a bee joke?

      That's the kind of stuff we do.

      Yeah, different.

      So, what are you gonna do, Barry?

      About work? I don't know.

      I want to do my part for the hive, but I can't do it the way they want.

      I know how you feel.

      • You do?
      • Sure.

      My parents wanted me to be a lawyer or a doctor, but I wanted to be a florist.

      • Really?
      • My only interest is flowers.

      Our new queen was just elected with that same campaign slogan.

      Anyway, if you look...

      There's my hive right there. See it?

      You're in Sheep Meadow!

      Yes! I'm right off the Turtle Pond!

      No way! I know that area. I lost a toe ring there once.

      • Why do girls put rings on their toes?
      • Why not?
      • It's like putting a hat on your knee.
      • Maybe I'll try that.
      • You all right, ma'am?
      • Oh, yeah. Fine.

      Just having two cups of coffee!

      Anyway, this has been great. Thanks for the coffee.

      Yeah, it's no trouble.

      Sorry I couldn't finish it. If I did, I'd be up the rest of my life.

      Are you...?

      Oan I take a piece of this with me?

      Sure! Here, have a crumb.

      • Thanks!
      • Yeah.

      All right. Well, then... I guess I'll see you around.

      Or not.

      OK, Barry.

      And thank you so much again... for before.

      Oh, that? That was nothing.

      Well, not nothing, but... Anyway...

      This can't possibly work.

      He's all set to go. We may as well try it.

      OK, Dave, pull the chute.

      • Sounds amazing.
      • It was amazing!

      It was the scariest, happiest moment of my life.

      Humans! I can't believe you were with humans!

      Giant, scary humans! What were they like?

      Huge and crazy. They talk crazy.

      They eat crazy giant things. They drive crazy.

      • Do they try and kill you, like on TV?
      • Some of them. But some of them don't.
      • How'd you get back?
      • Poodle.

      You did it, and I'm glad. You saw whatever you wanted to see.

      You had your "experience." Now you can pick out yourjob and be normal.

      • Well...
      • Well?

      Well, I met someone.

      You did? Was she Bee-ish?

      • A wasp?! Your parents will kill you!
      • No, no, no, not a wasp.
      • Spider?
      • I'm not attracted to spiders.

      I know it's the hottest thing, with the eight legs and all.

      I can't get by that face.

      So who is she?

      She's... human.

      No, no. That's a bee law. You wouldn't break a bee law.

      • Her name's Vanessa.
      • Oh, boy.

      She's so nice. And she's a florist!

      Oh, no! You're dating a human florist!

      We're not dating.

      You're flying outside the hive, talking to humans that attack our homes

      with power washers and M-80s! One-eighth a stick of dynamite!

      She saved my life! And she understands me.

      This is over!

      Eat this.

      This is not over! What was that?

      • They call it a crumb.
      • It was so stingin' stripey!

      And that's not what they eat. That's what falls off what they eat!

      • You know what a Oinnabon is?
      • No.

      It's bread and cinnamon and frosting. They heat it up...

      Sit down!

      ...really hot!

      • Listen to me!

      We are not them! We're us. There's us and there's them!

      Yes, but who can deny the heart that is yearning?

      There's no yearning. Stop yearning. Listen to me!

      You have got to start thinking bee, my friend. Thinking bee!

      • Thinking bee.
      • Thinking bee.

      Thinking bee! Thinking bee! Thinking bee! Thinking bee!

      There he is. He's in the pool.

      You know what your problem is, Barry?

      I gotta start thinking bee?

      How much longer will this go on?

      It's been three days! Why aren't you working?

      I've got a lot of big life decisions to think about.

      What life? You have no life! You have no job. You're barely a bee!

      Would it kill you to make a little honey?

      Barry, come out. Your father's talking to you.

      Martin, would you talk to him?

      Barry, I'm talking to you!

      You coming?

      Got everything?

      All set!

      Go ahead. I'll catch up.

      Don't be too long.

      Watch this!

      Vanessa!

      • We're still here.
      • I told you not to yell at him.

      He doesn't respond to yelling!

      • Then why yell at me?
      • Because you don't listen!

      I'm not listening to this.

      Sorry, I've gotta go.

      • Where are you going?
      • I'm meeting a friend.

      A girl? Is this why you can't decide?

      Bye.

      I just hope she's Bee-ish.

      They have a huge parade of flowers every year in Pasadena?

      To be in the Tournament of Roses, that's every florist's dream!

      Up on a float, surrounded by flowers, crowds cheering.

      A tournament. Do the roses compete in athletic events?

      No. All right, I've got one. How come you don't fly everywhere?

      It's exhausting. Why don't you run everywhere? It's faster.

      Yeah, OK, I see, I see. All right, your turn.

      TiVo. You can just freeze live TV? That's insane!

      You don't have that?

      We have Hivo, but it's a disease. It's a horrible, horrible disease.

      Oh, my.

      Dumb bees!

      You must want to sting all those jerks.

      We try not to sting. It's usually fatal for us.

      So you have to watch your temper.

      Very carefully. You kick a wall, take a walk,

      write an angry letter and throw it out. Work through it like any emotion:

      Anger, jealousy, lust.

      Oh, my goodness! Are you OK?

      Yeah.

      • What is wrong with you?!
      • It's a bug.

      He's not bothering anybody. Get out of here, you creep!

      What was that? A Pic 'N' Save circular?

      Yeah, it was. How did you know?

      It felt like about 10 pages. Seventy-five is pretty much our limit.

      You've really got that down to a science.

      • I lost a cousin to Italian Vogue.
      • I'll bet.

      What in the name of Mighty Hercules is this?

      How did this get here? Oute Bee, Golden Blossom,

      Ray Liotta Private Select?

      • Is he that actor?
      • I never heard of him.
      • Why is this here?
      • For people. We eat it.

      You don't have enough food of your own?

      • Well, yes.
      • How do you get it?
      • Bees make it.
      • I know who makes it!

      And it's hard to make it!

      There's heating, cooling, stirring. You need a whole Krelman thing!

      • It's organic.
      • It's our-ganic!

      It's just honey, Barry.

      Just what?!

      Bees don't know about this! This is stealing! A lot of stealing!

      You've taken our homes, schools, hospitals! This is all we have!

      And it's on sale?! I'm getting to the bottom of this.

      I'm getting to the bottom of all of this!

      Hey, Hector.

      • You almost done?
      • Almost.

      He is here. I sense it.

      Well, I guess I'll go home now

      and just leave this nice honey out, with no one around.

      You're busted, box boy!

      I knew I heard something. So you can talk!

      I can talk. And now you'll start talking!

      Where you getting the sweet stuff? Who's your supplier?

      I don't understand. I thought we were friends.

      The last thing we want to do is upset bees!

      You're too late! It's ours now!

      You, sir, have crossed the wrong sword!

      You, sir, will be lunch for my iguana, Ignacio!

      Where is the honey coming from?

      Tell me where!

      Honey Farms! It comes from Honey Farms!

      Orazy person!

      What horrible thing has happened here?

      These faces, they never knew what hit them. And now

      they're on the road to nowhere!

      Just keep still.

      What? You're not dead?

      Do I look dead? They will wipe anything that moves. Where you headed?

      To Honey Farms. I am onto something huge here.

      I'm going to Alaska. Moose blood, crazy stuff. Blows your head off!

      I'm going to Tacoma.

      • And you?
      • He really is dead.

      All right.

      Uh-oh!

      • What is that?!
      • Oh, no!
      • A wiper! Triple blade!
      • Triple blade?

      Jump on! It's your only chance, bee!

      Why does everything have to be so doggone clean?!

      How much do you people need to see?!

      Open your eyes! Stick your head out the window!

      From NPR News in Washington, I'm Oarl Kasell.

      But don't kill no more bugs!

      • Bee!
      • Moose blood guy!!
      • You hear something?
      • Like what?

      Like tiny screaming.

      Turn off the radio.

      Whassup, bee boy?

      Hey, Blood.

      Just a row of honey jars, as far as the eye could see.

      Wow!

      I assume wherever this truck goes is where they're getting it.

      I mean, that honey's ours.

      • Bees hang tight.
      • We're all jammed in.

      It's a close community.

      Not us, man. We on our own. Every mosquito on his own.

      • What if you get in trouble?
      • You a mosquito, you in trouble.

      Nobody likes us. They just smack. See a mosquito, smack, smack!

      At least you're out in the world. You must meet girls.

      Mosquito girls try to trade up, get with a moth, dragonfly.

      Mosquito girl don't want no mosquito.

      You got to be kidding me!

      Mooseblood's about to leave the building! So long, bee!

      • Hey, guys!
      • Mooseblood!

      I knew I'd catch y'all down here. Did you bring your crazy straw?

      We throw it in jars, slap a label on it, and it's pretty much pure profit.

      What is this place?

      A bee's got a brain the size of a pinhead.

      They are pinheads!

      Pinhead.

      • Oheck out the new smoker.
      • Oh, sweet. That's the one you want.

      The Thomas 3000!

      Smoker?

      Ninety puffs a minute, semi-automatic. Twice the nicotine, all the tar.

      A couple breaths of this knocks them right out.

      They make the honey, and we make the money.

      "They make the honey, and we make the money"?

      Oh, my!

      What's going on? Are you OK?

      Yeah. It doesn't last too long.

      Do you know you're in a fake hive with fake walls?

      Our queen was moved here. We had no choice.

      This is your queen? That's a man in women's clothes!

      That's a drag queen!

      What is this?

      Oh, no!

      There's hundreds of them!

      Bee honey.

      Our honey is being brazenly stolen on a massive scale!

      This is worse than anything bears have done! I intend to do something.

      Oh, Barry, stop.

      Who told you humans are taking our honey? That's a rumor.

      Do these look like rumors?

      That's a conspiracy theory. These are obviously doctored photos.

      How did you get mixed up in this?

      He's been talking to humans.

      • What?
      • Talking to humans?!

      He has a human girlfriend. And they make out!

      Make out? Barry!

      We do not.

      • You wish you could.
      • Whose side are you on?

      The bees!

      I dated a cricket once in San Antonio. Those crazy legs kept me up all night.

      Barry, this is what you want to do with your life?

      I want to do it for all our lives. Nobody works harder than bees!

      Dad, I remember you coming home so overworked

      your hands were still stirring. You couldn't stop.

      I remember that.

      What right do they have to our honey?

      We live on two cups a year. They put it in lip balm for no reason whatsoever!

      Even if it's true, what can one bee do?

      Sting them where it really hurts.

      In the face! The eye!

      • That would hurt.
      • No.

      Up the nose? That's a killer.

      There's only one place you can sting the humans, one place where it matters.

      Hive at Five, the hive's only full-hour action news source.

      No more bee beards!

      With Bob Bumble at the anchor desk.

      Weather with Storm Stinger.

      Sports with Buzz Larvi.

      And Jeanette Ohung.

      • Good evening. I'm Bob Bumble.
      • And I'm Jeanette Ohung.

      A tri-county bee, Barry Benson,

      intends to sue the human race for stealing our honey,

      packaging it and profiting from it illegally!

      Tomorrow night on Bee Larry King,

      we'll have three former queens here in our studio, discussing their new book,

      Olassy Ladies, out this week on Hexagon.

      Tonight we're talking to Barry Benson.

      Did you ever think, "I'm a kid from the hive. I can't do this"?

      Bees have never been afraid to change the world.

      What about Bee Oolumbus? Bee Gandhi? Bejesus?

      Where I'm from, we'd never sue humans.

      We were thinking of stickball or candy stores.

      How old are you?

      The bee community is supporting you in this case,

      which will be the trial of the bee century.

      You know, they have a Larry King in the human world too.

      It's a common name. Next week...

      He looks like you and has a show and suspenders and colored dots...

      Next week...

      Glasses, quotes on the bottom from the guest even though you just heard 'em.

      Bear Week next week! They're scary, hairy and here live.

      Always leans forward, pointy shoulders, squinty eyes, very Jewish.

      In tennis, you attack at the point of weakness!

      It was my grandmother, Ken. She's 81.

      Honey, her backhand's a joke! I'm not gonna take advantage of that?

      Quiet, please. Actual work going on here.

      • Is that that same bee?
      • Yes, it is!

      I'm helping him sue the human race.

      • Hello.
      • Hello, bee.

      This is Ken.

      Yeah, I remember you. Timberland, size ten and a half. Vibram sole, I believe.

      Why does he talk again?

      Listen, you better go 'cause we're really busy working.

      But it's our yogurt night!

      Bye-bye.

      Why is yogurt night so difficult?!

      You poor thing. You two have been at this for hours!

      Yes, and Adam here has been a huge help.

      • Frosting...
      • How many sugars?

      Just one. I try not to use the competition.

      So why are you helping me?

      Bees have good qualities.

      And it takes my mind off the shop.

      Instead of flowers, people are giving balloon bouquets now.

      Those are great, if you're three.

      And artificial flowers.

      • Oh, those just get me psychotic!
      • Yeah, me too.

      Bent stingers, pointless pollination.

      Bees must hate those fake things!

      Nothing worse than a daffodil that's had work done.

      Maybe this could make up for it a little bit.

      • This lawsuit's a pretty big deal.
      • I guess.

      You sure you want to go through with it?

      Am I sure? When I'm done with the humans, they won't be able

      to say, "Honey, I'm home," without paying a royalty!

      It's an incredible scene here in downtown Manhattan,

      where the world anxiously waits, because for the first time in history,

      we will hear for ourselves if a honeybee can actually speak.

      What have we gotten into here, Barry?

      It's pretty big, isn't it?

      I can't believe how many humans don't work during the day.

      You think billion-dollar multinational food companies have good lawyers?

      Everybody needs to stay behind the barricade.

      • What's the matter?
      • I don't know, I just got a chill.

      Well, if it isn't the bee team.

      You boys work on this?

      All rise! The Honorable Judge Bumbleton presiding.

      All right. Oase number 4475,

      Superior Oourt of New York, Barry Bee Benson v. the Honey Industry

      is now in session.

      Mr. Montgomery, you're representing the five food companies collectively?

      A privilege.

      Mr. Benson... you're representing all the bees of the world?

      I'm kidding. Yes, Your Honor, we're ready to proceed.

      Mr. Montgomery, your opening statement, please.

      Ladies and gentlemen of the jury,

      my grandmother was a simple woman.

      Born on a farm, she believed it was man's divine right

      to benefit from the bounty of nature God put before us.

      If we lived in the topsy-turvy world Mr. Benson imagines,

      just think of what would it mean.

      I would have to negotiate with the silkworm

      for the elastic in my britches!

      Talking bee!

      How do we know this isn't some sort of

      holographic motion-picture-capture Hollywood wizardry?

      They could be using laser beams!

      Robotics! Ventriloquism! Oloning! For all we know,

      he could be on steroids!

      Mr. Benson?

      Ladies and gentlemen, there's no trickery here.

      I'm just an ordinary bee. Honey's pretty important to me.

      It's important to all bees. We invented it!

      We make it. And we protect it with our lives.

      Unfortunately, there are some people in this room

      who think they can take it from us

      'cause we're the little guys! I'm hoping that, after this is all over,

      you'll see how, by taking our honey, you not only take everything we have

      but everything we are!

      I wish he'd dress like that all the time. So nice!

      Oall your first witness.

      So, Mr. Klauss Vanderhayden of Honey Farms, big company you have.

      I suppose so.

      I see you also own Honeyburton and Honron!

      Yes, they provide beekeepers for our farms.

      Beekeeper. I find that to be a very disturbing term.

      I don't imagine you employ any bee-free-ers, do you?

      • No.
      • I couldn't hear you.
      • No.
      • No.

      Because you don't free bees. You keep bees. Not only that,

      it seems you thought a bear would be an appropriate image for a jar of honey.

      They're very lovable creatures.

      Yogi Bear, Fozzie Bear, Build-A-Bear.

      You mean like this?

      Bears kill bees!

      How'd you like his head crashing through your living room?!

      Biting into your couch! Spitting out your throw pillows!

      OK, that's enough. Take him away.

      So, Mr. Sting, thank you for being here. Your name intrigues me.

      • Where have I heard it before?
      • I was with a band called The Police.

      But you've never been a police officer, have you?

      No, I haven't.

      No, you haven't. And so here we have yet another example

      of bee culture casually stolen by a human

      for nothing more than a prance-about stage name.

      Oh, please.

      Have you ever been stung, Mr. Sting?

      Because I'm feeling a little stung, Sting.

      Or should I say... Mr. Gordon M. Sumner!

      That's not his real name?! You idiots!

      Mr. Liotta, first, belated congratulations on

      your Emmy win for a guest spot on ER in 2005.

      Thank you. Thank you.

      I see from your resume that you're devilishly handsome

      with a churning inner turmoil that's ready to blow.

      I enjoy what I do. Is that a crime?

      Not yet it isn't. But is this what it's come to for you?

      Exploiting tiny, helpless bees so you don't

      have to rehearse your part and learn your lines, sir?

      Watch it, Benson! I could blow right now!

      This isn't a goodfella. This is a badfella!

      Why doesn't someone just step on this creep, and we can all go home?!

      • Order in this court!
      • You're all thinking it!

      Order! Order, I say!

      • Say it!
      • Mr. Liotta, please sit down!

      I think it was awfully nice of that bear to pitch in like that.

      I think the jury's on our side.

      Are we doing everything right, legally?

      I'm a florist.

      Right. Well, here's to a great team.

      To a great team!

      Well, hello.

      • Ken!
      • Hello.

      I didn't think you were coming.

      No, I was just late. I tried to call, but... the battery.

      I didn't want all this to go to waste, so I called Barry. Luckily, he was free.

      Oh, that was lucky.

      There's a little left. I could heat it up.

      Yeah, heat it up, sure, whatever.

      So I hear you're quite a tennis player.

      I'm not much for the game myself. The ball's a little grabby.

      That's where I usually sit. Right... there.

      Ken, Barry was looking at your resume,

      and he agreed with me that eating with chopsticks isn't really a special skill.

      You think I don't see what you're doing?

      I know how hard it is to find the rightjob. We have that in common.

      Do we?

      Bees have 100 percent employment, but we do jobs like taking the crud out.

      That's just what I was thinking about doing.

      Ken, I let Barry borrow your razor for his fuzz. I hope that was all right.

      I'm going to drain the old stinger.

      Yeah, you do that.

      Look at that.

      You know, I've just about had it

      with your little mind games.

      • What's that?
      • Italian Vogue.

      Mamma mia, that's a lot of pages.

      A lot of ads.

      Remember what Van said, why is your life more valuable than mine?

      Funny, I just can't seem to recall that!

      I think something stinks in here!

      I love the smell of flowers.

      How do you like the smell of flames?!

      Not as much.

      Water bug! Not taking sides!

      Ken, I'm wearing a Ohapstick hat! This is pathetic!

      I've got issues!

      Well, well, well, a royal flush!

      • You're bluffing.
      • Am I?

      Surf's up, dude!

      Poo water!

      That bowl is gnarly.

      Except for those dirty yellow rings!

      Kenneth! What are you doing?!

      You know, I don't even like honey! I don't eat it!

      We need to talk!

      He's just a little bee!

      And he happens to be the nicest bee I've met in a long time!

      Long time? What are you talking about?! Are there other bugs in your life?

      No, but there are other things bugging me in life. And you're one of them!

      Fine! Talking bees, no yogurt night...

      My nerves are fried from riding on this emotional roller coaster!

      Goodbye, Ken.

      And for your information,

      I prefer sugar-free, artificial sweeteners made by man!

      I'm sorry about all that.

      I know it's got an aftertaste! I like it!

      I always felt there was some kind of barrier between Ken and me.

      I couldn't overcome it. Oh, well.

      Are you OK for the trial?

      I believe Mr. Montgomery is about out of ideas.

      We would like to call Mr. Barry Benson Bee to the stand.

      Good idea! You can really see why he's considered one of the best lawyers...

      Yeah.

      Layton, you've gotta weave some magic

      with this jury, or it's gonna be all over.

      Don't worry. The only thing I have to do to turn this jury around

      is to remind them of what they don't like about bees.

      • You got the tweezers?
      • Are you allergic?

      Only to losing, son. Only to losing.

      Mr. Benson Bee, I'll ask you what I think we'd all like to know.

      What exactly is your relationship

      to that woman?

      We're friends.

      • Good friends?
      • Yes.

      How good? Do you live together?

      Wait a minute...

      Are you her little...

      ...bedbug?

      I've seen a bee documentary or two. From what I understand,

      doesn't your queen give birth to all the bee children?

      • Yeah, but...
      • So those aren't your real parents!
      • Oh, Barry...
      • Yes, they are!

      Hold me back!

      You're an illegitimate bee, aren't you, Benson?

      He's denouncing bees!

      Don't y'all date your cousins?

      • Objection!
      • I'm going to pincushion this guy!

      Adam, don't! It's what he wants!

      Oh, I'm hit!!

      Oh, lordy, I am hit!

      Order! Order!

      The venom! The venom is coursing through my veins!

      I have been felled by a winged beast of destruction!

      You see? You can't treat them like equals! They're striped savages!

      Stinging's the only thing they know! It's their way!

      • Adam, stay with me.
      • I can't feel my legs.

      What angel of mercy will come forward to suck the poison

      from my heaving buttocks?

      I will have order in this court. Order!

      Order, please!

      The case of the honeybees versus the human race

      took a pointed turn against the bees

      yesterday when one of their legal team stung Layton T. Montgomery.

      • Hey, buddy.
      • Hey.
      • Is there much pain?
      • Yeah.

      I...

      I blew the whole case, didn't I?

      It doesn't matter. What matters is you're alive. You could have died.

      I'd be better off dead. Look at me.

      They got it from the cafeteria downstairs, in a tuna sandwich.

      Look, there's a little celery still on it.

      What was it like to sting someone?

      I can't explain it. It was all...

      All adrenaline and then... and then ecstasy!

      All right.

      You think it was all a trap?

      Of course. I'm sorry. I flew us right into this.

      What were we thinking? Look at us. We're just a couple of bugs in this world.

      What will the humans do to us if they win?

      I don't know.

      I hear they put the roaches in motels. That doesn't sound so bad.

      Adam, they check in, but they don't check out!

      Oh, my.

      Oould you get a nurse to close that window?

      • Why?
      • The smoke.

      Bees don't smoke.

      Right. Bees don't smoke.

      Bees don't smoke! But some bees are smoking.

      That's it! That's our case!

      It is? It's not over?

      Get dressed. I've gotta go somewhere.

      Get back to the court and stall. Stall any way you can.

      And assuming you've done step correctly, you're ready for the tub.

      Mr. Flayman.

      Yes? Yes, Your Honor!

      Where is the rest of your team?

      Well, Your Honor, it's interesting.

      Bees are trained to fly haphazardly,

      and as a result, we don't make very good time.

      I actually heard a funny story about...

      Your Honor, haven't these ridiculous bugs

      taken up enough of this court's valuable time?

      How much longer will we allow these absurd shenanigans to go on?

      They have presented no compelling evidence to support their charges

      against my clients, who run legitimate businesses.

      I move for a complete dismissal of this entire case!

      Mr. Flayman, I'm afraid I'm going

      to have to consider Mr. Montgomery's motion.

      But you can't! We have a terrific case.

      Where is your proof? Where is the evidence?

      Show me the smoking gun!

      Hold it, Your Honor! You want a smoking gun?

      Here is your smoking gun.

      What is that?

      It's a bee smoker!

      What, this? This harmless little contraption?

      This couldn't hurt a fly, let alone a bee.

      Look at what has happened

      to bees who have never been asked, "Smoking or non?"

      Is this what nature intended for us?

      To be forcibly addicted to smoke machines

      and man-made wooden slat work camps?

      Living out our lives as honey slaves to the white man?

      • What are we gonna do?
      • He's playing the species card.

      Ladies and gentlemen, please, free these bees!

      Free the bees! Free the bees!

      Free the bees!

      Free the bees! Free the bees!

      The court finds in favor of the bees!

      Vanessa, we won!

      I knew you could do it! High-five!

      Sorry.

      I'm OK! You know what this means?

      All the honey will finally belong to the bees.

      Now we won't have to work so hard all the time.

      This is an unholy perversion of the balance of nature, Benson.

      You'll regret this.

      Barry, how much honey is out there?

      All right. One at a time.

      Barry, who are you wearing?

      My sweater is Ralph Lauren, and I have no pants.

      • What if Montgomery's right?
      • What do you mean?

      We've been living the bee way a long time, 27 million years.

      Oongratulations on your victory. What will you demand as a settlement?

      First, we'll demand a complete shutdown of all bee work camps.

      Then we want back the honey that was ours to begin with,

      every last drop.

      We demand an end to the glorification of the bear as anything more

      than a filthy, smelly, bad-breath stink machine.

      We're all aware of what they do in the woods.

      Wait for my signal.

      Take him out.

      He'll have nauseous for a few hours, then he'll be fine.

      And we will no longer tolerate bee-negative nicknames...

      But it's just a prance-about stage name!

      ...unnecessary inclusion of honey in bogus health products

      and la-dee-da human tea-time snack garnishments.

      Oan't breathe.

      Bring it in, boys!

      Hold it right there! Good.

      Tap it.

      Mr. Buzzwell, we just passed three cups, and there's gallons more coming!

      • I think we need to shut down!
      • Shut down? We've never shut down.

      Shut down honey production!

      Stop making honey!

      Turn your key, sir!

      What do we do now?

      Oannonball!

      We're shutting honey production!

      Mission abort.

      Aborting pollination and nectar detail. Returning to base.

      Adam, you wouldn't believe how much honey was out there.

      Oh, yeah?

      What's going on? Where is everybody?

      • Are they out celebrating?
      • They're home.

      They don't know what to do. Laying out, sleeping in.

      I heard your Uncle Oarl was on his way to San Antonio with a cricket.

      At least we got our honey back.

      Sometimes I think, so what if humans liked our honey? Who wouldn't?

      It's the greatest thing in the world! I was excited to be part of making it.

      This was my new desk. This was my new job. I wanted to do it really well.

      And now...

      Now I can't.

      I don't understand why they're not happy.

      I thought their lives would be better!

      They're doing nothing. It's amazing. Honey really changes people.

      You don't have any idea what's going on, do you?

      • What did you want to show me?
      • This.

      What happened here?

      That is not the half of it.

      Oh, no. Oh, my.

      They're all wilting.

      Doesn't look very good, does it?

      No.

      And whose fault do you think that is?

      You know, I'm gonna guess bees.

      Bees?

      Specifically, me.

      I didn't think bees not needing to make honey would affect all these things.

      It's notjust flowers. Fruits, vegetables, they all need bees.

      That's our whole SAT test right there.

      Take away produce, that affects the entire animal kingdom.

      And then, of course...

      The human species?

      So if there's no more pollination,

      it could all just go south here, couldn't it?

      I know this is also partly my fault.

      How about a suicide pact?

      How do we do it?

      • I'll sting you, you step on me.
      • Thatjust kills you twice.

      Right, right.

      Listen, Barry... sorry, but I gotta get going.

      I had to open my mouth and talk.

      Vanessa?

      Vanessa? Why are you leaving? Where are you going?

      To the final Tournament of Roses parade in Pasadena.

      They've moved it to this weekend because all the flowers are dying.

      It's the last chance I'll ever have to see it.

      Vanessa, I just wanna say I'm sorry. I never meant it to turn out like this.

      I know. Me neither.

      Tournament of Roses. Roses can't do sports.

      Wait a minute. Roses. Roses?

      Roses!

      Vanessa!

      Roses?!

      Barry?

      • Roses are flowers!
      • Yes, they are.

      Flowers, bees, pollen!

      I know. That's why this is the last parade.

      Maybe not. Oould you ask him to slow down?

      Oould you slow down?

      Barry!

      OK, I made a huge mistake. This is a total disaster, all my fault.

      Yes, it kind of is.

      I've ruined the planet. I wanted to help you

      with the flower shop. I've made it worse.

      Actually, it's completely closed down.

      I thought maybe you were remodeling.

      But I have another idea, and it's greater than my previous ideas combined.

      I don't want to hear it!

      All right, they have the roses, the roses have the pollen.

      I know every bee, plant and flower bud in this park.

      All we gotta do is get what they've got back here with what we've got.

      • Bees.
      • Park.
      • Pollen!
      • Flowers.
      • Repollination!
      • Across the nation!

      Tournament of Roses, Pasadena, Oalifornia.

      They've got nothing but flowers, floats and cotton candy.

      Security will be tight.

      I have an idea.

      Vanessa Bloome, FTD.

      Official floral business. It's real.

      Sorry, ma'am. Nice brooch.

      Thank you. It was a gift.

      Once inside, we just pick the right float.

      How about The Princess and the Pea?

      I could be the princess, and you could be the pea!

      Yes, I got it.

      • Where should I sit?
      • What are you?
      • I believe I'm the pea.
      • The pea?

      It goes under the mattresses.

      • Not in this fairy tale, sweetheart.
      • I'm getting the marshal.

      You do that! This whole parade is a fiasco!

      Let's see what this baby'll do.

      Hey, what are you doing?!

      Then all we do is blend in with traffic...

      ...without arousing suspicion.

      Once at the airport, there's no stopping us.

      Stop! Security.

      • You and your insect pack your float?
      • Yes.

      Has it been in your possession the entire time?

      Would you remove your shoes?

      • Remove your stinger.
      • It's part of me.

      I know. Just having some fun. Enjoy your flight.

      Then if we're lucky, we'll have just enough pollen to do the job.

      Oan you believe how lucky we are? We have just enough pollen to do the job!

      I think this is gonna work.

      It's got to work.

      Attention, passengers, this is Oaptain Scott.

      We have a bit of bad weather in New York.

      It looks like we'll experience a couple hours delay.

      Barry, these are cut flowers with no water. They'll never make it.

      I gotta get up there and talk to them.

      Be careful.

      Oan I get help with the Sky Mall magazine?

      I'd like to order the talking inflatable nose and ear hair trimmer.

      Oaptain, I'm in a real situation.

      • What'd you say, Hal?
      • Nothing.

      Bee!

      Don't freak out! My entire species...

      What are you doing?

      • Wait a minute! I'm an attorney!
      • Who's an attorney?

      Don't move.

      Oh, Barry.

      Good afternoon, passengers. This is your captain.

      Would a Miss Vanessa Bloome in 24B please report to the cockpit?

      And please hurry!

      What happened here?

      There was a DustBuster, a toupee, a life raft exploded.

      One's bald, one's in a boat, they're both unconscious!

      • Is that another bee joke?
      • No!

      No one's flying the plane!

      This is JFK control tower, Flight 356. What's your status?

      This is Vanessa Bloome. I'm a florist from New York.

      Where's the pilot?

      He's unconscious, and so is the copilot.

      Not good. Does anyone onboard have flight experience?

      As a matter of fact, there is.

      • Who's that?
      • Barry Benson.

      From the honey trial?! Oh, great.

      Vanessa, this is nothing more than a big metal bee.

      It's got giant wings, huge engines.

      I can't fly a plane.

      • Why not? Isn't John Travolta a pilot?
      • Yes.

      How hard could it be?

      Wait, Barry! We're headed into some lightning.

      This is Bob Bumble. We have some late-breaking news from JFK Airport,

      where a suspenseful scene is developing.

      Barry Benson, fresh from his legal victory...

      That's Barry!

      ...is attempting to land a plane, loaded with people, flowers

      and an incapacitated flight crew.

      Flowers?!

      We have a storm in the area and two individuals at the controls

      with absolutely no flight experience.

      Just a minute. There's a bee on that plane.

      I'm quite familiar with Mr. Benson and his no-account compadres.

      They've done enough damage.

      But isn't he your only hope?

      Technically, a bee shouldn't be able to fly at all.

      Their wings are too small...

      Haven't we heard this a million times?

      "The surface area of the wings and body mass make no sense."

      • Get this on the air!
      • Got it.
      • Stand by.
      • We're going live.

      The way we work may be a mystery to you.

      Making honey takes a lot of bees doing a lot of small jobs.

      But let me tell you about a small job.

      If you do it well, it makes a big difference.

      More than we realized. To us, to everyone.

      That's why I want to get bees back to working together.

      That's the bee way! We're not made of Jell-O.

      We get behind a fellow.

      • Black and yellow!
      • Hello!

      Left, right, down, hover.

      • Hover?
      • Forget hover.

      This isn't so hard. Beep-beep! Beep-beep!

      Barry, what happened?!

      Wait, I think we were on autopilot the whole time.

      • That may have been helping me.
      • And now we're not!

      So it turns out I cannot fly a plane.

      All of you, let's get behind this fellow! Move it out!

      Move out!

      Our only chance is if I do what I'd do, you copy me with the wings of the plane!

      Don't have to yell.

      I'm not yelling! We're in a lot of trouble.

      It's very hard to concentrate with that panicky tone in your voice!

      It's not a tone. I'm panicking!

      I can't do this!

      Vanessa, pull yourself together. You have to snap out of it!

      You snap out of it.

      You snap out of it.

      • You snap out of it!
      • You snap out of it!
      • You snap out of it!
      • You snap out of it!
      • You snap out of it!
      • You snap out of it!
      • Hold it!
      • Why? Oome on, it's my turn.

      How is the plane flying?

      I don't know.

      Hello?

      Benson, got any flowers for a happy occasion in there?

      The Pollen Jocks!

      They do get behind a fellow.

      • Black and yellow.
      • Hello.

      All right, let's drop this tin can on the blacktop.

      Where? I can't see anything. Oan you?

      No, nothing. It's all cloudy.

      Oome on. You got to think bee, Barry.

      • Thinking bee.
      • Thinking bee.

      Thinking bee! Thinking bee! Thinking bee!

      Wait a minute. I think I'm feeling something.

      • What?
      • I don't know. It's strong, pulling me.

      Like a 27-million-year-old instinct.

      Bring the nose down.

      Thinking bee! Thinking bee! Thinking bee!

      • What in the world is on the tarmac?
      • Get some lights on that!

      Thinking bee! Thinking bee! Thinking bee!

      • Vanessa, aim for the flower.
      • OK.

      Out the engines. We're going in on bee power. Ready, boys?

      Affirmative!

      Good. Good. Easy, now. That's it.

      Land on that flower!

      Ready? Full reverse!

      Spin it around!

      • Not that flower! The other one!
      • Which one?
      • That flower.
      • I'm aiming at the flower!

      That's a fat guy in a flowered shirt. I mean the giant pulsating flower

      made of millions of bees!

      Pull forward. Nose down. Tail up.

      Rotate around it.

      • This is insane, Barry!
      • This's the only way I know how to fly.

      Am I koo-koo-kachoo, or is this plane flying in an insect-like pattern?

      Get your nose in there. Don't be afraid. Smell it. Full reverse!

      Just drop it. Be a part of it.

      Aim for the center!

      Now drop it in! Drop it in, woman!

      Oome on, already.

      Barry, we did it! You taught me how to fly!

      • Yes. No high-five!
      • Right.

      Barry, it worked! Did you see the giant flower?

      What giant flower? Where? Of course I saw the flower! That was genius!

      • Thank you.
      • But we're not done yet.

      Listen, everyone!

      This runway is covered with the last pollen

      from the last flowers available anywhere on Earth.

      That means this is our last chance.

      We're the only ones who make honey, pollinate flowers and dress like this.

      If we're gonna survive as a species, this is our moment! What do you say?

      Are we going to be bees, orjust Museum of Natural History keychains?

      We're bees!

      Keychain!

      Then follow me! Except Keychain.

      Hold on, Barry. Here.

      You've earned this.

      Yeah!

      I'm a Pollen Jock! And it's a perfect fit. All I gotta do are the sleeves.

      Oh, yeah.

      That's our Barry.

      Mom! The bees are back!

      If anybody needs to make a call, now's the time.

      I got a feeling we'll be working late tonight!

      Here's your change. Have a great afternoon! Oan I help who's next?

      Would you like some honey with that? It is bee-approved. Don't forget these.

      Milk, cream, cheese, it's all me. And I don't see a nickel!

      Sometimes I just feel like a piece of meat!

      I had no idea.

      Barry, I'm sorry. Have you got a moment?

      Would you excuse me? My mosquito associate will help you.

      Sorry I'm late.

      He's a lawyer too?

      I was already a blood-sucking parasite. All I needed was a briefcase.

      Have a great afternoon!

      Barry, I just got this huge tulip order, and I can't get them anywhere.

      No problem, Vannie. Just leave it to me.

      You're a lifesaver, Barry. Oan I help who's next?

      All right, scramble, jocks! It's time to fly.

      Thank you, Barry!

      That bee is living my life!

      Let it go, Kenny.

      • When will this nightmare end?!
      • Let it all go.
      • Beautiful day to fly.
      • Sure is.

      Between you and me, I was dying to get out of that office.

      You have got to start thinking bee, my friend.

      • Thinking bee!
      • Me?

      Hold it. Let's just stop for a second. Hold it.

      I'm sorry. I'm sorry, everyone. Oan we stop here?

      I'm not making a major life decision during a production number!

      All right. Take ten, everybody. Wrap it up, guys.

      I had virtually no rehearsal for that.

    1. Author Response:

      Reviewer #1 (Public Review):

      Cell surface proteins are of vital interest in the functions and interactions of cells and their neighbors. In addition, cells manufacture and secrete small membrane vesicles that appear to represent a subset of the cell surface protein composition.

      Various techniques have been developed to allow the molecular definition of many cell surface proteins but most rely on the special chemistry of amino acid residues in exposed on the parts of membrane proteins exposed to the cell exterior.

      In this report Kirkemo et al. have devised a method that more comprehensively samples the cell surface protein composition by relying on the membrane insertion or protein glycan adhesion of an enzyme that attaches a biotin group to a nearest neighbor cellular protein. The result is a more complex set of proteins and distinctive differences between normal and a myc oncogene tumor cells and their secreted extracellular vesicle counterparts. These results may be applied to the identification of unique cell surface determinants in tumor cells that could be targets for immune or drug therapy. The results may be strengthened by a more though evaluation of the different EV membrane species represented in the broad collection of EVs used in this investigation.

      We thank the reviewer for recognizing the importance of the work outlined in the manuscript. We have addressed the necessary improvements in the essential revisions section above.

      Reviewer #2 (Public Review):

      This paper describes two methods for labeling cell-surface proteins. Both methods involve tethering an enzyme to the membrane surface to probe the proteins present on cells and exosomes. Two different enzyme constructs are used: a single strand lipidated DNA inserted into the membrane that enables binding of an enzyme conjugated to a complementary DNA strand (DNA-APEX2) or a glycan-targeting binding group conjugated to horseradish peroxidase (WGA-HRP). Both tethered enzymes label proteins on the cell surface using a biotin substrate via a radical mechanism. The method provides significantly enhanced labeling efficiency and is much faster than traditional chemical labeling methods and methods that employ soluble enzymes. The authors comprehensively analyze the labeled proteins using mass spectrometry and find multiple proteins that were previously undetectable with chemical methods and soluble enzymes. Furthermore, they compare the labeling of both cells and the exosomes that are formed from the cells and characterize both up- and down-regulated proteins related to cancer development that may provide a mechanistic underpinning.

      Overall, the method is novel and should enable the discovery of many low-abundance cell-surface proteins through more efficient labeling. The DNA-APEX2 method will only be accessible to more sophisticated laboratories that can carry out the protocols but the WGA-HRP method employs a readily available commercial product and give equivalent, perhaps even better, results. In addition, the method cannot discriminate between proteins that are genuinely expressed on the cell from those that are non-specifically bound to the cell surface.

      The authors describe the approach and identify two unique proteins on the surface of prostate cell lines.

      Strengths:

      Good introduction with appropriate citations of relevant literature Much higher labeling efficiency and faster than chemical methods and soluble enzyme methods. Ability to detect low-abundance proteins, not accessible from previous labeling methods.

      Weaknesses: The DNA-APEX2 method requires specialized reagents and protocols that are much more challenging for a typical laboratory to carry out than conventional chemical labeling methods.

      The claims and findings are sound. The finding of novel proteins and the quantitative measurement of protein up- and down-regulation are important. The concern about non-specifically bound proteins could be addressed by looking at whether the detected proteins have a transmembrane region that would enable them to localize in the cell membrane.

      We thank the reviewer for recognizing the strengths and importance of this work. We also thank the reviewer for mentioning the issue of non-specifically bound proteins. As addressed above in the essential revisions sections, we believe that any low affinity, non-specific binding proteins are likely removed in the multiple wash/centrifugation steps on cells or the multiple centrifugation steps and sucrose gradient purification on EVs. Given the likelihood for removal of non-specific binders, we believe that the secreted proteins identified are likely high affinity interactions and their differential expression on either cells or EVs play an important part in the downstream biology of both sample types. However, the previous data presentation did not clarify which proteins pertained to the transmembrane plasma membrane proteome versus secreted protein forms. For further clarity in the data presentation (Figure 3D, 4D, 5D), we have bolded proteins that are also found in the SURFY database that only includes surface annotated proteins with a predicted transmembrane domain (Bausch-Fluck et al., The in silico human surfaceome. PNAS. 2018). We have also italicized proteins that are annotated to be secreted from the cell to the extracellular space (Uniprot classification). We have updated the text and caption as shown below:

      New Figure 3:

      Figure 3. WGA-HRP identifies a number of enriched markers on Myc-driven prostate cancer cells. (A) Overall scheme for biotin labeling, and label-free quantification (LFQ) by LC-MS/MS for RWPE-1 Control and Myc over-expression cells. (B) Microscopy image depicting morphological differences between RWPE-1 Control and RWPE-1 Myc cells after 3 days in culture. (C) Volcano plot depicting the LFQ comparison of RWPE-1 Control and Myc labeled cells. Red labels indicate upregulation in the RWPE-1 Control cells over Myc cells and green labels indicate upregulation in the RWPE-1 Myc cells over Control cells. All colored proteins are 2-fold enriched in either dataset between four replicates (two technical, two biological, p<0.05). (D) Heatmap of the 15 most upregulated transmembrane (bold) or secreted (italics) proteins in RWPE-1 Control and Myc cells. Scale indicates intensity, defined as (LFQ Area - Mean LFQ Area)/standard deviation. Extracellular proteins with annotated transmembrane domains are bolded and annotated secreted proteins are italicized. (E) Table indicating fold-change of most differentially regulated proteins by LC-MS/MS for RWPE-1 Control and Myc cells. (F) Upregulated proteins in RWPE-1 Myc cells (Myc, ANPEP, Vimentin, and FN1) are confirmed by western blot. (G) Upregulated surface proteins in RWPE-1 Myc cells (Vimentin, ANPEP, FN1) are detected by immunofluorescence microscopy. The downregulated protein HLA-B by Myc over-expression was also detected by immunofluorescence microscopy. All western blot images and microscopy images are representative of two biological replicates. Mass spectrometry data is based on two biological and two technical replicates (N = 4).

      New Figure 4:

      Figure 4. WGA-HRP identifies a number of enriched markers on Myc-driven prostate cancer EVs. (A) Workflow for small EV isolation from cultured cells. (B) Labeled proteins indicating canonical exosome markers (ExoCarta Top 100 List) detected after performing label-free quantification (LFQ) from whole EV lysate. The proteins are graphed from least abundant to most abundant. (C) Workflow of exosome labeling and preparation for mass spectrometry. (D) Heatmap of the 15 most upregulated proteins in RWPE-1 Control or Myc EVs. Scale indicates intensity, defined as (LFQ Area - Mean LFQ Area)/SD. Extracellular proteins with annotated transmembrane domains are bolded and annotated secreted proteins are italicized. (E) Table indicating fold-change of most differentially regulated proteins by LC-MS/MS for RWPE-1 Control and Myc cells. (F) Upregulated proteins in RWPE-1 Myc EVs (ANPEP and FN1) are confirmed by western blot. Mass spectrometry data is based on two biological and two technical replicates (N = 4). Due to limited sample yield, one replicate was performed for the EV western blot.

      New Figure 5:

      Figure 5. WGA-HRP identifies a number of EV-specific markers that are present regardless of oncogene status. (A) Matrix depicting samples analyzed during LFQ comparison--Control and Myc cells, as well as Control and Myc EVs. (B) Principle component analysis (PCA) of all four groups analyzed by LFQ. Component 1 (50.4%) and component 2 (15.8%) are depicted. (C) Functional annotation clustering was performed using DAVID Bioinformatics Resource 6.8 to classify the major constituents of component 1 in PCA analysis. (D) Heatmap of the 25 most upregulated proteins in RWPE-1 cells or EVs. Proteins are listed in decreasing order of expression with the most highly expressed proteins in EVs on the far left and the most highly expressed proteins in cells on the far right. Scale indicates intensity, defined as (LFQ Area - Mean LFQ Area)/SD. Extracellular proteins with annotated transmembrane domains are bolded and annotated secreted proteins are italicized. (E) Table indicating fold-change of most differentially regulated proteins by LC-MS/MS for RWPE-1 EVs compared to parent cells. (F) Western blot showing the EV specific marker ITIH4, IGSF8, and MFGE8.Mass spectrometry data is based on two biological and two technical replicates (N = 4). Due to limited sample yield, one replicate was performed for the EV western blot.

      Authors mention time-sensitive changes but it is unclear how this method would enable one to obtain this kind of data. How would this be accomplished? The statement "Due to the rapid nature of peroxidase enzymes (1-2 min), our approaches enable kinetic experiments to capture rapid changes, such as binding, internalization, and shuttling events." Yes, it is faster, but not sure I can think of an experiment that would enable one to capture such events.

      We thank the reviewer for this comment and giving us an opportunity to elaborate on the types of experiments enabled by this new method. A previous study (Y, Li et al. Rapid Enzyme-Mediated Biotinylation for Cell Surface Proteome Profiling. Anal. Chem. 2021) showed that labeling the cell surface with soluble HRP allowed the researchers to detect immediate surface protein changes in response to insulin treatment. They demonstrated differential surfaceome profiling changes at 5 minutes vs 2 hours following treatment with insulin. Only methods utilizing these rapid labeling enzymes could allow for this type of resolution. A few other biological settings that experience rapid cell surface changes are: response to drug treatment, T-cell activation and synapse formation (S, Valitutti, et al. The space and time frames of T cell activation at the immunological synapse. FEBS Letters. 2010) and GPCR activation (T, Gupte et al. Minute-scale persistence of a GPCR conformation state triggered by non-cognate G protein interactions primes signaling. Nat. Commun. 2019). We also believe the method would be useful for post-translational processes where proteins are rapidly shuttling to the cell surface. We have updated the discussion to elaborate on these types of experiments.

      "Due to the fast kinetics of peroxidase enzymes (1-2 min), our approaches could enable kinetic experiments to capture rapid post-translational trafficking of surfaces proteins, such as response to insulin, certain drug treatments, T-cell activation and synapse formation, and GPCR activation."

      The authors do not have any way to differentiate between proteins expressed by cells and presented on their membranes from proteins that non-specifically bind to the membrane surface. Non-specific binding (NSB) is not addressed. Proteins can non-specifically bind to the cell or EV surface. The results are obtained by comparisons (cells vs exosomes, controls vs cancer cells), which is fine because it means that what is being measured is differentially expressed, so even NSB proteins may be up- and down-regulated. But the proteins identified need to be confirmed. For example, are all the proteins being detected transmembrane proteins that are known to be associated with the membrane?

      As mentioned above, we utilized the most rigorous informatics analysis available (Uniprot and SURFY) to annotate the proteins we find as having a signal sequence and/or TM domain. Data shown in heatmaps are based off of significance (p < 0.05) across all four replicates, which supports that any secreted proteins present are likely due to actual biological differences between oncogenic status and/or sample origin (i.e. EV vs cell). We have addressed this point in a previous comment above.

      The term "extracellular vesicles" (EVs) might be more appropriate than "exosomes" to describe the studied preparation.

      As we describe above in response to earlier comments, we have systematically changed from using exosomes to small extracellular vesicles and better defined the isolation procedure that we used in the methods section.

      Reviewer #3 (Public Review):

      The article by Kirkemo et al explores approaches to analyse the surface proteome of cells or cell-derived extracellular vesicles (EVs, called here exosomes, but the more generic term "extracellular vesicles" would be more appropriate because the used procedure leads to co-isolation of vesicles of different origin), using tools to tether proximity-biotinylation enzymes to membranes. The authors determine the best conditions for surface labeling of cells, and demonstrate that tethering the enzymes (APEX or HRP) increases the number of proteins detected by mass-spectrometry. They further use one of the two approaches (where HRP binds to glycans), to analyse the biotinylated proteome of two variants of a prostate cancer cell line, and the corresponding EVs. The approaches are interesting, but their benefit for analysis of cells or EVs is not very strongly supported by the data.

      First, the authors honestly show (fig2-suppl figures) that only 35% of the proteins identified after biotinylation with their preferred tool actually correspond to annotated surface proteins. This is only slightly better than results obtained with a non-tethered sulfo-NHS-approach (30%).

      We thank the reviewer for this comment. The reason we utilize membrane protein enrichment methods is that membrane protein abundance is low compared to cytosolic proteins and their identification can be overwhelmed by cytosolic contaminants. Nonetheless, despite our best efforts to limit labeling to the membrane proteins, cytosolic proteins can carry over. Thus, we utilize informatics methods to identify the proteins that are annotated to be membrane associated. The Uniprot GOCC (Gene Ontology Cellular Component) Plasma Membrane database is the most inclusive of membrane proteins only requiring they contain either a signal sequence, transmembrane domain, GPI anchor or other membrane associated motifs yielding a total of 5,746 proteins. This will include organelle membrane proteins. It is known that proteins can traffic from the internal organelles to the cell surface so these can be bonified cell surface proteins too. To increase the informatics stringency for membrane proteins we have now applied a new database aggregated from work by the Wollscheid lab, called SURFY (Bausch-Fluck et al., The in silico human surfaceome. PNAS. 2018). This is a machine learning method trained on 735 high confidence membrane proteins from the Cell Surface Protein Atlas (CSPA). SURFY predicts a total of 2,886 cell surface proteins. When we filter our data using SURFY for proteins, peptides and label free quantitation (LFQ) area for three methods, we find that the difference between NHS-Biotin and WGA-HRP expands considerably (see new Figure 3-Supplemental Figure 1 below). We observe these differences when the datasets are searched with either the GOCC Plasma Membrane database or the entire human Uniprot database. The difference is especially large for LFQ analysis, which quantitatively scores peptide intensity as opposed to simply count the number hits as for protein and peptide analysis. Cytosolic carry over is the major disadvantage of NHS-Biotin, which suppresses signal strength and is reflected in the lower LFQ values (24% for NHS-biotin compared to 40% for WGA-HRP). We have updated the main text and supplemental figure below:

      "Both WGA-HRP and biocytin hydrazide had similar levels of cell surface enrichment on the peptide and protein level when cross-referenced with the SURFY curated database for extracellular surface proteins with a predicted transmembrane domain (Figure 3 - Figure supplement 1A). Sulfo-NHS-LC-LC-biotin and whole cell lysis returned the lowest percentage of cell surface enrichment, suggesting a larger portion of the total sulfo-NHS-LC-LC-biotin protein identifications were of intracellular origin, despite the use of the cell-impermeable format. These same enrichment levels were seen when the datasets were searched with the curated GOCC-PM database, as well as the Uniprot entire human proteome database (Figure 3 - Figure supplement 1B). Importantly, of the proteins quantified across all four conditions, biocytin hydrazide and WGA-HRP returned higher overall intensity values for SURFY-specified proteins than either sulfo-NHS-LC-LC-biotin or whole cell lysis. Importantly, although biocytin hydrazide shows slightly higher cell surface enrichment compared to WGA-HRP, we were unable to perform the comparative analysis at 500,000 cells--instead requiring 1.5 million--as the protocol yielded too few cells for analysis."

      Figure 3-Figure Supplement 1. Comparison of surface enrichment between replicates for different mass spectrometry methods. (A) The top three methods (NHS-Biotin, Biocytin Hydrazide, and WGA-HRP) were compared for their ability to enrich cell surface proteins on 1.5 M RWPE-1 Control cells by LC-MS/MS after being searched with the Uniprot GOCC Plasma Membrane database. Shown are enrichment levels on the protein, peptide, and average MS1 intensity of top three peptides (LFQ area) levels. (B) The top three methods (NHS-Biotin, Biocytin Hydrazide, and WGA-HRP) were compared for their ability to enrich cell surface proteins on 1.5 M RWPE-1 Control cells by LC-MS/MS after being searched with the entire human Uniprot database. Shown are enrichment levels on the protein, peptide, and average MS1 intensity of top three peptides (LFQ area) levels. Proteins or peptides detected from cell surface annotated proteins (determined by the SURFY database) were divided by the total number of proteins or peptides detected. LFQ areas corresponding to cell surface annotated proteins (SURFY) were divided by the total area sum intensity for each sample. The corresponding percentages for two biological replicates were plotted.

      There are additional advantages to WGA-HRP over NHS-biotin. These include: (i) labeling time is 2 min versus 30 min, which would afford higher kinetic resolution as needed, and (ii) the NHS-biotin labels lysines, which hinders tryptic cleavage and downstream peptide analysis, whereas the WGA-HRP labels tyrosines, eliminating impacts on tryptic patterns. WGA-HRP is slightly below biocytin hydrazide in peptide and protein ID and somewhat more by LFQ. However, there are significant advantages over biocytin hydrazide: (i) sample size for WGA-HRP can be reduced a factor of 3-5 because of cell loss during the multiple washing steps after periodate oxidation and hydrazide labeling, (ii) the time of labeling is dramatically reduced from 3 hr for hydrazide to 2 min for WGA-HRP, and (iii) the HRP enzyme has a large labeling diameter (20-40 nm, but also reported up to 200 nm) and can label non-glycosylated membrane proteins as opposed to biocytin hydrazide that only labels glycosylated proteins. The hydrazide method is the current standard for membrane protein enrichment, and we feel that the WGA-HRP will compete especially when cell sample size is limited or requires special handling. In the case of EVs, we were not able to perform hydrazide labeling due to the two-step process and small sample size.

      Indeed the list of identified proteins in figures 4 and 5 include several proteins whose expected subcellular location is internal, not surface exposed, and whose location in EVs should also be inside (non-exhaustively: SDCBP = syntenin, PDCD6IP = Alix, ARRDC1, VPS37B, NUP35 = nucleopore protein)…

      We thank the reviewer for this comment. We have elaborated on this point in a number of response paragraphs above. The proteins that the reviewer points out are annotated as “plasma membrane” in the very inclusive GOCC plasma membrane database. However, this means that they may also spend time in other locations in the cell or reside on organelle membranes. We have done further analysis to remove any intracellular membrane residing proteins that are included in the GOCC plasma membrane database, including the five proteins mentioned above. We also have further highlighted proteins that appear in the SURFY database, as discussed above and in our response to Reviewer 2’s comment. To increase stringency, we have bolded proteins that are found in the more selective SURFY database and italicized secreted proteins. Due to our new analysis and data presentation, it is more clear which markers are bona fide extracellular resident membrane proteins. We have updated the Figures and Figure legends as mentioned above, as well as added this statement in the Data Processing and Analysis methods:

      "Additionally, to not miss any key surface markers such as secreted proteins or anchored proteins without a transmembrane domain, we chose to initially avoid searching with a more stringent protein list, such as the curated SURFY database. However, following the analysis, we bolded proteins found in the SURFY database and italicized proteins known to be secreted (Uniprot)."

      The membrane proteins identified as different between the control and Myc-overexpressing cells or their EVs, would have been identified as well by a regular proteomic analysis.

      To directly compare surfaceomes of EVs to cells, we are compelled to use the same proteomic method. For parental cell surfaceomic analysis, a membrane enrichment method is required due to the high levels of cytosolic proteins that swamp out signal from membrane proteins. Although EVs have a higher proportion of membrane to cytosol, whole EV proteomics would still have significant cytosolic contamination.

      Second, the title highlights the benefit of the technique for small-scale samples: this is demonstrated for cells (figures 1-2), but not for EVs: no clear quantitative indication of amount of material used is provided for EV samples. Furthermore, no comparison with other biotinylation technics such as sulfo-NHS is provided for EVs/exosomes. Therefore, it is difficult to infer the benefit of this technic applied to the analysis of EVs/exosomes.

      We appreciate the reviewer for this comment. We have updated the methods as mentioned above in our response to the Essential Revisions. In brief, the yield of EVs post-sucrose gradient isolation was 3-5 µg of protein from 16x15 cm2 plates of cells, totaling 240 mL of media. Since we had previously demonstrated that our method was superior to sulfo-NHS for enriching surface proteins on cells, we proceeded to use the WGA-HRP for the EV labeling experiments.

      In addition, the WGA-based tethering approach, which is the only one used for the comparative analysis of figures 4 and 5, possibly induces a bias towards identification of proteins with a particular glycan signature: a novelty would possibly have come from a comparison of this approach with the other initially evaluated, the DNA-APEX one, where tethering is induced by lipid moieties, thus should not depend on glycans. The authors may have then identified by LC-MS/MS specific glycan-associated versus non-glycan-associated proteins in the cells or EVs membranes. Also ideally, the authors should have compared the 4 combinations of the 2 enzymes (APEX and HRP) and 2 tethers (lipid-bound DNA and WGA) to identify the bias introduced by each one.

      We thank the reviewer for this comment. We performed analysis to determine whether there was a bias towards Uniprot annotated “Glyco” vs “Non-Glyco” surface proteins within the SURFY database identified across the WGA-HRP, APEX2-DNA, APEX2, and HRP labeling methods. We performed this analysis by measuring the total LFQ area detected for each category (glycoprotein vs non-glycoprotein) and dividing that by the total LFQ area found across all proteins detected in the sample. We found similar normalized areas of non-glyco surface proteins between WGA-HRP and APEX2-DNA suggesting there is not a bias against non-glycosylated proteins in the WGA-HRP sample. There were slightly elevated levels of Glycoproteins in the WGA-HRP sample over APEX2-DNA. It is not surprising to us that there is little bias because the free-radicals generated by biotin-tyramide can label over tens of nanometers and thus can label not just the protein they are attached to, but neighbors also, regardless of glycosylation status. We have added this as Figure 2-Supplement 3, and amended the text in the manuscript below in purple.

      Figure 2 – Figure Supplement 3: Comparison of enrichment of Glyco- vs Non-Glyco-proteins. (A) TIC area of Uniprot annotated Glycoproteins compared to Non-Glycoproteins in the SURFY database for each labeling method compared to total TIC area. There was not a significant difference in detection of Non-Glycoproteins detected between WGA-HRP and APEX2-DNA and only a slightly higher detection of Glycoproteins in the WGA-HRP sample over APEX2-DNA.

      "As the mode of tethering WGA-HRP involves GlcNAc and sialic acid glycans, we wanted to determine whether there was a bias towards Uniprot annotated 'Glycoprotein' vs 'Non-Glycoprotein' surface proteins identified across the WGA-HRP, APEX2-DNA, APEX2, and HRP labeling methods. We looked specifically looked at surface proteins founds in the SURFY database, which is the most restrictive surface database and requires that proteins have a predicted transmembrane domain (Bausch-Fluck et al., The in silico human surfaceome. PNAS. 2018). We performed this analysis by measuring the average MS1 intensity across the top three peptides (area) for SURFY glycoproteins and non-glycoproteins for each sample and dividing that by the total LFQ area found across all GOCC annotated membrane proteins detected in each sample. We found similar normalized areas of non-glyco surface proteins across all samples (Figure 2 - Figure supplement 4). If a bias existed towards glycosylated proteins in WGA-HRP compared to the glycan agnostic APEX2-DNA sample, then we would have seen a larger percentage of non-glycosylated surface proteins identified in APEX2-DNA over WGA-HRP. Due to the large labeling radius of the HRP enzyme, we find it unsurprising that the WGA-HRP method is able to capture non-glycosylated proteins on the surface to the same degree (Rees et al. Selective Proteomic Proximity Labeling Assay SPPLAT. Current Protocols in Protein Science. 2015). There is a slight increase in the area percentage of glycoproteins detected in the WGA-HRP compared to the APEX2-DNA sample but this is likely due to the fact that a greater number of surface proteins in general are detected with WGA-HRP."

      As presented the article is thus an interesting technical description, which does not convince the reader of its benefit to use for further proteomic analyses of EVs or cells. Such info is of course interesting to share with other scientists as a sort of "negative" or "neutral" result. Maybe a novelty of the presented work is the differential proteome analysis of surface enriched EV/cell proteins in control versus myc-expressing cells. Such analyses of EVs from different derivatives of a tumor cell line have been performed before, for instance comparing cells with different K-Ras mutations (Demory-Beckler, Mol Cell proteomics 2013 # 23161513). However, here the authors compare also cells and EVs, and find possibly interesting discrepancies in the upregulated proteins. These results could probably be exploited more extensively. For instance, authors could give clearer info (lists) on the proteins differentially regulated in the different comparisons: in EVs from both cells, in EVs vs cells, in both cells.

      We appreciate the reviewer for this critique and have updated the manuscript accordingly. We have changed the title to “Cell surface tethered promiscuous biotinylators enable small-scale comparative surface proteomic analysis of human extracellular vesicles and cells” to more accurately depict the focus of our manuscript which, as the reviewer highlighted, is that this technology allows for comparative analysis between the surfaceomes of cells vs EVs. We appreciate the fine work from the Coffey lab on whole EV analysis of KRAS transformed cells. They identified a mix of surface and cytosolic proteins that change in EVs from the transformed cells, whereas our data focuses specifically on the surfaceome differences in Myc transformed and non-transformed cells and corresponding small EVs. We believe this makes important contributions to the field as well.

      To further address the reviewer’s suggestions, we additionally have significantly reorganized the figures to better display the differentially regulated proteins. We have removed the volcano plots and instead included heatmaps with the top 30 (Figure 3 and Figure 4) and top 50 (Figure 5) differentially regulated proteins across cells and EVs. We have also updated the lists of proteins in the supplemental source tables section. See responses to Reviewer 2 above for the updates to Figures 3-5. We have additionally included supplemental figures with lists of differentially upregulated proteins in the EV and Cell samples, which are shown below:

      Figure 3 – Supplement 3: List of proteins comparing enriched targets (>2-fold) in Myc cells versus Control cells. Targets that were found enriched (Myc/Control) in the Control cells (left) and Myc cells (right). The fold-change between Myc cells and Control cells is listed in the column to the right of the gene name.

      Figure 4 – Supplement 1: List of proteins comparing enriched targets (>1.5-fold) in Myc EVs versus Control EVs. Targets that were found enriched (Myc/Control) in the Control EVs (left) and Myc EVs (right). The fold-change between Myc EVs and Control EVs is listed in the column to the right of the gene name.

      Figure 4 – Figure Supplement 2: Venn diagram comparing enriched targets (>2-fold) in Cells and EVs. (A) Targets that were found enriched in the Control EVs (purple) and Control cells (blue) when each is separately compared to Myc EVs and Myc cells, respectively. The 5 overlapping enriched targets in common between Control cells and Control EVs are listed in the center. (B) Targets that were found enriched in the Myc EVs (purple) and Myc cells (blue) when each is separately compared to Control EVs and Control cells, respectively. The 12 overlapping enriched targets in common between Myc cells and Myc EVs are listed in the center.

      Figure 5 - Supplement 1: List of proteins comparing enriched targets (>2-fold) in Control EVs versus Control cells and Myc EVs versus Myc cells. (A)Targets that were found enriched (EV/cell) in the Control samples are listed. The fold-change values between Control EVs and Control cells are listed in the column to the right of the gene name. (B)Targets that were found enriched (EV/cell) in the Myc samples are listed. The fold-change values between Myc EVs and Myc cells are listed in the column to the right of the gene name.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      *Reviewer #1 (Evidence, reproducibility and clarity (Required)): **

      **Summary**

      The authors have performed highly quantitative analyses of GPCR signaling to reveal heterogenous ERK and Akt activation patterns by using kinase translocation reporters. Using a massive number of single-cell imaging data, the authors show heterogeneous responses to GPCR agonists in the absence or presence of inhibitors. By cluster analysis, the responses of ERK and Akt were classified into eight and three patterns. This paper is clearly written with sufficient information for the reproducibility. However, the conclusion may not be necessarily supported by the provided data as described below.

      **Major comments:**

      This work has been well done in an organized way and adds new insight into the regulation of protein kinases by GPCRs. The conclusion will be of great interest in the field of single-cell signal dynamics and quantitative biology. On a bit negative note, considering the complexity of the downstream of GPCRs, some of the conclusions may need revision. *

      We thank the reviewer for the evaluation and for raising a number of comments that have helped us to strengthen the manuscript and that will be addressed below.

      1. The conclusion of the title that "Heterogeneity and dynamics of ERK/Akt activation by GPCR depend on the activated heterotrimeric G proteins," may not be supported by the data. The authors compared just one pair each of GPCR and ligand. The heterogeneity may come from the nature of the ligand or the characteristics of the single clone chosen for this study. The title may suggest that the heterogeneity depends only on the G-protein (although that is not what the title says). Instead, we mean that G-proteins play a role in the heterogeneity, as we infer from the experiments with the G-protein inhibitors. If the reviewer feels strongly about this, we are open to changing the title, for instance to:

      “Kinase translocation reporters reveal the single cell heterogeneity and dynamics of ERK and Akt activation by G protein-coupled receptors”

      • The obvious question is that why the authors did not analyze the correlation between ERK and Akt activity more extensively. Cell Profiler will be able to extract multiple cellular features. Linking the heterogeneous signals to cellular features will benefit readers in the broad cell biology field. If the authors wish to write another paper with that data, it should be at least discussed. *

      We agree that we can add more information on the correlation between ERK and Akt activity and we have added a plot that shows the co-incidence of the ERK and Akt clusters. This is now panel C of figure 8. We have no wish of writing another paper and we have made the data and code available, so anyone can do a more detailed analysis if desired.

      We appreciate the suggestion to correlate activities with cellular features, such as cell area and shape. However, in our analysis we use nuclear fluorescence to segment the nuclear and cytoplasmic fluorescence (as generally done in studies that use KTRs). Therefore, the information on cellular features is not readily available. Such analysis would require a marker for the cytoplasm or membrane (or yet another image analysis procedure).

      Another apparent flaw of this work is that YM was not challenged to UK-stimulated cells. The authors probably assumed lack of effect. Nevertheless, I believe it is required to show. Or, remove the PTx data from the Histamine-stimulated cell data.

      We agree that this is valuable data to include. Unfortunately, this experiment was done in a slightly different condition than the other experiments (different spacing of the time intervals) and we initially skipped the data for these reasons. After careful examination of the data, we have decided to include these data (added to figures 3 & 5).

      We note that we still miss the data from the YM+PTx data for UK and we have currently no way to carry out these experiments (mainly due to lack of funding). In our opinion, the absence of this data is not critical for the interpretation of the results. We prefer to show the YM+PTx data for the other two conditions.

      The most interesting response is that of S1P. ERK is biphasically activated. Combined inhibition of Gq and Gi failed to suppress ERK activity. It may be discussed why the biphasic activation pattern was not identified by the classification.

      We think that the biphasic activation pattern is reflected by cluster 7 and 8 and we now mention this in the text: “The biphasic ERK activation pattern, which is specific for stimulation with S1P are reflected by cluster 7 and 8.”

      For clarity, we now added the dynamics for each cluster to figure 9.

      *The authors argue that the brightness of the KTR reporter was not correlated with the dynamic range of ERK or Akt reporter (Supplementary Figure 3), but it is not clear. I had an impression that ERK-KTR brightness (Supplementary Figure 3A) has a slightly negative correlation with "maximum change in CN ratio" (Supplementary Figure 3B) (e.g., A6>B3>B5 in brightness and A6

      We thank the reviewer for the suggestion and have now added this data to supplemental figure 3 as panel C.

      The authors have shown cluster analyses for the temporal patterns in kinase activations. However, the only difference of cluster 3 and 5 (Figure 7) seem to be amplitude. The authors have also shown the amplitude is dependent on the dose of the activators, which together makes it difficult to see the biological meaning of discriminating the two patterns in comparing different agonists, e.g., Histamine, UK, and S1P. The authors should discuss their views on how the clustering analyses will benefit biological interpretations together with possible limitations.

      This is a valid point, and it is a consequence of clustering method. We have added text to the discussion to explain our view: “The clustering is a powerful method for the detection of patterns and simplification of large amounts of data. Yet, it should be realized that clustering is mathematical procedure that is not necessarily reflecting the biological processes. One example is the graded response of ERK and Akt activities to ligands, whereas cells are grouped in weak, middle and strong responders. This may be solved by developing and using clustering methods that take the underlying biological processes into account.”

      Considering the importance of the content, the supplemental note 2 may be included in the main text.

      We appreciate this suggestion, and we have incorporated supplemental note 2 in the main text.

      \*Minor comments:**

      1. The authors should clarify the cell type they used (HeLa cells) in the main text and figure legends. *

      This information is now indicated in the first paragraph of the results section and in the legend of figure1.

      Supplementary note1: The data-not-shown data (no correlation of KTR expression and its response to serum) should be very informative for the readers. The data should be shown as an independent supplementary figure.

      This relates to major point 5 and we agree that this is valuable. The data of the expression and the maximum response has been added to supplementary figure 3 as panel C.

      Supplementary Figure S2: The authors should clarify this image processing is about background subtraction. Also, the authors should clearly note "rolling ball with a radius of 70 pixels" is about an ImageJ function, "Subtract Background".

      We added text to highlight that the processing is a background subtraction and noise reduction. We added text to explain it is a FIJI function.

        1. Supplementary Figure S5: Figure labels are "A, A, B, B" not "A, B, C, D". Also the top two figures are lacking Y axis labels. *

      Thanks for pointing this out. We the labels are corrected.

      Page6 (top): The authors should mention the description is about Supplementary Figure S5 (UK) and Supplementary Figure S6 (S1P).

      This is an accidental omission, it is corrected.

      Figure 3: the figures are lacking x-axis labels (probably uM, nM and pM from left).

      Well spotted, this is fixed by adding the units to the labels for each ligand.

      Values in tables: The significant figure must be 2, at best. This should be consistent throughout the text. For example, "The EC50 values for histamine, S1P and UK were respectively 0.3 μM, 63.7 nM and 2.5 pM." This is somewhat awkward.

      This has been fixed in the text and in the table.

      Page 7, the first paragraph: No comments on S1P!

      We added our observation that: “The response to S1P is hardly affected by YM, but the amplitude is reduced by PTx.”

      Fig. 3: 100 mM must read as 100 micromolar.

      We do not understand this comment, but the units of figure 3 are now corrected (see also point 6).

      • Fig. 9: Concentration unit is missing.*

      Thanks for pointing this out, units are added.

      • Page 11, line 4: EKR should read as ERK. *

      Fixed

      • Page 13: "So far, only a couple of studies looked into kinase activation by GPCRs and these studies used overexpressed receptors [32,33]." Please describe precisely. Protein kinase activation by GPCR has been studied more than 20 years. Why are these two recent papers cited here? *

      We updated the text to explain that: “So far, only a couple of studies looked into kinase activation by GPCRs in single cells with KTRs and these studies used overexpressed receptors”.

      "This is in marked contrast to other fluorescent biosensors that typically require an overexpressed receptor for robust responses [34]." Following words should be included in the end: "in our hands".

      We’ve included the suggested line.

      • "Histamine is reported to predominantly activate Gq in HeLa cells [36] and UK activates Gi [37]." Describe the name of receptors for the better understanding. *

      We added names: “Histamine is reported to predominantly activate Gq in HeLa cells by the histamine H1 receptor [36] and UK activates Gi by α2-adrenergic receptors [37]”

      • "S1P can activate a number of different GPCRs, all known to be expressed by HeLa cells [24]." Why is this paper chosen? The authors can easily find RNA-Seq data, if they wish to see the expression level. The cited paper did not scrutinize the S1P receptors expressed in HeLa cells. *

      The S1PR levels are scrutinized in the cited paper, but it is ‘hidden’ in the supplemental figure S4A. We will clarify this and explicitly mention this supplemental figure: “The situation for S1P is different. S1P can activate a number of different GPCRs, all known to be expressed by HeLa cells as shown in the supplemental figure S4A of [24]”

      *Reviewer #1 (Significance (Required)):

      The authors used biosensors for ERK and Akt to examine the kinetics of activation by GPCR ligands. Technical advancement is in the massive analysis method and cluster analysis. This is an important direction for the quantitative biology. GPCR signaling is complex because of multiple receptors coupled with different G proteins. The simple ones such as histamine receptor and alpha2-adrenergic receptor can be easily analyzed as shown in this study. However, there are many S1P receptors, which make the interpretation difficult. If the authors could have shown interesting proposal on this data, the paper may interest many researchers in the field of cell biology and systems biology.

      Expertise: Cell biology, signal transduction of protein kinases, fluorescence microscopy.

      **Referee Cross-commenting**

      1. I agree with the other two reviewers in that immunoblotting data is required to show the efficiency of P2A cleavage.
      2. All reviewers think it looks strange that the authors did not show UK + YM data.
      3. Showing the dynamic range of the biosensors will reinforce the data as Reviewer #3 states. ERK-KTR is quite sensitive and can be easily saturated. Ideally, the ratio of pERK vs ERK can be quantified by the different mobility in SDS-PAGE. But, I do not know how we can do it for Akt.

        Reviewer #2 (Evidence, reproducibility and clarity (Required)):

        **Summary**

        In this paper Chavez-Abiega and colleagues investigate the dynamics of ERK and Akt activity downstream of several G protein-couples receptors (GPCRs). Using drugs to block specific G-proteins, they probe the activation of ERK/Akt by different heterotrimeric G proteins with fluorescent biosensors at the single cell resolution. Main finding is that ERK/AKT can be activated by different G-proteins, depending on the receptor coupling to the G-protein subclass, and that the ERK/AKT dynamics for S1P are specifically heterogeneous. Moreover, it seems that the AKT signaling response is very similar to ERK after GPCR stimulation.

        **Major points:**

        1) For this paper, the authors produced a new construct to express simultaneously the nuclear marker, the Akt and the ERK biosensors. The tree parts are connected by P2A peptides that determine their separation. Although, the biosensors are based on existing ones, the connection between them by P2A might create artifacts if the separation of the two parts is incomplete. For that, important controls are missing, such as treatment with an ERK and an Akt inhibitor. If the two parts are well separated the inhibitors should block the cytosol translocation of one of the two components and not of the other. This control is also important to check if in HeLa cells the Akt biosensors is not phosphorylated by ERK as well, as described in other reports. Alternatively, P2A separation can be quantified on a protein blot. *

      We agree that it is important to establish that the P2A sequence results in separation of the reporters. There are several observations that support our notion that the separation is efficient. First, we have been using the 2A-like sequences for over a decade in HeLa cells (first paper: doi:10.1038/nmeth.1415) and we have never encountered situations where the cleavage was problematic. Second, the distribution in signal of the nuclear Scarlet probe differs substantially from that of the mTurquoise2 and the mNeonGreen probe. Third, the dynamics of the ERK-KTR and Akt-KTR are different. Fourth, we have included new data with an ERK inhibitor, showing that the Akt-KTR responds independently of the ERK-KTR (figure S5). We have also added text to explain this: “Next, we examined the effect of the MEK inhibitor PD 0325901. Pre-incubation with the inhibitor for 20 minutes blocked the response of the ERK-KTR to FBS, but not that of Akt-KTR (Supplemental Figure S5). This supports previous observations [14] [15] that the P2A effectively separates the different components, since the Akt-KTR and ERK-KTR show independent relocation patterns.”

      This latter point is also supported by the co-incidence plot of the ERK versus Akt clusters (figure 8C) showing that the probes act independently (which is the main reason for using this strategy).

      Although any of the aforementioned points cannot exclude that a small fraction of the probe remains fused, we think that this potential issue is far outweighed by the benefits of the use of 2A peptides.

      2) The description of ERK and Akt should be reported in a more uniform way, such as using the same representations for both (e.g. the equivalent of figure 2 for Akt is missing) or the same number of clusters.

      We choose to concentrate first on ERK activity, that is why a similar plot for Akt activation is not shown. However, the Akt responses are detailed in figure 4 and supplemental figures S5 and S7.

      For the cluster analysis, we looked into the optimal number of clusters (as explained in Supplemental note S2). This number differs for ERK and Akt, since the complexity of the responses is different. We move supplemental note 2 to the main text, which also clarifies the different number of clusters that we used for the analysis.

      3) Figure 3 & Figure 5: It seems that the YM and YM+PTx data for the UK 14304 data is missing. This would be an interesting addition to the manuscript, and it is easy to add. A similar analysis for the Akt sensor is missing in figure 3 and should be added for consistency. Figure 4 shows data for Akt, but as timeseries and only for Histamine. See point 2, it would benefit the reader greatly if ERK and AKT are presented in a more uniform and complete fashion throughout the manuscript.

      We agree that it is valuable to add data for UK with YM. This data has been added, see also reply to reviewer 1, major point 3

      As for the Akt data, the response was largely similar albeit with less complexity and a lower amplitude. This is the reason to focus on ERK and this is explained in the discussion: “Therefore, the measurement of Akt does not add information. Moreover, the Akt response had a relatively poor amplitude.”

      4) In the results text of figure 4, the authors state that "...as shown in Figure 4C-D, which is in line with the effect of histamine on ERK.". It is unclear what the authors mean with this statement, the effects of single/double inhibition of Histamine stimulation on ERK are not quantified or discussed. Both responses can be quantified more carefully and compared.

      We agree that this is poorly formulated, and we rephrase it to make it clearer: “Inhibition of Gq (figure 4C) decreases the maximum activity up to ~70%, and simultaneous inhibition of Gq and Gi causes a decrease of the responses up to ~90%, as shown in Figure 4D. These Akt amplitudes and effects of inhibitors are largely similar to those observed for ERK.”

      5) This paper would benefit from a mechanistic investigation. For instance, the authors could investigate the pathways that lead to the generation of the pulse of ERK and Akt. These (preliminary) results presented call for deeper investigation into the signaling pathway from Gai and Gaq to ERK and AKT, and the authors are in a great position to probe this. One simple approach is to explore the upstream pathway, such as the MAPK cascade, PI3K, RTKs by means of inhibitors.

      We agree that there is much that can be done with the KTR technology. To this end, we deposit the probe and make all our data analysis methods available. We hope that others will benefit from our efforts and use the tools for mechanistic studies. 6) Since different G-proteins seem to elicit similar responses on ERK and especially for Akt, it is likely a B-arrestin / beta-gamma subunit mediated mechanism? It would be interesting to hear what the authors think of this, did they investigate/consider this possibility? E.g. Perhaps blocking RTK signaling / B-arrestin signaling would reduce heterogeneity?

      We appreciate this suggestion and have added a statement to the discussion: “Based on our data, we cannot exclude that beta-arrestin or RTKs play a role in the activation of ERK and Akt. To study the role of non-classical routes to ERK activation, inhibitor studies, or probes that interrogate these processes would be useful.”

      7) The authors should take a serious effort to summarize the data in the figures better. Many plots that can be merged/presented in a more concise way, which would improve the readability of the manuscript greatly.

      We will take care to improve the data visualization during the revision. We will address any specific points that are raised.

      \*Minor points:**

      1) The authors should spell out in the legend of each figure if they are representing the absolute C/N or the normalized C/N *

      Thanks for pointing this out. We added this information to the legends and it is also written in the materials and methods: "data was normalized by subtracting the average of two time points prior to stimulation (usually the 5th and 6th time point) from every data point."

      2) In Figure 2 the authors should show the control with no stimulus. Also would be informative to inform the reader about the stimulation protocol used, or indicate the stimulation time and length in the figure.

      We have added the no stimulus control and added the information to the legend.

      3) Figure 3: This figure would benefit from a different presentation of the data, it is currently confusing. E.g. Average curves per drug condition in a single graph would present the point the authors make more clear and concise, and this single cell overview can be moved to supplements.

      Our main focus is on single cell analysis and we think that the current plots convey the message in a clear and transparent fashion. It is in line with the recently proposed idea of “superplot” (https://doi.org/10.1083/jcb.202001064). We also provide scripts and data, enabling anyone to replot the data if that is desired.

      4) Figure 4 legend states "CN ERK" and "ERK C/N", but is depicting only Akt responses? Only in 4c the axes are labeled, this together is very confusing.

      Thanks for pointing this out. This is corrected

      *5) Figure 5 is missing the controls with ERK and Akt inhibitors, to show the loss of correlation between the AUC of the two

      *We have included data with a MEK inhibitor (new supplemental figure S5) to demonstrate the specificity of the probe and it also demonstrates that Akt can be independently activated

      6) Figure 6, the presumed lack of correlation between baseline activity and response should be confirmed statistically.

      We have improved the presentation of figure 6. We now show only the maximal response and how this varies between conditions. It is evident from the graphical representation that the curves are similar for the different start ratios. We feel that the use of statistics is not necessary here.

      7) It seems that in S1P treated cells there is a second oscillation in ERK activity well visible in figure 2 and also in S10. Could the authors comment on that?

      We add text to the discussion to address this: “We observed that activation of endogenous S1P receptors resulted in a strong, but highly heterogeneous ERK-KTR response, with two peaks in a population of cells.” and “When PTx is present, the biphasic response is abolished and the first peak of activation is reduced, suggesting that the initial response is due to Gi signaling.”

      *8) In the abstract it is unclear what authors mean with "UK".

      *Changed to brimonidine

      9) Figure 9, it would be helpful to visually repeat the typical curve of the different clusters here, to guide the reader.

      This is a good suggestion and we have added the typical curves for the different clusters to the plot.

      10) The observed heterogeneity in responses might be related to different cell cycle stages, did the authors investigated/consider this possibility (e.g. with a cell cycle biosensor)?

      This is a very valid comment. We do consider its importance, but we did not investigate the effects of cell cycle.

      *Reviewer #2 (Significance (Required)):

      The paper describes with high accuracy the dynamics of ERK and Akt biosensors downstream of several GPCRs.

      However, it feels like this is a preliminary report that leaves many important questions still open. It does not provide mechanistic insight and doesn't fully exploit the potential of single-cell technologies. The authors have the tools to investigate several important questions that are left open in the manuscript (e.g. connection Gaq/Gai to ERK/AKT, B-arrestin/betagamma involvement). Moreover, some important controls are missing. The authors should also consider the data presentation in the figures, to improve readability and interpretation of the manuscript.

      Properly revised, would be of interest for a broad audience in cell biology, specifically GPCR and RTK signaling fields.

      Expertise in cell biology, gpcr and rtk signaling, fluorescent biosensors.

      **Referee Cross-commenting**

      I agree with the assessments by the other reviewers.

      Indeed showing the dynamic range of the biosensors, as Reviewer #3 states, would strengthen the manuscript and put the S1P response heterogeneity in context.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      This manuscript uses a live-cell biosensor approach to examine the activity kinetics of the ERK and Akt kinases in response to different GPCR ligands. The paper provides a detailed description of the development of a HeLa reporter cell line that expresses both Akt and ERK biosensors, along with a nuclear marker for use in cell tracking. The authors then catalog the individual responses from thousands of cells to three GPCR ligands. Individual cells show strong correlation in stimulated ERK and Akt activity. Using inhibitors for Gq and Gi proteins, it is shown that ERK and Akt activities are dependent on different G proteins. The authors also show that the heterogeneous responses within each population can be decomposed into several clusters representing similar dynamic behaviors; the frequencies of these clusters increase or decrease depending on treatments.

      Overall, this is a well documented extension of an existing biosensor approach to examine GPCR signaling, and the approach is clearly described. There are however, some control experiments that are essential to support the conclusions.

      **Major comments:**

      1. The maximal responses of ERK and Akt biosensors in the selected cell clone are not adequately shown. Although FBS responsiveness is used as a validation and selection criterion, it would be much more informative to show the distribution of single-cell responses for defined activators of ERK and Akt, such as EGF and IGF-1, respectively. Without seeing the variability in these responses, it is difficult to put the heterogeneity observed in GPCR responses into context. *

      The FBS is used as a (crude) way to examine responsiveness of the clones. We understand that treatment of the cells with growth factors would add more data and therefore more information to the manuscript. However, the main aim of the study is to examine whether KTR technology can be used to study endogenous GPCR signaling. It is clear that the answer is positive. Next, we asked whether we could detect differences for different GPCRs and that was the focus of this study. It is unclear how studies with EGF would add new information to our observations.

      It is not clear whether the basal activity for the biosensors represents actual activity or simply the measurement floor. This should be established by using saturating treatment inhibitors for ERK and Akt to determine the biosensor readings in the absence of any activity. Ideally, an approach such as the one shown by Ponsioen et al. (PMID: 33795873) should be used to determine the dynamic range of the sensors.

      We studied the basal levels and the effect of serum. We found that the basal levels are reduced by replacing the growth medium with serum free medium. The reduction in C/N ratio reaches a plateau after ~ 2hours of replacing the medium. This data is added as supplemental figure S4. Therefore, we have performed all experiments 2 hours after replacing the growth medium with serum free imaging medium.

      Because the biosensors are separated by self-cleaving peptides, there is the potential that incomplete cleavage could complicate the results. Cleavage efficiency should be assessed by western blot or an equivalent method.

      We agree that it is important to establish that the P2A sequence results in separation of the reporters. There are several observations that support our notion that the separation is efficient. First, we have been using the 2A-like sequences for over a decade in HeLa cells (first paper: doi:10.1038/nmeth.1415) and we have never encountered situations where the cleavage was problematic. Second, the distribution in signal of the nuclear Scarlet probe differs substantially from that of the mTurquoise2 and mNeonGreen probe. Third, the dynamics of the ERK-KTR and Akt-KTR are different. Fourth, we have included new data with an ERK inhibitor, showing that the Akt-KTR responds independently of the ERK-KTR (figure S5). We have also added text to explain this: “Next, we examined the effect of the MEK inhibitor PD 0325901. Pre-incubation with the inhibitor for 20 minutes blocked the response of the ERK-KTR to FBS, but not that of Akt-KTR (Supplemental Figure S5). This supports previous observations [14] [15] that the P2A effectively separates the different components, since the Akt-KTR and ERK-KTR show independent relocation patterns.”

      This latter point is also supported by the co-incidence plot of the ERK versus Akt clusters (figure 8C) showing that the probes act independently (which is the main reason for using this strategy).

      Although any of the aforementioned points cannot exclude that a small fraction of the probe remains fused, we think that this potential issue is far outweighed by the benefits of the use of 2A peptides.

      Ideally, an alternate method such as immunofluorescence for phosphorylated ERK/Akt or their substrates could be used in a subset of the conditions to validate the heterogeneity observed by the biosensors.

      We thank the reviewer for this suggestion. Since we see a lot of variability in the dynamics, which cannot be addressed by immunofluorescence, we do not think this will experiment be valuable. Of note, GPCR activity is known to induce ERK activity in a dose-dependent manner on a population level as determined with immunolabeling methods and that is what we observe with the ERK KTR as well.

      \*Minor comments:**

      1. In the introduction, more rationale and background could be provided for the examination of GPCR-stimulated ERK and Akt activity. There is not much information provided on why this is an interesting question. Other than the involvement of beta arrestin and RTK transactivation, which are mentioned, what mechanisms are known to be involved? Also, the importance of ERK and Akt in cancer is brought up, but it is not made clear how this approach or results would connect specifically to a cancer model. *

      We think that the connections between heterotrimeric G-proteins and kinase activity are not well established. Except for the classical Gq -> PKC -> ERK pathway, not so much is known and we add this to the discussion: “The classic downstream effector of Gq is PKC, which can activate ERK. On the other hand, it is not so clear how Gq would affect Akt. The molecular network that connects the activity of Gi with kinases also not so clear.”

      *It would be helpful to provide some explanation for why the UK+YM and UK+YM+PTx data are not shown in figure 3

      *

      We agree that this is valuable data to include. Unfortunately, this experiment was done in a slightly different condition than the other experiments (different spacing of the time intervals) and we initially skipped the data for these reasons. After careful examination of the data, we have decided to include these data (added to figures 3 & 5).

      We note that we still miss the data from the YM+PTx data for UK and we have currently no way to carry out these experiments (mainly due to lack of funding). We prefer to show the YM+PTx data for the other two conditions.

      • In the Abstract figure, it is not clear which samples "Inhibitor" and "Agonist" are referring to. **

        *

      Thanks for this comment. We will remove the visual abstract when the preprint is submitted to a journal.

      * Reviewer #3 (Significance (Required)):

      While similar reporter approaches have been used in a number of papers to examine growth factor signaling dynamics of ERK and Akt, this manuscript is the first I have seen to examine the responses of these kinases to different GPCR ligands. In doing so, it adds significantly to the growing body of literature on single-cell signaling responses. The mechanisms of ERK and Akt activation by GPCRs remain somewhat ambiguous, and the data reported here will be helpful in refining models for this signal transduction process. The findings that the GPCR ligands examined show different G protein dependencies than anticipated is an interesting facet, as is the observation that, while ERK and Akt are generally correlated, inhibition of Gi preferentially blocks S1P-induced ERK activity more so than Akt activity. However, the main findings of heterogeneity in signaling, and the observation of clusters that describe the different dynamic behaviors present within a population, are highly consistent with what has been shown in other systems. Overall, this study is a useful confirmation that GPCR signaling to ERK and Akt follows a similar pattern to other forms of stimulation.

      **Referee Cross-commenting**

      Regarding the dynamic range, I don't think it is necessary to do a western blot (though this would be nice) - I think it would be sufficient to show maximal activation using EGF/IGF and full suppression using MEK/ERK and Akt inhibitors. I also agree that all the points raised by the other reviewers. In particular, a deeper exploration and better visualization of the relationship between ERK and Akt would be very useful, as noted by both Reviewers #1 and #2.*

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary

      The authors have performed highly quantitative analyses of GPCR signaling to reveal heterogenous ERK and Akt activation patterns by using kinase translocation reporters. Using a massive number of single-cell imaging data, the authors show heterogeneous responses to GPCR agonists in the absence or presence of inhibitors. By cluster analysis, the responses of ERK and Akt were classified into eight and three patterns. This paper is clearly written with sufficient information for the reproducibility. However, the conclusion may not be necessarily supported by the provided data as described below.

      Major comments:

      This work has been well done in an organized way and adds new insight into the regulation of protein kinases by GPCRs. The conclusion will be of great interest in the field of single-cell signal dynamics and quantitative biology. On a bit negative note, considering the complexity of the downstream of GPCRs, some of the conclusions may need revision.

      1. The conclusion of the title that "Heterogeneity and dynamics of ERK/Akt activation by GPCR depend on the activated heterotrimeric G proteins," may not be supported by the data. The authors compared just one pair each of GPCR and ligand. The heterogeneity may come from the nature of the ligand or the characteristics of the single clone chosen for this study.
      2. The obvious question is that why the authors did not analyze the correlation between ERK and Akt activity more extensively. Cell Profiler will be able to extract multiple cellular features. Linking the heterogeneous signals to cellular features will benefit readers in the broad cell biology field. If the authors wish to write another paper with that data, it should be at least discussed.
      3. Another apparent flaw of this work is that YM was not challenged to UK-stimulated cells. The authors probably assumed lack of effect. Nevertheless, I believe it is required to show. Or, remove the PTx data from the Histamine-stimulated cell data.
      4. The most interesting response is that of S1P. ERK is biphasically activated. Combined inhibition of Gq and Gi failed to suppress ERK activity. It may be discussed why the biphasic activation pattern was not identified by the classification.
      5. The authors argue that the brightness of the KTR reporter was not correlated with the dynamic range of ERK or Akt reporter (Supplementary Figure 3), but it is not clear. I had an impression that ERK-KTR brightness (Supplementary Figure 3A) has a slightly negative correlation with "maximum change in CN ratio" (Supplementary Figure 3B) (e.g., A6>B3>B5 in brightness and A6<B3<B5 in maximum change in CN ratio). The authors should show dot plots of average fluorescence vs. the maximum change in CN ratio.
      6. The authors have shown cluster analyses for the temporal patterns in kinase activations. However, the only difference of cluster 3 and 5 (Figure 7) seem to be amplitude. The authors have also shown the amplitude is dependent on the dose of the activators, which together makes it difficult to see the biological meaning of discriminating the two patterns in comparing different agonists, e.g., Histamine, UK, and S1P. The authors should discuss their views on how the clustering analyses will benefit biological interpretations together with possible limitations.
      7. Considering the importance of the content, the supplemental note 2 may be included in the main text.

      Minor comments:

      1. The authors should clarify the cell type they used (HeLa cells) in the main text and figure legends.
      2. Supplementary note1: The data-not-shown data (no correlation of KTR expression and its response to serum) should be very informative for the readers. The data should be shown as an independent supplementary figure.
      3. Supplementary Figure S2: The authors should clarify this image processing is about background subtraction. Also, the authors should clearly note "rolling ball with a radius of 70 pixels" is about an ImageJ function, "Subtract Background".
      4. Supplementary Figure S5: Figure labels are "A, A, B, B" not "A, B, C, D". Also the top two figures are lacking Y axis labels.
      5. Page6 (top): The authors should mention the description is about Supplementary Figure S5 (UK) and Supplementary Figure S6 (S1P).
      6. Figure 3: the figures are lacking x-axis labels (probably uM, nM and pM from left).
      7. Values in tables: The significant figure must be 2, at best. This should be consistent throughout the text. For example, "The EC50 values for histamine, S1P and UK were respectively 0.3 μM, 63.7 nM and 2.5 pM." This is somewhat awkward.
      8. Page 7, the first paragraph: No comments on S1P!
      9. Fig. 3: 100 mM must read as 100 micromolar.
      10. Fig. 9: Concentration unit is missing.
      11. Page 11, line 4: EKR should read as ERK.
      12. Page 13: "So far, only a couple of studies looked into kinase activation by GPCRs and these studies used overexpressed receptors [32,33]." Please describe precisely. Protein kinase activation by GPCR has been studied more than 20 years. Why are these two recent papers cited here?
      13. "This is in marked contrast to other fluorescent biosensors that typically require an overexpressed receptor for robust responses [34]." Following words should be included in the end: "in our hands".
      14. "Histamine is reported to predominantly activate Gq in HeLa cells [36] and UK activates Gi [37]." Describe the name of receptors for the better understanding.
      15. "S1P can activate a number of different GPCRs, all known to be expressed by HeLa cells [24]." Why is this paper chosen? The authors can easily find RNA-Seq data, if they wish to see the expression level. The cited paper did not scrutinize the S1P receptors expressed in HeLa cells.

      Significance

      The authors used biosensors for ERK and Akt to examine the kinetics of activation by GPCR ligands. Technical advancement is in the massive analysis method and cluster analysis. This is an important direction for the quantitative biology. GPCR signaling is complex because of multiple receptors coupled with different G proteins. The simple ones such as histamine receptor and alpha2-adrenergic receptor can be easily analyzed as shown in this study. However, there are many S1P receptors, which make the interpretation difficult. If the authors could have shown interesting proposal on this data, the paper may interest many researchers in the field of cell biology and systems biology.

      Expertise: Cell biology, signal transduction of protein kinases, fluorescence microscopy.

      Referee Cross-commenting

      1. I agree with the other two reviewers in that immunoblotting data is required to show the efficiency of P2A cleavage.
      2. All reviewers think it looks strange that the authors did not show UK + YM data.
      3. Showing the dynamic range of the biosensors will reinforce the data as Reviewer #3 states. ERK-KTR is quite sensitive and can be easily saturated. Ideally, the ratio of pERK vs ERK can be quantified by the different mobility in SDS-PAGE. But, I do not know how we can do it for Akt.
    1. Likewise, the filing cabinet cannot feed itself without user collaboration; indeed, without a user, the filing cabinet cannot even start its combinatory po-tential. Nevertheless, the card index is used as a true ‘communicative partner’ because it has proper autonomy. In a sense, the card index is fully dependent on and fully independent of the user. The inner structure is methodically ar-ranged so that the users, whoever they may be, can in principle use it; entries are linked so that once the combinatory potential begun, combinations repro-duce themselves and increase the available complexity in unexpected ways.34

      There is an interesting analogy here worth pursuing:

      This idea and its structure have lots of similarities to those of growth and evolution in Werner R. Loewenstein's The Touchstone of Life: Molecular Information, Cell Communication, and the Foundations of Life. What if we reframe RNA or mitochondria in the role of the filing cabinet? What emergent properties occur in these processes? What do these processes have in common?

      I need at least some shorthand idea or word for talking about the circular evolving processes of life in Loewenstein's book. Maybe evolution spirals?

      Think inputs and outputs.

    1. Author Response:

      Reviewer #2 (Public Review):

      Oberle et al. provide a detailed analysis of how descending projections from the auditory cortex interact with ascending auditory projections on neurons in the shell region of the inferior colliculus on a cellular basis. Using optogenetic activation of auditory cortical neurons or projections and electrical stimulation of fibres in combination with whole-cell patch clamp recordings in vivo and in vitro, they show that most neurons in the shell region of the inferior colliculus receive several monosynaptic cortical inputs. In vitro, these descending synapses show sublinear summation with a major tonic component for prolonged stimuli. Both in vivo and in vivo experiments support the idea that descending cortical inputs and ascending inputs from the central inferior colliculus temporally overlap and both activate NMDA and non-NMDA receptors. This cooperativity of inputs leads to supra-linear summation and boosting of the response.

      Strengths:

      • The manuscript provides a first detailed analysis of a loop between the cortex and midbrain. It elegantly combines in vivo and in vitro electrophysiological techniques to study this network on a cellular/synaptic level.

      • These experiments thoroughly characterize the nature of cortical and midbrain excitatory inputs onto shell IC neurons and elucidate how they integrate the ascending and descending inputs on a cellular level.

      Weaknesses:

      • A major weakness of this study is that they do not directly show that ascending and descending inputs to the IC shell neurons actually coincide, but only imply that this should be the case, considering different latency measurements. Latencies that are measured in the anesthetized preparation may change in the awake behaving animals which may change the timing of the respective inputs.

      We rectify this issue in our revision with new data showing that the latency of sound-evoked activity in the superficial IC is similar in anesthetized and awake mice. We acknowledge that the conduction velocity of descending axons may differ between anesthetized and awake state. However, existing data show that conduction velocities of cortical axons increase in the alert brain compared to non-alert conditions (Stoelzel et al., 2017). Taken together, we would expect an increased temporal coincidence of ascending and descending signals in awake compared to anesthetized animals, which all available evidence suggests would enhance NMDAR-dependent non-linearities such as those we described (Gasparini et al., 2004; Gasparini and Magee, 2006; Losonczy and Magee, 2006; Takahashi and Magee, 2009; Branco et al., 2010; Branco and Häusser, 2011). We now revise our Results to highlight that our latency measurements in anesthetized mice represent the upper bound for the arrival of auditory cortical EPSPs.

      In addition, the authors do not show to what extent coincidence of ascending and descending inputs to shell IC neurons is maintained for longer and more complex sounds as compared to click stimuli.

      Previous work shows that auditory cortico-collicular neurons sustain firing during long, complex sounds (Williamson and Polley, 2019), and our data show that descending transmission is maintained for extended periods of corticofugal activity both in vitro and in vivo (Figure 4E-H). Thus, we would expect temporal overlap of ascending and descending inputs to occur under these conditions as well. We agree that Reviewer #2 touches upon an important knowledge gap. However, we believe that a full investigation of which sounds do and do not engage descending modulation merits a separate, in-depth study.

      • The manuscript does not address the question of whether the different neuron types that they encounter in the shell region based on the firing pattern to current injections, vary in their input latencies, their number and distribution of NMDA receptors or their integrative properties. This may have some additional effect on how these neurons process ascending and descending information.

      We agree that correlating intrinsic and synaptic properties could reveal something interesting. However, our initial analyses (Figure 3) did not show any striking correlation between membrane biophysics and the half-width or amplitude of descending EPSPs. As such, we had no a priori basis to hypothesize that synaptic integration differs systematically with measurable membrane properties, and the low-throughput of dual pathway stimulation experiments (Figures 6 and 8) precluded collecting a large dataset needed to convincingly determine if any synaptic non-linearity does or does not meaningfully correlate with the cellular biophysics.

      We acknowledge this limitation of our study in our revised Discussion. Future studies, perhaps leveraging cell-type specific markers for different IC neurons (Goyer et al., 2019; Naumov et al., 2019; Silveira et al., 2020; Kreeger et al., 2021) will be required to clarify this issue.

      • The authors have not demonstrated that silencing of descending inputs from the AC affects IC shell activity.

      We did not initially perform this experiment given the extensive literature establishing that silencing auditory cortex modifies the magnitude, timing, and/or selectivity of IC neuron sound responses (Yan and Suga, 1999; Nwabueze-Ogbo et al., 2002; Popelár et al., 2003; Nakamoto et al., 2008, 2010; Anderson and Malmierca, 2013; Popelář et al., 2016; Weible et al., 2020). Indeed, these classic results were a major motivation for us to focus on the cellular mechanisms that support corticofugal transmission. We thus reasoned that a cortical inactivation experiment would be largely confirmatory of prior knowledge, and limited in its potential for mechanistic interpretation given the known caveats of cortical loss-of-function manipulations (Li et al., 2019; Andrei et al., 2021; Slonina et al., 2021). However, we acknowledge that such an experiment is useful to frame our cellular-level findings in a broader, systems-level context. As such, we address Reviewer #2’s concern in our revision with a new experiment demonstrating that auditory cortical silencing indeed affects sound-evoked activity in the IC of awake mice.

      Reviewer #3 (Public Review):

      Overall, this manuscript is generally nicely written and well-illustrated. I don´t really have any major issues. I like the manuscript but I have a few comments and some issues that need to be addressed.

      My main concern is that the authors claim several times that the projections to the central nucleus of IC are weak and they neglect their potential functional role. I think this is a little bit unfortunate. It is true that the large AC projection primarily targets the cortical regions or shell of IC, but it is beyond doubt that it also targets the central nucleus (e.g. Saldaña's studies) . We cannot know whether it is a weak projection or not without central nucleus recordings. Admittedly, these experiments would be challenging, so I would ask the authors to tone down a bit these comments throughout the ms. Also, the reason for the 'weak' projection to the central nucleus may be due to the size and location of the injections made in the auditory cortex. Thus, I would like to see the injections site of Chronos if possible. Likewise, fig 1B is too small and of low quality (at least in my pdf file for review) to appreciate details of labeling. I would suggest that the authors make a separate figure showing the injection site in the AC and larger and clearer labeling in the IC.

      We agree that in vitro recordings from the central IC in adult mice are quite challenging. As suggested we have toned down claims of the “weak” projection to central IC and provide micrographs of Chronos injection sites. However, we concur that this is an important point. Thus, we include a new transsynaptic tracing experiment showing the somata of presumptive postsynaptic targets of auditory cortex neurons in the IC. Although the data show that the majority of cortico-recipient IC neurons are located in the shell regions, a few central IC neurons are indeed clearly labeled. Future studies will be required to test the extent and potency of this direct auditory cortex->central IC projection, and to compare the synaptic properties with our results in the shell IC.

      Also I wonder if the title of the manuscript should refer to the non-lemniscal IC as most of the data is related to this area.

      We have changed the title of the paper to Synaptic Mechanisms of Top-Down Control in the Non-Lemniscal Inferior Colliculus.

      While the dogma is that the descending projections are glutamatergic, the authors may care to consider a recently published paper https://www.frontiersin.org/articles/10.3389/fncir.2021.714780/full, which challenges this view by showing that inhibitory long-range VIP-GABAergic neurons target the IC. It would be interesting if the authors could comment on how this projection may have influenced the results of the present study.

      We thank Reviewer #3 for pointing out this new study which does indeed relate to our work. However, we don’t think direct GABAergic projections contributed much, if at all to our results. Indeed, the experiments of Figure 5A did not reveal any inhibitory postsynaptic potentials following bath application of NBQX as one might expect from direct stimulation of VIP-GABA axons (these experiments were performed without SR95531 in the bath). Rather, it may be that the VIP-GABA synapses have low release probability, transmit mainly via non-synaptic diffusion (e.g., spillover), or may primarily release the neuropeptide VIP which would be difficult to detect via whole-cell patch-clamp electrophysiology. We now address the work of Bertero et al. in the Discussion section.

      References

      Anderson LA, Malmierca MS (2013) The effect of auditory cortex deactivation on stimulus-specific adaptation in the inferior colliculus of the rat. Eur J Neurosci 37:52–62.

      Andrei AR, Debes S, Chelaru M, Liu X, Rodarte E, Spudich JL, Janz R, Dragoi V (2021) Heterogeneous side effects of cortical inactivation in behaving animals. eLife 10:e66400.

      Branco T, Clark BA, Häusser M (2010) Dendritic discrimination of temporal input sequences in cortical neurons. Science 329:1671–1675.

      Branco T, Häusser M (2011) Synaptic integration gradients in single cortical pyramidal cell dendrites. Neuron 69:885–892.

      Gasparini S, Magee JC (2006) State-dependent dendritic computation in hippocampal CA1 pyramidal neurons. J Neurosci Off J Soc Neurosci 26:2088–2100.

      Gasparini S, Migliore M, Magee JC (2004) On the initiation and propagation of dendritic spikes in CA1 pyramidal neurons. J Neurosci Off J Soc Neurosci 24:11046–11056.

      Goyer D, Silveira MA, George AP, Beebe NL, Edelbrock RM, Malinski PT, Schofield BR, Roberts MT (2019) A novel class of inferior colliculus principal neurons labeled in vasoactive intestinal peptide-Cre mice. eLife 8:e43770.

      Kreeger LJ, Connelly CJ, Mehta P, Zemelman BV, Golding NL (2021) Excitatory cholecystokinin neurons of the midbrain integrate diverse temporal responses and drive auditory thalamic subdomains. Proc Natl Acad Sci U S A 118:e2007724118.

      Li N, Chen S, Guo ZV, Chen H, Huo Y, Inagaki HK, Chen G, Davis C, Hansel D, Guo C, Svoboda K (2019) Spatiotemporal constraints on optogenetic inactivation in cortical circuits. eLife 8:e48622.

      Losonczy A, Magee JC (2006) Integrative properties of radial oblique dendrites in hippocampal CA1 pyramidal neurons. Neuron 50:291–307.

      Nakamoto KT, Jones SJ, Palmer AR (2008) Descending projections from auditory cortex modulate sensitivity in the midbrain to cues for spatial position. J Neurophysiol 99:2347–2356.

      Nakamoto KT, Shackleton TM, Palmer AR (2010) Responses in the inferior colliculus of the guinea pig to concurrent harmonic series and the effect of inactivation of descending controls. J Neurophysiol 103:2050–2061.

      Naumov V, Heyd J, de Arnal F, Koch U (2019) Analysis of excitatory and inhibitory neuron types in the inferior colliculus based on Ih properties. J Neurophysiol 121:2126–2139.

      Nwabueze-Ogbo FC, Popelár J, Syka J (2002) Changes in the acoustically evoked activity in the inferior colliculus of the rat after functional ablation of the auditory cortex. Physiol Res 51 Suppl 1:S95–S104.

      Popelár J, Nwabueze-Ogbo FC, Syka J (2003) Changes in neuronal activity of the inferior colliculus in rat after temporal inactivation of the auditory cortex. Physiol Res 52:615–628.

      Popelář J, Šuta D, Lindovský J, Bureš Z, Pysanenko K, Chumak T, Syka J (2016) Cooling of the auditory cortex modifies neuronal activity in the inferior colliculus in rats. Hear Res 332:7–16.

      Silveira MA, Anair JD, Beebe NL, Mirjalili P, Schofield BR, Roberts MT (2020) Neuropeptide Y Expression Defines a Novel Class of GABAergic Projection Neuron in the Inferior Colliculus. J Neurosci 40:4685–4699.

      Slonina ZA, Poole KC, Bizley JK (2021) What can we learn from inactivation studies? Lessons from auditory cortex. Trends Neurosci:S0166-2236(21)00203-4.

      Stoelzel CR, Bereshpolova Y, Alonso J-M, Swadlow HA (2017) Axonal Conduction Delays, Brain State, and Corticogeniculate Communication. J Neurosci Off J Soc Neurosci 37:6342–6358.

      Takahashi H, Magee JC (2009) Pathway interactions and synaptic plasticity in the dendritic tuft regions of CA1 pyramidal neurons. Neuron 62:102–111.

      Weible AP, Yavorska I, Wehr M (2020) A Cortico-Collicular Amplification Mechanism for Gap Detection. Cereb Cortex N Y N 1991 30:3590–3607.

      Williamson RS, Polley DB (2019) Parallel pathways for sound processing and functional connectivity among layer 5 and 6 auditory corticofugal neurons. eLife 8:e42974.

      Yan J, Suga N (1999) Corticofugal Amplification of Facilitative Auditory Responses of Subcortical Combination-Sensitive Neurons in the Mustached Bat. J Neurophysiol 81:817–824.

    2. Reviewer #3 (Public Review):

      Overall, this manuscript is generally nicely written and well-illustrated. I don´t really have any major issues. I like the manuscript but I have a few comments and some issues that need to be addressed.

      My main concern is that the authors claim several times that the projections to the central nucleus of IC are weak and they neglect their potential functional role. I think this is a little bit unfortunate. It is true that the large AC projection primarily targets the cortical regions or shell of IC, but it is beyond doubt that it also targets the central nucleus (e.g. Saldaña's studies) . We cannot know whether it is a weak projection or not without central nucleus recordings. Admittedly, these experiments would be challenging, so I would ask the authors to tone down a bit these comments throughout the ms. Also, the reason for the 'weak' projection to the central nucleus may be due to the size and location of the injections made in the auditory cortex. Thus, I would like to see the injections site of Chronos if possible. Likewise, fig 1B is too small and of low quality (at least in my pdf file for review) to appreciate details of labeling. I would suggest that the authors make a separate figure showing the injection site in the AC and larger and clearer labeling in the IC.<br> Also I wonder if the title of the manuscript should refer to the non-lemniscal IC as most of the data is related to this area.<br> While the dogma is that the descending projections are glutamatergic, the authors may care to consider a recently published paper<br> https://www.frontiersin.org/articles/10.3389/fncir.2021.714780/full, which challenges this view by showing that inhibitory long-range VIP-GABAergic neurons target the IC. It would be interesting if the authors could comment on how this projection may have influenced the results of the present study.

    1. Author Response:

      Reviewer #2:

      In Zhang et al.'s paper, with 7T fMRI, they used different face parts as stimuli to explore the functional organization within the face specific areas, and found consistent patterns between different subjects in rFFA and rOFA. In these areas, the posterior region was biased to eye, and the anterior region was biased to mouth. To exclude potential confounds, they also ran several control experiments to show that the preference to eyes and mouth is not due to the eccentricity or upper-lower visual field preference. Based on what they found, they claim that there exists a finer scale functional organization within the face areas.

      In general, I think the whole study is carefully designed, and the results are solid and interesting. However, I am not very comfortable about the claim about the organization of the face areas. Typically, when we talk about the organization, it either has more than 2 subdivisions or it has a continuous representation of certain features. In this paper, the results are mainly about the comparison between two face parts, and they failed to find other distinctive subareas showing preference to other face parts. Therefore, I would suggest that the authors could tune down their claim from functional organization to functional preference.

      We have followed the advice from the reviewer to tune down the claim of functional organization in our manuscript. To emphasize both the functional preferences to different face parts within face-selective regions and the consistent spatial profile across different individuals, we now use “spatial tuning of face parts” in the manuscript.

      Reviewer #3:

      Zhang and colleagues investigated the spatial distribution of feature tuning for different face-parts within face-selective regions of human visual cortex using ultra-high resolution 7.0 T fMRI. By comparing the response patterns elicited by images of face-parts (hair, eyes, nose, mouth and chin) with whole faces, they report a spatial pattern of tuning for eyes and mouth along the posterior-anterior axis of both the pFFA and OFA. Within the pFFA this pattern spatial tuning appeared to track the orientation of the mid fusiform sulcus - an anatomical landmark for face-processing in ventral temporal cortex. Two additional control experiments are conducted to examine the robustness of the original findings and to rule out potentially confounding variables. These data are consistent with recent evidence for similar face-part tuning in the OFA and add to the growing body of work showing the topographical mapping feature based tuning within visual cortex.

      The conclusions of this paper are mostly supported by the data, but some aspects of the data acquisition, analysis and interpretation that require further clarification/consideration.

      1) It is currently unclear whether the current data are in full agreement with recent work (de Haas et al., 2021) showing similar face-part tuning within the OFA (or IOG) bilaterally. The current data suggest that feature tuning for eye and mouth parts progresses along the posterior-anterior axis within the right pFFA and right OFA. In this regard, the data are consistent. But de Haas and colleagues also demonstrated tuning for visual space that was spatially correlated (i.e. upper visual field representations overlapped upper face-part preferences and vice-versa). The current manuscript found little evidence for this correspondence within pFFA but does not report the data for OFA. For completeness this should be reported and any discrepancies with either the prior, or between OFA and pFFA discussed.

      In the current study, three participants had data from both retinotopic mapping and face part mapping experiments. Consistent and robust part clustering were found in the right pFFA and right OFA. Following the reviewer’s suggestion, we analyzed these data for the right OFA and found the spatial patterns of eyes vs. mouths are similar to the patterns of visual field sensitivity on the vertical direction (i.e., upper to lower visual field), which are consistent with de Haas and colleagues’ findings. Note that we used more precise functional localization of OFA, while de Haas et al’s analysis was based on anatomically defined IOG, for which OFA is a part of. We have added this result in the Results session (Page 16), and also added a supplemental Figure 4-figure supplement 1.

      2) It is somewhat challenging to fully interpret the responses to face-parts when they were presented at fixation and not in the typical visual field locations during real-world perception. For instance, we typically fixate faces either on or just below the eyes (Peterson et al., 2012) and so in the current experiment the eyes are in the typical viewing position, but the remainder of the face-parts are not (e.g. when fixating the eyes, the nose mouth and chin all fall in the lower visual field but in the current experimental paradigm they appear at fixation). Consideration of whether the reported face-part tuning would hold (or even be enhanced) if face-parts were presented in their typical locations should be included.

      Our early visual cortex and some of the object-selective visual areas are sensitive to visual field locations. To dissociate the visual field tuning and face part tuning in face processing regions, in the main experiment of the current study the face part stimuli were presented at fixation to avoid the potential confounding contribution from visual field location. The spatial correlation between face part tuning and visual field tuning has been observed in posterior part of the face network. It is unlikely that presenting the face parts at the fixation was responsible for the observed face part tuning. To directly test the role of stimulus location, we reanalyzed the data from control experiment 2 in which face parts were presented at their typical locations. Contrasting eyes above fixation vs. nose & mouth below fixation revealed similar anterior-posterior bias in the right pFFA, showing that the face part tuning in the right pFFA is invariant to the visual field location of stimuli. See comparison in the figure below, note that the maps of eyes on top vs. nose & mouth on bottom are unsmoothed:

      3) Although several experiments (including two controls) have been conducted, each one runs the risk of being underpowered (n ranges 3-10). One way to add reassurance when sample sizes are small is to include analyses of the reliability and replicability of the data within subjects through a split-half, or other cross-validation procedure. The main experiment here consisted of eight functional runs, which is more than sufficient for these types of analyses to be performed.

      Following the reviewer’s suggestion, we split the eight runs data from each participant in the main experiment into two data sets (odd-runs and even-runs), and estimated the eyes-mouth biases within each data set. Then we calculated the correlation coefficient between such biases across different voxels between the two data sets to estimate the reliability of the results in the right pFFA. The results demonstrate strong reliability of the data within participants. We have added these results in the Results session (Page 7 and Figure 2-figure supplement 1).

      4) The current findings were only present within the right pFFA and right OFA. Although right lateralisation of face-processing is mentioned in the discussion, this is only cursory. A more expansive discussion of what such a face-part tuning might mean for our understanding of face-processing is warranted, particularly given that the recent work by de Haas and colleagues was bilateral.

      The right lateralization of face-processing has been observed in face-selective network. Both the neural selectivity to faces (Kanwisher et al., 1997) and the decodable neural information of faces (Zhang et al., 2015) are higher in the right than in the left hemisphere. The neural clustering of face part tuning and consistent spatial patterns across individuals in the right rather than in the left face selective regions provides a potential computational advantage for right lateralization for face processing. The clustering of neurons with similar feature tuning have been found extensively in the ventral pathway, which may help to support a more efficient neural processing. Therefore, one of the neural mechanisms underlying the functional lateralization of face processing could be the existence of spatial clustering of face part tunings in the right hemisphere. We have added more discussion about the relevance between our results and lateralization of face processing.

    1. Reviewer #1 (Public Review):

      This paper reports features of the development (and subsequent loss) of the egg tooth of the short-beaked echidna (T. aculeatus) at the histological level. Based on these features, the authors then consider the homology of the egg tooth/caruncle of the echidna with those of avian and non-avian reptiles. The authors report that while the echidna egg tooth is first apparent as a Shh-expression epithelial placode, the tooth then takes shape by evagination, rather than invagination, of that placode. This is reminiscent of the first teeth of some reptiles. The authors also find that the echidna egg tooth is anchored directly to the bone of the premaxilla (again, reminiscent of the mechanism of attachment of some reptilian teeth, and unlike the thecodonty seen in mammalian teeth). The caruncle also forms near the premaxillary bone and is associated with a prematurely differentiated and cornified epithelium. Finally, the authors find that the egg tooth is lost via a combination of resorption (by multinucleated TRAP-positive clast cells) and by cell death within the egg tooth pulp, and that the caruncle is lost at some undetermined point between 11- and 50-days post-hatching. Taken together, these findings indicate that the only tooth (albeit a transient one) in the otherwise edentulate echidna more closely resembles the teeth of reptiles than those of eutherian mammals, indicative of remarkable conservation of dental features in monotremes and reptiles from the last common ancestor of amniotes.

      Strengths

      We commend the authors on acquiring a unique and impressive series of embryonic and post-embryonic echidna specimens, and on making the most of these precious specimens by sequentially imaging them for microCT, followed by processing for paraffin histochemistry and/or immunofluorescence. The quality of the histology and image data presented here is high, and the authors effectively use their various data types (CT and section) in combination to provide good and clear anatomical context for their observations. This histochemical stainings presented here are very clear, and easily allows the reader to distinguish tissue types and connectively between elements (e.g., between the dentine of the egg tooth and the premaxilla, and between the os caruncle and the premaxilla).

      Furthermore, by framing their work in a comparative context, the authors can propose homologies between the egg tooth of the echidna and the first forming teeth of some lizards and crocodilians. Monotremes possess a fascinating melange of anatomical features classically regarded as "mammalian" or "reptilian", but these are extremely difficult to study developmentally. This work is a significant contribution in this regard and highlights the importance of monotreme developmental data when reconstructing the nature of the last common ancestor of amniotes.

      Weaknesses

      The introduction of the paper is a bit too long (and, at times, unfocused). Given the succinct nature of the results, the paper would benefit from a more focused and streamlined introduction.

      While the embryonic samples studied here are understandably limited (and sample sizes necessarily small), there are nevertheless claims made here that are not fully supported by the figures. In most instances, this is a case of a lack of high-magnification panels in the plates illustrating, for example, the features of the odontoblast layer, the ameloblast-like cells at the tip of the tooth, etc. These features are discussed, but not shown.

      The story around the caruncle isn't fully developed. It is introduced as though there has been some debate about whether the element forms as a distinct condensation from the premaxilla, but then this is not revisited. Also, the rationale for the choice of molecular markers used to characterise the epithelial component of the caruncle isn't entirely clear. The authors state that Loricrin is a marker of "terminal differentiation" - but does this mean that the loricrin-expressing epithelium adjacent to the caruncle skeleton is just farther along in its development relative to adjacent epidermis? Or is loricrin a specific marker of "cornified epithelia"? And if the latter, has loricrin expression been examined in the developing caruncles of avian or non-avian reptiles?

      Finally, the evolutionary synthesis presented here seems reasonable with respect to the egg tooth but remains a bit less clear with respect to the caruncle. The authors conclude that the os caruncle may be a novelty of monotremes, but that the epithelial caruncle may be homologous between monotremes and reptiles - but then suggest that the last common ancestor of amniotes had both structures? It is difficult to follow this logic. I think that that paper would benefit from a more nuanced "final model" or hypothesis of homology of egg teeth and caruncles across amniotes.

    1. Author Response:

      Reviewer #3 (Public Review):

      In this manuscript the authors make several conclusions, according to the abstract:

      1 - LTG activity is essential by contributing to a process independent of PG recycling.

      2 - LTGs are important because of their catalytic activity rather than because of a protein-protein interaction.

      3 - LTG mutants are hypersusceptible to production of periplasmic polymers.

      4 - LTGs prevent toxic periplasmic crowding and their function is temporally separate from PG synthesis.

      The authors perform a series of genetic experiments that lead to their conclusions. Their first conclusion is well supported by data showing that a PG recycling mutant does not have the same defects as their LTG mutant.

      Their second conclusion needs more justification/explanation. They show a catalytic mutant of RlpA is unable to sustain growth as the only LTG in the cell. However, I am confused by their wording around RlpA in general. In the text they note that their delta_7 mutant, which encodes RlpA, 'has no highly active LTGs' (lines 130-131). Does that imply that RlpA is not an LTG? In the discussion they note that E.coli RlpA has no LTG activity. Is this enzyme known to have LTG activity in V.cholerae? One important control would be to show that the catalytically inactive protein is stable (i.e. that the defect is not due to protein misfolding). This could be supported by looking at protein stability via Western or even quantifying the fluorescence data in Figure S3b.

      Alignment of VcRlpA with P. aeruginosa RlpA, which has been demonstrated in vivo and in vitro to be an active LTG, suggests VcRlpA retains the active site residues required for PG cleavage. This, as well as the inability of a VcRlpA^D145A mutant (based on the alignment with catalytically inactive EcRlpA) to rescue native RlpA depletion from the ∆LTG mutants suggests that VcRlpA is an active LTG and that this activity is required in the absence of all other annotated V. cholerae LTGs. We agree that “no highly active LTGs” is confusing and we have changed the text to simply describe the ∆7 LTG mutant as being significantly depleted in LTG activity as measured by anhMurNAc abundance in the sacculus. Lastly, we have conducted Western Blots demonstrating in the revised manuscript that our catalytic site mutant is indeed produced and stable (Figure S3).

      Their third conclusion also needs more support. The authors do a series of experiments showing that delta7 is more susceptible to SacB. What are the data that show sacB produces large polysaccharides molecules in the periplasm rather than (or in addition to) the cytoplasm? This would be important to show as these data are the main test of the authors model.

      In native B. subtilis as well as in E. coli, SacB has a canonical Sec signal peptide which is annotated as being cleaved after residue Ala29 (Uniprot G3CAF6_BACIU) to be released extracellularly. A reference (Pereira, et al, 2001) has been added in support of SacB functioning extracellularly and not in the cytoplasm of its native host, B. subtilis.

      The authors have other data that all argue for their model that LTG deficient strains have an excess of periplasmic crowding. The suppressor of delta_opgH is intriguing, but does not restore the morphological defects in delta_7, suggesting that the increase in length during prolonged growth may not be caused by periplasmic crowding, or at least is not alleviated by deletion of OpgH. What then does the deletion of OpgH suppress? Here, I was confused by the experiments in low salt. The authors write that the cells lyse (line 222) but this is not shown anywhere. Growing the cells continually in low salt may not be the hypoosmotic challenge the authors presume. A challenge typically implies an acute change in osmolarity, rather than a prolonged exposure, which may allow cells to adapt.

      We do not fully understand the role of OpgH, but here is our working model: LTGs have at least two essential functions – 1) PG release and 2) mitigating periplasmic crowding, either or both of which can become more important based on osmotic conditions. Since MltG seems to be the main PG release factor (at least based on E. coli), which can be partially supplanted by collective action of other LTGs, the ∆7 suffers from both PG release defects and periplasmic crowding defects, perhaps more so in an osmotically challenging low salt medium. The evidence for lysis is that at high inoculum (10^-2) the ∆7 LTG mutant does grow for a short time, but then we observe a drop in OD_600, indicative of lysis. According to our model, ∆6, on the other hand, which still has MltG, likely suffers only (or mostly) from a periplasmic crowding defect. Deleting periplasmic glucans only mitigates periplasmic crowding (and probably only partially), which does not help the more defective ∆7, which additionally suffers from lack of the postulated second activity.

      The reviewers raise an interesting point regarding the word “challenge”. We indeed specifically make the point that this is not an acute challenge, but rather accumulating damage during prolonged growth, even in salt-free LB. We have thus removed the word “challenge” from the revised manuscript. Importantly, we only use the ∆opgH suppression phenotype as one of many puzzle pieces for our conclusion. The key assay is the direct demonstration of periplasmic soluble PG strands accumulating in both WT and, to a higher degree, the ∆6 LTG mutant (Fig. 6).

      I was also highly confused by the antibiotic + BADA staining experiments. Do the authors stain the cells, treat, and then visualize? Are they then studying the fate of old PG? How does BADA get incorporated into PG in V.cholerae? Is it through LDT activity or some other way? Without more explanation, it is hard to interpret the results.

      BADA does get incorporated through either LDT or PG synthesis activity in V. cholerae, but for these experiments, the specific incorporation pathway is inconsequential, since we only focus on the end product (stained PG). We think that what we visualize is not the fate of old PG (otherwise we would see similar strong stains with Fosfomycin, which inhibits cell wall synthesis upstream of PG strand generation by PBPs/SEDS), but rather visualizes the generation of long, uncrosslinked PG strands due to the inhibition of PBP transpeptidase activity. We have added more explanations of this assay to the revised manuscript.

      The last conclusion is not supported by data. There are no data showing that LTG activity is temporally separate from PG synthesis.

      We would like to point out that this is not framed as a conclusion per se, but rather a plausible speculation. Our data showing soluble strand accumulation in the WT strongly suggest that LTGs do not work in perfect harmony with synthesis, but rather degrade strands AFTER they accumulate (i.e., temporally separate). We further believe that complementation with a heterologous enzyme (MltE), which does not have a homolog in V. cholerae strongly argues that LTGs and PG synthesis do not have to associate through protein-protein interactions. All this adds to an emerging model that PG synthesis and LTG-mediated degradation are not as tightly co-ordinated as one might assume.

    1. Author Response:

      Reviewer #1:

      Authors introduce a deep learning-based toolbox (ELEPHANT) to provide ease in annotation and tracking for 3D cells across time. The study takes two datasets (CE and PH) to demonstrate the performance of their method and compare it with two existing 3D cell tracking methods on segmentation and accuracy metrics. 3D U-Nets are shown to be performing well in segmentation tasks in recent years, authors also utilize 3D U-Net for segmenting cells as well as linking the nuclei across time through optical flow. The variation in selected datasets is shown to be in the shape, size and intensity of cells. Beyond segmentation, authors also demonstrate the performance of ELEPHANT in exploring the tracking results with and without optical flow and regenerating their fate maps. A complete server-based implementation is provided with detailed codebase and docker images to implement and utilize ELEPHANT.

      Strengths:

      The paper is technically sound with detailed explanation of each methodological step and results. 3D U-Nets are optimized for the segmentation task in hand with large training sessions, efficiency of the pipeline is nicely demonstrated which serves this as a useful toolbox for real-time annotation and prediction of cell structures. The detailed implementation on a local and remote server is presented which is a need while handling and analyzing large scale bio-imaging datasets. Beyond smoothing, SSIM-based loss is effectively applied to make the model robust against intensity and structural variations which definitely helps in generalized performance of the segmentation and tracking pipeline.

      Segmentation results are validated on a large set of nuclei and links which is helpful to understand the limitation of the models. The advantage of using optical flow-based linking is clearly shown on top of using nearest neighbors. Spatio-temporal distribution of cells on a given data guides the users in using the framework for several biological applications such as tracking the lineage of newly born cells - a hard task in stem cell engineering.

      A detailed implementation on both remote and server as well as open-source codebase on Github is well provided for the scientific community which will help the users to easily use ELEPHANT for specific datasets. Although CE and PH datasets are used to demonstrate the performance, however, similar implementation can also be performed on neuronal datasets that would be of much use in exploring neurogenesis.

      Weaknesses:

      Authors use ellipse-like shapes to annotate the data, however, many cells are not elliptic or circular in shape but consist of varying morphology. If the annotation module is equipped with drawing free annotations then it will be better useful to capture the diverse shapes of cells in both training and validation. This also limits the scope of the study to be used only for cells' datasets that are circular/elliptical in shape.

      ELEPHANT can be used to track nuclei or cells of diverse shapes. Tracking is based on reliable detection of nuclei/cells but does not require precise segmentation of their shapes. We have now added results showing that ellipsoid approximations are sufficient for detection and cell tracking, even when tracking cells with complex and variable shapes (figure 3).

      As we now explain in the manuscript (page 4), we use ellipsoids for annotation because they are essential for rapid and efficient training and predictions, which are the backbone of interactive deep learning. In practice, using ellipsoids also reduces the amount of work required for annotating the data compared with precise drawing of cell outlines. Post-processing can be appended to our workflow if a user needs to extract the precise morphology of cells.

      Authors use 3D U-net for segmentation which is a semantic segmenter, perhaps, an instance-based 3D segmenter could be a better choice to track the identity of the cells across time and space. However, an instance-based segmenter may not be ideal for segmenting the cells boundaries but a comparison between a 3D U-Net and an instance-based 3D segmenter on the same datasets will be helpful to evaluate.

      Although the original 3D U-Net is a semantic segmenter, we use its architecture to estimate the center region of cells, which works as an instance-wise detector. A similar strategy was followed by recent techniques (Kok et al. 2020, PLoS One doi:10.1101/2020.03.18.996421, Scherr et al. 2020 PLoS One doi:10.1371/journal.pone.0243219) to identify cell instances. Instance-based segmenters (e.g. StarDist, Mask R-CNN) are particularly useful for precise segmentation but our primary focus here is detection and tracking, which can be done most efficiently with the current architecture. Because StarDist or Mask R-CNN do not support sparse annotations, a direct comparison of these methods is difficult at the moment.

      The selected datasets seem to be capturing the diversity in shape and intensity, however, the biological imaging datasets in practice often have low signal to noise ratio, cell density variation and overlapping, etc. It seems like the selected datasets lack these diversities and a performance on any other data of such kind would be useful for performance evaluation as well as providing a pre-trained model for the community usage. Moreover, it would also be useful to demonstrate the performance of the framework in segmenting+tracking any 3D neuronal nuclei dataset which will broaden the scope of the study.

      The PH dataset that we used for testing ELEPHANT presents many challenges, such as variations in intensity, areas of low signal to noise ratio, densely packed and overlapping nuclei (see manuscript page 7, Suppl. Figure 5). To add to this analysis, we have now applied our method to additional datasets that show diverse characteristics – including datasets with elongated/irregular-shaped cells from the Cell Tracking Challenge (Figure 3E) and organoids imaged by light and confocal microscopy (Figure 3C,D) – demonstrating the versatility of our method. We do not think that neuronal nuclei present a particular challenge for ELEPHANT (the PH dataset includes neurons).

      We now also provide a pre-trained model, trained with diverse image datasets, which can be applied by users as a starting point for tracking on new image data.

      The 3D U-Nets are used for linking by using the difference between two consecutive images (across time) as labels. However, this technique helps to track the cell in theory but may also result in losing cell identity when cells are overlapping or when boundary features are less prominent, etc. Perhaps, a specialized deep neural network such as FlowNet3D could be a better choice here.

      Our 3D U-Net does not directly generate links across consecutive images. Instead it produces voxel-wise optical flow maps for each of the three dimensions, which are then combined with detection results to predict the position for each object (see manuscript page 6 and Methods). This is then used for linking. The identity of the tracked objects is defined during detection.

      In the end, our approach is similar to FlowNet3D in that both estimate optical flow for each detected object, although we use two consecutive images as input instead of the sets of detected objects. FlowNet3D operates only on object coordinates, without taking into account image features that could be important cues for cell tracking (e.g. fluorescence intensity of nuclei during cell division).

      Reviewer #2:

      The authors created a cell tracking tool, which they claimed was user-friendly and achieved state-of-the-art performance.

      Would a user, particularly a biologist, be able to run the code from a set of instructions clearly defined on the readme? This was not possible for me. I am not familiar with Java or Mastodon, but I'm not sure we can expect the average biologist to be familiar with these tools either. I was very impressed by the interface provided though.

      We have updated the user manual and software interface to make the software more accessible for users. Moreover, ELEPHANT is now available as an extension on Fiji, which will greatly facilitate its adoption by non-expert users.

      Did the authors achieve state-of-the-art performance? It is unclear from the paper. It would be helpful to see comparisons of this tool with modern deep learning approaches such as Stardist. Stardist for instance reports performance on the parhyale dataset in their paper. Many people in the field are combining tools like Stardist with cell tracking tools like trackmate (e.g. see https://www.biorxiv.org/content/10.1101/2020.09.22.306233v1). It would be important to know whether one can get performance comparable to Stardist (at e.g. a 0.5 IoU threshold) on a single 3D with this sparse labelling and interactive approac. I still think this approach of using sparse labelling could be very useful for transferring to novel datasets, but it is difficult to justify the framework if there is a large drop in performance compared to a fully supervised algorithm.

      The novelty in ELEPHANT is making deep learning available for cell tracking and lineaging by users who do not have extensive annotated datasets for training. Existing deep learning applications (including StarDist) do not fulfill this purpose.

      The detection and tracking scores of ELEPHANT in the Cell Tracking Challenge (identified as IGFL-FR) were the best when applied to cell lineaging on C. elegans test datasets, compared to a large number of other tracking applications (http://celltrackingchallenge.net/latest-ctb-results/). This comparison includes methods that employ deep-learning.

      ELEPHANT models trained with sparse annotation perform similarly well to trained StarDist3D models for nuclear detection in single 3D stacks (see Supplementary Figure 8). For cell tracking over time, StarDist and Trackmate have so far only been implemented in 2D.

      Reviewer #3:

      This work describes a new open source tool (ELEPHANT, https://elephant-track.github.io/) for efficient and interactive training of a deep learning based cell detection and tracking model. It uses the existing Fiji plugin Mastodon as an interactive front end (https://github.com/mastodon-sc/mastodon). Mastodon is a large-scale tracking and track-editing framework for large, multi-view images. The authors contribution is an extension of Mastodon, adding automated deep learning based cell detection and tracking. Technically, this is achieved by connecting the Mastodon as a client (written in Java) to a deep learning server (written in Python). The server can run on a different dedicated computer, capable of the GPU based computations that are needed for deep learning. This framework makes possible the detection and tracking of cells in very large volumetric data sets, within a user friendly graphical user interface.

      Strengths:

      1) It is great to reuse an existing front-end framework like Mastodon and plug in a deep learning back-end! Such software design avoids reinvention of the wheel and avoids that users need to learn too many tools.

      2) The idea to use sparse ellipsoids as annotations for cell detection is in my view fantastic as it allows very efficient annotation. This is much faster than having to paint dense 3D ground truth as is required for most deep learning algorithms.

      3) It is great that the learning is so fast that it is essentially interactive!

      Opportunities for improvements:

      The software in its current form had a view issues that made it a little hard to use. It would be great if those could be addressed in future versions.

      1) There are several options for how to set up the ELEPHANT server. In any case this requires quite some technical knowledge that may prevent adoption by a broader user base. It would thus be great if this could be further streamlined.

      We thank reviewer 3 for the very useful and detailed suggestions on improving the user interface of ELEPHANT. We have implemented most of these suggestions and we plan to pursue additional ones in future versions of the software. In brief:

      • To facilitate the setting up of the ELEPHANT server, we have implemented a control panel that allows users to monitor the process and provides links to the relevant section of the user manual and to Google Colab.
      • ELEPHANT is now available as an extension on Fiji, which will greatly facilitate its use by non-expert users.
      • Pre-trained detection and linking models, trained on diverse image datasets, are now available on the ELEPHANT github.
      • Image data can be uploaded and converted automatically via the Fiji/Mastodon interface when the image data files are missing on the server.

      2) For a GUI based software it is becoming state-of-the-art to provide recorded videos that demonstrate how to use the software. This is much more telling than written text. The authors added very nice short videos to the documentation, but I think it would be essential to also provide a longer video (ideally with voice over) where the authors demonstrate the whole workflow in one go.

      We are preparing a demo video on YouTube, which will be embedded in the user manual.

      3) As a user one interacts with the Mastodon software which sends requests to the ELEPHANT client. It would be great if the feedback for what is going on server side could be improved. For example adding progress bars and metrics for the process of the deep learning training that are visualized within Mastodon would be, in my view, very important for the usability.

      We added a log window in which users can monitor the processes that are running on the server.

    1. a lot of people start with learning and then they build things and then they close the circle but there's one key piece missing here and some people hate the word but you 00:29:54 learn to love it eventually it's called marketing and marketing means a lot of things to a lot of people but what it means to me is getting the word out because someone else will if you don't and 00:30:05 you are awesome you just have to realize that maybe not everyone knows right away so you should really talk about it more maybe at conferences see what i did there 00:30:17 um maybe on twitter maybe you can just tell your friends and maybe you can ask people to contribute and to support you like what's wrong with that somehow it's frowned upon in the community that if you do 00:30:30 marketing you're not doing it for real but i think that's not true um i think that if smart people and patient and um passionate people as well 00:30:44 if they did marketing then the world would be a better place because i'm pretty sure the evil guys do marketing so do your homework

      Marketing is very critical but it has negative connotations in the open source community because it is associated with mainstream business , after all, marketing is derived from the word "market".

      Perhaps it is better to think in psychological terms. If we have a great idea, the internet is a way to reach billions of eyeballs. Everyone is, in a sense, forced to compete in an attention economy. Instead of marketing, we can also use the words "attracting attention", because that is really what we are trying to do, be an attention attractor.

      The Indieverse, being developed by knowledge architect Gyuri Lajos, offers an alternative to marketing. Marketing is an attention attractor that relies on a "push" strategy. We are making content and pushing it out to different parts of the world we think may resonate with us to attract attention.

      Instead, the Indieverse, with its built in read and write provenance can act like a "pull" attention attractor. People can discover you through the built in discoverability aspects of the indieverse. Unlike the private sector, which uses this pull method to try to match you to stuff they want to sell you, Indieverse inegrates tools that exposes relevant content to you. If that content has demonstrably improved your life, which can be tracked through your public sharing, you can sponsor or reward that content. Microsponsorship can even be built in.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thank the reviewers for their helpful, detailed and insightful comments. We have modified the figures and rewritten large sections of the manuscript following the reviewers’ suggestions. In addition, we have incorporated new data throughout the manuscript and figures to clarify and better support our conclusions. All of these changes have significantly improved the coherence, consistency and clarity of our data, and have allowed us to better communicate the advance our findings represent for the fields of splicing and muscle development.

      Please find a point-by-point response to the reviewers’ comments below. The reviewers’ comments are in black and italics.

      Response to Reviewer 1* Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Rbfox proteins regulate skeletal muscle splicing and function and in this manuscript, Nikonova et.al. sought to investigate the mechanisms by which Rbfox1 promotes muscle function in Drosophila.

      Using a GFP-tagged Rbfox1 line, the authors showed that Rbfox1 is expressed in all muscles examined but differentially expressed in tubular and fibrillar (IFM)muscle types, and expression is developmentally regulated. Based on RNA-seq data from isolated muscle groups, the authors showed that Rbfox1 expression is much higher in TDT (jump muscle) than IFM.

      Using fly genetics authors developed tools to reduce expression of Rbfox1 at different levels and the highest levels of muscle-specific Rbfox1 knockdown was lethal and displayed eclosion defects (deGradFP > Rbfox1-IRKK110518 > Rbfox1-RNAi > Rbfox1-IR27286). Consistently, Rbfox1 knockdown flies have reduced jumping and climbing phenotypes, due to tubular muscle defect where Rbfox1 is expressed at higher levels. Rbfox1 knockdown in IFM caused flight defects which have been shown previously. Further characterization of IFM and tubular muscles demonstrated a requirement of Rbfox1 for the development of myofibrillar structures in both fibrillar (IFM) and tubular fiber-types in Drosophila. Interestingly, knockdown or overexpression of Rbfox1 displayed hypercontraction phenotypes in IFMs which is often an end result of misregulation of acto-myosin interactions which was rescued by expression of force-reduction myosin heavy chain (Mhc, P401S), in the context of Rbfox1 knockdown (the rescue experiment could not be performed with Rbfox1 overexpression due to complex genetics).

      Authors also performed computation analyses of the Rbfox binding motifs in the fly genome and identified GCAUG motif in 3,312, 683, and 1184 genes in the intronic, 5'UTR, and 3'UTR, respectively. These genes are enriched for factors that play important roles in muscle function including transcription factors (exd, Mef2, Salm), RNA-binding proteins (Bru1), and structural proteins (TnI, encoded by wupA). Many of these gene transcripts and proteins are affected in flies with reduction or overexpression of Rbfox1. Using fly genetics, authors propose and test different mechanisms (co-regulation of gene targets by Rbfox1 and Bru1), and regulators of muscle function (exd, Me2, Salm) and structural proteins (TnI, Mhc, Zasp52, Strn-Mlck, Sls) by which these changes could affect the muscle function.

      *Overall, the characterization of Rbfox1 phenotypes and myofibrillar structure is very well elucidated, mechanisms by which Rbfox1 affects muscle function are not clear and remain largely speculative. We thank the reviewer for the positive evaluation of our phenotypic analysis of Rbfox1 knockdown in multiple muscle fiber types. This manuscript is the first detailed characterization of Rbfox1 in Drosophila muscle, extending far beyond our previous finding that Rbfox1-IR flies are flightless. Beyond behavioral and cellular phenotypes, we report that there are regulatory interactions between Rbfox1, Bruno1 and Salm and identify other Rbfox1 targets in flies. We acknowledge that there are molecular and biochemical details of specific regulatory mechanisms that remain to be elucidated, but this paper provides many foundational observations to guide future biochemical experiments and is thus important to the muscle field.

      \*Major comments**

      *1. The varying level of Rbfox1 knockdown (deGradFP > Rbfox1-IRKK110518 > Rbfox1-RNAi > Rbfox1-IR27286) was achieved by different strategies without validation at the protein level (likely due to lack of a Rbfox1 antibody). It is important to show different Rbfox1 protein level (at least with different RNAi), especially when authors propose that autoregulation of Rbfox1 causes increased level Rbfox1 transcript in case of Rbfox1-RNAi (mild knockdown). Autoregulation of Rbfox1 in mammalian cells may not be similar in flies.

      To address this comment, we have toned-down the discussion of level-dependent regulation throughout the manuscript, and have removed claims of Rbfox1 autoregulation. We appreciate the reviewer’s point that it would be ideal to be able to determine the protein levels of Rbfox1 in the different knockdown conditions. We have tested the published antibody against DmRbfox1, but it is very dirty and we see multiple bands in Western Blot. This background partially obscures the bands from 80-90 kDa at the molecular weight where we expect Rbfox1, and prevents accurate quantification (see Reviewer Figure 1). Verification of protein levels of Rbfox1 will require generation of a new antibody which is beyond the scope of this study. As we do not have a good antibody, we performed two experiments to demonstrate our ability to tune knockdown efficiency. First, we crossed Rbfox1-IRKK110518 and Rbfox1-IR27286 to UAS-Dcr2, Mef2-Gal4 and demonstrated we could enhance the phenotype (Figure 2A, B). Second, we performed knockdown with the same hairpins at different temperatures and demonstrate that stronger knockdown at higher temperature leads to stronger phenotypes with the same hairpin

      (Figure 2B). This data supports our knockdown series interpretation.

      Reviewer Figure 1. Western Blot of whole fly with anti-Rbfox1 (A2BP1) (Shukla et al., 2017). Tubulin was blotted as a loading control.

      • TnI and Act88F protein levels are inversely correlated with Rbfox1 level in IFM but did not correlate with the RNA level. Using RIP authors showed that Rbfox1 was shown to bound to wupA transcripts (has Rbfox binding sites) but not Act88F transcripts (does not have Rbfox binding sites). Authors performed Rbfox1 IP and identified co-IP of components of cellular translational machinery and propose that wupA (TnI) levels are regulated by translation or NMD (non-sense mediated decay). A follow up experiment was not performed to identify the mechanism by which TnI level is regulated by Rbfox1. *

      Further biochemical and genetic verification of the underlying mechanisms of Rbfox1 regulation in Drosophila muscle will be addressed in a future manuscript, as in vivo modulation of translation or NMD in an Rbfox1 knockdown background involves recombination to coordinate multiple genetic elements. We have modified the text to reflect this hypothesis remains to be explored in future experiments (Line 473-474).

      We have further added RT-PCR data for wupA transcript levels in IFM and TDT with Rbfox1-IRKK110518 knockdown (Figure S4 A), but as in Rbfox1-RNAi flies, there is not a significant change in expression. We do see significant downregulation of Act88F when we overexpress Rbfox1 in IFM (Figure S4 B), as well as in TDT when we knockdown Rbfox1 with either Rbfox1-IRKK110518 or Rbfox1-IR27286.

      It was known that TnI mutations (affects splice site, fliH or Mef2 binding site, Hdp-3) led to a reduction in TnI level and hypercontraction. Authors showed rescue of hypercontraction phenotype in hdp-3 background by knocking down Rbfox1, likely due to increase in wupA transcription (Mef2-dependent or independent manner). However, no rescue was observed in the fliH background. Reduced level of Rbfox1 in fliH background would be expected to cause worsening of phenotype as splicing of remaining wupA transcripts would be affected with reduced Rbfox1 level. The splicing of wupA of exon 4 is not affected in Rbfox1 knockdown (fig. 6U), it's not clear if the splicing of exon 6b1 is affected in Rbfox1 knockdown.

      We thank the reviewer for pointing out our lack of clarity regarding exon 6b1 and IFM-specific isoform 6b1. To address this comment and validate our previous data, we performed additional Sanger sequencing on RT-PCR products, added a diagram of the wupA gene region in Figure 4 A and improved the clarity of our discussion of the fliH and hdp3 alleles and our results in the text.

      To directly respond to the reviewer, first, it is unclear if the reduced level of Rbfox1 in a fliH background should actually cause a more severe phenotype. Our data suggests that Rbfox1 represses TnI expression through binding the 3’-UTR, and can likely indirectly regulate wupA expression level via Mef2. Thus, arguably, the reduced level of Rbfox1 in the fliH background might not affect splicing, as the mutations in the regulatory element should rather make wupA insensitive to increased Mef2 expression in the Rbfox-RNAi background.

      Second, we confirmed via Sanger sequencing of RT-PCR products that both IFM and TDT in control and Rbfox1-IR flies use exon 6b1 (current exon 7). The IFM isoform contains exon 3, 6b1 and 9, while the TDT isoform contains exon 3 and 6b1, but skips exon 9 (see Figure 4 A). In other tubular muscles, wupA isoforms skip exons 3 and 9, and use exon 6b2 instead of 6b1. Thus, to directly answer the reviewer’s question, no, splicing of exon 6b1 itself is not affected by Rbfox1. However, Rbfox1 does influence expression of the ”6b1 isoform”, or the wupA isoforms in IFM and TDT containing exon 6b1 and exon 3. Additionally, our data shows that Bru1, not Rbfox1, regulates alternative splicing of wupA exon 9 (Fig. S6 T).

      What the reviewer has correctly identified with this comment is that the effect on splicing in the hdp-3 allele also appears to be complex and to have not been fully clarified. Although hdp-3 results from mutation of a splice site in exon 6b1 (which based on (Barbas et al., 1993) results in aberrant use of 6b2 in IFM), it also results in a near complete absence of the longer isoform containing exon 3 in adult flies. hdp-3 is reported in the same paper to affect both IFM and TDT, which both express isoforms containing exon 3 and 6b1. It is not known how mis-splicing of exon 6b1 leads to loss of isoforms containing exon 3, but our data indicate that Rbfox1 is somehow involved. It is purely speculation and beyond the scope of this manuscript, but perhaps selection of alternative exons in wupA are not independent events (ie that the splicing of exon 3 depends on correct splicing of exon 6b1). This could be mediated with interactions with chromatin, the PolII complex or through a larger splicing factor complex (something like LASR, for example (Damianov et al., 2016)), that restricts choice in alternative events through higher-order interactions. Another possible mechanism is that a second mutation exists in the hdp-3 allele that affects splicing of exon 3, although this was not indicated in the extensive sequencing data in (Barbas et al., 1993).

      Bruno1 was identified as a co-regulator of Rbfox1 in different IFM and tubular muscle types. However, except Mhc, other Rbfox1 targets seem to be regulated by either Rbfox1 or Bruno1, not both. Analyses of RNA-seq datasets from single and double knockouts should identify additional targets to support the claim that - Rbfox1 and Bruno1 co-regulate alternative splice events in IFMs. Phenotypic changes with reduced Rbfox1 and Bruno1 double knockdowns are very severe, but the mechanistic basis of such genetic interaction resulting in synergistic phenotypes in IFMs is lacking as splicing changes in single vs double knockout is similar.

      We agree with the reviewer that RNA-seq data would be useful to obtain a genome-wide perspective on the regulatory interactions between Rbfox1 and Bru1, and we plan to generate this data as part of a future manuscript. However, the tissue-specific dissections to isolate enough material from all of the necessary genotypes will take months to complete, and are not realistic to wait to include in this manuscript. Instead, to address the reviewer’s question, we have expanded our RT-PCR experiments to cover a wider panel of events in 12 sarcomere genes (see new data in Figures 6 and S6 and summary in Figure 8). We now can show that splice events in Fhos and Zasp67 are Rbfox1 dependent, while events in sls, Strn-Mlck and wupA are Bru1 dependent. An event in Zasp66 responds to both Rbfox1 and Bru1, but in opposite directions. Events in Mhc, Tm1 and Zasp52 are regulated by both Rbfox1 and Bru1 (or are sensitive to changes in Bru1 expression in the Rbfox1 background), and change in the same direction. This data provides a clearer mechanistic basis for the synergistic phenotype observed between Rbfox1 and Bru1 in IFM.

      Rbfox1 is expressed at a high level in tubular muscle whereas Bruno1 is expressed at a high level in IFM. Rbfox1 binds to Bruno1 transcript and inversely regulates Bru1-RB level but knockdown of Bru1 does not affect Rbfox1 level (Fig. S5 G,I,J). Overexpression of Bruno1 decreased the Rbfox1 level, however, it's difficult to interpret these results as overexpression of Bruno1 may have other effects on IFM gene expression.

      The reviewer correctly pointed out that we did not observe significant changes in Rbfox1 mRNA levels in the mutant bru1M3 background, however, in the original version of this manuscript, we also showed a significant decrease in Rbfox1 expression in IFM from the bru1-IR background at both 72 h APF and 1 d adult in mRNA-Seq data. To clarify differences in Rbfox1 levels between bru1-IR and our bru1 mutant backgrounds, we have performed additional RT-PCR experiments. We examined Rbfox1 levels after knockdown of bru1 (bru1-IR), and we now show that Rbfox1 levels are significantly decreased in IFM and TDT after bru1-IR (Fig. 5S, Fig S5 I). We see a weaker effect in the bru1M2 hypomorphic mutant, which likely reflects differences in Bru1 expression levels in bru1-IR and the bru1M2 allele. These results are consistent with the mRNA-seq data we presented previously (now in Fig. 5R). These additional data suggest that loss as well as gain of Bru1 affects Rbfox1 expression levels.

      A dose-dependent effect of Rbfox1 knockdown was shown to regulate the expression of transcription factors that are important for muscle type specification and function including exd, Mef2, and Salm. However, it is not clear how Rbfox1 mechanistically regulates the expression of these transcription factors.

      We present two pieces of data suggesting possible regulatory mechanisms for Mef2. First, RIP data suggest Rbfox1 can directly bind the 3’-UTR region of Mef2, and this region contains two binding motifs identified in both the oRNAment database and in our PWMScan dataset. Second, we show that use of the 5’-UTR regions of Mef2 is altered in Rbfox1-IR muscle. Although not definitive, this suggests that regulation of alternative 5’-UTR use may influence transcript stability or translation efficiency. We feel the many experiments to elucidate the detailed mechanism of regulation (and indeed to determine the likely contribution of multiple, layered regulatory processes) are beyond the scope of this paper, and are better left for future studies. This manuscript is the first in-depth characterization of Rbfox1 function in Drosophila muscle, and we provide multiple lines of evidence suggesting that different regulatory mechanisms exist as a basis for future experiments to explore these interesting and important regulatory interactions.*

      **Minor comments**

      1. It is not described if the rescue of Rbfox1 knockout by expression of force-reduction myosin heavy chain (Mhc, P401S) led to rescue of phenotypes (jumping, climbing, flight). *

      Force-reduction myosin heavy chain MhcP401S is a mutation at the endogenous Mhc locus that results in a headless myosin and was previously characterized to be flightless (Nongthomba et al., 2003). It is however able to rescue jumping and walking defects observed with the hdp2 TnI allele, and supports largely normal myofibril assembly (Nongthomba et al., 2003). It is also important to note that fibrillar muscle function is very finely tuned, such that alterations that result in flightlessness in many cases do not alter myofibril structure as detected by confocal microscopy (Schnorrer et al., 2010). We therefore looked at myofiber and sarcomere structure as a more sensitive read-out of the rescue ability in the Rbfox1 knockdown, to be able to detect a partial-rescue of myofibrillar structure that may not be evident in a behavioral assay.

      Immunofluorescence (IF) and Western blotting are different techniques, and Bruno1 antibody was validated for specificity in IF but not in Western blots. Figure 5L and S5 E should include muscle samples from Bru1M2.

      We have added a Western Blot panel in Figure S5 D including bru1-IR, bru1M2 and samples of different wild-type tissues including abdomen, ovaries, testis and IFM.

      To quantify alternative splicing or percent spliced in (PSI), primers are typically designed in the exons flanking the alternative exons. A better primer design along with PSI calculation by RT-PCR will robustly validate alternative splicing changes in different genetic background (Fig 6U and S6 U).

      We do not yet have RNA-Seq data from these Rbfox1 knockdown samples to facilitate calculation of transcriptome-wide PSI values; thus, we rely on the results from our RT-PCR experiments. Our primers used to detect alternative splice events are indeed located within flanking exons or as close to the alternative exons as possible based on sequence design limitations (see schemes in Figure 6 and Figure S6). Many of the events we are detecting are complex, and not a simple “included” or “excluded” determination, and are therefore not amenable to RT-qPCR. To increase the robustness of our validation, we now provide RT-PCR gel-based quantification of exon use for the events we tested in Zasp52, Zasp66, Zasp67, wupA and Mhc (Figure 6 U-W and Figure S6 T-U).*

      Reviewer #1 (Significance (Required)):

      Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field.

      Understanding how muscle fiber type splicing and gene expression is regulated will conceptually move the field forward. How transcriptional and posttranscriptional programs coordinate to specify muscle fiber type gene expression is still lacking.

      Place the work in the context of the existing literature (provide references, where appropriate). Multiple RNA binding proteins and splicing factors have been shown to affect muscle function along with hundreds of gene expression and splicing changes in a complex fashion. Linking phenotypes with gene expression changes is still challenging as RNA binding proteins or RBPs are multifunctional and affect the function of other regulators that are important for muscle biology. *We thank the reviewer for recognizing the conceptual advance our findings represent, as well as the complexity in the regulatory network we are seeking to understand. A detailed understanding of the coordination of transcriptional and posttranscriptional programs is enabled by our work and will be the subject of future investigation.

      * State what audience might be interested in and influenced by the reported findings.

      Fly genetics, alternative splicing regulation, muscle specification and function.

      Define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate.

      Regulation and function of alternative splicing in muscle. I do not have a thorough knowledge of Drosophila genetics.


      Response to Reviewer 2 Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      **Summary**

      This paper reports analysis of the function of RbFox1, an RNA-binding protein, best known for roles in the regulation of alternative splicing. It uses Drosophila as its in vivo model system, one that is highly suited to the analysis in vivo of complex biological events. In general, the authors present a very thorough approach with an impressive range of molecular analysis, genetic experiments and phenotypic assays. *We thank the reviewer for recognizing the suitability of our model organism as well as the time investment and diversity of experiments that were performed in this work. We have added and revised multiple experiments during this revision, which has greatly improved the manuscript.

      * The authors report that Rbfox1 is expressed in all Drosophila muscle types, and regulated in both a temporal and muscle type specific manner. Using inhibitory RNA to knock down gene function, they show that Rbfox1 is required in muscle for both viability and pupal eclosion, and contributes to both muscle development and function. A Bioinformatic approach then identifies muscle genes with Rbfox1-binding motifs. They show Rbfox1 regulates expression of both muscle structural proteins and the splicing factor Bruno1, interestingly preferentially targeting the Bruno1-RB isoform. They report functional interaction between Rbfox1 and Bruno1 and that this is expression level-dependent. Lastly, they report that Rbfox1 regulates transcription factors that control muscle gene expression.

      They conclude that the effect on muscle function of RbFox1 knock down is through mis-regulation of fibre type specific gene and splice isoform expression. Moreover, "Rbfox1 functions in a fibre-type and level-dependent manner to modulate both fibrillar and tubular muscle development". They propose that it does this by "binding to 5'-UTR and 3'-UTR regions to regulate transcript levels and binding to intronic regions to promote or inhibit alternative splice events." They also suggest that Rbfox1 acts "also through hierarchical regulation of the fibre diversity pathway." They provide further evidence to the field that Rbfox1's role in muscle development is conserved.

      **MAJOR COMMENTS**

      Are key conclusions convincing?

      In terms of presentation, I suggest ensuring a clear demarcation throughout of the evidence behind the main conclusions. This can get somewhat lost as a great deal of information is presented, including all the parallels with prior findings in other systems. I am not saying this is a major problem, just highlighting the importance of clarity. Conclusions to clearly evidence include: Rbfox1 functions in a fibre-type manner to modulate both fibrillar and tubular muscle development (e.g. L664); Rbfox1 functions in a level-dependent manner (e.g. L664); Rbfox1 functions by binding to 5'-UTR and 3'-UTR regions to regulate transcript levels (e.g. L670); Rbfox1 functions by binding to intronic regions to promote or inhibit alternative splice events" (e.g. L670); "Bru1 can regulate Rbfox1 levels in Drosophila muscle, and likely in a level-dependent manner" (L488) - Clearly evidence the level effect; "first evidence for negative regulation for fine tuning acquisition of muscle-type specific properties. Depending on its expression level, Rbfox1 can either promote or inhibit expression of" muscle regulators (L797). Lastly, the controlled stoichiometry of muscle structural proteins is known to be important, but all mechanisms are not known, so again make the supporting evidence as clear as possible for the interesting point of a role for Rbfox1 in this (e.g. L787). *Using the above comments from the reviewer as a guide, we have rewritten the manuscript, including large portions of the discussion, introduction and results. We thank the reviewer for pointing out where we could more effectively communicate our results, support our conclusions and highlight the significance of our findings.

      * Should some claims be qualified as preliminary or removed?

      P301 "complicated genetic recombination" - seems a bit weak to include. Either do it or don't include? *

      We have removed this statement from the text.*

      *

      Also, see section below on "adequate replication of experiments"

      Are additional exps essential? (if so realistic in terms of time and cost) None essential in my view. It depends on the authors' goals, but for the most impact of the project then following up these suggestions are possible. L369-372: mutate putative Rbfox1 binding site and ask does binding still occur or not. If it doesn't, then ask if this mutation affects the expression of the putative target gene. L775-777 "Our data thus support findings that Rbfox1 modulates transcription, but introduce a novel method of regulation, via regulating transcription factor transcript stability." It would be good to demonstrate this.

      We thank the reviewer for these suggestions, and agree they are indeed interesting experiments, but beyond the scope of this manuscript. We plan to pursue the detailed molecular and biochemical mechanisms of regulation in a future project including exploring Rbfox1 binding through use of reporters, identification of direct targets via CLIP and investigation of post-transcriptional regulation of translation or NMD.*

      Presented in such a way as to be reproduced

      Yes

      Are exps adequately replicated?

      A main area I would address is the authors frequent use of "may", "tend", "trend". This is confusing the picture they present. What is statistically significant and what is not? Only the former can be used as evidence. Examples include: L170: "may display preferential exon use" - does it or doesn't it? L272: "myofibrils tended to be thicker" - were they or weren't they? L350 "wupA mRNA levels tend towards upregulation in Rbfox1-RNAi". L353 "but tended towards upregulation (Fig. S4A)" L466 "Correspondingly, we see a trend towards increased protein-level expression of Bru1-PA" L474 "both Bru1-PA and Bru1-PB tend to increase" L485 "Overexpression of Bru1 in TDT with Act79B-Gal4 also tends to reduce Rbfox1" L595 "Rbfox1-IR27286 tended towards increased exd levels in IFM (Fig. 7A)" L614 "and a trend towards increased use of Mef2-Ex20 " Also, L487 "suggesting that Bru1 can also negatively regulate Rbfox1" - one cannot use a non-significant observation to suggest something. *

      We have modified the text to limit use of “may”, “tend” and “trend”, and have removed discussion of non-significant results. We thank the reviewer for the very helpful and detailed list of sentences to modify.

      \*MINOR COMMENTS**

      *

      Although individual samples are not significant, in aggregate there is a trend….

      * Specific exp issues that are easily addressable

      L162: "dip in Rbfox1 expression levels around 50h APF". The Fig indicates as early as 30h. Is this significantly less than the 24h data point? Comparisons in Figure 1G that are significant based on DESeq2 differential expression analysis with an adjusted p-value L427 "this staining was lost after Rbfox1 knockdown". This conflicts with Fig 5K which says no significant difference. Again in L429 "Rbfox1 knockdown leads to a reduction of Bru1 protein levels in IFMs and TDT." Fig says no significant difference in TDT. *

      We thank the reviewer for pointing out this inconsistency. We have revised the text accordingly. Our Western Blot (Figure 5L, M) and RT-PCR (Figure 5N, O) do show changes of Bru1 protein and mRNA expression levels after knockdown of Rbfox1KK110518. *

      Are prior studies referenced appropriately?

      This m/s is an authoritative presentation of the field as a whole with a comprehensive, impressive reference list. However, a point related to this area is one of the main things I would consider tackling. This is to have more clarity in the demarcation of what this study has found that adds to prior knowledge. It is worthwhile in itself to demonstrate the many similarities with previous work in other systems, as part of establishing the Drosophila system with all its analytical advantages for in vivo molecular genetics as an excellent model for future study in this area of research. However, the impact/strength of this m/s would be enhanced by clarity in presenting what is new to the field in all organisms. *We thank the reviewer for this suggestion. We have rewritten large portions of the manuscript, including the introduction and discussion, to improve the clarity of our findings and their importance to the field.

      * Are the text and Figs clear and accurate?

      TEXT

      L156: more precise language than "in a pattern consistent with the myoblasts" - maybe a simple co-expression with a myoblast marker? *

      We have revised this phrasing in the text. Rbfox1 expression in myoblasts was previously reported by (Usha and Shashidhara, 2010). *

      L181: at first use define difference between RNAi and IR*

      We use IR as an abbreviation for RNAi. In particular, we are trying to distinguish the two hairpins obtained from stock centers (27286 and KK110518) from the third, homemade RNAi hairpin, originally named UAS-dA2BP1RNAi, that was generated by Usha and Shashidhara (Usha and Shashidhara, 2010). We have better defined this in the text and methods. *

      L205: maybe clearly explain the link between eclosion and tubular muscle?? *

      We have added a sentence explaining the link between eclosion and tubular muscle (see Line 331).*

      L231: "Sarcomeres were not significantly shorter at 90h APF with the stronger Mef2-Gal4" - not clear why this is the case when the less strong knockdown conditions have shorter sarcomeres. *

      We have modified the text as well as the figure labeling to clarify that the other samples were tested in 1 d adult, while the KK110518 hairpin was tested at 90 h APF. This likely indicates that the short sarcomeres observed in 1 d adults reflect hypercontraction, which in IFM is classically first apparent after eclosion when the flies actively try to use the flight muscles. The difference in timing is due to pupal lethality of the KK110518 hairpin line, so we could not evaluate adult flies.*

      L234: "classic hypercontraction mutants in IFMs display a similar phenotype" - presumably not similar to the not significantly shorter sarcomeres of the previous sentence. *

      We have modified the text to clarify this statement. The change in sarcomere length from 90 h APF to 1 d adult is actually the relevant observation, as this reflects the progressive shortening of sarcomeres observed in classic hypercontraction mutants.*

      L244: "90h", should be "90h APF"? *

      Yes, we have modified the text.*

      L273: "Myofibrils in Act88F-Gal4 mediated knockdown only showed mild defects (Fig. 3 G, H, Fig. S2 C, D) despite adult flies being flight impaired". This seems worthy of discussion - the functional defect is not due to overt structure change? *

      In our own experience as well as observations included in a genome-wide RNAi screen in muscle (Schnorrer et al., 2010), there are a rather large number of knockdown conditions where few if any structural defects are observed at the level of light microscopy, but flies are completely flightless. We interpret this to reflect the narrow tuning of IFM function, where slight alterations in calcium regulation or sarcomere gene isoform expression result in dysfunction and a lack of flight. Ultrastructural evaluation might reveal defects in these cases, but the defect could also be with the dynamics of tropomyosin complex function, calcium regulation, mitochondrial function or even neuro-muscular junction structure. We have added a sentence to the text to discuss and clarify the Act88F result.*

      L281 "also known as Zebra bodies" - helpful to indicate these on the Fig, they are not. *

      We have added arrows to the figure to mark the Zebra bodies, and updated the figure legend.*

      L282: "we were unable to attempt a rescue of these defects" - I may have missed something, but what about rescue undertaken of the defects on previous pages? *

      This is the first point in the text where we introduced overexpression of Rbfox1, as preceding experiments where knockdown or using a GFP-tagged protein trap line at the endogenous locus. We have revised the sentence to focus on the overexpression phenotype with UH3-Gal4.*

      L283: "Over-expression of Rbfox1 from 40h APF" - this is the first over-expression experiment, so introduce why done now (and perhaps not earlier), and also explain the use of a different Gal4 driver.*

      We have reworded this section of the text. The UH3-Gal4 driver is restricted to expressing in IFM from 40h APF, so is first expressed after myofibrils have been generated and selectively in IFM. This avoids lethality observed from pan-muscle expression with Mef2-Gal4 (presumably due to severe defects in tubular muscles), and also allows us to image IFM tissue from adult flies. Later experiments with Mef2-Gal4 were performed with a later temperature shift to avoid this early lethality.*

      L290 "Interestingly, both Rbfox1 knockdown and Rbfox1 over-expression produce similar hypercontraction defects" - this could be interesting, worthy of discussion/explanation. *

      The most logical explanation is that Rbfox1 regulates the balance in fiber-type specific isoform expression. Loss of Rbfox1 would cause a shift in the relative ratio of the isoforms of structural genes, and overexpression of Rbfox1 would likely cause a similar shift in the opposite direction. This is supported by our RT-PCR panel, where we see co-regulation of different events with Bru1, and we see fiber-type specific difference in regulation of alternative splicing (Figure 8). Overexpression of Rbfox1 would be expected to make IFM look more like TDT, which would result in an isoform imbalance and lead to the observed hypercontraction phenotype. Interestingly, loss and overexpression of Bru1 also result in the same hypercontraction phenotype, similar to what we observe with Rbfox1. We have added a paragraph in the discussion about level-dependent regulation, to address this reviewer comment.*

      P305: Bioinformatic analysis. It is not clear what is taken as a potentially interesting result. On average a specific 5 base motif is found every 1000bps - so what is being looked for? How many sites in what length or position? A range of examples are described in the next pages of the m/s. For example: L337 "Bruno1.... contains 42 intronic and 2 5'-UTR Rbfox1 binding motifs" and L591 "exd contains three Rbfox1 binding sites," *

      We have redone the bioinformatic analysis completely, relying on data from oRNAment and the in-vitro determined PWM. We have also rewritten all portions of the text related to this analysis and no longer focus on the number of observed motifs in a given gene. As we unfortunately do not have RNA CLIP data, we do not know genome-wide which motifs are bound in muscle. Clustering of motifs may reflect binding, but a single, strong motif can also be bound, as we demonstrate via RIP of the wupA transcript. Thus, we identified interesting targets to test based on 1) a previously described role in the literature in myofibril assembly or contractility and 2) the presence of any Rbfox1 motif in that gene. A more elegant selection method of direct and indirect target exons will be designed for a future manuscript after integrating CLIP and mRNA-Seq data that have not yet been collected.

      L315: "many of these genes have binding or catalytic activity". "catalytic activity" seems very vague.

      For the original supplemental figure panel, we relied on Panther high-level ontology terms, which can unfortunately be rather vague, ie “catalytic activity” or “binding activity”. We have redone this analysis and rely rather on GO terms in the biological process and molecular function categories (Figure S3 B).

      L317 "When we look in previously annotated gene lists" - be more specific. What are they?

      This section of the text has been rewritten, and the “previously annotated gene lists” are described in greater detail in the Methods. *

      L327 "may also affect the neuro-muscular junction" - maybe better left for the Discussion? *

      We have removed this sentence from the Results.*

      L333 "extradenticle (exd) and Myocyte enhancer factor 2 (Mef2) contain 3 and 7 Rbfox1 motifs," Discuss the number and position of multiple motifs found in known targets? *

      We have removed the discussion of the number of binding sites for different target genes, instead incorporating this information graphically in Figure S3 C. It is not clear that the number of binding sites per gene has any influence on whether it is regulated in Rbfox1 knockdown. Thus, we have de-emphasized discussion of the number of binding sites throughout the text.*

      L350 "wupA mRNA levels " - clearer to stick to using TroponinI or WupA? *

      We have updated instances throughout the text to consistently refer to the protein as Troponin-I (TnI) and the gene or mRNA as wupA. *

      L376 "To check whether Rbfox1 regulates some target mRNAs such as wupA....." The suggestion here is more of a further indication than a "check". *

      We have reworded this section of the results to make the link between post-transcriptional regulation and our mass spectrometry results more salient.*

      L544 "In IFMs, knockdown of Rbfox1 and loss of Bru1 results in...." clarify if this is the two genes separately or the two genes together? *

      We have rewritten this entire section and present an expanded list of tested alternative events. We have taken care in this revision to clearly denote if the genotype is Rbfox1-IR or bru1M2 or a double knockdown background.*

      L580 "Our bioinformatic analysis identified Rbfox1 binding motifs in more than 40% of transcription factors genes" - is this all TFs or just "muscle" TF genes? *

      We have redone this analysis and changed this sentence in the text.*

      L598, what would be the mechanism of some decrease in Rbfox1 increasing mRNA levels and more of a decrease resulting in a decrease of the mRNA? The authors say "the nature of this regulation requires further investigation". *

      We have added more data to this section of the manuscript and repeated several of these experiments. After adding more biological replicates and additional data points, we have more consistent results that also demonstrate the variability in bru1 expression levels after Rbfox1 knockdown. Overall levels of bru1 assayed with a primer set in exons 14 and 17 now consistently show an increase in bru1 expression after Rbfox1 knockdown between all three hairpins (Rbfox1-RNAi, Rbfox1-IRKK110518 and Dcr2, Rbfox1-IR27286) (Figure 5 N).

      The relationship between expression level of Rbfox1 and expression level of bru1 and Bru1 protein isoforms is more complex. We now report a novel splice event in the annotated isoform bru1-RB that skips exon 7, resulting in a frame shift and generation of a protein that lacks all RRM domains, which we call bru1-RBshort (Figure S5). This short isoform is preferentially used in TDT, while the long isoform encoding the full-length protein is preferentially used in IFM (Figure 5 P). Presumably, this provides a mechanism, in addition to the use of different promoters, for muscle cells to regulate expression levels of different Bru1 isoforms. Knockdown of Rbfox1 in IFM results in a significant increase in the use of the long mRNA isoform, but paradoxically a decrease in the corresponding protein isoform (Figure 5, S5). We interpret this to mean that Rbfox1 regulates alternative splicing of Bru1, and likely independently a translational/post-translational mechanism regulates the expression level of Bru1-RB. This in theory could be mediated by interaction with translational machinery, post-translational modification, increased P-granule association, etc., and given the depth and breadth of experiments (as well as the multitude of isoform-specific expression reagents) required to isolate the responsible pathway, we deem it beyond the scope of this manuscript to biochemically demonstrate this specific regulatory mechanism. *

      L609 "The short 5'-UTR encoded by Mef2-Ex17". Ensure all abbreviations are defined. What does "Ex" mean here? Not straightforward to relate to the diagram in the Supplemental material that indicates the Mef2 gene has many fewer than 17 exons. In Fig7 legend too. *

      We have changed “Ex” to “exon” in the text. We apologize for the confusion. We have also added a diagram to Figure 7 E of the 5’-UTR region of Mef2, and a complete diagram of the locus in Figure S3 C. Based on the current annotation, Mef2 exons are numbered 1 to 21, corresponding to at least 16 distinct regions of the genome (18 if you include the variable 3’-UTR lengths). Exons sometimes will have more than one number in the annotation if a particular splice event causes a shift in the ORF, or if alternative splice sites or poly-adenylation sites are used. Mef2 is also on the minus strand, so as exons are numbered based on the genome scaffold, the exon numbering goes in reverse (ie exon 1 is the 3’-UTR).

      We strongly believe in following the numbering provided in the annotation, to increase reproducibility and transparency in working with complex gene loci for many different genes. Another researcher can go to Flybase, look-up the exon number from a given gene from a specific annotation, and get the exact location and sequence of the exons we name. It is incredibly challenging and time intensive to go through older papers and figure out which exon or splice event corresponds to those in the current annotation, and we aim to alleviate this difficulty (we illustrate this in Figure 4 A for the wupA locus, where we verified exon numbers in annotation FB2021_05 by BLASTing each individual sequence and primer provided in (Barbas et al., 1993).*

      L617 "Levels of Mef2 are known to affect muscle morphogenesis but not production of different isoforms" - clarify what is meant here by "different isoforms". *

      We have revised this section of the text. This statement was meant to reflect that Mef2 affects muscle morphogenesis through regulation of transcription levels, but not at the level of alternative splicing.*

      L638 "Salm levels were significantly increased in IFM from Rbfox1-RNAi animals, but significantly decreased in IFMs from flies with Dcr2 enhanced Rbfox1-IR27286 or Rbfox1-IRKK110518". This is worth discussion or further analysis. Normally would expect an allelic series, with an effect becoming more apparent with increased loss-of-function. *

      Dcr2, Rbfox1-IR27286 and Rbfox1-IRKK110518 produce a stronger knockdown than Rbfox1-RNAi, and indeed produce significantly decreased levels of salm, thus following the allelic series. We repeated this experiment, but obtained the same results. *

      L641 "This suggests that Rbfox1 can regulated Salm". How, if there are no Rbfox1 binding sites? Deserves further analysis? *

      Our new bioinformatic analysis suggests a possible answer, in that it identified possible Rbfox1 motifs in a salm exon and a site in an intron. Previously, we had focused on introns and UTR regions. In addition, using the PWM we now recover Rbfox1 binding sites of the canonical TGCATGA as well as AGCATGA sites. The intron site in salm is an AGCATGA site. Further experiments will be required to determine if Rbfox1 directly binds to salm mRNA, if it interacts with the transcriptional machinery to regulate salm expression, or if this regulation occurs through yet a different mechanism, and are beyond the scope of this manuscript.*

      L674: "We found the valence of several regulatory interactions..." I'm not sure the meaning of "valence" here and elsewhere will be readily understood. *

      Thank you for pointing this out. We have used a different phrasing throughout the text.*

      FIGURES

      Fig 1 it is difficult to see the green in A-F. Can this be improved? It is clearer in I-L. *

      We have replaced the images with better examples and increased the levels to make the green channel better visible. *

      Fig 2 legend (others too), say what the clusters of small black ellipses in P and Q are. *

      Thank you for pointing out this oversight. All boxplots are plotted with Tukey whiskers, such that they are drawn to the 25th and 75th percentile plus 1.5 the interquartile range. Dots represent outlying datapoints outside of this range. We have added statements in the relevant figure legends, as well as a more detailed explanation in the Methods. *

      Fig 3 it is not easy to see a shorter sarcomere in D, as the arrow partially obscures what is being indicated. Also, the data in G indicates that sarcomeres are not shorter in Mef2 GAL4 > KK110518, although the legend says this is shown in D. *We have rephrased the statement in the legend. The arrows are pointing to frayed or torn myofibrils.

      Fig 5 legend "-J). Bru1 signal is reduced with Rbfox1-IRKK110518 (C, F, I)". Clarify that this is only in IFM. It is not significant in TDT or Abd-M.

      Done.*

      Fig 7 legend "quantification of the fold change in exd transcript levels" - only KK110518 in IFM is significant. *

      This panel was moved to Figure S7. The relevant regions of the text and figure legend were modified to reflect that only Rbfox1-IRKK110518 results in a significant change in exd levels. C - "indicates Rbfox1 binds to Mef2 mRNA" - it is not easy to see the band.

      We replaced the image and adjusted the levels to make the band more visible. D - what do the different lanes on the gel below the histogram in D correspond to? We adjusted the labeling on the figure panel. The gel is a representative image of RT-PCR results that are quantified above in the histogram.

      *Suggestions that would help the presentation of their data and conclusion **

      There is a lot of good, thorough work here, but overall there is the impression that some of the presentation/writing could be improved (also see the above lists on clarity and accuracy). I admire the authors for their comprehensive presentation of what has already been found out in this field. As the authors summarise, a lot is already known in many other species, so (as also indicated above) it is crucial to emphasise what new is found in this work that advances overall knowledge in this field. This can be obscured in many places where they say because of what was found in vertebrate systems we looked in Drosophila. These include: L417: "This led us to investigate if Rbfox1 might regulate Bru1 in Drosophila." L452: "and we were curious if these interactions are evolutionarily conserved in flies." L528 "Thus, we next checked if Rbfox1 and Bru1 co-regulate alternative splicing in Drosophila muscle." L677 "Moreover, as in vertebrates, Rbfox1 and Bru1 exhibit cross-regulatory interactions" L683 "Rbfox1 function in muscle development is evolutionarily conserved" L697 "Here we extend those findings and show that as in vertebrates......" L702 "our observations are consistent with observations in vertebrates" L707 "Studies from both vertebrates and C. elegans suggest that Rbfox1 modulates developmental isoform switches." L746 "We see evidence for similar regulatory interactions between Rbfox1 and the CELF1/2 homolog Bru1 in our data from Drosophila." *We thank the reviewer for this honest and helpful assessment of the manuscript. Upon rereading the original text and with the guidance of the list of sentences above, we agreed with the reviewer and we have rewritten large segments of the manuscript. In particular in the introduction and discussion, we now better emphasize what is new in our findings and how they advance overall knowledge in this field.

      L185 paragraph. The knockdown series is important for the study. A lot is presented in this paragraph, especially for a non-specialist and it could be easier to follow. Perhaps present the four genetic conditions in the order of the severity of their phenotype on viability. Also, clearly state what each Gal4 driver is used for. What is the nature of the RNAi/IR lines such that Dcr2 could enhance their action? Also comment on off targets - are any predicted?

      We have rewritten this paragraph as the reviewer requested. The hairpins are ordered by decreasing phenotypic severity, and we have more clearly described each Gal4 driver as well as Dicer2. This information is also available in the Methods, along with the off targets for the hairpins. KK110518 has one predicted off-target ichor, but this gene is not expressed in IFM, TDT or leg based on mRNA-Seq data. 27286 has no predicted off-targets. *

      L227: "In severe examples". Be as clear as possible. Are the "severe examples" using the stronger RNAi line or are they the most severe examples with a single line? I'd suggest including the result in the main Fig rather than in the Supplemental. However, as I read more of the m/s I realise there is a great deal of important information in the Supplemental Figs, and so the case is not much stronger for this example than many others. The balance of what is included where could be looked at, because it is not straightforward for the reader to read the paper and quickly flick between the main and supplemental Figs. Later in the m/s is a substantial section that starts L450 (finishes L489) and which only refers to Supplemental Figs. L503 is another area where it is necessary, and difficult, for the reader to move between main Figs and supplemental Figs. *We have reorganized the figure panels in several figures, notably Figures 4, 5, 6, 7 and 8 and the corresponding supplementary figures, including moving panels from the supplemental figures to the main figures and generating more comprehensive quantification panels. In the specific case referenced here for Fig. S1 P and Q, we chose to keep the most representative images of the phenotype in the main figure (Fig. 2 I, N), and have reworded the text to reflect that the most severe phenotypic instances are in the supplement. As we do not have CLIP data, we chose to keep the bioinformatics analysis in the supplement and have shortened the paragraph in the results devoted to Figure S3. We hope our reorganization and rewriting have better streamlined the text and figures.

      L258: - perhaps a Table summarising this and other phenotype trends with the different RNA conditions might be helpful. It gets quite difficult to follow.

      We have revised the text and several figure panels to make the phenotypic trends with the different RNAi conditions easier to follow.*

      Reviewer #2 (Significance (Required)):

      The advance reported is mechanistic.

      The authors already do a very good job of placing their work in the context of prior research (see comment is Section A).

      Muscle biologists interested in its development and function will be interested in this work. More broadly, those intrigued by alternative splicing will be interested. Despite its very widespread occurrence, much about alternative splicing is still poorly understood in terms of regulation and significance. This is especially the case in vivo, and this paper uses an excellent in vivo model system (Drosophila) for the genetic and mechanistic analysis of complex biological problems. My field of expertise: cell differentiation, gene expression, muscle development, Drosophila.

      Response to Reviewer 3 Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      **SUMMARY**

      This manuscript characterizes the role of splicing factor Rbfox1 in Drosophila muscle and explores its ability to modulate expression of genes important for fibrillar and tubular muscle development. The authors hypothesize that Rbfox1 binds directly to 5'-UTR and 3'-UTR regions to regulate transcript levels, and to intronic regions to promote or inhibit alternative splicing events. Because some of the regulated genes encode transcriptional activators and other splicing factors such as Bru1, the effects of Rbfox1 may encompass a complex regulatory network that fine-tunes transcript levels and alternative splicing patterns that shape developing muscle. Most likely the authors' hypothesis is correct that Rbfox1 is critical for muscle development in Drosophila, but overall the interesting ideas presented here are too often based only on correlations without further experimental validation. *

      We respectfully disagree with the reviewer that our hypothesis that Rbfox1 is critical for muscle development in Drosophila is based only on correlation without further experimental validation. In this manuscript we extensively characterize the knockdown phenotype of 3 RNAi hairpins against Rbfox1 as well as a GFP-tagged Rbfox1 protein in both fibrillar flight muscle and tubular abdominal and jump muscle. All hairpins produce similar phenotypes with defects in myofiber and myofibril structure and result in behavioral defects in climbing, flight and jumping, confirming this phenotype is due to loss of Rbfox1 and not a random off-target gene. We also convincingly demonstrate that Rbfox1 regulates Bru1, another splicing factor known to be critical for fibrillar specific splice events in IFM. Moreover, Rbfox1 and Bru1 genetically interact selectively in IFM and our RT-PCR data for 12 select structural genes reveals fiber-type specific alternative splicing defects regulated by Rbfox1 selectively, by Bru1 selectively, or by both Rbfox1 and Bru1. Thus, we conclude that Rbfox1 is indeed critical for muscle development, and this is the first report to demonstrate this requirement in Drosophila.*

      **MAJOR COMMENTS**

      The hypothesis that Rbfox1 plays an important role in regulating muscle development is based on previous studies in other species and supported by much new data in this manuscript. Initial bioinformatic analysis showed that many Drosophila genes, including 20% of all RNA-binding proteins, 40% of transcription factors, etc. have the motifs in introns or UTR regions. However, I think a deeper analysis is required. Any hexamer might be present about once every 4kb, and we do not expect all UGCAUG motifs are necessarily functional, so one might ask whether the association of Rbfox motifs with muscle development genes is statistically significant? Are the motifs conserved in other Drosophila species, which might support a functional role in muscle? Are the intronic motifs located as expected for regulatory effects, that is, proximal to alternative exons that exhibit changes in splicing when Rbfox1 expression is decreased or increased? *

      We appreciate the point of the reviewer that it would be ideal to distinguish genome-wide motifs that are actually bound directly by Rbfox1 from those that are unused, but our behavioral and phenotypic characterization of the knockdown phenotype in this manuscript is also valid without this data. The most effective approach to identify direct targets is to perform cross-linking immunoprecipitation, or CLIP, but we unfortunately do not have CLIP data from Drosophila muscle and it is beyond the scope of the current study to generate this data. It is not trivial to obtain the amount of material necessary to identify tissue-specific binding sites, as we would also likely expect differences in targeting specificity between tubular and fibrillar muscle. Genome-wide analysis of the evolutionary conservation of binding site motifs is also not trivial and is beyond the scope of this paper.

      Despite these limitations and to address the reviewer’s comment, we have done the following:

      1. We have completely redone our bioinformatic analysis using transcriptome data from the oRNAment database (Benoit Bouvrette et al., 2020), as well as searching genome-wide for instances of the in vitro determined PWM using PWMScan, to capture possible sites in introns (Figure S3). The oRNAment database was shown to reasonably predict peaks identified in eCLIP from human cell lines, which we assume would translate to a similar predictive capacity in the Drosophila
      2. We have calculated the expected distribution of Rbfox1 sites in a random gene list for Figure S3, and indeed the number of Rbfox1 sites in sarcomere genes is significantly enriched.
      3. We have looked more carefully at the distribution of Rbfox1 and Bru1 motifs in the transcriptome (in the oRNAment data), and find not only that these motifs frequently occur in the same muscle phenotype genes, but also that they are closer together than is expected by chance (Fig. S4 J).
      4. We marked the location of Rbfox1 and Bru1 motifs in the vicinity of select alternative splice events we tested via RT-PCR on the provided summary diagrams (Fig. 6, Fig. S6).
      5. We have tested additional alternative splice events in total from 12 structural genes, and of the 9 events misregulated after Rbfox1 or Bru1 knockdown, all but 1 are flanked by Rbfox1 or Bru1 binding motifs. This indicates that the motifs are indeed located as expected for a regulatory effect. Is it possible to knock out an Rbfox motif and show that splicing of the alternative exon is altered, or regulation of transcript levels is abrogated?


      The construction and mutation of reporter constructs is possible, but would take longer than the recommended revision time-frame, in particular to generate reporters that can be evaluated in vivo. We intend to address the biochemical mechanism(s) of Rbfox1 regulation with future experiments in a separate manuscript.

      Also, what was the background set of genes used for the GO enrichment analysis? Genes expressed in muscle or all genes?

      The background set of genes for GO enrichment (now Figure S3 B) was all annotated genes for the “all genes” label and all muscle phenotype genes for the “Muscle phenotype” label.

      The data on cross regulation between Rbfox1 and Bru1 are confusing and inconsistent, since mild knockdown and stronger knockdown of Rbfox1 seem to have different effects on Bru1 expression. New data suggest that Rbfox1 can positively regulate Bru1 protein levels (Fig.5), but this seems inconsistent with the lab's earlier studies indicating opposite temporal mRNA expression profiles for Rbfox1 and Bru1 across IFM development. 


      We apologize for the confusion, but the relationship between Rbfox1 and bru1 levels across IFM development has not been published previously. We previously generated that mRNA-Seq data, but presented here (now in Figure 5Q) is a new analysis of that data, specifically focused on Rbfox1 and bru1 expression. We have corrected the phrasing in the text.

      To address this comment, along with points raised above by Reviewer 2, we have revised this part of the manuscript, added more data to this section of the manuscript and repeated several of these experiments. After adding more biological replicates and additional data points, we have more consistent results that also demonstrate the variability in bru1 expression levels after Rbfox1 knockdown. Overall levels of bru1 assayed with a primer set in exons 14 and 17 now consistently show an increase in bru1 expression after Rbfox1 knockdown between all three hairpins (Rbfox1-RNAi, Rbfox1-IRKK110518 and Dcr2, Rbfox1-IR27286) (Figure 5 N). This is consistent with our observations of inversely correlated mRNA levels during IFM development, as when Rbfox1 levels decrease, bru1 transcripts increase.

      We agree with the reviewer that the relationship between the expression level of Rbfox1 and expression level of bru1 mRNA and Bru1 protein isoforms is more complex. We now report a novel splice event in the annotated isoform bru1-RB that skips exon 7, resulting in a frame shift and generation of a protein that lacks all RRM domains, which we call bru1-RBshort (Figure S5). Unknowingly, we had previously used a primer set from exon 7 to exon 8 as “common”, which lead to some confusion. This short isoform is preferentially used in TDT, while the long isoform encoding the full-length protein is preferentially used in IFM (Figure 5 P). Presumably, this provides a mechanism, in addition to the use of different promoters, for muscle cells to regulate expression levels of different Bru1 isoforms. Knockdown of Rbfox1 in IFM results in a significant increase in the use of the long mRNA isoform, but paradoxically a decrease in the corresponding protein isoform (Figure 5, S5). We interpret this to mean that Rbfox1 regulates alternative splicing of Bru1, and likely independently a translational/post-translational mechanism regulates the expression level of Bru1-RB. This in theory could be mediated by interaction with translational machinery, post-translational modification, increased P-granule association, etc., and given the depth and breadth of experiments (as well as the multitude of isoform-specific expression reagents) required to isolate the responsible pathway, we deem it beyond the scope of this manuscript to biochemically demonstrate this specific regulatory mechanism. *

      *

      Both Rbfox1 and Bru1 gene have many Rbfox motifs, but they are both large genes (>100kb) and would be expected to have many copies of all hexamers. How do we know whether any of them are functional?

      We do not know if all of the Rbfox1 binding sites in the Bru1 and Rbfox1 loci are bound, but the CLIP data required to assess this is beyond the scope of this manuscript, as discussed above. We do show, however, that changes in the expression level of Rbfox1 affect the expression of Bru1 on both the mRNA transcript and protein level, and changes in the expression level of Bru1 also can affect the expression level of Rbfox1. The direct or indirect nature of this regulation remains to be fully elucidated, although we do provide RIP data showing we can detect bru1 transcript bound to Rbfox1-GFP (Figure S4 I). We have modified the text to address this comment.

      Figure S4, section I, J: if changes in Bru1-RB isoform expression are correlated with Rbfox1 knockdown, it seems reasonable to test whether the Bru1-RB promoter can drive expression of GFP in an Rbfox1-dependent manner. But if I understand correctly, the assay as described on p. 19 uses the promoter region upstream of Bru1-RA. What is the logic for this experiment? It is not surprising that no effect was observed. The end result is that we have no idea whether Rbfox1 directly regulates bru1-RB. Even if it does, bru-Rb appears to be a minor component of Bru expression in IFM.

      Upon reevaluating this experiment and with respect to the reviewer’s comment, we have removed it from the manuscript to avoid confusion. Our new data indicate a switch in use of the bru1-RBlong and bru1-RBshort isoforms (Figure 5 N-P), suggesting that Rbfox1 regulation is on the level of splicing.

      Further experiments will be necessary to refine the indirect versus direct regulatory effects of Rbfox1 on Bru1, but our data do demonstrate that Bru1 levels are regulated in Rbfox1 knockdown conditions. We also provide a RIP experiment (Figure S4 I) showing that Rbfox1-GFP does directly bind bru1 mRNA, but we did not determine if this was isoform-specific. Multiple additional experiments would be necessary to distinguish between regulation of alternative splicing, direct binding to regulate transcript translation or stability, or transcriptional regulation via regulation of Salm, or some combination of these possible mechanisms. The data presented here are important to the field as they are the first report of isoform-specific regulation of Bru1 in muscle, even if we do not conclusively show if this regulation by Rbfox1 is direct or indirect.

      In the section "Rbfox1 and Bruno1 co-regulate alternative splice events in IFMs", the data show that splicing of several genes is altered by knockdown or over-expression of Rbfox1 and Bru1. The interesting conclusion is for a complex regulatory dynamic where Rbfox1 and Bru1 co-regulate some alternative splice events and independently regulate other events in a muscle-type specific manner. However, if we are to conclude that these activities are due to direct binding of Rbfox1 and Bru1 to the adjacent introns, we need information about the location of flanking Rbfox and/or Bru1 motifs. Do upstream or downstream binding sites correlate with enhancer or silencer activity, as reported in previous studies of these splicing factors in other species? For wupA, Figure S3 shows an intronic Rbfox site, but exon 4 is not labeled so the reader cannot correlate this information with the diagram in Figure 6U.

      As mentioned above, we have marked the location of Rbfox1as well as Bru1 binding motifs in the diagrams in Figure 6 and Figure S6. We have tested additional alternative splice events, and can now show events regulated only in the Rbfox1 knockdown, only after bru1 knockdown, or in double knockdown flies (Figure 8). 8 out of 9 events where we see clear changes in splicing are flanked by potential Rbfox1 or Bru1 motifs. Demonstration of direct binding and assay of genome-wide binding sites through CLIP studies is beyond the scope of this manuscript and will be pursued in the future.

      The evidence that Rbfox1 directly affects expression of transcription factor Exd seems to be based only a correlation between Rbfox1 knockdown and decreased expression of Exd. The observation that binding of Rbfox1 to the Exd 3'UTR in RIP experiments further weakens the case.

      We agree with the reviewer and have moved the data related to exd to the supplement (Figure 7 and S7). We still mention exd in the text as it is significantly decreased after knockdown with Rbfox1-IRKK110518, but we have removed it from larger claims of transcriptional regulation as well as from the summary in Figure 8. Also, just to note that although we failed to detect Rbfox1-GFP bound to exd, this experiment was performed with adult flies. Since Exd is functionally important early in pupal development during fate specification of the IFMs, it is possible we might detect binding to exd mRNA at a different developmental timepoint.

      Similarly, there is a correlation of Rbfox1 knockdown with expression of alternative 5'UTRs in the Mef2 gene. However, the changes in UTR expression appear mostly not statistically significant. Do the authors have a model to explain what mechanism might allow Rbfox to regulate expression of alternative 5'UTRs, which would seem to be a transcriptional process?

      Mef2 transcript levels are significantly increased after knockdown with Rbfox1-RNAi and decreased after overexpression of Rbfox1, and we can detect direct binding of Rbfox1-GFP to Mef2 RNA via RIP. This establishes Mef2 as a likely direct target of Rbfox1 regulation, likely through the two Rbfox1 motifs in the 3’-UTR (Figure S3 C). In addition to this regulation, we made an observation that has not been previously reported in the literature, that IFM expresses a particular isoform of Mef2 that uses a short promoter encoded by Exon 17. We see both tissue-specific use of Exon 17 (Figure 7 F) as well as developmental regulation of Exon 17 use in IFM (Figure S7 C). Surprisingly, we saw that use of exon 17 in the Mef2 promoter is altered in Rbfox1 knockdown muscle. We now provide a quantification of this data, to show the change is statistically significant. We also provide a scheme of the Mef2 locus and RT-PCR primers with exons 17, 20 and 21 labelled (Figure 7 E). We have also rewritten this section of the text to increase the impact and clarity of our finding.

      For Salm, there apparently are no Rbfox motifs in the gene, and there are statistically significant but apparently inconsistent changes in Salm expression when it is knocked down in IFM by Rbfox1-RNAi (Salm increases) vs knockdown by Rbfox1-IR27286 or Rbfox1-IRKK110518 (Salm decreases). These are potentially interesting observations but more data would be needed to make stronger conclusions. How would regulation occur in the absence of Rbfox motifs?


      The best explanation we can provide for why salm expression is increased with the weak hypomorph Rbfox1-RNAi condition, but decreased with the stronger hypomorph Rbfox1-IRKK110518 or Dcr2, Rbfox1-IR27286 conditions is that salm regulation is sensitive to Rbfox1 expression or activity level. We now discuss this in a new section of the discussion. We further attempted several experiments to address this question, including obtaining an endogenously tagged Salm-GFP line, as well as a UAS-Salm line (kindly provided by F. Schnorrer). Disappointingly, there is no GFP expressed in the Salm-GFP line, either live, by immunostaining or in Western Blot of multiple developmental stages, indicating that the line has fallen apart and we have not yet redone the CRISPR targeting to generate a new line. The UAS-Salm construct works (too well), in that overexpression with Mef2-Gal4 results in early lethality and we have not yet managed to optimize the experiment and obtain enough pupal muscle where we can evaluate the effect on Bru1 or Rbfox1 levels.

      Our new bioinformatic analysis further revealed possible Rbfox1 motifs in a salm exon and a site in an intron. Previously, we had focused on introns and UTR regions. Now, using the in vitro determined PWM, we can recover Rbfox1 binding sites of the canonical TGCATGA as well as AGCATGA sites. The intron site in salm is an AGCATGA site. Further experiments will be required to determine if Rbfox1 directly binds to salm pre-mRNA, if it interacts with the transcriptional machinery to regulate salm expression, or if this regulation occurs through yet a different mechanism. We feel the many required experiments are beyond the scope of the current manuscript. Our data provides an experimental basis for future studies on this topic.

      \*MINOR COMMENTS**

      1. In several figures there is a misalignment of the transcriptional driver information with the phenotype data in the bar graphs above. Please correct the alignments to make interpretation easier. *

      We have revised the layout of labels for many plots throughout the manuscript to avoid a category label associated with a genotype label at a 45-degree angle, and to make interpretation easier.

      On p. 14 Brudno et al. is cited as ref for Fox motifs near muscle exons, but this paper only focused on brain-specific exons.

      In addition to brain-specific exons, Brudno et al. also analyzed a set of muscle-specific exons, and thus this is the appropriate reference. For instance, from the Brudno paper, “As an additional control in some experiments we analyzed a smaller sample of muscle-specific alternative exons that were collected exactly as described above for the brain-specific exons” and “UGCAUG was also found at a high frequency downstream of a smaller group of muscle-specific exons.” Further details of the muscle-specific exon analysis can be found in (Brudno et al., 2001).

      For Mef2, why do exons described as 5'UTR have numbers 17, 20, and 21? One would normally expect these to be exon 1, 2 or 1A, 1B, etc.

      We rely on the Flybase annotation and numbering system to refer to exons. Per Flybase, all exons are labeled in the 5’ to 3’ direction of the sequenced genome, even for genes, such as Mef2 or wupA, that are encoded on the reverse strand. We strongly believe in following the numbering provided in the annotation, to increase reproducibility and transparency in working with complex gene loci for many different genes. Another researcher can go to Flybase, look-up the exon number from a given gene from a specific annotation, and get the exact location and sequence of the exons we name. It is incredibly challenging and time intensive to go through older papers and figure out which exon or splice event corresponds to those in the current annotation. We illustrate this in Figure 4 A for the wupA locus, where we verified exon numbers in annotation FB2021_05 by BLASTing each individual sequence and primer provided in (Barbas et al., 1993). The Mhc locus is even more complex, in particular regarding alternative 3’-UTR regions and historic versus current exon designations (Nikonova et al., 2020). For clarity and reproducibility, we therefore rely on the current Flybase designations.

      Fig 8: "regulation of regulators" seems to imply the Rbfox1 is impacting transcription?? Is there precedence for this type of regulation by Rbfox1? Yes, indeed, there is precedence for Rbfox1 impacting transcription, as we presented in the Discussion. Rbfox2 is reported to interact with the Polycomb repressive complex 2 to regulate gene transcription in mouse (Wei et al., 2016) and in flies Rbfox1 interacts with transcription factors including Cubitus interruptus and Suppressor of Hairless to regulate transcription downstream of Hedgehog and Notch signaling (Shukla et al., 2017; Usha and Shashidhara, 2010). In addition, Rbfox1 regulates splicing of Mef2A and Rbfox1 and Rbfox1 cooperatively regulate splicing of Mef2D during C2C12 cell differentiation (Gao et al., 2016). Our results provide a further piece of evidence implicating Rbfox1 either directly or indirectly in transcriptional regulation as well as regulation of alternative splicing.

      * Reviewer #3 (Significance (Required)):

      **SIGNIFICANCE**

      These studies of a major tissue-specific RNA binding protein, Rbfox1, are definitely important for our understanding of functional differences between muscle subtypes, and between muscle and nonmuscle tissues. The broad outlines of Rbfox1 alternative splicing regulation are known, but there is very little specific detail about the important targets in muscle subtypes that might help explain functional differences between subtypes. If more experimental validation can be obtained for regulation of transcript levels by binding 3'UTRs, this would also represent new information. *

      We thank the reviewer for recognizing the significance of our work and our detailed analysis of Rbfox1 phenotypes in different muscle fiber-types. Experimental validation of 3’-UTR binding will be a significant time investment in terms of building and testing in-vivo reporter constructs, assaying NMD and translation effects and performing the CLIP studies necessary for identification of directly-bound 3’-UTR regions, extending beyond the scope of this manuscript and the time allotted for revision. The data we present here represent an important advance in our understanding how Rbfox1 contributes to muscle-type specific differentiation, and form the basis for future experiments to explore the molecular and biochemical mechanisms underlying this regulation. *

      I am reviewing based on my experience studying alternative splicing in vertebrate systems, with an emphasis on Rbfox genes. Therefore I am unable to evaluate the functional data on different subtypes of muscle in Drosophila.

      *

      Reviewer Response References

      Barbas, J. A., Galceran, J., Torroja, L., Prado, A. and Ferrús, A. (1993). Abnormal muscle development in the heldup3 mutant of Drosophila melanogaster is caused by a splicing defect affecting selected troponin I isoforms. Mol Cell Biol 13, 1433–1439.

      Benoit Bouvrette, L. P., Bovaird, S., Blanchette, M. and Lécuyer, E. (2020). oRNAment: a database of putative RNA binding protein target sites in the transcriptomes of model species. Nucleic Acids Research 48, D166–D173.

      Brudno, M., Gelfand, M. S., Spengler, S., Zorn, M., Dubchak, I. and Conboy, J. G. (2001). Computational analysis of candidate intron regulatory elements for tissue-specific alternative pre-mRNA splicing. Nucleic Acids Res 29, 2338–2348.

      Damianov, A., Ying, Y., Lin, C.-H., Lee, J.-A., Tran, D., Vashisht, A. A., Bahrami-Samani, E., Xing, Y., Martin, K. C., Wohlschlegel, J. A., et al. (2016). Rbfox Proteins Regulate Splicing as Part of a Large Multiprotein Complex LASR. Cell 165, 606–619.

      Gao, C., Ren, S., Lee, J.-H., Qiu, J., Chapski, D. J., Rau, C. D., Zhou, Y., Abdellatif, M., Nakano, A., Vondriska, T. M., et al. (2016). RBFox1-mediated RNA splicing regulates cardiac hypertrophy and heart failure. J Clin Invest 126, 195–206.

      Nikonova, E., Kao, S.-Y. and Spletter, M. L. (2020). Contributions of alternative splicing to muscle type development and function. Semin. Cell Dev. Biol.

      Nongthomba, U., Cummins, M., Clark, S., Vigoreaux, J. O. and Sparrow, J. C. (2003). Suppression of muscle hypercontraction by mutations in the myosin heavy chain gene of Drosophila melanogaster. Genetics 164, 209–222.

      Schnorrer, F., Schönbauer, C., Langer, C. C. H., Dietzl, G., Novatchkova, M., Schernhuber, K., Fellner, M., Azaryan, A., Radolf, M., Stark, A., et al. (2010). Systematic genetic analysis of muscle morphogenesis and function in Drosophila. Nature 464, 287–291.

      Shukla, J. P., Deshpande, G. and Shashidhara, L. S. (2017). Ataxin 2-binding protein 1 is a context-specific positive regulator of Notch signaling during neurogenesis in Drosophila melanogaster. Development 144, 905–915.

      Usha, N. and Shashidhara, L. S. (2010). Interaction between Ataxin-2 Binding Protein 1 and Cubitus-interruptus during wing development in Drosophila. Dev Biol 341, 389–399.

      Wei, C., Xiao, R., Chen, L., Cui, H., Zhou, Y., Xue, Y., Hu, J., Zhou, B., Tsutsui, T., Qiu, J., et al. (2016). RBFox2 Binds Nascent RNA to Globally Regulate Polycomb Complex 2 Targeting in Mammalian Genomes. Mol Cell 62, 875–889.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      SUMMARY

      This manuscript characterizes the role of splicing factor Rbfox1 in Drosophila muscle and explores its ability to modulate expression of genes important for fibrillar and tubular muscle development. The authors hypothesize that Rbfox1 binds directly to 5'-UTR and 3'-UTR regions to regulate transcript levels, and to intronic regions to promote or inhibit alternative splicing events. Because some of the regulated genes encode transcriptional activators and other splicing factors such as Bru1, the effects of Rbfox1 may encompass a complex regulatory network that fine-tunes transcript levels and alternative splicing patterns that shape developing muscle. Most likely the authors' hypothesis is correct that Rbfox1 is critical for muscle development in Drosophila, but overall the interesting ideas presented here are too often based only on correlations without further experimental validation.

      MAJOR COMMENTS

      The hypothesis that Rbfox1 plays an important role in regulating muscle development is based on previous studies in other species and supported by much new data in this manuscript. Initial bioinformatic analysis showed that many Drosophila genes, including 20% of all RNA-binding proteins, 40% of transcription factors, etc. have the motifs in introns or UTR regions. However, I think a deeper analysis is required. Any hexamer might be present about once every 4kb, and we do not expect all UGCAUG motifs are necessarily functional, so one might ask whether the association of Rbfox motifs with muscle development genes is statistically significant? Are the motifs conserved in other Drosophila species, which might support a functional role in muscle? Are the intronic motifs located as expected for regulatory effects, that is, proximal to alternative exons that exhibit changes in splicing when Rbfox1 expression is decreased or increased? Is it possible to knock out an Rbfox motif and show that splicing of the alternative exon is altered, or regulation of transcript levels is abrogated?
 Also, what was the background set of genes used for the GO enrichment analysis? Genes expressed in muscle or all genes?

      1. The data on cross regulation between Rbfox1 and Bru1 are confusing and inconsistent, since mild knockdown and stronger knockdown of Rbfox1 seem to have different effects on Bru1 expression. Both Rbfox1 and Bru1 gene have many Rbfox motifs, but they are both large genes (>100kb) and would be expected to have many copies of all hexamers. How do we know whether any of them are functional? New data suggest that Rbfox1 can positively regulate Bru1 protein levels (Fig.5), but this seems inconsistent with the lab's earlier studies indicating opposite temporal mRNA expression profiles for Rbfox1 and Bru1 across IFM development. 

      2. Figure S4, section I, J: if changes in Bru1-RB isoform expression are correlated with Rbfox1 knockdown, it seems reasonable to test whether the Bru1-RB promoter can drive expression of GFP in an Rbfox1-dependent manner. But if I understand correctly, the assay as described on p. 19 uses the promoter region upstream of Bru1-RA. What is the logic for this experiment? It is not surprising that no effect was observed. The end result is that we have no idea whether Rbfox1 directly regulates bru1-RB. Even if it does, bru-Rb appears to be a minor component of Bru expression in IFM.
      3. In the section "Rbfox1 and Bruno1 co-regulate alternative splice events in IFMs", the data show that splicing of several genes is altered by knockdown or over-expression of Rbfox1 and Bru1. The interesting conclusion is for a complex regulatory dynamic where Rbfox1 and Bru1 co-regulate some alternative splice events and independently regulate other events in a muscle-type specific manner. However, if we are to conclude that these activities are due to direct binding of Rbfox1 and Bru1 to the adjacent introns, we need information about the location of flanking Rbfox and/or Bru1 motifs. Do upstream or downstream binding sites correlate with enhancer or silencer activity, as reported in previous studies of these splicing factors in other species? For wupA, Figure S3 shows an intronic Rbfox site, but exon 4 is not labeled so the reader cannot correlate this information with the diagram in Figure 6U.
      4. The evidence that Rbfox1 directly affects expression of transcription factor Exd seems to be based only a correlation between Rbfox1 knockdown and decreased expression of Exd. The observation that binding of Rbfox1 to the Exd 3'UTR in RIP experiments further weakens the case.
      5. Similarly, there is a correlation of Rbfox1 knockdown with expression of alternative 5'UTRs in the Mef2 gene. However, the changes in UTR expression appear mostly not statistically significant. Do the authors have a model to explain what mechanism might allow Rbfox to regulate expression of alternative 5'UTRs, which would seem to be a transcriptional process?
      6. For Salm, there apparently are no Rbfox motifs in the gene, and there are statistically significant but apparently inconsistent changes in Salm expression when it is knocked down in IFM by Rbfox1-RNAi (Salm increases) vs knockdown by Rbfox1-IR27286 or Rbfox1-IRKK110518 (Salm decreases). These are potentially interesting observations but more data would be needed to make stronger conclusions. How would regulation occur in the absence of Rbfox motifs?


      MINOR COMMENTS

      1. In several figures there is a misalignment of the transcriptional driver information with the phenotype data in the bar graphs above. Please correct the alignments to make interpretation easier.
      2. On p. 14 Brudno et al. is cited as ref for Fox motifs near muscle exons, but this paper only focused on brain-specific exons.
      3. For Mef2, why do exons described as 5'UTR have numbers 17, 20, and 21? One would normally expect these to be exon 1, 2 or 1A, 1B, etc.
      4. Fig 8: "regulation of regulators" seems to imply the Rbfox1 is impacting transcription?? Is there precedence for this type of regulation by Rbfox1?

      5. The data on cross regulation between Rbfox1 and Bru1 are confusing and inconsistent, since mild knockdown and stronger knockdown of Rbfox1 seem to have different effects on Bru1 expression. Both Rbfox1 and Bru1 gene have many Rbfox motifs, but they are both large genes (>100kb) and would be expected to have many copies of all hexamers. How do we know whether any of them are functional? New data suggest that Rbfox1 can positively regulate Bru1 protein levels (Fig.5), but this seems inconsistent with the lab's earlier studies indicating opposite temporal mRNA expression profiles for Rbfox1 and Bru1 across IFM development. 


      6. Figure S4, section I, J: if changes in Bru1-RB isoform expression are correlated with Rbfox1 knockdown, it seems reasonable to test whether the Bru1-RB promoter can drive expression of GFP in an Rbfox1-dependent manner. But if I understand correctly, the assay as described on p. 19 uses the promoter region upstream of Bru1-RA. What is the logic for this experiment? It is not surprising that no effect was observed. The end result is that we have no idea whether Rbfox1 directly regulates bru1-RB. Even if it does, bru-Rb appears to be a minor component of Bru expression in IFM.

      7. In the section "Rbfox1 and Bruno1 co-regulate alternative splice events in IFMs", the data show that splicing of several genes is altered by knockdown or over-expression of Rbfox1 and Bru1. The interesting conclusion is for a complex regulatory dynamic where Rbfox1 and Bru1 co-regulate some alternative splice events and independently regulate other events in a muscle-type specific manner. However, if we are to conclude that these activities are due to direct binding of Rbfox1 and Bru1 to the adjacent introns, we need information about the location of flanking Rbfox and/or Bru1 motifs. Do upstream or downstream binding sites correlate with enhancer or silencer activity, as reported in previous studies of these splicing factors in other species? For wupA, Figure S3 shows an intronic Rbfox site, but exon 4 is not labeled so the reader cannot correlate this information with the diagram in Figure 6U.

      8. The evidence that Rbfox1 directly affects expression of transcription factor Exd seems to be based only a correlation between Rbfox1 knockdown and decreased expression of Exd. The observation that binding of Rbfox1 to the Exd 3'UTR in RIP experiments further weakens the case.

      9. Similarly, there is a correlation of Rbfox1 knockdown with expression of alternative 5'UTRs in the Mef2 gene. However, the changes in UTR expression appear mostly not statistically significant. Do the authors have a model to explain what mechanism might allow Rbfox to regulate expression of alternative 5'UTRs, which would seem to be a transcriptional process?

      10. For Salm, there apparently are no Rbfox motifs in the gene, and there are statistically significant but apparently inconsistent changes in Salm expression when it is knocked down in IFM by Rbfox1-RNAi (Salm increases) vs knockdown by Rbfox1-IR27286 or Rbfox1-IRKK110518 (Salm decreases). These are potentially interesting observations but more data would be needed to make stronger conclusions. How would regulation occur in the absence of Rbfox motifs?


      MINOR COMMENTS

      1. In several figures there is a misalignment of the transcriptional driver information with the phenotype data in the bar graphs above. Please correct the alignments to make interpretation easier.

      2. On p. 14 Brudno et al. is cited as ref for Fox motifs near muscle exons, but this paper only focused on brain-specific exons.

      3. For Mef2, why do exons described as 5'UTR have numbers 17, 20, and 21? One would normally expect these to be exon 1, 2 or 1A, 1B, etc.

      4. Fig 8: "regulation of regulators" seems to imply the Rbfox1 is impacting transcription?? Is there precedence for this type of regulation by Rbfox1?

      Significance

      SIGNIFICANCE

      These studies of a major tissue-specific RNA binding protein, Rbfox1, are definitely important for our understanding of functional differences between muscle subtypes, and between muscle and nonmuscle tissues. The broad outlines of Rbfox1 alternative splicing regulation are known, but there is very little specific detail about the important targets in muscle subtypes that might help explain functional differences between subtypes. If more experimental validation can be obtained for regulation of transcript levels by binding 3'UTRs, this would also represent new information.

      I am reviewing based on my experience studying alternative splicing in vertebrate systems, with an emphasis on Rbfox genes. Therefore I am unable to evaluate the functional data on different subtypes of muscle in Drosophila.

    1. Author Response:

      Reviewer #1 (Public Review):

      [...] While the study is addressing an interesting topic, I also felt this manuscript was limited in novel findings to take away. Certainly the study clearly shows that substitution saturation is achieved at synonymous CpG sites. However, subsequent main analyses do not really show anything new: the depletion of segregating sites in functional versus neutral categories (Fig 2) has been extensively shown in the literature and polymorphism saturation is not a necessary condition for observing this pattern.

      We agree with the reviewer that many of the points raised were appreciated previously and did not mean to convey another impression. Our aim was instead to highlight some unique opportunities provided by being at or very near saturation for mCpG transitions. In that regard, we note that although depletion of variation in functional categories is to be expected at any sample size, the selection strength that this depletion reflects is very different in samples that are far from saturated, where invariant sites span the entire spectrum from neutral to lethal. Consider the depletion per functional category relative to synonymous sites in the adjoining plot in a sample of 100k: ~40% of mCpG LOF sites do not have T mutations. From our Fig. 4 and b, it can be seen that these sites are associated with a much broader range of hs values than sites invariant at 780k, so that information about selection at an individual site is quite limited (indeed, in our p-value formulation, these sites would be assigned p≤0.35, see Fig. 1). Thus, only now can we really start to tease apart weakly deleterious mutations from strongly deleterious or even embryonic lethal mutations. This allows us to identify individual sites that are most likely to underlie pathogenic mutations and functional categories that harbor deleterious variation at the extreme end of the spectrum of possible selection coefficients. More generally, saturation is useful because it allows one to learn about selection with many fewer untested assumptions than previously feasible.

      Similarly, the diminishing returns on sampling new variable sites has been shown in previous studies, for example the first "large" human datasets ca. 2012 (e.g. Fig 2 in Nelson et al. 2012, Science) have similar depictions as Figure 3B although with smaller sample sizes and different approaches (projection vs simulation in this study).

      We agree completely: diminishing returns is expected on first principles from coalescent theory, which is why we cited a classic theory paper when making that point in the previous version of the manuscript. Nonetheless, the degree of saturation is an empirical question, since it depends on the unknown underlying demography of the recent past. In that regard, we note that Nelson et al. predict that at sample sizes of 400K chromosomes in Europeans, approximately 20% of all synonymous sites will be segregating at least one of three possible alleles, when the observed number is 29%. Regardless, not citing Nelson et al. 2012 was a clear oversight on our part, for which we apologize; we now cite it in that context and in mentioning the multiple merger coalescent.

      There are some simulations presented in Fig 4, but this is more of a hypothetical representation of the site-specific DFE under simulation conditions roughly approximating human demography than formal inference on single sites. Again, these all describe the state of the field quite well, but I was disappointed by the lack of a novel finding derived from exploiting the mutation saturation properties at methylated CpG sites.

      As noted above, in our view, the novelty of our results lies in their leveraging saturation in order to identify sites under extremely strong selection and make inferences about selection without the need to rely on strong, untested assumptions.

      However, we note that Fig 4 is not simply a hypothetical representation, in that it shows the inferred DFE for single mCpG sites for a fixed mutation rate and given a plausible demographic model, given data summarized in terms of three ranges of allele frequency (i.e., = 0, between 1 and 10 copies, or above 10 copies). One could estimate a DFE across all sites from those summaries of the data (i.e., from the proportion of mCpG sites in each of the three frequency categories), by weighting the three densities in Fig 4 by those proportions. That is, in fact, what is done in a recent preprint by Dukler et al. (2021, BioRxiv): they infer the DFE from two summaries of the allele frequency spectrum (in bins of sites), the proportion of invariant sites and the proportion of alleles at 1-70 copies, in a sample of 70K chromosomes.

      To illustrate how something similar could be done with Fig. 4 based on individual sites, we obtain an estimate of the DFE for LOF mutations (shown in Panel B and D for two different prior distributions on hs) by weighting the posterior densities in Panel A by the fraction of LOF mutations that are segregating (73% at 780K; 9% at 15K) and invariant (27% and 91% respectively); in panel C, we show the same for a different choice of prior. For the smaller sample size considered, the posterior distribution recapitulates the prior, because there is little information about selection in whether a site is observed to be segregating or invariant, and particularly about strong selection. In the sample of 780K, there is much more information about selection in a site being invariant and therefore, there is a shift towards stronger selection coefficients for LOF mutations regardless of the prior.

      Our goal was to highlight these points rather than infer a DFE using these two summaries, which throw out much of the information in the data (i.e., the allele frequency differences among segregating sites). In that regard, we note that the DFE inference would be improved by using the allele frequency at each of 1.1 million individual mCpG sites in the exome. We outline this next step in the Discussion but believe it is beyond the scope of our paper, as it is a project in itself – in particular it would require careful attention to robustness with regard to both the demographic model (and its impact on multiple hits), biased gene conversion and variability in mutation rates among mCpG sites. We now make these points explicitly in the Outlook.

      Similarly, I felt the authors posed a very important point about limitations of DFE inference methods in the Introduction but ended up not really providing any new insights into this problem. The authors argue (rightly so) that currently available DFE estimates are limited by both the sparsity of polymorphisms and limited flexibility in parametric forms of the DFE. However, the nonsynonymous human DFE estimates in the literature appear to be surprisingly robust to sample size: older estimates (Eyre-Walker et al. 2006 Genetics, Boyko et al. 2008 PLOS Genetics) seem to at least be somewhat consistent with newer estimates (assuming the same mutation rate) from samples that are orders of magnitude larger (Kim et al. 2017 Genetics).

      We are not quite sure what the reviewer has in mind by “somewhat consistent,” as Boyko et al. estimate that 35% of non-synonymous mutations have s>10^-2 while Kim et al. find that proportion to be “0.38–0.84 fold lower” than the Boyko et al. estimate (see, e.g., Fig. 4 in Kim et al., 2017). Moreover, the preprint by Dukler et al. mentioned above, which infers the DFE based on ~70K chromosomes, finds estimates inconsistent with those of Kim et al. (see SOM Table 2 and SOM Figure S5 in Dukler et al., 2021).

      More generally, given that even 70K chromosomes carry little information about much of the distribution of selection coefficients (see our Fig. 4), we expect that studies based on relatively sample sizes will basically recover something close to their prior; therefore, they should agree when they use the same or similar parametric forms for the distribution of selection coefficients and disagree otherwise. The dependence on that choice is nicely illustrated in Kim et al., who consider different choices and then perform inference on the same data set and with the same fixed mutation rate for exomes; depending on their choice anywhere between 5%-28% of non-synonymous changes are inferred to be under strong selection with s>=10^-2 (see their Table S4).

      Whether a DFE inferred under polymorphism saturation conditions with different methods is different, and how it is different, is an issue of broad and immediate relevance to all those conducting population genomic simulations involving purifying selection. The analyses presented as Fig 4A and 4B kind of show this, but they are more a demonstration of what information one might have at 1M+ sample sizes rather than an analysis of whether genome-wide nonsynonymous DFE estimates are accurate. In other words, this manuscript makes it clear that a problem exists, that it is a fundamental and important problem in population genetics, and that with modern datasets we are now poised to start addressing this problem with some types of sites, but all of this is already very well-appreciated except for perhaps the last point.

      At least a crude analysis to directly compare the nonsynonymous genome-wide DFE from smaller samples to the 780K sample would be helpful, but it should be noted that these kinds of analyses could be well beyond the scope of the current manuscript. For example, if methylated nonsynonymous CpG sites are under a different level of constraint than other nonsynonymous sites (Fig. S14) then comparing results to a genome-wide nonsynonymous DFE might not make sense and any new analysis would have to try and infer a DFE independently from synonymous/nonsynonymous methylated CpG sites.

      We are not sure what would be learned from this comparison, given that Figure 4 shows that, at least with an uninformative prior, there is little information about the true DFE in samples, even of tens of thousands of individuals. Thus, if some of the genome-wide nonsynonymous DFE estimates based on small sample sizes turn out to be accurate, it will be because the guess about the parametric shape of the DFE was an inspired one. In our view, that is certainly possible but not likely, given that the shape of the DFE is precisely what the field has been aiming to learn and, we would argue, what we are now finally in a position to do for CpG mutations in humans.

      Reviewer #2 (Public Review):

      This manuscript presents a simple and elegant argument that neutrally evolving CpG sites are now mutationally saturated, with each having a 99% probability of containing variation in modern datasets containing hundreds of thousands of exomes. The authors make a compelling argument that for CpG sites where mutations would create genic stop codons or impair DNA binding, about 20% of such mutations are strongly deleterious (likely impairing fitness by 5% or more). Although it is not especially novel to make such statements about the selective constraint acting on large classes of sites, the more novel aspect of this work is the strong site-by-site prediction it makes that most individual sites without variation in UK Biobank are likely to be under strong selection.

      The authors rightly point out that since 99% of neutrally evolving CpG sites contain variation in the data they are looking at, a CpG site without variation is likely evolving under constraint with a p value significance of 0.01. However, a weakness of their argument is that they do not discuss the associated multiple testing problem-in other words, how likely is it that a given non synonymous CpG site is devoid of variation but actually not under strong selection? Since one of the most novel and useful deliverables of this paper is single-base-pair-resolution predictions about which sites are under selection, such a multiple testing correction would provide important "error bars" for evaluating how likely it is that an individual CpG site is actually constrained, not just the proportion of constrained sites within a particular functional category.

      We thank the reviewer for pointing this out. One way to think about this problem might be in terms of false discovery rates, in which case the FDR would be 16% across all non-synonymous mCpG sites that are invariant in current samples, and ~4% for the subset of those sites where mutations lead to loss-of-function of genes.

      Another way to address this issue, which we had included but not emphasized previously, is by examining how one’s beliefs about selection should be updated after observing a site to be invariant (i.e., using Bayes odds). At current sample sizes and assuming our uninformative prior, for a non-synonymous mCpG site that does not have a C>T mutation, the Bayes odds are 15:1 in favor of hs>0.5x10^-3; thus the chance that such a site is not under strong selection is 1/16, given our prior and demographic model. These two approaches (FDR and Bayes odds) are based on somewhat distinct assumptions.

      We have now added and/or emphasized these two points in the main text.

      The paper provides a comparison of their functional predictions to CADD scores, an older machine-learning-based attempt at identifying site by site constraint at single base pair resolution. While this section is useful and informative, I would have liked to see a discussion of the degree to which the comparison might be circular due to CADD's reliance on information about which sites are and are not variable. I had trouble assessing this for myself given that CADD appears to have used genetic variation data available a few years ago, but obviously did not use the biobank scale datasets that were not available when that work was published.

      We apologize for the lack of clarity in the presentation. We meant to emphasize that de novo mutation rates vary across CADD deciles when considering all CpG sites (Fig. 2-figure supplement 5c), which confounds CADD precisely because it is based in part on which sites are variable. We have edited the manuscript to clarify this.

      Reading this paper left me excited about the possibility of examining individual invariant CpG sites and deducing how many of them are already associated with known disease phenotypes. I believe the paper does not mention how many of these invariant sites appear in Clinvar or in databases of patients with known developmental disorders, and I wondered how close to saturation disease gene databases might be given that individuals with developmental disorders are much more likely to have their exomes sequenced compared to healthy individuals. One could imagine some such analyses being relatively low hanging fruit that could strengthen the current paper, but the authors also make several reference to a companion paper in preparation that deals more directly with the problem of assessing clinical variant significance. This is a reasonable strategy, but it does give the discussion section of the paper somewhat of a "to be continued" feel.

      We apologize for the confusion that arose from our references to a second manuscript in prep. The companion paper is not a continuation of the current manuscript: it contains an analysis of fitness and pathogenic effects of loss-of-function variation in human exomes.

      Following the reviewer’s suggestion to address the clinical significance of our results, we have now examined the relationship of mCpG sites invariant in current samples with Clinvar variants. We find that of the approximately 59,000 non-synonymous mCpG sites that are invariant, only ~3.6% overlap with C>T variants associated with at least one disease and classified as likely pathogenic in Clinvar (~5.8% if we include those classified as uncertain or with conflicting evidence as pathogenic). Approximately 2% of invariant mCpGs have C>T mutations in what is, to our knowledge, the largest collection of de novo variants ascertained in ~35,000 individuals with developmental disorders (DDD, Kaplanis et al. 2020). At the level of genes, of the 10k genes that have at least one invariant non-synonymous mCpG, only 8% (11% including uncertain variants) have any non-synonymous hits in Clinvar, and ~8% in DDD. We think it highly unlikely that the large number of remaining invariant sites are not seen with mutations in these databases because such mutations are lethal; rather it seems to us to be the case that these disease databases are far from saturation as they contain variants from a relatively small number of individuals, are subject to various ascertainment biases both at the variant level and at the individual level, and only contain data for a small subset of existing severe diseases.

      With a view to assessing clinical relevance however, we can ask a related question, namely how informative being invariant in a sample of 780k is about pathogenicity in Clinvar. Although the relationship between selection and pathogenicity is far from straightforward, being an invariant non-synonymous mCpG in current samples not only substantially increases (15-10fold) the odds of hs > 0.5x10-3 (see Fig. 4b), it also increases the odds of being classified as pathogenic vs. benign in Clinvar 8-51 fold. In the DDD sample, we don’t know which variants are pathogenic; however, if we consider non-synonymous mutations that occur in consensus DDD genes as pathogenic (a standard diagnostic criterion), being invariant increases the odds of being classified as pathogenic 6-fold. We caution that both Clinvar classifications and the identification of consensus genes in DDD relies in part on whether a site is segregating in datasets like ExAC, so this exercise is somewhat circular. Nonetheless it illustrates that there is some information about clinical importance in mCpG sites that are invariant in current samples, and that the degree of enrichment (6 to 51-fold) is very roughly on par with the Bayes odds that we estimate of strong selection conditional on a site being invariant. We have added these findings to the main text and added the plot as Supplementary Figure 13.

      Reviewer #3 (Public Review):

      [...] The authors emphasize several times how important an accurate demographic model is. While we may be close to a solid demographic model for humans, this is certainly not the case for many other organisms. Yet we are not far off from sufficient sample sizes in a number of species to begin to reach saturation. I found myself wondering how different the results/inference would be under a different model of human demographic history. Though likely the results would be supplemental, it would be nice in the main text to be able to say something about whether results are qualitatively different under a somewhat different published model.

      We had previously examined the effect of a few demographic scenarios with large increases in population size towards the present on the average length of the genealogy of a sample (and hence the expected number of mutations at a site) in Figure 3-figure supplement 1b, but without quantifying the effect on our selection inference. Following this suggestion, we now consider a widely used model of human demography inferred from a relatively small sample, and therefore not powered to detect the huge increase in population size towards the present (Tennessen et al. 2012). Using this model, we find a poor fit to the proportion of segregating CpG sites (the observed fraction is 99% in 780k exomes, when the model predicts 49%). Also, as expected, inferences about selection depend on the accuracy of the demographic model (as can be seen by comparing panel B to Fig 4B in the main text).

      On a similar note, while a fixed hs simplifies much of the analysis, I wondered how results would differ for 1) completely recessive mutations and 2) under a distribution of dominance coefficients, especially one in which the most deleterious alleles were more recessive. Again, though I think it would strengthen the manuscript by no means do I feel this is a necessary addition, though some discussion of variation in dominance would be an easy and helpful add.

      There's some discussion of population structure, but I also found myself wondering about GxE. That is, another reason a variant might be segregating is that it's conditionally neutral in some populations and only deleterious in a subset. I think no analysis to be done here, but perhaps some discussion?

      We agree that our analysis ignores the possibilities of complete recessivity in fitness (h=0) as well as more complicated selection scenarios, such as spatially-varying selection (of the type that might be induced by GxE). We note however that so long as there are any fitness effects in heterozygotes, the allele dynamics will be primarily governed by hs; one might also imagine that under some conditions, the mean selection effect across environments would predict allele dynamics reasonably well even in the presence of GxE. Also worth exploring in our view is the standard assumption that hs remains fixed even as Ne changes dramatically. We now mention these points in the Outlook.

      Maybe I missed it, but I don't think the acronym DNM is explained anywhere. While it was fairly self-explanatory, I did have a moment of wondering whether it was methylation or mutation and can't hurt to be explicit.

      We apologize for the oversight and have updated the text accordingly.

    1. I’m not thinking the way I used to think. I can feel it most strongly when I’m reading. Immersing myself in a book or a lengthy article used to be easy.

      I thought this sentence was very interesting because it is how I feel also. The Internet is here to make things easier for us, we don't have to remember anything or to thing as much as we used to. But loosing the habit to read and thing by ourselves may also by synonyme of not being as critical as we were.

    1. SciScore for 10.1101/2021.11.30.21266810: (What is this?)

      Please note, not all rigor criteria are appropriate for all manuscripts.

      Table 1: Rigor

      <table><tr><td style="min-width:100px;margin-right:1em; border-right:1px solid lightgray; border-bottom:1px solid lightgray">Ethics</td><td style="min-width:100px;border-bottom:1px solid lightgray">IRB: Trials had to be approved by the institutional review boards, and competent authorities of the countries involved, and all patients gave written informed consent.<br>Consent: Trials had to be approved by the institutional review boards, and competent authorities of the countries involved, and all patients gave written informed consent.</td></tr><tr><td style="min-width:100px;margin-right:1em; border-right:1px solid lightgray; border-bottom:1px solid lightgray">Sex as a biological variable</td><td style="min-width:100px;border-bottom:1px solid lightgray">not detected.</td></tr><tr><td style="min-width:100px;margin-right:1em; border-right:1px solid lightgray; border-bottom:1px solid lightgray">Randomization</td><td style="min-width:100px;border-bottom:1px solid lightgray">Study patients and selection criteria: Although the exact inclusion and exclusion criteria could vary across the trials, all the subjects had to fulfill the following criteria; 1) Participant of a trial that joined the COMPILEhome consortium, 2) Confirmed COVID-19 diagnosis by a diagnostic PCR or antigen test, 3) Neither hospitalized nor at the emergency room department of a hospital before or at the time of randomization, 4) Symptomatic with illness onset ≤7 days at the time of screening for the study, and 5) Age 50 or older.</td></tr><tr><td style="min-width:100px;margin-right:1em; border-right:1px solid lightgray; border-bottom:1px solid lightgray">Blinding</td><td style="min-width:100px;border-bottom:1px solid lightgray">Overview of Study Design and research partners: Beginning in November 2020, we systematically searched for RCTs recruiting outpatients that compared treatment with CP with a blinded or unblinded control arm in the European (https://www.clinicaltrialsregister.eu/) and American (www.clinicaltrials.gov) trial register.</td></tr><tr><td style="min-width:100px;margin-right:1em; border-right:1px solid lightgray; border-bottom:1px solid lightgray">Power Analysis</td><td style="min-width:100px;border-bottom:1px solid lightgray">not detected.</td></tr></table>

      Table 2: Resources

      <table><tr><th style="min-width:100px;text-align:center; padding-top:4px;" colspan="2">Antibodies</th></tr><tr><td style="min-width:100px;text=align:center">Sentences</td><td style="min-width:100px;text-align:center">Resources</td></tr><tr><td style="min-width:100px;vertical-align:top;border-bottom:1px solid lightgray">Pre-planned subgroup analyses assessed the efficacy of the 2 primary outcomes in the following subgroups: 1) days since disease onset (1-5 or >5days), 2) level of neutralizing antibody anti-SARS-CoV-2 titers in transfused plasma and 3) Negative serum anti-SARS-CoV-2 IgG status (Trimeric Spike antibody test, Liaison, Diasorin, Saluggia, Italy).</td><td style="min-width:100px;border-bottom:1px solid lightgray"><div style="margin-bottom:8px"><div>anti-SARS-CoV-2</div><div>suggested: None</div></div><div style="margin-bottom:8px"><div>anti-SARS-CoV-2 IgG</div><div>suggested: None</div></div><div style="margin-bottom:8px"><div>Diasorin, Saluggia, Italy).</div><div>suggested: None</div></div></td></tr><tr><th style="min-width:100px;text-align:center; padding-top:4px;" colspan="2">Software and Algorithms</th></tr><tr><td style="min-width:100px;text=align:center">Sentences</td><td style="min-width:100px;text-align:center">Resources</td></tr><tr><td style="min-width:100px;vertical-align:top;border-bottom:1px solid lightgray">Overview of Study Design and research partners: Beginning in November 2020, we systematically searched for RCTs recruiting outpatients that compared treatment with CP with a blinded or unblinded control arm in the European (https://www.clinicaltrialsregister.eu/) and American (www.clinicaltrials.gov) trial register.</td><td style="min-width:100px;border-bottom:1px solid lightgray"><div style="margin-bottom:8px"><div>https://www.clinicaltrialsregister.eu/</div><div>suggested: (EU Clinical Trials Register, RRID:SCR_005956)</div></div></td></tr></table>

      Results from OddPub: We did not detect open data. We also did not detect open code. Researchers are encouraged to share open data when possible (see Nature blog).


      Results from LimitationRecognizer: We detected the following sentences addressing limitations in the study:
      Several limitations should be mentioned. Although we only included patients aged ≥50, and most of them also had comorbidities, the hospital admission rate was relatively low at 9.3%. Therefore, the study was not powered to exclude a small overall treatment effect. However, administering CP to infectious and symptomatic outpatients is complex and labor-intensive. Hence, we think that small CCP’s clinical role is significantly diminished if unable to establish something greater than “a small effect” because it ceases to be practical. As vaccination uptake progressed in patients aged 50 or older and monoclonal antibody-based therapy with proven effectiveness in high-risk outpatients became available, the recruitment dropped dramatically as of June 2021. This resulted in the recommendation by the individual and COMPILEhome DSMBs that further enrollment was unlikely to change the results, and both studies were discontinued. Regarding the advent of the SARS-CoV-2 variants that may be less susceptible to antibodies induced by the original SARS-CoV-2 virus or the alpha variant, it is reassuring that >95% of the patients in both countries were included at a time when the delta variant was still rare (<5%) (Appendix Figure 3 and 4). The last limitation of our study (and all studies on CP for COVID-19 so far) is the lack of a proper phase 2 dose-finding study. In a recent study, we administered 600 mL of CP to 25 SARS-CoV-2 antibody-negative B-cell depleted patients diagnosed with COVID...

      Results from TrialIdentifier: We found the following clinical trial numbers in your paper:<br><table><td style="min-width:95px; border-right:1px solid lightgray; border-bottom:1px solid lightgray">Identifier</td><td style="min-width:95px; border-right:1px solid lightgray; border-bottom:1px solid lightgray">Status</td><td style="min-width:95px; border-right:1px solid lightgray; border-bottom:1px solid lightgray">Title</td></tr><tr><td style="min-width:95px; border-right:1px solid lightgray; border-bottom:1px solid lightgray">NCT04621123</td><td style="min-width:95px; border-right:1px solid lightgray; border-bottom:1px solid lightgray">Completed</td><td style="min-width:95px; border-right:1px solid lightgray; border-bottom:1px solid lightgray">Plasma for Early Treatment in Non-hospitalised Mild or Moder…</td></tr><tr><td style="min-width:95px; border-right:1px solid lightgray; border-bottom:1px solid lightgray">NCT04589949</td><td style="min-width:95px; border-right:1px solid lightgray; border-bottom:1px solid lightgray">Recruiting</td><td style="min-width:95px; border-right:1px solid lightgray; border-bottom:1px solid lightgray">Early Convalescent Plasma Therapy for High-risk Patients Wit…</td></tr></table>


      Results from Barzooka: We did not find any issues relating to the usage of bar graphs.


      Results from JetFighter: We did not find any issues relating to colormaps.


      Results from rtransparent:
      • Thank you for including a conflict of interest statement. Authors are encouraged to include this statement when submitting to a journal.
      • Thank you for including a funding statement. Authors are encouraged to include this statement when submitting to a journal.
      • No protocol registration statement was detected.

      Results from scite Reference Check: We found no unreliable references.


      <footer>

      About SciScore

      SciScore is an automated tool that is designed to assist expert reviewers by finding and presenting formulaic information scattered throughout a paper in a standard, easy to digest format. SciScore checks for the presence and correctness of RRIDs (research resource identifiers), and for rigor criteria such as sex and investigator blinding. For details on the theoretical underpinning of rigor criteria and the tools shown here, including references cited, please follow this link.

      </footer>

    1. Ineliminable Inscrutability Scrutinized and Eliminated

      Brandom rejects two possible theories of normativity (rule-following).

      Regularism:

      If a given performance conforms to some pre-existing pattern of performances, then we call that performance correct or competent. If it doesn’t so conform, then we call it incorrect or incompetent

      Brandom's objection: Regularists can't distinguish between what happens and what ought to happen. We don't say "gravity ought to work", so a Regularist must somehow explain why the Law of Gravity is not normative, while the Law of US is a normative.

      Everything in nature ‘follows’ the ‘rules of nature,’ the regularities isolated by the natural sciences. So what does the normativity that distinguishes human rule-following consist in?

      Regulism: rules are certain declarative sentences like "No smoking.", and rule-following is behavior that is described by the rules.

      Brandom's objection: rules can't be made entirely explicit. There must always some unsaid rule to avoid an infinite regress, like "the rule about following rules" and "the rule about the rule about following rules" etc.

      Wittgenstein said this about how the infinite regress is cut off by unspoken rules that are followed in practice.

      If I have exhausted the justifications I have reached bedrock, and my spade is turned. Then I am inclined to say: “This is simply what I do.”

      Thus, there is a necessary implicitness, or "blindness", in rule-following behavior. At some level, we simply follow rules without understanding.

      But then, a challenge! How is "implicit norm" even possible?

      How can a performance be nothing but a ‘blind’ reaction to a situation, not an attempt to act on interpretation?

      Unconscious rule-following is automatic, therefore not normative, much like a sneeze, or falling in gravity is not normative.

      Or is it? Perhaps we are forced to conclude that fundamentally, norms are based on mechanical, thoughtless behaviors. In this way, we can naturalize norms in a norm-less theory (such as neuroscience).

      Brandom refused this, and insists that we must then admit "nonconscious norms". He even proposes a kind of non-natural metaphysics, where non-natural normativity is baked into the metaphysics.

      But Bakker has a better idea: explain norms in a norm-less scientific theory

      The history of the social sciences is a history of emancipation from the intellectual propensity to intentionalize social phenomenon—this was very much part of the process that Weber called the disenchantment of the world. Brandom proposes to re-enchant the world by re-instating the belief in normative powers, which is to say, powers in some sense outside of and distinct from the forces known to science.

      Bakker's Blind Brain Theory

      Now Bakker begins his own philosophy, using Blind Brain Theory.

      Note how important is implicit/blindness in Wittgenstein's and Brandom's explanations of how norms work. But they never paused to consider it deeper than a simple "Such implicitness means implicit normativity exists." They then went on to consider normativity without studying further just what are implicit, and how they are implicit.

      This is a grave error. To explain normativity, we must study what are implicit and how they are implicit in the brain when people think normative thoughts and do normative actions. We must study the neglect structure of the brain, and that brings us to Blind Brain Theory.

      According to BBT, all cognition is heuristic and depends critically on the environment to play nice (that is, remain stable). Heuristic algorithms can skip many steps and come out right, as long as the environment rarely challenges it with difficult examples that exposes the error of the heuristic.

      Normative cognition is also heuristic -- what features of the human environment does it depend on?

      Wittgenstein again

      If I have exhausted the justifications I have reached bedrock, and my spade is turned. Then I am inclined to say: “This is simply what I do.”

      The "bedrock" is the stable normative behaviors of other humans I live with. In other words, Regularism is actually the right approach to explaining normativity.

      Brandom was wrong to reject Regularism, but to see why he was wrong, we must do some psycho-philosophy. We must understand why the human animal is psychologically prone to reject Regularism (just like how it is psychologically prone to think souls exist). It is, again, because of BBT.

      We think "This rule is normative." when some normativity-detection cognitive module is triggered. If the module keeps quiet, and we have the distinct feeling of "Wait, that's not normative...", no matter how much information processing the other modules do. And it just so happens that thinking about causes and statistical correlations cannot trigger this module.

      There are roughly two types of explanations: causal/natural and normative/supernatural. Causal/natural explanations are those step-by-step explanations that intrinsically allows you to break it down further ("how does this step work?"), push it forwards and backwards in time ("and what happened before/after?"). Normative/supernatural explanations are those brute assertions about what to do and not to do ("This is simply what I do."), accompanied with an anosognosia, a blindness to the blindness, a feeling that the assertions are sufficient with no further explanations possible ("What do you mean I must explain why it is what I do? I have explained myself sufficiently. There is nothing left to explain!")

      Since Regularism involves solving normative cognition using the resources of natural cognition, it simply follows that it fails to engage resources specific to normative cognition.

      Bakker is in no danger of self-contradiction, because the problem "how does normative cognition work?" is perfectly possible to be the kind of problem that causal cognition can solve. Sure, causal cognition can't solve all problems, but it can solve some... like "how to build a plane?" and "how the brain works?" Science works, and that shows the power of causal cognition. In contrast, nothing sophisticated like science has been built upon normative cognition. This shows that causal cognition can solve normative cognition, while normative cognition can't.

      What doesn’t follow is that normative cognition thus lies outside the problem ecology of natural cognition, let alone inside the problem ecology of normative cognition.

      In short, Brandom failed because he tried to solve normativity with normative cognition. Bakker may succeed, because he is trying to solve normativity with causal cognition. The feeling that "normativity can't be solved causally" misguided Brandom, and it is just an illusion generated by the fractured nature of cognition, described above.

      normative cognition seems unlikely to theoretically solve normative cognition in any satisfying manner. The very theoretical problems that plague Normativism—supernaturalism, underdetermination, and practical inapplicability—are the very problems we should expect if normative cognition were not in fact among the problems that normative cognition can solve.

      Here is Bakker's explanation of normative cognition, and how it leads to Brandom's mistake:

      normative cognition belongs to social cognition more generally, and that... has evolved to solve astronomically complicated biomechanical problems involving the prediction, understanding, and manipulation of other organisms absent detailed biomechanical information. Adapted to solve in the absence of this information, it stands to reason that the provision of that information, facts regarding biomechanical regularities, will render it ineffective...

      ... intentional cognition has evolved to overcome neglect, to solve problems in the absence of causal information. This is why philosophical reflection convinces us we somehow stand outside the causal order via choice or reason or what have you. We quite simply confuse an incapacity, our inability to intuit our biomechanicity, with a special capacity, our ability to somehow transcend or outrun the natural order.

    1. Author Response:

      Reviewer #1 (Public Review):

      The introduction felt a bit short. I was hoping early on I think for a hint at what biotic and abiotic factors UV could be important for and how this might be important for adaptation. A bit more on previous work on the genetics of UV pigmentation could be added too. I think a bit more on sunflowers more generally (what petiolaris is, where natural pops are distributed, etc.) would be helpful. This seems more relevant than its status as an emoji, for example.

      We had opted to provide some of the relevant background in the corresponding sections of the manuscript, but agree that it would be beneficial to expand the introduction. In the revised version of the manuscript, we have modified the introduction and the first section of Results and Discussion to include more information about wild sunflowers, possible adaptive functions of floral UV patterns, and previous work on the genetic basis of floral UV patterning. More generally, we have strived to provide more background information throughout the manuscript.

      The authors present the % of Vp explained by the Chr15 SNP. Perhaps I missed it, but it might be nice to also present the narrow sense heritability and how much of Va is explained.

      Narrow sense heritability for LUVp is extremely high in our H. annuus GWAS population; four different software [EMMAX (Kang et al., Nat Genet 2010), GEMMA (Zhou and Stephens, Nat Genet. 2012), GCTA (Yang et al., Am J Hum Genet 2011) and BOLT_LMM (Loh et al., Nat Genet 2015)] provided h2 estimates of ~1. While it is possible that these estimates are somewhat inflated by the presence of a single locus of extremely large effect, all individuals in this populations were grown at the same time under the same conditions, and limited environmental effects would therefore be expected. The percentage of additive variance explained by HaMYB111 appears therefore to be equal to the percentage of phenotypic variance (~62%).

      We have included details in the Methods section – Genome-wide association mapping, and added this information to the relevant section of the main text:

      “The chromosome 15 SNP with the strongest association with ligule UV pigmentation patterns in H. annuus (henceforth “Chr15_LUVp SNP”) explained 62% of the observed phenotypic and additive variation (narrow-sense heritability for LUVp in this dataset is ~1).”

      A few lines of discussion about why the Chr15 allele might be observed at only low frequencies in petiolaris I think would be of interest - the authors appear to argue that the same abiotic factors may be at play in petiolaris, so why don't we see this allele at frequencies higher than 2%? Is it recent? Geographically localized?

      That is a very interesting observation, and we currently do not have enough data to provide a definitive answer to why that is. From GWAS, HaMYB111 does not seem to play a measurable role in controlling variation for LUVp in H. petiolaris; Even when we repeat the GWAS with MAF > 1%, so that the Chr15_LUVp SNP would be included in the analysis, there is no significant association between that SNP and LUVp (the significant association on chr. 15 seen in the Manhattan plot for H. petiolaris is ~20 Mbp downstream of HaMYB111). The rarity of the L allele in H. petiolaris could complicate detection of a GWAS signal; on the other hand, the few H. petiolaris individuals carrying the L allele have, on average, only marginally larger LUVp than the rest of the population (LL = 0.32 allele).

      The two most likely explanations for the low frequencies of the L allele in H. petiolaris are differences in alleles, or their effect, between H. annuus and H. petiolaris; or, as suggested by the reviewer, a recent introgression. In H. annuus, the Chr15_LUVp SNP is likely not the actual causal polymorphism affecting HaMYB111 activity, but is only in LD with it (or them); this association might be absent in H. petiolaris alleles. An alternative possibility is that downstream differences in the genetic network regulating flavonol glycosides biosynthesis mask the effect of different HaMYB111 alleles.

      H. annuus and H. petiolaris hybridize frequently across their range, so this could be a recent introgression that has not established itself; alternatively, physiological differences in H. petiolaris could make the L allele less advantageous, so the introgressed allele is simply being maintained by drift (or recurring hybridization). Further analysis of genetic and functional diversity at HaMYB111 in H. petiolaris will be required to differentiate between these possibilities.

      We have added a few sentences highlighting some of these possible explanations at the end the main text of the manuscript, which now reads:

      “Despite a more limited range of variation for LUVp, a similar trend (larger UV patterns in drier, colder environments) is present also in H. petiolaris (Figure 4 – figure supplement 4). Interestingly, while the L allele at Chr_15 LUVp SNP is present in H. petiolaris (Figure 1 – figure supplement 2), it is found only at a very low frequency, and does not seem to significantly affect floral UV patterns in this species (Figure 2a). This could represent a recent introgression, since H. annuus and H. petiolaris are known to hybridize in nature (Heiser, 1947, Yatabe et al., 2007). Alternatively, the Chr_15 LUVp SNP might not be associated with functional differences in HaMYB111 in H. petiolaris, or differences in genetic networks or physiology between H. annuus and H. petiolaris could mask the effect of this allele, or limit its adaptive advantage, in the latter species.“

      Page 14: It's unclear to me why there is any need to discretize the LUVp values for the analyses presented here. Seems like it makes sense to either 1) analyze by genotype of plant at the Chr15 SNP, if known, or 2) treat it as a continuous variable and analyze accordingly.

      We designed our experiment to be a comparison between three well-defined phenotypic classes, to reduce the experimental noise inherent to pollinator visitation trials. As a consequence, intermediate phenotypic classes (0.3 < LUVp < 0.5 and 0.8 < LUVp < 0.95) are not represented in the experiment, and therefore we believe that analyzing LUVp as a continuous variable would be less appropriate in this case. In the revised manuscript, we have provided a modified Figure 4 – figure supplement 1 in which individual data points are show (colour-coded by pollinator type), as well as a fitted lines showing the general trend across the data.

      The individuals in pollinator visitation experiments were not genotyped for the Chr15_LUVp SNP; while having that information might provide a more direct link between HaMYB111 and pollinator visitation rates, our main interest in this experiment was to test the possible adaptive effects of variation in floral UV pigmentation.

      Page 14: I'm not sure you can infer selection from the % of plants grown in the experiment unless the experiment was a true random sample from a larger metapopulation that is homogenous for pollinator preference. In addition, I thought one of the Ashman papers had actually argued for intermediate level UV abundance in the presence of UV?

      We have removed mentions of selection from the sentence - while the 110 populations included in our 2019 common garden experiment were selected to represent the whole range of H. annuus, we agree that the pattern we observe is at best suggestive. We have, however, kept a modified version of the sentence in the revised version of the manuscript, since we believe that is an interesting observation. The sentence now reads:

      “Pollination rates are known to be yield-limiting in sunflower (Greenleaf and Kremen, 2006), and a strong reduction in pollination could therefore have a negative effect on fitness; consistent with this plants with very small LUVp values were rare (~1.5% of individuals) in our common garden experiment, which was designed to provide a balanced representation of the natural range of H. annuus.”. (new lines 373-378)

      It is correct that Koski et al., Nature Plants 2015 found intermediate UV patterns to increase pollen viability in excised flowers of Argentina anserina exposed to artificial UV radiation. However, the authors also remark that larger UV patterns would probably be favoured in natural environments, in which UV radiation would be more than two times higher than in their experimental setting. Additionally, when using artificial flowers, they found that pollen viability increased linearly with the size of floral UV pattern.

      More generally, as we discuss later on in the manuscript, the pollen protection mechanism proposed in Koski et al., Nature Plants 2015 is unlikely to be as important in sunflower inflorescences, which are much flatter than the bowl- shaped flowers of A. anserina; consistent with this, and contrary to what was observed for A. anserina, we found no correlation between UV radiation and floral UV patterns in wild sunflowers (Figure 4c).

      I would reduce or remove the text around L316-321. If there's good a priori reason to believe flower heat isn't a big deal (L. 323) and the experimental data back that up, why add 5 lines talking up the hypothesis?

      We had fairly strong reasons to believe temperature might play an important role in floral UV pattern diversity: a link between flower temperature and UV patterns has been proposed before (Koski et al., Current Biol 2020); a very strong correlation exists between temperature and LUVp in our dataset; and, perhaps more importantly, inflorescence temperature is known to have a major effect on pollinator attraction (Atamian et al., Science 2016; Creux et al., New Phytol 2021). While it is known that UV radiation is not particularly energetic, we didn’t mean line 323 to imply that we were sure a priori that there wouldn’t be any effect of UV patterns of inflorescence temperature.

      In the revised manuscript, we have re-organized that section and provided the information reported in line 323 (UV radiation accounts for only 3-7% of the total radiation at earth level) before the experimental results, to clarify what our thought process was in designing those experiments. The paragraph now reads:

      “By absorbing more radiation, larger UV bullseyes could therefore contribute to increasing temperature of the sunflower inflorescences, and their attractiveness to pollinators, in cold climates. However, UV wavelengths represents only a small fraction (3-7%) of the solar radiation reaching the Earth surface (compared to >50% for visible wavelengths), and might therefore not provide sufficient energy to significantly warm up the ligules (Nunez et al., 1994). In line with this observation, different levels of UV pigmentation had no effect on the temperature of inflorescences or individual ligules exposed to sunlight (Figure 4e-g; Figure 4 – figure supplement 3).”

      Page 17: The discussion of flower size is interesting. Is there any phenotypic or genetic correlation between LUVP and flower size?

      This is a really interesting question! There is no obvious genetic correlation between LUVp and flower size – in GWAS, HaMYB111 is not associated to any of the floral characteristics we measured (flowerhead diameter; disk diameter; ligule length; ligule width; relative ligule size; see Todesco et al., Nature 2020). There is also no significant association between ligule length and LUVp (R^2 = 0.0024, P = 0.1282), and only a very weak positive association between inflorescence size and LUVp (R^2 = 0.0243, P = 0.00013; see attached figure). There is, however, a stronger positive correlation between LUVp and disk size (the disk being the central part of the sunflower inflorescence, composed of the fertile florets; R^2 = 0.1478. P = 2.78 × 10-21), and as a consequence a negative correlation between LUVp and relative ligule size (that is, the length of the ligule relative to the diameter of the whole inflorescence; R^2 = 0.1216, P = 1.46 × 10-17). This means that, given an inflorescence of the same size, plants with large LUVp values will tend to have smaller ligules and larger discs. Since the disk of sunflower inflorescences is uniformly UV- absorbing, this would further increase the size of UV-absorbing region in these inflorescences.

      While it is tempting to speculate that this might be connected with regulation of transpiration (meaning that plants with larger LUVp further reduce transpiration from ligules by having smaller ligules - relative ligule size is also positively correlated with summer humidity; R^2 = 0.2536, P = 2.86 × 10_-5), there are many other fitness-related factors that could determine inflorescence size, and disk size in particular (seed size, florets/seed number...). Additionally, in common garden experiments, flowerhead size (and plant size in general) is affected by flowering time, which is also one of the reason why we use LUVp to measure floral UV patterns instead of absolute measurements of bullseye size; in a previous work from our group in Helianthus argophyllus, size measurements for inflorescence and UV bullseye mapped to the same locus as flowering time, while genetic regulation of LUVp was independent of flowering time (Moyers et al., Ann Bot 2017). Flowering time in H. annuus is known to be strongly affected by photoperiod (Blackman et al., Mol Ecol 2011), meaning that the flowering time we measured in Vancouver might not reflect the exact flowering time in the populations of origin of those plants – with consequences on inflorescence size.

      In summary, there is an interesting pattern of concordance between floral UV pattern and some aspects of inflorescence morphology, but we think it would be premature to draw any inference from them. Measurements of inflorescence parameters in natural populations would be much more informative in this respect.

      Reviewer #2 (Public Review):

      The genetic analysis is rigorously conducted with multiple Helianthus species and accessions of H. annuus. The same QTL was inputed in two Helianthus species, and fine mapped to promotor regions of HaMyb111.

      While there is a significant association at the beginning of chr. 15 in the GWAS for H. petiolaris petiolaris, we should clarify that that peak is unfortunately ~20 Mbp away from HaMYB111. While it is not impossible that the difference is due to reference biases in mapping H. petiolaris reads to the cultivated H. annuus genome, the most conservative explanation is that those two QTL are unrelated. We have clarified this in the legend to Fig. 2 in the revised manuscript.

      The allelic variation of the TF was carefully mapped in many populations and accessions. Flavonol glycosides were found to correlate spatially and developmentally in ligules and correlate with Myb111 transcript abundances, and a downstream flavonoid biosynthetic gene. Heterologous expression in Arabidopsis in Atmyb12 mutants, showed that HaMyb111 to be able to regulate flavonol glycoside accumulations, albeit with different molecules than those that accumulate in Helianthus. Several lines of evidence are consistent with transcriptional regulation of myb111 accounting for the variation in bullseye size.

      Functional analysis examined three possible functional roles, in pollinator attraction, thermal regulation of flowers, and water loss in excised flowers (ligules?), providing support for the first and last, but not the second possible functions, confirming the results of previous studies on the pollinator attraction and water loss functions for flavonol glycosides. The thermal imaging work of dawn exposed flower heads provided an elegant falsification of the temperature regulation hypothesis. Biogeographic clines in bullseye size correlated with temperature and humidity clines, providing a confirmation of the hypothesis posed by Koski and Ashmann about the patterns being consistent with Gloger's rule, and historical trends from herbaria collections over climate change and ozone depletion scenarios. The work hence represents a major advance from Moyers et al. 2017's genetic analysis of bullseyes in sunflowers, and confirms the role established in Petunia for this Myb TF for flavonoid glycoside accumulations, in a new tissue, the ligule.

      Thank you. We have specified in the legend of Fig. 4i of the revised manuscript that desiccation was measured in individual detached ligules, and added further details about the experiment in the Methods section.

      While there is a correlation between pigmentation and temperature/humidity in our dataset, it goes in the opposite direction to what would be expected under Gloger’s rule – that is, we see stronger pigmentation in drier/colder environments, contrary to what is generally observed in animals. This is also contrary to what observed in Koski and Ashman, Nature Plants 2015, where the authors found that floral UV pigmentation increased at lower latitudes and higher levels of UV radiation. While possibly rarer, such “anti-Gloger” patterns have been observed in plants before (Lev-Yadun, Plant Signal Behav 2016).

      Weakness: The authors were not able to confirm their inferences about myb111 function through direct manipulations of the locus in sunflower.

      That is unfortunately correct. Reliable and efficient transformation of cultivated sunflower (much less of wild sunflower species) has eluded the sunflower community (including our laboratories) so far – see for example discussion on the topic in Lewi et al. Agrobacterium protocols 2016, and Sujatha et al. PCTOC 2012. We had therefore to rely on heterologous complementation in Arabidopsis; while this approach has limitations, we believe that its results, given also the similarity in expression patterns between HaMYB111 and AtMYB111, and in combination with the other experiments reported in our manuscript, make a convincing case that HaMYB111 regulates flavonol glycosides accumulation in sunflower ligules.

      Given that that the flavonol glycosides that accumulate in Helianthus are different from those regulated when the gene is heterologously expressed in Arabidopsis, the biochemical function of Hamyb111, while quite reasonable, is not completely watertight. The flavonol glycosides are not fully characterized (only Ms/Ms data are provided) and named only with cryptic abbreviations in the main figures.

      We believe that the fact that expression of HaMYB111 in the Arabidopsis myb111 mutant reproduces the very same pattern of flavonol glycosides accumulation found in wild type Col-0 is proof that its biochemical function is the same as that of the endogenous AtMYB111 gene – that is, HaMYB111 induces expression of the same genes involved in flavonol glycosides biosynthesis in Arabidopsis. Differences in function between HaMYB11 and AtMYB111 would have resulted in different flavonol profiles between wild type Col-0 and 35S::HaMYB111 myb111 lines. It should be noted that the known direct targets of AtMYB111 in Arabidopsis are genes involved in the production of the basic flavonol aglycone (Strake et al., Plant J 2007). Differences in flavonol glycoside profiles between the two species are likely due to broader differences between the genetic networks regulating flavonol biosynthesis: additional layers of regulation of the genes targeted by MYB111, or differential regulation (or presence/absence variation) of genes controlling downstream flavonol glycosylation and conversion between different flavonols.

      In the revised manuscript, we have added the full names of all identified peaks to the legend of Figures 3a,b,e.

      This and the differences in metabolite accumulations between Arabidopsis and Helianthus becomes a bit problematic for the functional interpretations. And here the authors may want to re-read Gronquist et al. 2002: PNAS as a cautionary tale about inferring function from the spatial location of metabolites. In this study, the Eisner/Meinwald team discovered that imbedded in the UV-absorbing floral nectar guides amongst the expected array of flavonoid glycosides, were isoprenilated phloroglucinols, which have both UV-absorbing and herbivore defensive properties. Hence the authors may want to re-examine some of the other unidentified metabolites in the tissues of the bullseyes, including the caffeoyl quinic acids, for alternative functional hypotheses for their observed variation in bullseye size (eg. herbivore defense of ligules).

      This is a good point, and we have included a mention of a more explicit mention possible role of caffeoyl quinic acid (CQA) as a UV pigment in the main text, as well as highlighted at the end of the manuscript other possible factors that could contribute to variation for floral UV patterns in wild sunflowers.

      We should note, however, that CQA plays a considerably smaller role than flavonols in explaining UV absorbance in UV-absorbing (parts of) sunflower ligules, and the difference in abundance with respect to UV-reflecting (parts of) ligules is much less obvious than for flavonols (height of the absorbance peak is reduced only 2-3 times in UV- reflecting tissues for CQA, vs. 7-70 fold reductions for individual quercetin glycosides). Therefore, flavonols are clearly the main pigment responsible for UV patterning in ligules. This is in contrast with the situation for Hypericum calycinum reported in Gronquist et al., PNAS 2002, were dearomatized isoprenylated phloroglucinols (DIPs) are much more abundant than flavonols in most floral tissue, including petals. The localization of DIPs accumulation, in reproductive organs and on the abaxial (“lower”) side of the petals (so that they would be exposed when the flower is closed), is also more consistent with a role in prevention of herbivory; no UV pigmentation is found on the adaxial (“upper”) part of petals in this species, which would be consistent with a role in pollinator attraction.

      The hypotheses regarding a role for the flavonoid glycosides regulated by Myb111 expression in transpirational mitigation and hence conferring a selective advantage under high temperatures and low and high humidities, are not strongly supported by the data provided. The water loss data from excised flowers (or ligules-can't tell from the methods descriptions) is not equivalent to measures of transpiration rates (the stomatal controlled release of water), which are better performed with intact flowers by porometry or other forms of gas-exchange measures. Excised tissues tend to have uncontrolled stomatal function, and elevated cuticular water loss at damaged sites. The putative fitness benefits of variable bullseye size under different humidity regimes, proposed to explain the observed geographical clines in bullseye size remain untested.

      We have clarified in the text and methods section that the desiccation experiments were performed on detached ligules. We agree that the results of this experiments do not constitute a direct proof that UV patterns/flavonol levels have an impact on plant fitness under different humidities in the wild – our aim was simply to provide a plausible physiological explanation for the correlation we observe between floral UV patterns and relative humidity. However, we do believe they are strongly suggestive of a role for floral flavonol/UV patterns in regulating transpiration, which is consistent with previous observations that flowers are a major source of transpiration in plants (Galen et al., Am Nat 2000, and other references in the manuscript). As suggested also by other reviewers, we have softened our interpretation of these result to clarify that they are suggestive, but not proof, of a connection between floral UV patterns, ligule transpiration and environmental humidity levels.

      “While desiccation rates are only a proxy for transpiration in field conditions (Duursma et al. 2019, Hygen et al. 1951), and other factors might affect ligule transpiration in this set of lines, this evidence (strong correlation between LUVp and summer relative humidity; known role of flavonol glycosides in regulating transpiration; and correlation between extent of ligule UV pigmentation and desiccation rates) suggests that variation in floral UV pigmentation in sunflowers is driven by the role of flavonol glycosides in reducing water loss from ligules, with larger floral UV patterns helping prevent drought stress in drier environments.” (new lines 462-469)

      Detached ligules were chosen to avoid confounding the results should differences in the physiology of the rest of the inflorescence/plant between lines also affect rates of water loss. Desiccation/water loss measurements were performed for consistency with the experiments reported in Nakabayashi et al Plant J. 2014, in which the effects of flavonol accumulation (through overexpression of AtMYB12) on water loss/drought resistance were first reported. It should also be noted that the use of detached organs to study the effect of desiccation on transpiration, water loss and drought responses is common in literature (see for example Hygen, Physiol Plant 1951; Aguilar et al., J Exp Bot 2000; Chen et al., PNAS 2011; Egea et al., Sci Rep 2018; Duursma et al., New Phytol 2019, among others). While removing the ligules create a more stressful/artificial situation, mechanical factors are likely to affect all ligules and leaves in the same way, and we can see no obvious reason why that would affect the small LUVp group more than the large LUVp group (individuals in the two groups were selected to represent several geographically unrelated populations).

      We have included some of the aforementioned references to the main text and Methods sections in the revised manuscript to support our use of this experimental setup.

      Alternative functional hypotheses for the observed variation in bullseye size in herbivore resistance or floral volatile release could also be mentioned in the Discussion. Are the large ligules involved in floral scent release?

      We have added sentences in the Results and Discussion, and Conclusions section in the revised manuscript to explore possible additional factors that could influence patterns of UV pigmentation across sunflower populations, including resistance to herbivory and floral volatiles. While some work has been done to characterize floral volatiles in sunflower (e.g. Etievant et al. J. Agric. Food Chem; Pham-Delegue et al. J. Chem. Ecol. 1989), to our knowledge the role of ligules in their production has not been investigates.

      In the revised manuscript, the section “A dual role for floral UV pigmentation” now includes the sentences:

      “Although pollinator preferences in this experiment could still affected by other unmeasured factors (nectar content, floral volatiles), these results are consistent with previous results showing that floral UV patterns play a major role in pollinator attraction (Horth et al., 2014, Koski ad Ashman, 2014, Rae and Vamosi, 2013, Sheehan et al., 2016).” (new lines 378-381)

      And the Conclusions sections includes the sentence:

      “It should be noted that, while we have examined some of the most likely factors explaining the distribution of variation for floral UV patterns in wild H. annuus across North America, other abiotic factors could play a role, as well as biotic ones (e.g. the aforementioned differences in pollinator assemblages, or a role of UV pigments in protection from herbivory (Gronquist et al., 2001)).” (new lines 540-544)

      Reviewer #3 (Public Review):

      Todesco et al undertake an ambitious study to understand UV-absorbing variation in sunflower inflorescences, which often, but not always display a "bullseye" pattern of UV-absorbance generated by ligules of the ray flowers. [...] I think this manuscript has high potential impact on science on both of these fronts.

      Thank you! We are aware that our experiments do not provide a direct link between UV patterns and fitness in natural populations (although we think they are strongly suggestive) and that, as pointed out also by other reviewers, there are other possible (unmeasured) factors that could explain or contribute to explain the patterns we observed. In the revised manuscript we have better characterized the aims and interpretation of our desiccation experiment, and modified the main text to acknowledge other possible factors affecting pollination preferences (nectar production, floral volatiles) and variation for floral UV patterns in H. annuus (pollinator assemblages, resistance to herbivory).

    1. Author Response:

      Reviewer #3 (Public Review):

      1) The two algorithms presented are essentially a low-pass and high-pass filter on binarized odor. As such, it may not be so surprising that there is a tradeoff between which algorithm works better depending on the frequency content of different environments. The low-pass filter (algorithm 1) works better in environments with mostly low-frequency fluctuations (boundary layer plume, low wind-speed, high diffusivity) while the high-pass filter (algorithm 2) works better in environments with mostly high-frequency fluctuations (high windspeed, low diffusivity). To understand what is essential in these algorithms I think it would be useful to (1) compare the two algorithms to a "null" algorithm that drives upwind orientation whenever odor is present (i.e. include thresholding and binarization but no filtering), (2) compare navigation success metrics directly to the frequency content of different environments, (3) examine how navigation success depends on the filtering cutoff of the two algorithms (tau_on and tau_w). Comparing to the null algorithm with no filtering I think is important to determine whether there is actually a tradeoff to be made, or whether a system that can approximate a flat transfer function (or at least capture all relevant frequencies in the environment) is ideal and must be approximated with biological parts.

      For (1) and (3), we have now added simulations of the models for a range of different timescales, including an integrator with an infinitely fast timescale corresponding to the “null” model the reviewer describes (Results lines 376-380, Figure 4—figure supplement 2 and Materials and methods lines 1008-1025). We find that changing the timescale of the intermittency filter largely leaves performance unchanged whereas changing the timescale of the frequency filter is akin to changing the gain on the frequency filter, as predicted by Equations 24 and 29. Since we do find a local maximum in the frequency filter timescale, we conclude that there are benefits to filtering in time. For (2), many plumes we simulate in Fig. 5 span a wide range of frequencies and intermittencies; we chose to plot performance as a function of diffusivity / windspeed to emphasize how performance depends on environment parameters that shape the statistics of the plume (flow and odor dynamics). Note that we renamed 𝜏! to 𝜏".

      2) While the two algorithms presented here present a nice conceptual division, biological filtering algorithms are likely to incorporate elements of both. For example, the adaptive compression algorithm of Alvarez-Salvado (which is eliminated in the simplification used here) provides some sensitivity to odor onsets and is based on well-described adaptation at the olfactory periphery. Synaptic depression algorithms likewise provide sensitivity to derivatives as well as integration over time, and synaptic depression with multiple timescales has been described in detail at various stages of the olfactory system. A productive extension of the work done here would be to explore the utility of biophysically-motivated filtering algorithms for navigation in different environments.

      Thank you for this suggestion, which led us to extend our work in that interesting direction. We have now generalized our model to respond to odor intensity (rather than its binarized version) by implementing an adaptive compression taken from prior modeling efforts (Alvarez-Salvado et al, eLife 2018) (added to Fig. 3; also see additional Fig. 3 Supplement 1). Moreover, we now also consider navigators that respond to odor signals using a biophysical model of odor transduction, ORN firing, and PN firing, in addition to synaptic depression within the ORN-PN synapse, which combines modeling efforts from prior works (Gorur-Shandilya, Demir, et al, eLife 2017; Nagel & Wilson, Nat. Neurosci. 2015; Fox & Nagel, “Synaptic control of temporal processing in the Drosophila olfactory system” arXiv 2021). This realistic circuit model produced exciting results that indicate that the natural ORN-PN circuitry can, to some degree, satisfy the dual demands of intermittency and frequency sensing. These results are shown in the new Fig. 6.

      3) It would be helpful in the Discussion to present a clearer picture of what the frequency content of natural environments is likely to be. For example, flies stop walking at windspeeds above ~70cm/s (Yorozu 2009). In contrast, flies in flight are likely to encounter much sparser and high frequency plume encounters, as they are moving through the air at much faster speeds and because odors encountered here would be away from the boundary layer. Therefore the best test of the tradeoff hypothesis would likely be to compare temporal filtering of odor plumes by neural circuitry in flying vs walking flies. This would connect to the literature in motion detection as well, where octopamine release during flight causes a speeding of the motion detection algorithm.

      We have added lines 47-48 to the introduction describing the natural frequency content of plumes and lines 574-578 discussing how one might see evidence of this tradeoff when comparing between walking and flying flies.

    1. Author Response:

      Reviewer #2:

      What the authors attempt to achieve, and their approaches:

      The author attempt to establish by which mechanisms cholesterol influences the function of the GPCR A_{2A}R, an adenosine receptor. The role of cholesterol on GPCRs has been reported in a number of studies, primarily in cellular experiments, and the authors set out here to clarify the molecular mechanisms.

      To this end, they build upon their recent achievements to produce this protein and reconstitute it in nanodiscs, i.e. discoidal objects comprised of the membrane protein (here: A_{2A}R), lipids (here: POPC, POPG and cholesterol) and a membrane-scaffold protein (MSP) which wraps around this disc of protein+lipid. Nanodiscs allow studying proteins in solution, and are thought to be much more native-like than e.g. detergent micelles.

      The authors first use GTP hydrolysis experiments to quantify the basal activity and agonist potency at cholesterol concentrations from 0 to 13%. The cholesterol effects are weak but detectable. Then they use a single 19F label that reports on the protein's conformation (active, inactive) to show that the protein populates slightly more active states with cholesterol. (again, weak effects). Then they investigate G-protein binding to A_{2A}R in the nanodisc, and find (very!) weak enhancement at 13% cholesterol. These data point to weak positive allosteric modulation by cholesterol. They then use molecular dynamics simulations to probe the allosteric communication, using a recently proposed framework (Rigidity-transmission allostery). Doing these simulations in the presence of cholesterol (postions of cholesterol from X-ray structure) and absence. This analysis shows again only very weak effects of cholesterol, and this time the effect is opposite, i.e. negative allosteric modulation by cholesterol. Then they use 19F-labeled cholesterol analogues to probe by NMR the state of cholesterol (bound to protein?). Lastly, they use Laurdan fluorescence experiments and pressure NMR to establish that (i) the lipids become more ordered when cholesterol is present, and (ii) if one achieves such ordering even without cholesterol - namely by pressure - one may achieve similar effects as those that cholesterol has.

      Collectively, these data lead them to conclude that cholesterol has a (weak) positive allosteric effect on this receptor, and this effect is not a direct one, but goes via modulation of the membrane properties.

      We thank the reviewer for his comments and critique. A lot of his comments have to do with the nanodisc as a model system. We have therefore included an additional paragraph as discussed above, highlighting the advantages and disadvantages of the nanodisc. We’ve also included references to papers that have characterized nanodiscs or membrane proteins in nanodiscs. In our hands, 31P NMR spectra of POPC/POPG nanodiscs and their melt behavior is very similar to liposomes. We’ve tried to add to the discussion on nanodiscs without distracting too much from the focus in the paper.

      Major strengths and weaknesses of methods and results:

      The study addresses an important question, which inherently is difficult to answer: the effect of cholesterol is poorly understood and such studies require to work in an actual membrane. The authors do a careful combination of different methods to achieve their goal of identifying the mechanisms.

      Despite combining several methods, several of them have their inherent problems:

      (i) the nanodisc is too small to properly mimic the membrane environment, and it does not allow reaching relevant cholesterol concentrations. Moreover, it is not clear (to me) if one can exclude e.g. interactions of the protein with the surrounding MSP, or of cholesterol with MSP (see (iii) below).

      We agree. In principle, we should worry about MSP. On the other hand, this is a constant in all of the samples and we focus instead on the cholesterol-dependent effects. These nanodiscs are unarguably small. We’ve commented on this in the paper now. However, we’d expect that the confinement would if anything emphasize the cholesterol bound state. Yet, the NMR studies of F-cholesterol interactions at best identified transient bound states.

      (ii) the state of the protein (inactive, active) is probed with a single NMR-active site. The effects are small and I am not convinced that one shall interpret changes as small as the ones in Figures 3 and 4. In particular, how does this single probe behave at high pressure? Does it reflect an active state at 2000 bar pressure - where possibly other effects (unfolding?) may occur?

      Here we can be quite confident. The spectra are predicated on a recent paper (Huang, et al, 2021) published in Cell in the spring of this year. Each state was carefully correlated with specific functional assays and conditions in a self-consistent way. The labeling site used on TM6 was strategically chosen based on earlier crystallographic studies of inactive and active A2AR. We have other labeling sites (TM7 and TM5) but the point was to use the chemical shift signatures to talk about cholesterol-induced changes to the conformational ensemble assigned in the Cell paper. The differences are small, but the fact that PAM effects are observed across conditions (apo, inverse agonist-bound, agonist-bound, and G protein-bound) reassures us that the spectral differences between low and high cholesterol samples are real. Unfolding by 19F NMR is in this case easy to see – the effects become irreversible and independent of ligand and the chemical shift ends up as one upfield peak. We also see a stabilization of the A1 (active) state, and a slight downfield shift of the active ensemble with increased pressure, consistent with reduced exchange dynamics (and coalescence) associated with the active state. We’ve commented on this in the revised version while trying not to distract from the flow of the paper.

      (iii) the data in Figure 6 (19F of cholesterol analogs) are hard to interpret. Is cholesterol bound to the protein? Does the 19F shift reflect binding to the protein? or interactions within the confined space of the disc? or with MSP? The two analogs do not tell a coherent story.

      It is confusing. We agree. We were fully expecting to see a clear A2AR bound state of cholesterol either through a concentration-dependent shift or a new peak. We also looked for “hidden” bound states through 19F NMR CEST experiments. We never identified a bound state in the presence of a range of cholesterol concentrations, as a function of receptor drug. We did observe small shifts although often these effects were as prominent with inverse agonist as agonist, possibly pointing to the existence of multiple weak binding sites. We’ve added some of this to the conversation. It’s also certainly possible that cholesterol exhibits some interaction with MSP, although again MSP is a constant presence in all the samples while we are focusing on cholesterol-dependent effects. In any case, we never detected a bound signature characteristic of slow exchange. That’s significant to the study despite the ambiguity of the measurements.

      (iv) the pressure NMR study (Fig 7D) has weaknesses. The authors implicitly assume that pressure acts on the membrane, leading to more ordering. (They do recognize the possibility that pressure may have an effect on the protein directly, but consider that this direct effect on the protein is minor.) I think that their arguments are possibly incorrect: they apply here pressure onto a sample of nanodiscs, but all studies they cite to justify the use of pressure on membranes dealt with extended lipid bilayers (liposomes). To me it is not clear what is the lateral effect of pressure onto a nanodisc. Can water laterally enter into the bilayer and thus modify the lipid structure? I also note that previous pressure-NMR studies on a GPCR in micelles (rather than nanodiscs) showed a shift toward the active state. As a micelle is a very different thing than a nanodisc, this suggests that the pressure effect is, at least in part or predominantly, on the protein itself.

      On top of the weakness of the pressure NMR experiment to identify what actually happens to the disc, it is not clear either how to interpret the 19F shift at very high pressure (Fig 7D). Given that there is only a single NMR probe, far out in an artificial side chain, it is difficult to assess the state of the protein.

      These are good questions. Firstly, lipid bilayers (be it in liposomes, bicelles, or nanodiscs) are super soft and compressible systems – all known to change in hydrophobic thickness to pressure much more readily than proteins – be they membrane embedded or soluble. Secondly, the 19F NMR spectra are well-known to be representative of fully functional receptor as discussed above. Thirdly, even detergent micelles are susceptible to pressure (much more so than the receptor itself) See J. Phys. Chem. B 2014, 118, 5698−5706 (now referenced in the paper). Pressure will enhance hydrophobic thickness, even in a detergent host, by ordering the acyl chains. The lower specific volume states, selected by higher pressure, have a larger hydrophobic dimension. Thus, the effects seen earlier are equally an effect of environment. In the revised version, we simply make the point that the protein isn’t unfolded and that both cholesterol or pressure give rise to enhanced hydrophobic thickness and corresponding shifts in equilibria to the active states.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reply to the reviewers

      We would like to thank the two reviewers for the valuable comments and suggestions on improvements. We addressed each reviewer’s comments individually. We have carefully revised the manuscript to incorporate new data and to make necessary clarifications.

      Overall we made the following major modifications:

      1. We investigated the relevance of BHRF1 expression in the context of EBV infection, in B cells and epithelial cells. We observed that EBV reactivation leads to MT hyperacetylation and subsequent mito-aggresome formation in both cell types. An EBV+ B cell line deficient for BHRF1 was generated and allowed us to demonstrate the involvement of BHRF1 in this phenotype. These results were added to Figures 2, 3 and Figure 1 – S1 in the revised version of the manuscript.
      2. We better characterized the mechanism leading to MT hyperacetylation, by demonstrating that BHRF1 colocalizes and interacts with the tubulin acetyltransferase ATAT1. These results were added to Figure 5 and Figure 5 – S2 in the revised manuscript.
      3. We generated stable HeLa cells KO for ATG5. Using these autophagy-deficient cells, we demonstrated the involvement of autophagy in BHRF1-induced MT hyperacetylation and mito-aggresome formation. We added these results to Figure 8 in the revised version of the manuscript.
      4. We compared the impact of BHRF1 with other mitophagy inducers on MT hyperacetylation, mitochondrial morphodynamics and the inhibition of IFN production, to demonstrate the specificity of the mechanism of action of BHRF1 (Figure 4 – S1).
      5. We demonstrated that MT hyperacetylation requires mitochondrial fission, using a Drp1-deficient HeLa cell line that we have previously described (Vilmen et al., 2020). This result was added to the revised version of the manuscript in Figure 3 – S2A. Moreover, we confirmed this result in the context of EBV infection (Figure 3 – S2B). ## Reviewer#1 Reviewer #1 (Evidence, reproducibility and clarity)

      Major comments:

      1. In the presented manuscript the authors characterize mainly BHRF1 overexpression in HeLa cells. Does BHRF1 also block type I IFN responses by microtubule hyperacetylation in the context of EBV infection? Do alpha-tubulin K40A overexpressing B cells produce more type I IFN after EBV infection?

      In the revised version of the manuscript, we added several experiments to explore the phenotype of BHRF1 during EBV infection, as requested by the two reviewers. Since EBV infects both B cells and epithelial cells, we used two different approaches. In latently-infected B cells, coming from Burkitt lymphoma (Akata cells), we induced EBV reactivation by anti-IgG treatment. To explore the importance of BHRF1 in this cell type, we constructed a cell line knocked down for BHRF1 expression, thanks to a lentivirus bearing an shRNA against BHRF1. In parallel, HEK293 cells harboring either EBV WT or EBV ΔBHRF1 genome were transfected with ZEBRA and Rta plasmids to induce the viral productive cycle in epithelial cells.

      We demonstrated that EBV infection induces MT hyperacetylation and subsequent mito-aggresome formation, both dependent on autophagy. Moreover, this phenotype requires BHRF1 expression in B cells and epithelial cells. We also observed that the expression of alpha-tubulin K40A in EBV+ epithelial cells blocks mito-aggresome formation induced by EBV reactivation. These results are now presented in Figures 2 and 3 in the revised version of the manuscript.

      Regarding regulation of IFN response during infection, several EBV-encoded proteins and non-coding RNAs have been described to interfere with the innate immune system. For example, BGLF4 and ZEBRA bind to IRF3 and IRF7, respectively, to block their nuclear activity (Hahn et al., 2005; Wang et al., 2009). Moreover, Rta expression decreases mRNA expression of IRF3 and IRF7 (Bentz et al., 2010; Zhu et al., 2014). We therefore think that studying the inhibitory role of BHRF1 on IFN response in the context of EBV reactivation will be arduous. Indeed, the lack of BHRF1 could be compensated by the activity of other viral proteins acting on innate immunity.

      1. The authors document that the observed microtubule hyperacetylation is due to the acetyltransferase ATAT1. How does BHRF1 activate ATAT1? Is there any direct interaction?

      As requested by reviewer#1, we explored a possible interaction of BHRF1 and ATAT1. First, we observed by confocal microscopy that GFP-ATAT1 colocalized with BHRF1 in the juxtanuclear region of HeLa cells (Figure 5 – S2). Second, we demonstrated by two co-immunoprecipitation assays that BHRF1 binds to exogenous ATAT1 (Figures 5E and 5F). These new results have been added to the revised version of the manuscript and clarify the mechanism of action of BHRF1.To go further, we explored whether BHRF1 was able to stabilize ATAT1 because it was recently reported that p27, an autophagy inducer that modulates MT acetylation, binds to and stabilizes ATAT1 (Nowosad et al., 2021). However, BHRF1 expression does not impact the expression of ATAT1 (data not shown).

      1. Furthermore, the authors demonstrate with pharmacological autophagy inhibitors that autophagy is increased in a BHRF1 dependent and microtubule acetylation independent manner but required for microtubule hyperacetylation. How does autophagy stimulate ATAT1 dependent microtubule hyperacetylation? Is this dependency also observed with a more specific ATG silencing or knock-out?

      We generated a stable autophagy-deficient HeLa cell line KO for ATG5, using an ATG5 CRISPR/Cas9 construct delivered by a lentivirus. The lack of ATG5 expression and LC3 lipidation was verified by immunoblot (Figure 8B). We observed that BHRF1 was unable to increase MT acetylation in this autophagy-deficient cell line (Figure 8C) in accordance with our data reported in the original manuscript using treatment with spautin 1 or 3-MA (previously Figure S5C and Figure 8A in the revised version). Moreover, the lack of hyperacetylated MT in BHRF1-expressing cells led to a dramatic reduction of mito-aggresome formation (Figures 8D and 8E). These new results demonstrate that autophagy is required for BHRF1-induced MT hyperacetylation.

      Minor comments:

      1. "Innate immunity" and "innate immune system", but not "innate immunity system" are in my opinion better wordings.

      We thank reviewer #1 for this useful comment. The term “innate immunity system” in the introduction section has been replaced by “innate immune system”. Elsewhere, we used “innate immunity”.

      1. The reader would benefit from a discussion on the role of type I IFNs during EBV infection and how important the authors think their new mechanism could be in this context.

      We thank the reviewer for this suggestion. However, we already discussed the different strategies developed by EBV to counteract IFN response induction, in our previous study, suggesting the importance of IFN in the control of EBV infection (Vilmen et al., 2020). In this study, we have focused the discussion on the role of mitophagy in the control of IFN production.

      Reviewer #1 (Significance):

      The significance of the described pathway for type I IFN production needs to be documented in the context of EBV infection.

      The revised version of the manuscript now explored the role of BHRF1 in the context of EBV infection See above for details (major comment 1).

      Reviewer#2

      Reviewer #2 (Evidence, reproducibility and clarity)

      The work presented is a relatively straightforward cell biological dissection of a subset of the previously described functions of BHRF1, focusing on the mitochondrial aggregation phenotype. The approaches and analysis are performed in cell lines mainly using overexpression and some siRNA experiments and appear well done throughout.

      We thank reviewer #2 for this comment and would like to underline that the revised version of the manuscript includes now a study of BHRF1 in the context of infection in both B cells and epithelial cells, the generation of a stable EBV positive B cells KD for BHRF1 by using shRNA approach and the generation of a stable autophagy-deficient cell line, using CRISPR/cas9 against ATG5.

      Reviewer #2 (Significance):

      The current study unpicks one of the phenotypes induced by BHRF1 over expression: namely the previously reported mitochondrial aggregation phenotype. The findings that peri-nuclear mitochondrial aggregation are dependent on microtubules and retrograde motors are useful but could perhaps have been predicted. Overexpression of many proteins (or indeed chemical treatments) causing cellular and / or mitochondrial stress have been shown to cause mitochondrial perinuclear aggregation.

      To explore the specificity of BHRF1 activity on mito-aggresome formation, we decided to investigate the impact of AMBRA1-ActA, a previously characterized mitophagy inducer, on MT (Strappazzon et al., 2015). We observed that expression of AMBRA1-ActA leads to mito-aggresome formation but does not modulate acetylation of MTs, contrary to BHRF1. This result was added to the revised version of the manuscript (Figure 4 - S1A and S1B). Moreover, chemical treatments with either oligomycin/antimycin or CCCP, which induce mitochondrial stress and mitophagy (Lazarou et al., 2015; Narendra et al., 2008), do not cause mitochondrial juxtanuclear aggregation (Figure 4 - S1C). We also observed that a hyperosmotic shock-induced by NaCl leads to MT hyperacetylation (Figure 4 - S1D) but not to the mito-aggresome formation (data not shown), suggesting that MT hyperacetylation per se is not sufficient to induce the clustering of mitochondria. Altogether, these new results demonstrated the originality of the mechanism used by BHRF1 to induce mito-aggresome formation.

      The findings linking the process to altered tubulin acetylation are more novel and interesting and may add a new dimension to understanding of BHRF1 function. However what is lacking here is really advancing our understanding of how BHRF1 does this.

      We thank the reviewer for underlining the fact that regulation of mitochondrial morphodynamics by BHRF1 via MT hyperacetylation is novel and interesting.

      In the original version of the manuscript, we have demonstrated that autophagy and ATAT1 are required for BHRF1-induced hyperacetylation. In the revised version, we uncovered that BHRF1 interacts and colocalizes with ATAT1 (Figures 5E, 5F and Figure 5 – S2). Moreover, we demonstrated that MT hyperacetylation is involved in the localization of autophagosomes next to the nucleus, thus close to the mito-aggresome. Therefore, we better characterized the mechanism of action of BHRF1 in the revised manuscript.

      Although some downstream processes are identified in the current and previous study it still remains unclear what the exact underlying mechanisms are. Is BHRF1 doing this by disrupting mitochondrial function and making the organelles sick or by causing cellular stress indirectly leading to mitochondrial pathology? Previous studies have shown that cellular stress such as altered proteostasis can also cause stress-induced mitochondrial retrograde trafficking and aggregation. Is BHRF1 causing the same phenotype by generally stressing the cell and if it is more specifically through mitochondrial disruption what is the mechanism? As demonstrated by the authors in their previous work, BHRF1 does a number of things to cell signalling. Which of these are leading to a general disruption of cell signalling versus having specific effects on the cell or mitochondria still seems somewhat unclear.

      We previously reported that BHRF1 expression does not alter the mitochondrial membrane potential (Vilmen et al., 2020). contrary to treatment by O/A or CCCP. Moreover, we observed that these treatments do not induce mitochondrial clustering (Figure 4 – S1). Therefore, BHRF1 modulates mitochondrial dynamics in a specific and regulated manner.

      Our study clearly demonstrated that BHRF1 uses an original strategy to modulate IFN response, via a regulated pathway of successive steps, from mitochondrial fission to mitophagy, via MT hyperacetylation, rather than “a general disruption of cell signalling”.

      It would be interesting to know whether the role of microtubule hyperacetylation and ATAT1 are more generally involved in other previously described processes of stress induced mitochondrial aggregation.

      In the revised version of the manuscript, we observed that AMBRA1-ActA does not change the level of MT acetylation, whereas it induces mito-aggresome formation. These data reinforce the originality of the BHRF1 mechanism.

      Currently while this is a nicely performed follow up study to their 2020 paper, the present study neither provides in depth mechanistic advance of BHRF1 function, nor a better understanding of the molecular steps in a more generally relevant pathway (e.g. mitophagy).

      We disagree with the reviewer’s comment. Indeed, in this new study, we uncovered and characterized a new mechanism of action for BHRF1 via ATAT1-dependent MT hyperacetylation. More generally, we reported for the first time that innate immunity can be regulated by the level of MT acetylation.

      In addition, all the experiments were performed in cell lines and rely on the overexpression of a viral protein. But this is a significant over-simplification of the viral pathological process. It therefore remains unclear how pathophysiologically relevant the findings are (e.g. to EBV pathology) without further extending this element of the work.

      To address this comment, we extended our results in the infectious context, by adding several experiments performed in EBV-infected cell lines (see above reviewer#1 for details). The same phenotype was observed after reactivation of the EBV productive cycle as in BHRF1 ectopic expression. Moreover, we demonstrated that the phenotype is BHRF1-dependent. This suggests the importance of BHRF1 in EBV pathogenesis by participating in innate immunity control.

      An additional minor issue is the authors naming of the process as Mito-aggresome formation. Although this might sound catchy it is somewhat unclear what the biological basis for this is. Aggresomes are defined structures that occur in cells during pathology and due to the peri-nuclear accumulation of misfolded protein. Since the process here is simply the description of aggregated mitochondria next to the nucleus but doesn't seem to have anything to do with protein misfolding it's really unclear how this labelling is helpful to the field. The process of perinuclear mitochondrial aggregation e.g. during mitochondrial stress or damage has been described many times before without the need for calling it a mito-aggresome. This term is likely to cause unhelpful confusion.

      We understand the comment of reviewer #2, but since 2010 the term “mito-aggresome” was previously used in other studies and refers to a clustering of mitochondria next to the nucleus, similarly to what we observed with BHRF1 (D’Acunzo et al., 2019; Lee et al., 2010; Springer and Kahle, 2011, 2011; Strappazzon et al., 2015; Van Humbeeck et al., 2011; Yang and Yang, 2011).

      However, we took into consideration the risk of confusion for the readers, by changing how we introduced the term “mito-aggresome” in the revised version of the manuscript (page 5 line 94).

      References

      Bentz GL, Liu R, Hahn AM, Shackelford J, Pagano JS. 2010. Epstein–Barr virus BRLF1 inhibits transcription of IRF3 and IRF7 and suppresses induction of interferon-β. Virology 402:121–128. doi:10.1016/j.virol.2010.03.014

      D’Acunzo P, Strappazzon F, Caruana I, Meneghetti G, Di Rita A, Simula L, Weber G, Del Bufalo F, Dalla Valle L, Campello S, Locatelli F, Cecconi F. 2019. Reversible induction of mitophagy by an optogenetic bimodular system. Nat Commun 10:1533. doi:10.1038/s41467-019-09487-1

      Hahn AM, Huye LE, Ning S, Webster-Cyriaque J, Pagano JS. 2005. Interferon regulatory factor 7 is negatively regulated by the Epstein-Barr virus immediate-early gene, BZLF-1. J Virol 79:10040–10052. doi:10.1128/JVI.79.15.10040-10052.2005

      Lazarou M, Sliter DA, Kane LA, Sarraf SA, Wang C, Burman JL, Sideris DP, Fogel AI, Youle RJ. 2015. The ubiquitin kinase PINK1 recruits autophagy receptors to induce mitophagy. Nature 524:309–314. doi:10.1038/nature14893

      Lee J-Y, Nagano Y, Taylor JP, Lim KL, Yao T-P. 2010. Disease-causing mutations in Parkin impair mitochondrial ubiquitination, aggregation, and HDAC6-dependent mitophagy. J Cell Biol 189:671–679. doi:10.1083/jcb.201001039

      Narendra DP, Tanaka A, Suen D-F, Youle RJ. 2008. Parkin is recruited selectively to impaired mitochondria and promotes their autophagy. J Cell Biol 183:795–803. doi:10.1083/jcb.200809125

      Nowosad A, Creff J, Jeannot P, Culerrier R, Codogno P, Manenti S, Nguyen L, Besson A. 2021. p27 controls autophagic vesicle trafficking in glucose-deprived cells via the regulation of ATAT1-mediated microtubule acetylation. Cell Death Dis 12:1–18. doi:10.1038/s41419-021-03759-9

      Springer W, Kahle PJ. 2011. Regulation of PINK1-Parkin-mediated mitophagy. Autophagy 7:266–278. doi:10.4161/auto.7.3.14348

      Strappazzon F, Nazio F, Corrado M, Cianfanelli V, Romagnoli A, Fimia GM, Campello S, Nardacci R, Piacentini M, Campanella M, Cecconi F. 2015. AMBRA1 is able to induce mitophagy via LC3 binding, regardless of PARKIN and p62/SQSTM1. Cell Death Differ 22:419–32. doi:10.1038/cdd.2014.139

      Van Humbeeck C, Cornelissen T, Hofkens H, Mandemakers W, Gevaert K, De Strooper B, Vandenberghe W. 2011. Parkin Interacts with Ambra1 to Induce Mitophagy. J Neurosci 31:10249–10261. doi:10.1523/JNEUROSCI.1917-11.2011

      Vilmen G, Glon D, Siracusano G, Lussignol M, Shao Z, Hernandez E, Perdiz D, Quignon F, Mouna L, Poüs C, Gruffat H, Maréchal V, Esclatine A. 2020. BHRF1, a BCL2 viral homolog, disturbs mitochondrial dynamics and stimulates mitophagy to dampen type I IFN induction. Autophagy 17:1296–1315. doi:10.1080/15548627.2020.1758416

      Wang J-T, Doong S-L, Teng S-C, Lee C-P, Tsai C-H, Chen M-R. 2009. Epstein-Barr Virus BGLF4 Kinase Suppresses the Interferon Regulatory Factor 3 Signaling Pathway. J Virol 83:1856–1869. doi:10.1128/JVI.01099-08

      Yang J-Y, Yang WY. 2011. Spatiotemporally controlled initiation of Parkin-mediated mitophagy within single cells. Autophagy 7:1230–1238. doi:10.4161/auto.7.10.16626

      Zhu L-H, Gao S, Jin R, Zhuang L-L, Jiang L, Qiu L-Z, Xu H-G, Zhou G-P. 2014. Repression of interferon regulatory factor 3 by the Epstein-Barr virus immediate-early protein Rta is mediated through E2F1 in HeLa cells. Mol Med Rep 9:1453–1459. doi:10.3892/mmr.2014.1957

  8. Nov 2021
    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Response to Reviewer’s comments

      We thank the three reviewers for their positive comments and constructive feedback. We have addressed the issues raised through additional experiments and text changes which have helped to improve the manuscript. Below, we address the specific points with detailed responses (reviewer comments are provided in italic).

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      The manuscript by Rodriguez-Lopez et al describes the analysis of long intergenic non-coding RNA (lincRNA) function in fission yeast using both deletion and overexpression methods. The manuscript is very well presented and provides a wealth of lincRNA functional information for the field. This work is an important advance as there is still very little known about the function of lincRNAs in both normal and other conditions. An impressive array of conditions were assessed here. With a large scale analysis like this there is really not one specific conclusion. The authors conclude that lincRNAs exert their function in specific environmental or physiological conditions. This conclusion is not a novel conclusion, it has been proposed and shown before, but this manuscript provides the experimental proof of this concept on a large scale.

      The lincRNA knock-out library was assessed using a colony size screen, a colony viability screen and cell size and cell cycle analysis. Additionally, a lincRNA over-expression library was assessed by a colony size screen. These different functional analysis methods for lincRNAs were than carried out in a wide variety of conditions to provide a very large dataset for analysis. Overall, the presentation and analysis of the data was easy to follow and informative. Some points below could be addressed to improve the manuscript.

      There were 238 protein coding gene mutants assessed in parallel, to provide functional context, which was a very promising idea. But, unfortunately, the inclusion of 104 protein coding genes of unknown function restricted the use of the protein coding genes in the integrated analysis to connect lincRNAs to a known function using guilt by association.

      Reply: Yes, the unknown coding-gene mutants did certainly not help to provide functional context through guilt by association. These mutants were included to generate functional clues for the unknown proteins and compare phenotype hits with unknown lincRNA mutants. Nevertheless, because the known coding-gene mutants included broadly cover all high-level biological processes (GO slim), we could make several useful functional inferences for certain lincRNAs as discussed.

      The colony viability screen is not described well throughout the manuscript. Firstly, the use of phloxine B dye to determine cell viability needs to be described better when first introduced at the bottom of page 6. What exactly is this viability screen and red colour intensity indicating? Please define what the different levels of red a colony would indicate as far as viability. I assume an increase in red colour indicates more dead cells? So it is confusing that later the output of this assay is described as giving a resistant/sensitive phenotype or higher/lower viability. How can you get a higher viability from an assay that should only detect lower viability? Shouldn't this assay range from viable (no, or low red, colour) to increasing amounts of red indicating increasingly less viability? Figure 4D is also confusing with the "red" and "white" annotations. These should be changed to "lower viability" and "viable" or "not viable" and "viable".

      Reply: The colony-viability screen is described in detail in our recent paper (Kamrad et al, eLife 2020). We have now better explained how phloxine B works to determine cell viability (p. 6). The reviewer’s assumption is correct: an increase in red colour indicates more dead cells. However, all phenotypes reported are relative to wild-type cells under the same condition. Many conditions lead to a general increase in cell death, but some mutants show a lower increase in cell death compared to wild-type cells. These mutants, therefore, have a higher viability than wild-type cells, i.e. they are more resistant than wild-type under the given condition. We have tried to clarify this in the text, including the legend of Fig. 4. We agree that the ‘red’ and ‘white’ annotations in Fig. 4D could be confusing. We have now changed these to ‘low viability’ and ‘high viability’. Again, this is relative to wild-type cells.

      How are you sure that when generating the 113 lincRNA ectopic over-expression constructs by PCR that the sequences you cloned are correct? Simply checking for "correct insert size", as stated in the methods, is not really good practice and these constructs should be fully sequenced to be sure they contain the correct sequence and that constructs have not had mutations introduced by the PCR used for cloning. Without such sequence confirmation one cannot be completely confident that the data produced is specific for a lincRNA over-expression. Additionally, a selection of strains with the overexpression constructs should be tested by qRT-PCR and compared to a non-over-expressing strain to confirm lincRNA overexpression.

      Reply: To minimize errors during PCR amplification, we used the high-fidelity Phusion DNA polymerase which features an >50-fold lower error rate than Taq DNA Polymerase. We had confirmed the insert sequences for the first 17 lincRNAs cloned using Sanger sequencing (but did not report this in the manuscript). We have now checked additional inserts of the overexpression plasmids by Sanger sequencing in 96-well plate-format using a universal forward primer upstream of the cloning site. This high-troughput sequencing produced reliable sequence data for 80 inserts, including full insert sequences for 62 plasmids and the first ~900 bp of insert sequences for 18 plasmids). Of these, only the insert for SPNCRNA.601 showed a sequence error compared to the reference genome: T to C transition in position 559. This mutation could reflect either an error that occurred during cloning or a natural sequence variant among yeast strains (lincRNA sequences are much more variable than coding sequences). So, in general, the PCR cloning accurately preserved the sequence information. We have added this information in the Methods (p. 27-28). Please note that lincRNAs depend much less on primary nucleotide sequence than mRNAs, and a few nucleotide changes are highly unlikely to interfere with lincRNA function.

      Minor comments:

      Page 4, lines 19-20 - "A substantial portion of lincRNAs are actively translated (Duncan and Mata, 2014), raising the possibility that some of them act as small proteins." This sentence does not make sense, lincRNAs can't "act as" small proteins, they can only "code for" small proteins. Wording needs to be changed here.

      Reply: We agree and have changed the wording as suggested.

      Figure 1A is a nice representation but what are the grey dots? Are they all ncRNAs including lincRNAs? This needs to be stated in the legend.

      Reply: The grey dots represent all non-coding RNAs across the three S. pombe chromosomes as described by Atkinson et al., 2018. This has now been clarified in the legend.

      How many lincRNAs are there in total in pombe and what percentage did you delete? These numbers should be stated in the text.

      Reply: There are 1189 lincRNAs and we mutated ~12.6% of them. These numbers are now stated at the end of the Introduction, page 5.

      It would be nice if Supplementary Figure 1 included concentrations or amounts of the conditions used. This info is buried in a Supplementary table and would be better placed here.

      Reply: Supplemental Fig. 1 provides a simple overview for the different conditions and drugs used. For most stresses and drugs, we used multiple different doses. So the figure would become cluttered if we indicated all these concentrations, detracting from the main message. Colleagues who are interested in the different concentration ranges used for specific conditions can readily obtain this information from Supplemental Dataset 1. We have now added a statement in this respect to the legend of Supplemental Fig. 1

      Page 6, last sentence. What is a "biological repeat"? Three distinct deletion strains (ie three different deletion strains made by CRISPR) or one deletion strain used three times?

      Reply: Biological repeat means that one deletion strain was assayed three times independently, each with at least two colonies (technical repeats). In most cases, we had two or more independently generated deletion strains for each lincRNA (using the same or different gRNAs), and we performed at least three biological repeats for each strain. The numbers of independent strains for each lincRNA are provided in Supplemental Dataset 1 (sheet: lincRNA_metadata, column: n_independent_ko_mutants). The total numbers of repeats carried out for each condition after QC filtering are available in Supplemental Dataset 2 (columns: observation_count). We have clarified this on p. 7, and the details are now provided in the Methods on p. 28-29 (deletion mutants) and p. 32 (overexpression mutants).

      There is no mention in the manuscript of how other researchers can get access to the deletion strains and over-expression plasmids.

      Reply: As is usual, all strains and plasmids will be readily available upon request.

      Reviewer #1 (Significance (Required)):

      The production of lincRNA deletion strains and overexpression plasmids, and their analysis under an impressive number of conditions, provides key resources and data for the ncRNA field. This work complements nicely the analysis of protein coding gene deletion strains and provides the tools and data for future mechanistic studies of individual lincRNAs. This work would be of interest to the growing audience of ncRNA researchers in both yeast and other systems.

      Field of expertise:

      Yeast deletion strain construction and analysis, RNA functional analysis

      \*Referee Cross-commenting** *

      Reviewer #3 makes an important point that the stability of each lincRNA over expressed from plasmid is not known and therefore some lincRNAs may not be overexpressed as predicted. RT-qPCR would be required to assess lincRNA expression levels from the plasmids. It also appears that we both agree that it is important to determine the sequence of the cloned lincRNAs in the over expression plasmids.

      Reply: See reply in response to Reviewer 3.

      Reviewer #3 also makes an important point in his review that where it is predicted that a lincRNA deletion influences an adjacent gene in cis then the expression of that gene should be tested.

      Reply: See reply in response to Reviewer 3.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      \*Summary:** *

      The Rodriguez-Lopez manuscript from the Bahler lab present the phenotypical and functional profiling of lincRNA in fission yeast. This is the first large-scale, extensive work of this nature in this model organism and it therefore nicely complement the well-documented examples of lincRNA already reported in S.pombe.

      The work is very solid using seamless genome deletion and overexpression followed by colony-based assay in respone to a very wide set of conditions.

      \*Major comments:** *

      - considering that this is a descriptive work by nature and that the experiments were properly conducted as far as I can judge, I don't have major issues with this paper.

      To me the only thing that is missing is a gametogenesis assay, for two reasons: First, several reported cases of lincRNAs in pombe critically regulates meiosis, and second many of the analysed lincRNAs are upregulated durig meiosis. Figure 6B already points to three obvious candidates. I don't think it would take to much time to look at the deletion and OE in an h90 strain and see the effect of gametogenesis for the entire set or at least the 3 candidates from Figure 6.

      If the already broad set of lincRNAs implicated in meiosis would grow, this would be another evidence that eukaryotic cell differentiation relies on non-coding RNAs even in simpler models.

      Reply: We agree that this is a meaningful analysis to add. We have now deleted the three unstudied lincRNA genes, along with the meiRNA gene, from the sub-cluster of Figure 6B in the homothallic h90 background (to allow self-mating). We have analysed meiosis and spore viability of these four deletion strains together with a wild-type h90 control strain. These experiments indicate that cell mating is normal in the deletion mutants, but meiotic progression is somewhat delayed in SPNCRNA.1154, SPNCRNA.1530 and, most strongly, meiRNA mutants (the latter has been reported before (reviewed by Yamashita 2019). Notably, we detected significant reductions in spore viability for all four deletion mutants compared to the control strain. These results point to roles of SPNCRNA.1154, SPNCRNA.1530, and SPNCRNA.335 in meiotic differentiation, as predicted by the clustering analyses. This is a nice addition to the manuscript. We now report these results on p. 23, with a new Supplemental Figure 10, and describe the experimental procedures in the Methods (p. 34-35).

      \*Minor comments:** *

      - A reference to the recent work of the Rougemaille lab on mamRNA is necessary

      Reply: Yes, we now cite this reference in the Introduction (p. 4).

      - a discussion of the possibility to perfom large-scale genetic interactions searches (as done by Krogan for protein-coding genes) would add to the discussion of futue plans

      Reply: We have added a sentence about the potential of SGA screens in the Conclusions (p. 26).

      Reviewer #2 (Significance (Required)):

      The work unambigously shows that that most of the lincRNAs analyzed exert cellular functions in specific environmental or physiological contexts. This conclusion is critical because the biological relevance this so-called « dark matter » is still debated despite a few well-established cases. This is an important addition to the field and the deep phenotyping work already points to some directions to analyse some of these lincRNA in the context of cell cycle progression, metabolism or meiosis.

      \*Referee Cross-commenting** *

      - I agree with the issues raised by referees 1 and 3 but I am concerned about the added value of a RT-qPCR. First, this is a significant amout of work considering the large set of targets. Second a more importantly, what you ll end up with is a fold change. What will be considered as overexpression? Which threshold? This is why I prefer a biological read-out (a phenotype) because whatever the fold change, it tells us that there is an effect. It is very likely indeed that some targets are not overexpressed because of their rapid degradation. To me, this is the drawback of any large-scale studies.

      - Also, looking at the expression of the adjacent gene in the case of a cis-effect is interesting though this is likely condition-dependent (because most phenotypes appear in specific conditions). So, what would be the conclusion if there is no effect in classical rich media?

      - The sequence of the insert should be specified, I agree. Most likely, it is the sequence available from pombase (this is what I understood) but that should be clarified indeed.

      Reply: Yes, the sequences of the inserts are available from PomBase, and we provide the primer sequences used for cloning in the Supplemental Dataset 1. We have now clarified this in the Methods (p. 27).

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      In this work from the group of Jurg Bahler, the authors take advantage of the high throughput colony-based screen approach they recently developed (Kamrad et al, eLife 2020) to perform a functional profiling analysis on a subset of 150 lincRNAs in fission yeast. Using a seamless CRISPR/Cas9-based method, they created deletion mutants for 141 lincRNAs. In addition, the authors also generated strains ectopically overexpressing 113 lincRNAs from a plasmid (under the control of the strong and inducible nmt1 promoter).

      The viability and growth of all these mutants was then assessed across benign, nutrient, drug and stress conditions (149 conditions for the deletion mutants, 47 conditions for the overexpression). For the deletion mutants, the authors also assayed in parallel mutants of 238 protein-coding genes (PCGs) covering multiple biological processes and main GO classes.

      In benign conditions, deletion of 5 and 10 lincRNAs resulted in a reduced growth phenotype (rich and minimal medium, respectively). Morphological characterization by microscopy also revealed cell size defects for 6 lincRNA mutants (2 shorter, 4 longer). In addition, 27 mutants displayed phenotypes pointing defects in the cell cycle.

      Remarkably, the nutrient/drug/stress conditions revealed more phenotypes, with 60 of the 141 lincRNA mutants showing a growth phenotype in at least one condition, and 25 mutants showing a different viability compared to the wild-type in at least one condition.

      Also remarkable is the observation that 102/113 lincRNA overexpression strain displayed a growth phenotype in at least one condition, 14 lincRNAs showing phenotypes in more than 10 conditions.

      The clustering analyses performed by the authors also provide functional insight for some lincRNAs.

      Overall, this is an important study, well conducted and well presented. Together, the data described by the authors are convincing and highlight that most lincRNAs would function in very particular conditions, and that deletion/inactivation and overexpression are complementary approaches for the functional characterization of lncRNAs. This has been demonstrated here, in a very elegant manner.

      I think this manuscript will be acknowledged as a pioneer work in the field.

      \*A. Major comments** *

      - A.1. Are the key conclusions convincing?

      To my opinion, the key conclusions of this study are convincing.

      - A.2. Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether?

      No. The authors are careful in their claims and conclusions.

      - A.3. Would additional experiments be essential to support the claims of the paper? Request additional experiments only where necessary for the paper as it is, and do not ask authors to open new lines of experimentation.

      This study is based on systematic lincRNA deletion/overexpression.

      - For the deletion strains, I could not find any information about the control of the deletions. Are the authors sure that the targeted lincRNAs were indeed properly deleted?

      Reply: Yes, we had carefully checked the correctness of the deletions using several controls as described by Rodriguez-Lopez et al. 2017. All deletion strains were checked for missing open-reading frames by PCR. For 20 strains, we also sequenced across the deletion scars. We re-checked all strains by PCR after arraying them onto the 384 plates to ensure that no errors occurred during the process. We have now specified this in the Methods (p. 27).

      - For the overexpression, there is only a control of the insert size by PCR. Sanger sequencing would have been preferable to confirm that the targeted lincRNAs were properly cloned, without any mutation. In addition, the authors did not check that the lincRNAs were indeed overexpressed (at least in the benign conditions). Is the overexpression fold similar for all the lincRNAs? Do the 14 lincRNAs showing the most consistent phenotypes in at least 10 conditions display different expression levels than the other lincRNAs?

      - A.4. Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated cost and time investment for substantial experiments.

      - Validating the deletion strains requires genomic DNA extraction and then PCR. This is repetitive and tedious, but this control is important, I think. The time needed depends on the possibility of automating the process. I think this is feasible in this lab.

      - Controlling the insert sequence into the overexpression vector requires plasmid DNA (available as it was used for PCR) and one/several primer(s), depending on the insert size. The sequencing itself is usually done by platforms.

      - Analysing lincRNA overexpression at the RNA level requires yeast cultures, RNA extraction and then RT-qPCR. Again, the time needed depends on the possibility of automating the process.

      Reply: We have now checked most overexpression constructs by Sanger sequencing of the inserts as described in response to Reviewer 1. Moreover, we have tested the overexpression levels for eight selected overexpression constructs using RT-qPCR analysis. These eight constructs feature the entire range of associated phenotypes hits, including 3 lincRNAs with the highest number of phenotypes in 14 conditions, 3 with no phenotypes, and 2 with intermediate numbers of phenotypes. The RT-qPCR results show that the lincRNAs were 35- to 2200-fold overexpressed relative to the empty-vector control strain (which expresses the lincRNA at native levels). No clear pattern was evident between expression levels and phenotype hits, e.g. lincRNAs without phenotypes when overexpressed showed similar fold-changes as a lincRNA showing 13 phenotypes. We present these results on p. 21/22 and in the new Supplemental Figure 9A, and describe the experiment in the Methods (p. 28).

      As pointed out by Reviewer 2, these fold changes in expression are actually of limited value compared to the phenotype read-outs. The important result is that we detected phenotypes for over 90% of the overexpression strains, indicating that overexpression generally worked. Given that this is a large-scale study, there might be some lincRNA constructs that are faulty or are not overexpressed. It would not be realistic or meaningful to test all constructs. Any follow-on studies focusing on a specific lincRNAs will need to first validate the large-scale results as is common practice.

      - A.5. Are the data and the methods presented in such a way that they can be reproduced?

      The methods are clearly and extensively explained. If necessary, the reader can find more details about the high-throughput colony-based screen approach in the original paper (Kamrad et al, eLife 2020); a very interesting technical discussions can also be found in the reviewers reports and in the authors response published alongside.

      - A.6. Are the experiments adequately replicated and statistical analysis adequate?

      The experiments are replicated. However, I feel confused regarding the number of replicates used in each analysis.

      In the first part of the Results, it is mentioned that all colony-based phenotyping was performed in at least 3 independent replicates, with a median number of 9 repeats per lincRNAs. In the Methods section, I read that for the high-throughput microscopy and flow cytometry for cell-size and cell-cycle phenotypes, over 80% of the 110 lincRNA mutants screened for cellular phenotypes were assayed in at least 2 independent biological repeats. For the overexpression, I read that each strain was represented by at least 12 colonies across 3 different plates and experiments were repeated at least 3 times. Each condition was assayed in three independent biological repeats, together with control EMM2 plates, resulting in at least 36 data points per strain per condition.

      Perhaps I missed something. If not, could the authors clarify this? In addition, I suggest to indicate the number of replicates used for each lincRNA/condition/assay in Supplemental Dataset 2 (I could only find the information for the Flow Cytometry) and in Supplemental Dataset 6.

      Reply: For all colony-based phenotyping, we performed at least three biological repeats, meaning that the strains were assayed three times independently, each with at least two colonies (technical repeats). In most cases, we had two or more independently generated deletion strains for each lincRNA, and we performed at least three biological repeats for each strain (hence the higher median number of nine repeats per lincRNA). The numbers of independent deletion strains for each lincRNA are provided in Supplemental Dataset 1 (sheet: lincRNA_metadata, column: n_independent_ko_mutants). The total numbers of repeats carried out for each condition after QC filtering are available in Supplemental Dataset 2 (columns: observation_count). We have now clarified this on p. 6, and the details are provided in the Methods on p. 28-29 (for deletion mutants) and p. 32 (for overexpression mutants). For the high-throughput microscopy and flow cytometry experiments, we performed the repeats as described in the text.

      \*B. Minor comments** *

      - B.1. Specific experimental issues that are easily addressable.

      - The pattern of the SPNCRNA.1343 and SPNCRNA.989 mutants is consistent with the idea that these lincRNAs act in cis and that their deletion interferes with the expression of the adjacent tgp1 and atd1 genes, respectively. The authors could easily test by RT-qPCR or Northern Blot that the lincRNA deletion leads to the induction of the adjacent gene. Also, if the hypothesis of the authors is correct, the ectopic expression of these two lincRNAs in trans should not complement the phenotypes of the corresponding mutants. These experiments would reinforce the conclusion of the authors about the specific regulatory effect of the SPNCRNA.1343 and SPNCRNA.989 lincRNAs.

      Reply: It would actually not be as easy as suggested to obtain conclusive results in this respect. For SPNCRNA.1343 and its neighbour, atd1, the mechanisms involved have already been shown in detail based on several mechanistic studies (Ard et al., 2014; Ard and Allshire, 2016; Garg et al., 2018; Shah et al., 2014; 2014; Yague-Sanz et al., 2020). But these studies did require multiple precise genetic constructs and specialized approaches to interrogate the complex regulatory relationships between the overlapping transcripts which can be both positive and negative. As correctly pointed out by Reviewer 2, we do not know the particular conditions where any cis-regulatory interactions take place, and a negative result would not be conclusive. We have interrogated our RNA-seq data obtained under multiple genetic and environmental conditions (Atkinson et al. 2018) to analyse the regulatory relationship between SPNCRNA.1343 and atd1 (studied before) as well as SPNCRNA.989 and tgp1 (proposed in our manuscript). Depending on the specific conditions, both of these gene pairs show positive or negative correlations in expression levels. So it is not possible to just perform the easy experiment as suggested to reach a clear conclusion.

      - Is there any possibility that some nutrient/drug/stress conditions interfere with the expression from the nmt1 promoter?

      Reply: This seems unlikely as this widely used promoter is known to be specifically regulated by thiamine. Consistent with this, we actually detected phenotypes for over 90% of the overexpression strains. But we cannot exclude the possibility that some conditions might interfere with nmt1 function.

      - Supplemental Figure 7 refers to unpublished data from Maria Rodriguez-Lopez. Is this still allowed?

      Reply: These are just control RNA-seq data from wild-type cells growing in rich medium. It does not seem that meaningful, but if required we could submit these data to the European Nucleotide Archive (ENA).

      - Supplemental Figure 8 shows drop assays to validate the growth phenotypes revealed by the screen for lincRNAs of clusters 1 and 3. As admitted by the authors in the text, in most cases, the effects are quite difficult to see to the naked eye. Did the authors consider the possibility to use growth curves (for the lincRNAs/conditions they would like to highlight), which might be more appropriate to visualize weak effects?

      Reply: We have tried a few experiments in liquid medium using our BioLector microfermentor. However, the doses need to be substantially changed for liquid media (in which cells typically are more sensitive than on solid media). So the situation with the altered conditions would become too confusing and could not be used as a direct validation of our results from solid media.

      - B.2. Are prior studies referenced appropriately?

      Yes. The authors could have cited the work of Huber et al (2016) Cell Rep. (PMID: 27292640) as another pioneer study where systematic lncRNA deletion was performed, even if in this case, these were antisense lncRNAs.

      Reply: Agreed, we now cite this paper in the Introduction (p. 4).

      - B.3. Are the text and figures clear and accurate?

      Overall, I found the text and figures clear.

      Reviewer #3 (Significance (Required)):

      Eukaryotic genomes produce thousands of long non-coding RNAs, including lincRNAs which are expressed from intergenic regions and do not overlap PCGs. Several lincRNAs have been extensively studied and characterized, showing that they function in different cellular processes, such as regulation of gene expression, chromatin modification, etc. However, beside these well documented lincRNAs, the function of most lincRNAs remains elusive. In addition, under the standard growth conditions used in labs, many of them are expressed to very low levels, and for the few cases for which it has been tested, the deletion and/or overexpression in trans often failed to display in a detectable phenotype.

      High throughput approaches for lncRNA functional profiling are currently emerging. The lab of Jurg Bahler recently developed a high throughput colony-based screen approach enabling them to quantitatively assay the growth and viability of fission yeast mutants under multiple conditions (Kamrad et al, eLife 2020). Here, they take advantage of this approach to characterize mutants of 150 lincRNAs in fission yeast, including not only deletion mutants generated using the CRISPR/Cas9 technology, but also overexpression mutants, tested in 149 and 47 growth conditions, respectively. This systematic approach allowed the authors to reveal specific phenotypes for a large fraction of the lincRNAs, emphasizing the fact that they are likely to be functional in particular nutrient/drug/stress conditions, acting in cis but also in trans.

      As I wrote in the summary above, I think that this study is important and constitutes a significant contribution in the lncRNA field.

      My field of expertise: long non-coding RNAs, yeast, genetics.

      \*Referee Cross-commenting** *

      I can see that reviewer #1 and I have raised the same concerns about the lack of insert sequencing for the overexpression plasmids, which is crucial to control that the correct lincRNAs were cloned and that no mutation has been introduced by the PCR. We are also both asking for RT-qPCR controls to show that the lincRNAs are indeed overexpressed. Again, this control is very important as many long non-coding RNAs are rapidly degraded by the nuclear and/or ctyoplasmic RNA decay machineries. So expressing a lincRNA from a plasmid, under the control of a strong promoter, does not guarantee increased RNA levels.

      I see that reviewer #2 is asking for a gametogenesis assay. I think it should be limited to the 3 lincRNAs which belong to the same sub-cluster as meiRNA.

    1. Author Response:

      Reviewer #1 (Public Review):

      Hickey et al. studied chromatin landscape changes in early Zebrafish embryos at three distinct stages: preZGA, ZGA and postZGA. Using ChIP-seq on these time-course samples, they examined developmental genes at their regulatory elements, including promoters and enhancers, that carry nucleosomes enriched with histone variant H2A.Z, as well as post-translational modifications H3K4me1 and H3K27ac, but with low DNA methylation, in early-stage embryos prior to turning on zygotic gene expression. During embryogenesis, this group of elements recruit a Polycomb Repressive Complex 1 (PRC1) component Rnf2 to "write" the ubiquitinated H2A or H2A.Z. The mono-ubH2A/Z then recruits a PRC2 component Aebp2 to further "write" the H3K27me3 repressive mark to silent these developmentally regulated genes in later stage embryos. Using a small molecule to inhibit Rnf2 abolishes H3K27me3 and leads to ectopic gene expression.

      Most of the data for the first half of this manuscript are presented in a clear and logic manner. The conclusions based on these correlation assays are quite obvious and well supported (except a few minor points raised below for clarifications, #2-#3). The major concern is for the second half of the manuscript where a drug is used to draw causal relationships (see point #1 below).

      1. Using small molecule could have secondary effects. It also seems that the drug-induced defects cannot be reversed after being washed away. Furthermore, this drug treatment eliminates almost all H3K27me3 genome-wide, regardless of their occupancy status with mono-ubH2A/Z, making it difficult to make the causal connection between the prerequisite mono-ubH2A/Z occupancy and the subsequent de novo H3K27me3. I think it is important for the authors to address this point more directly as this is the main conclusion of this work. Could the authors perform genetic analyses to confirm the specificity of the phenotypes?

      2. Page 8, line 160-163: "Curiously, enhancer cluster 5 (Figure 2A) was unique - displaying high H3K4me1, very high H3K27ac, and open chromatin (via ATAC-seq analysis; Figure 2 - figure supplement C, D) - but bore DNA methylation - an unusual combination given the typical strong correlation between high H3K4me1 and DNA hypomethylation." I suspect that the authors are talking about the chromatin state at pre-ZGA stage as this is the only stage DNA methylation pattern was included, but it is hard to tell that this cluster displays high H3K4me1 at all.

      We now see the confusion, and are happy to clarify this. We were intending to refer to to the histone marking at postZGA, and the DNAme at postZGA (for cluster #5) – as postZGA is the time when H3K4me1 is high, H3K27ac is very high, and DNAme remains high. The reviewer is right that we do not show the DNAme pattern at post ZGA, only preZGA. However, the DNAme pattern stays almost constant between preZGA (2.5 hpf) and postZGA (4.3 hpf) – a result we published previously in Potok et al., 2010 (note: the maternal genome shows DNA reprogramming prior to 2.5hr, and is then constant through ZGA). We did not include DNAme at every stage simply to save space in Panel A, which was getting crowded. However, to avoid the reader misunderstanding our point, we have taken care to make this clear in the revised manuscript. We thank the reviewer for raising this point.

      1. Page 10, line 206-207: "PRT4165 treatment also conferred limited new/ectopic Aebp2 peaks (Figure 4C, clusters 4, 6, 7,8)", it seems that clusters 4, 6, 7, 8 together are not "limited" compared to clusters 1, 3, and 5, and could be even more abundant.

      Thank you for this comment - we agree with the reviewer and have clarified this in the text and Figure 4. In the initial version, the section where we mention ‘limited’ additional sites was intend to refer to promoters, and although as only a modest fraction of the ectopic sites are at promoters, but we did not provide that context in the text. Indeed, if one looks at all sites in the genome, there are a large number of ectopic sites after PRT4165 treatment. This is shown clearly in the revised Figure 4 (which shows all genomic sites) and we have clarified this in the text.

      We were curious whether there is any feature that helps us understand what might unify the ectopic binding, and therefore underlie the mechanism(s). First, we tested whether binding sites for particular transcription factors might be enriched; however, we did not find a class of binding sites that represented more than 3% of the total sites. We note that others have reported some affinity of mammalian Aebp2 for DNA and some limited sequence specificity (Kim et al., NAR 2009), and in the absence of a high-affinity H2AUb target, that shadow DNA binding function may become more apparent. Furthermore, we did not observe chromatin marks that showed a highly significant degree of overlaps. Thus, although intriguing, there does not appear to yet be a logic to the ectopic binding observed.

      1. In the context of studying the chromatin state of developmental genes in early vertebrate embryos, there are two recent publications in mouse embryos which also investigated the crosstalk between mono-ubH2A and H3K27me3 at the ZGA transition in mouse (https://doi.org/10.1038/s41588-021-00821-2 and https://doi.org/10.1038/s41588-021-00820-3). It would be informative to add some discussion for comparisons between these two vertebrate organisms.

      Reviewer #2 (Public Review):

      One model for polycomb domain establishment suggests that PRC2 adds H3K27me3 first, and then recruits PRC1 for silencing. The key evidence for this model is the H3K27me3-binding module CBX proteins in canonical PRC1 complexes. This model has been revised by recent studies, and it is now well recognized that the polycomb domains can be de novo established in a different order. In other scenarios, including X inactivation, a non-canonical PRC1 complex that lacks CBX proteins catalyzes ubH2A first, and PRC2 complex is subsequently recruited through recognizing ubH2A modification by its Jarid2 and Aebp2 subunits.

      In this manuscript, Hickey and co-workers analyzed the temporal change of various epigenetic marks around ZGA stages during zebrafish early embryo development. Based on their experimental data and bioinformatic analysis, they suggest that polycomb establishment in zebrafish embryo is following the 'non-canonical' order, in which H3K27me3 establishment is dependent on ubH2A pre-deposition and the following recruitment of Aebp2-PRC2 complex. Moreover, they suggest that polycomb-silenced developmental genes are solely repressed by ubH2A, independent of H3K27me3. Overall, the functional analysis (RNF2 inhibitor experiments) conducted in the current study highlights the critical function of PRC1 and ubH2A in silencing developmental genes during early embryo development. Moreover, this study provides clues that could reconcile with the earlier observations that H3K27me3 seems largely dispensable for silencing developmental genes in zebrafish early embryo (e.g. PMID: 31488564).

      The main concern is two similar studies have just been published in Nature Genetics using mouse early embryos, and the observation of this manuscript largely agree with the two mouse studies, rendering the novelty of this study.

      In addition, certain conclusions in the manuscript requires further experimental support:

      1. While the authors claim that H3K27me3 is established after ZGA, it is quite surprising to me that they did NOT analyzed the H3K27me3 pattern before ZGA. While IF staining suggests a minimal level of H3K27me3 before ZGA (Fig1 S2B), previous ChIP-seq analysis demonstrate that H3K27me3 are present (e.g. PMID: 22137762).

      Briefly, in our own work, we do not detect H3K27me3 by IF prior to ZGA, and we could not detect H3K27me3 peaks by ChIP during preZGA (also mentioned as ‘data not shown’ in Murphy et al., 2018).

      1. While the RNF2 inhibitor experiment clearly demonstrates that PRC1 is required for the deposition of both ubH2A and H3K27me3, that does not necessarily mean that PRC1-mediated ubH2A deposition precedes H3K27me3. The establishment and maintenance of polycomb domain usually requires the crosstalk and reinforcement between polycomb complexes. Therefore, the deficiency in either PRC1 or PRC2 complex may lead to the decreased level of both marks. To clarify a hierarchical order of the polycomb domain establishment, a phenotypic analysis of PRC2 deficiency is also necessary.

      Here, we emphasize that prior to performing the inhibitor experiment, we addressed the temporal order of addition in Figure 1 and in Figure 1 – figure supplement 1. H2Aub1 is added extensively to thousands of developmental genes during preZGA, well before H3K27me3 is detected. We interpret this as evidence that H2Aub1 temporally precedes H3K27me3 during embryonic development. We will also mention (described in the Discussion) that maternal zygotic loss of Ezh2, which eliminates all H3K27me3 in the genome at all embryo stages does not result in the activation of developmental genes.

      1. Parental difference. As shown in Fig.1B, ubH2A level varies greatly in sperm and egg, which suggests that the reprogramming process of ubH2A (and perhaps H3K27me3) distribution could be significantly different for the two parental alleles. It would be interesting to analyze the ubH2A and H3K27me3 distribution in germ cells before fertilization.

      We appreciate the reviewer’s comment and agree that this would be an interesting line of inquiry. However, this would require genomics analyses from reciprocal crosses of highly polymorphic fish strains. This would involve very considerable additional work. Therefore, we will consider this in our future studies.

      1. The role of Aebp2 subunit. Given the well-characterized function of Aebp2 in recognizing ubH2A, an involvement of Aebp2-PRC2 complex in establishing H3K27me3 on PRC1 pre-deposited regions is not unexpected. Indeed, Aebp2 co-localized well with ubH2A marked regions (Fig.3). However, an issue not clarified in the manuscript is whether Aebp2 is the sole subunit for the recruitment of PRC2 to ubH2A marked regions. Paralleled analysis of the changes for Aebp2 and H3K27me3 upon RNF2 inhibitor treatment is necessary, and Aebp2-dependent and -independent regions should be separately classified for analysis.

      2. Role of PRC1 on the temporal regulation of gene expression during early development. The authors only analyzed the RNA-seq results for RNF2i treated embryos post ZGA. Therefore, it is currently not clear if the role of PRC1 in transcriptional repression is restricted to post-ZGA stages. RNA-seq analysis of RNF2i treated embryos on those stages are also warranted.

    1. Author Response:

      Reviewer #1 (Public Review):

      The model proposed here is the first large-scale model that actually performs a cognitive task, which in this case is working memory but could easily extend to decision making in general as is acknowledged by the authors. Briefly, each of the 30 areas are simulated as a rate, Wong-Wang circuit (i.e. two excitatory pools inhibit each other through a third, inhibitory population). The authors use previously collected anatomical data to constrain the model and show qualitatively match with the data, in particular how mnemonic activity emerges somewhat abruptly along the brain hierarchy.

      Strengths Previous models have focused on neural dynamics during the so-called "resting state", in which subjects are not performing any cognitive task - thus, resting. This study is therefore an important improvement in the field of large-scale modelling and will certainly become an influential reference for future modelling efforts. As typically done in large-scale modelling, some anatomical data is used to constrain the model. The model shows several interesting characteristics, in particular how distributed working memory is more resilient to distractors and how the global attractors can be turned off by inhibition of only top areas.

      Weaknesses Some of these results are not clear how they emerge, and some "biological constraints" do not seem to constrain. Moreover, some claims are slightly exaggerated, in particular how the model matches the data in the literature (which in some cases it does not) or how somatosensory working memory can be simulated by simply stimulating the "somatosensory cortex".

      This paper has two different models, one being a simplified version of the main model. However, it is not very clear what the simplified model adds the main findings, if not to show that the empirical anatomical connectivity does not constrain the full model.

      We thank the reviewer for this evaluation, and for appreciating the innovative character of our study in implementing a cognitive function in a data-constrained large-scale brain model. We hope that it will be useful for future studies planning to add cognitive functions to their large-scale models, and also for experimentalists who might benefit from this insight.

      In response to the detailed comments of the reviewer, and to address the weaknesses identified above, we have rewritten parts of the text, clarified important concepts and included a new simulations. Briefly:

      -We have clarified the nature and effects of the ‘biological constraints’ that we use. The full model that we use is indeed data-constrained, in the sense that we use real data to determine the values of many parameters. Having a data-constrained model, however, does not mean that all the results will be equally constrained. Some model results will critically depend on (some) data used to constrain the model, while other results will be more robust to changes in these parameters. We have highlighted this point and we also added explanations for each of the results presented.

      -We have corrected several claims along the text to make it more in line with experimental evidence, and included the new references suggested by the reviewer to this effect. For example, for the case of somatosensory WM mentioned by the reviewer, we have indicated that the existence of a ‘gating’ mechanism (explored in a supplementary figure) is important for achieving an accurate match with the experimentally observed effects of somatosensory stimulation.

      -Finally, we have highlighted the complementary benefits of the full and simplified models, and improved our motivation for the latter. Briefly, the simplified model allows us to identify the key ingredients needed for distributed WM (useful to generalize to other animal models), while the full model ensures that the main findings are still present when more realistic assumptions are made. A good example is the counterstream inhibitory bias, which is in principle not necessary for a simplified model but becomes a crucial factor to implement the distributed WM mechanism in our macaque model.

      Reviewer #2 (Public Review):

      There is a lot to like about this manuscript. It provides a large-scale model of a well-known phenomenon, the "delay activity" underlying working memory, our oldest and most enduring model of a cognitive function. The authors correctly state that despite the ubiquity of delay activity, there is little known about the macro and micro circuitry that produces it. The authors offer a computational model with testable hypotheses that is rooted in biology. I think this will be of interest to a wide variety of researchers just as delay activity is studied across a variety of animal models, brain systems, and behavior. It is also well-written.

      My main concern is the authors may be self-handicapping the impact of their model by not taking into account newer observations about delay activity. For a number of years now, evidence has been building that working memory is more complicated than "persistent activity" alone. Stokes, Pasternak, Dehaene, Miller and others have been mounting considerable evidence for more complex dynamics and for "activity-silent" mechanisms where memories are briefly held in latent (non-active) forms between bouts of spiking. There is also mounting evidence that the thalamus plays a key role in working memory (and attention). In particular, higher thalamic nuclei are critical for regulating cortical feedback. Cortical feedback plays a central role in the model presented here. The model presented in this manuscript just deals with persistent attractor states and the cortex alone.

      This is not to say that this manuscript does not have good value as is. No one disputes that some form of elevated, sustained, activity underlies working memory. This work adds insights into how that activity gets sustained and the role of, and interactions between, different cortical areas. The observation that the prefrontal and parietal cortex are more critical than other areas, that there are "hidden" attractor states, and "counterstream inhibitory bias" are important insights (and, importantly, testable). They will likely remain relevant even as the field is moving beyond persistent attractor states alone as the model for working memory. The new developments do not argue against the importance of delay activity in working memory. They show that it is more to the story, as inevitably happens in brain science.

      The authors do include a paragraph in the Discussion referencing the newer developments. Kudos to them for that. However, it presented as "new stuff to address in the future". Well, that future is now. These "newer" developments have been mounting over the past 10 years. The worry here is that by relying so heavily on the older persistent attractor dynamics model and presenting it as the only model, the authors are putting an early expiration date on their work, at least in terms of how it will be received and disseminated.

      We thank the reviewer for a careful and positive evaluation of our work. We consider that the main point raised here is indeed crucial: classical explanations of WM based on elevated and constant firing are an important part of the story, however other alternative or complementary approaches developed in the past years also deserve attention. These approaches include, to name a few, activitysilent mechanisms (Mongillo et al. 2008, Trübutschek et al. 2017), dynamic hidden states (Wolff et al. 2017), persistent activity without feedback (Goldman 2009), and paradigms relying on gamma bursts (Miller et al. 2018).

      It’s important to highlight, however, that our approach is “attractor network theory” not “persistent activity theory”, and an attractor does not have to be a steady state (tonic firing) but may display complex spatiotemporal patterns (fluid turbulence with tremendously rich temporal dynamics and eddies on many spatial scales is an attractor). We now have largely eliminated the use of “persistent” in the manuscript. On the other hand, for lack of a better word it’s fine to still use that term, if it is understood in a more general sense, which also includes stable representations in which the activity of individual neurons varies along the delay period (Goldman, 2009; Murray et al. 2017) or rhythmic activity which persists over time (Miller et al. 2018). The attractor network theory should be contrasted conceptually with mechanisms based on intrinsically transient memory traces (see Wang TINS 2021 for a more elaborated discussion on this).

      Our proposal for distributed WM has a general aim and it’s not restricted to the classical ‘elevated constant firing’ scenario. Following the reviewer’s suggestion, we have rewritten the text to make sure that multiple mechanisms of WM are acknowledged in different parts of the text, not only on a paragraph in the discussion. We have also acknowledged the importance of thalamocortical interactions and cited previous relevant studies in this sense (such as Guo et al. 2017), also as a response to comments from Reviewer 1.

      In addition, we have attempted to go beyond a simple rewriting and, using a variation of our simplified model, we now show that distributed WM representations can also happen in the context of activitysilent models (Figure 3 –figure supplement 1). In particular, we use a simplified network model with reduced local and long-range connectivity strength and incorporate short-term synaptic facilitation in synaptic projections. Our model results show that, while activity-silent memory traces can’t be maintained when areas are isolated from each other, inter-areal projections reinforce the synaptic efficacy levels and lead to a distributed representation via activity-silent mechanisms.

      We hope that this result serves to prove the generality of our distributed WM framework, and opens the door to subsequent studies focusing not only on distributed activity-silent mechanisms, but in distributed frameworks relying on other WM mechanisms as well.

    1. Author Response:

      Reviewer #3 (Public Review):

      The paper contains a substantial amount of novel experimental work, the experiments appear well done, and the analysis of the data makes sense. Raw data and analysis scripts have been made fully available.

      I have two specific comments:

      • While the paper talks extensively about deep mutational scanning, I don't think this is a deep mutational scanning study. In deep mutational scanning, we usually make every possible single-point mutation in a protein. This is not what was done here, as far as I can tell.

      In the revised manuscript, we have avoided using deep mutational scanning to describe our experimental design. Instead, we described our approach as “a high-throughput experimental approach that coupled combinatorial mutagenesis and next-generation sequencing”

      • For the analysis of epistasis vs distance (Fig 4d, e, f), it would be better to look at side-chain distances rather than C_alpha distances. In covariation analyses, it can be seen that C_alpha distances are not a good predictor of pairwise interactions. Similar patterns may be observable here.

      See e.g.: A. J. Hockenberry, C. O. Wilke (2019). Evolutionary couplings detect side-chain interactions. PeerJ 7:e7280.

      Thank you for the suggestion. In the revised manuscript, we replaced the Cα analysis by a side-chain analysis according to Hockenberry and Wilke (see response to Essential Revisions above).

  9. wt3fall2021.commons.gc.cuny.edu wt3fall2021.commons.gc.cuny.edu
    1. And then we go right in after you see after that montage then you see it's Sideshow Bob in court

      This kind of dialogue between friends discussing something they all love is relatable. However, I find it quite odd that they can recount detail by detail, montage by montage, as if they have each watched it more than a thousand times. Its not naturalistic at times and forces me to think they may have been restricted from such entertainments in their futuristic world. Why are they discussing the Simpsons in such great detail? Is this to feed the audience with information that will be later relevant in Act 3 (As they are noted to wear Simpson costumes) Hmmm we will see

    1. The diversity of human values and the methods by means of which they may be realized is so vast, and many of them remain so unacknowledged, that they cannot fail but lead to conflicts in human relations.  Indeed, to say that human relations at all levels -- from mother to child, through husband and wife, to nation and nation -- are fraught with stress, strain, and disharmony is, once again, making the obvious explicit.  Yet, what may be obvious may be also poorly understood. This I think is the case here.  For it seems to me that -- at least in our scientific theories of behavior -- we have failed to accept the simple fact that human relations are inherently fraught with difficulties and that to make them even relatively harmonious requires much patience and hard work. I submit that the idea of mental illness is now being put to work to obscure certain difficulties which at present may be inherent -- not that they need be unmodifiable -- in the social intercourse of persons.  If this is true, the concept functions as a disguise; for instead of calling attention to conflicting human needs, aspirations, and values, the notion of mental illness provides an amoral and impersonal "thing" (an "illness") as an explanation for problems in living (Szasz, 1959).  We may recall in this connection that not so long ago it was devils and witches who were held responsible for men's problems in social living.  The belief in mental illness, as something other than man's trouble in getting along with his fellow man, is the proper heir to the belief in demonology and witchcraft. Mental illness exists or is "real" in exactly the same sense in which witches existed or were "real."  

      This section sets the tone for ensuring that we do not disguise what would be a normal problem with day to day efforts in society as a mental illness. This is hugely impactful on modern society and is key again in treatment. Professionals must be able to decipher the reality of mental illness from daily life struggles. To be fair, any truly trained psychologist or psychiatrist should be able to do this given they have the proper training and credentials. This however is something that must be at the forefront of a professionals mind.

    2. To recapitulate: In actual contemporary social usage, the finding of a mental illness is made by establishing a deviance in behavior from certain psychosocial, ethical, or legal norms.  The judgment may be made, as in medicine, by the patient, the physician (psychiatrist), or others.  Remedial action, finally, tends to be sought in a therapeutic -- or covertly medical -- framework, thus creating a situation in which psychosocial, ethical, and/or legal deviations are claimed to be correctible by (so-called) medical action.   Since medical action is designed to correct only medical deviations, it seems logically absurd to expect that it will help solve problems whose very existence had been defined and established on nonmedical grounds.  I think that these considerations may be fruitfully applied to the present use of tranquilizers and, more generally, to what might be expected of drugs of whatever type in regard to the amelioration or solution of problems in human living.  

      If we are to advance we must question who is making the standards which is something that is being questioned here by the author. The problem that the author is showing is the definitions of illness along with health regarding someone's mental state. This is also showing a disdain for the idea of medicating someone on grounds that are not established within physicality. This methodology is rather skewed in accordance to modern testing and understanding but serves as a stellar check and balance to those who intend to practice now.

    3. "Mental illnesses" are thus regarded as basically no different than all other diseases (that is, of the body).  The only difference, in this view, between mental and bodily diseases is that the former, affecting the brain, manifest themselves by means of mental symptoms; whereas the latter, affecting other organ systems (for example, the skin, liver, etc.), manifest themselves by means of symptoms referable to those parts of the body.  This view rests on and expresses what are, in my opinion, two fundamental errors. In the first place, what central nervous system symptoms would correspond to a skin eruption or a fracture?  It would not be some emotion or complex bit of behavior. Rather, it would be blindness or a paralysis of some part of the body. The crux of the matter is that a disease of the brain, analogous to a disease of the skin or bone, is a neurological defect, and not a problem in living. For example, a defect in a person's visual field may be satisfactorily explained by correlating it with certain definite lesions in the nervous system.  On the other hand, a person's belief -- whether this be a belief in Christianity, in Communism, or in the idea that his internal organs are "rotting" and that his body is, in fact, already "dead" -- cannot be explained by a defect or disease of the nervous system.  Explanations of this sort of occurrence -- assuming that one is interested in the belief itself and does not regard it simply as a "symptom" or expression of something else that is more interesting -- must be sought along different lines. The second error in regarding complex psycho-social behavior, consisting of communications about ourselves and the world about us, as mere symptoms [p. 114] of neurological functioning is epistemological.  In other words, it is an error pertaining not to any mistakes in observation or reasoning, as such, but rather to the way in which we organize and express our knowledge. In the present case, the error lies in making a symmetrical dualism between mental and physical (or bodily) symptoms, a dualism which is merely a habit of speech and to which no known observations can be found to correspond. Let us see if this is so. In medical practice, when we speak of physical disturbances, we mean either signs (for example, a fever) or symptoms (for example, pain). We speak of mental symptoms, on the other hand, when we refer to a patient's communications about himself, others, and the world about him.  He might state that he is Napoleon or that he is being persecuted by the Communists. These would be considered mental symptoms only if the observer believed that the patient was not Napoleon or that he was not being persecuted[sic] by the Communists. This makes it apparent that the statement that "X is a mental symptom" involves rendering a judgment. The judgment entails, moreover, a covert comparison or matching of the patient's ideas, concepts, or beliefs with those of the observer and the society in which they live.  The notion of mental symptom is therefore inextricably tied to the social (including ethical) context in which it is made in much the same way as the notion of bodily symptom is tied to an anatomical and genetic context (Szasz, 1957a, 1957b). To sum up what has been said thus far: I have tried to show that for those who regard mental symptoms as signs of brain disease, the concept of mental illness is unnecessary and misleading.  For what they mean is that people so labeled suffer from diseases of the brain; and, if that is what they mean, it would seem better for the sake of clarity to say that and not something else.

      This component of the passage hold great value via the idea of not having any physical implications that can be reversed to the naked eye. While flawed as mindset, this is something that we must regularly keep in mind as psychologist when attempting to treat patients. When looking into the mind and the effects that we have on society, we must think of how we are going to better the interactions with those around us. Many members of society that suffer from mental illness are in fact almost incapable due to chemical imbalance. This is something that is in fact falsifiable due to testing and treatment results (Parekh, 2018). This is still a great thought process to keep in mind especially for the year it was released.

      Reference, Parekh, R. (2018, August). What Is Mental Illness? What is mental illness? Retrieved November 28, 2021, from https://www.psychiatry.org/patients-families/what-is-mental-illness.

    1. we have a very 00:38:26 unwieldy process of more than close to 200 countries with very stark differences sometimes and very different starting points so i think all of this doesn't really 00:38:39 make a good sort of negotiation process and if we if we go to the next cup my sense is that the process is extremely slow and we are 00:38:50 more or less at say setting ourselves up for failure but also you know we are going to one cup after another we with a great sense of a predictability of something that we know it's not going 00:39:03 to work at the pace at which it needs to work

      countries negotiating may not be as effective as working at the individual / civil society level to appeal to the wealthy demographics, who are responsible for the lions share of emissions.

    1. Reviewer #1 (Public Review):

      Todesco et al. investigate the genetic causes of variation in UV pigmentation in sunflowers as well as the possible biotic and abiotic factors that play a role in natural variation for the trait among populations. Overall I am very enthusiastic about this manuscript as it does an elegant job of going from phenotype to a key locus and then presenting a solid foray into the factors causing variation. I have only a fe relatively minor comments.

      The introduction felt a bit short. I was hoping early on I think for a hint at what biotic and abiotic factors UV could be important for and how this might be important for adaptation. A bit more on previous work on the genetics of UV pigmentation could be added too. I think a bit more on sunflowers more generally (what petiolaris is, where natural pops are distributed, etc.) would be helpful. This seems more relevant than its status as an emoji, for example.

      The authors present the % of Vp explained by the Chr15 SNP. Perhaps I missed it, but it might be nice to also present the narrow sense heritability and how much of Va is explained.

      A few lines of discussion about why the Chr15 allele might be observed at only low frequencies in petiolaris I think would be of interest - the authors appear to argue that the same abiotic factors may be at play in petiolaris, so why don't we see this allele at frequencies higher than 2%? Is it recent? Geographically localized?

      Page 14: It's unclear to me why there is any need to discretize the LUVp values for the analyses presented here. Seems like it makes sense to either 1) analyze by genotype of plant at the Chr15 SNP, if known, or 2) treat it as a continuous variable and analyze accordingly.

      Page 14: I'm not sure you can infer selection from the % of plants grown in the experiment unless the experiment was a true random sample from a larger metapopulation that is homogenous for pollinator preference. In addition, I thought one of the Ashman papers had actually argued for intermediate level UV abundance in the presence of UV?

      I would reduce or remove the text around L316-321. If there's good a priori reason to believe flower heat isn't a big deal (L. 323) and the experimental data back that up, why add 5 lines talking up the hypothesis?

      Page 17: The discussion of flower size is interesting. Is there any phenotypic or genetic correlation between LUVP and flower size?

    1. Author Response:

      Reviewer #1:

      Summary:

      Moody et al. presented a comprehensive investigation into the choice of marker genes and its impact on the reconstruction of the early evolution of life, especially regarding the length of the branch that separates domains Bacteria and Archaea in the phylogenetic tree. Specifically, this work attempts to resolve a debate raised by a previous work: Zhu et al. Nat Commun. 2019, that the evolutionary distance between the two domains is short as estimated using an expanded set of marker genes, in contrast to conventional strategies which involve a small number of "core" genes and indicate a long branch.

      Through a series of analyses on 1000 genomes, Moody et al. defended the use of core genes, and reinforced the conventional notion that the inter-domain branch (the AB branch) is long, as inferred by the core gene set. They proposed that with the 381 marker genes (the "expanded" set) used by Zhu et al., the observed short branch length is an artifact due to inter-domain gene transfer and hidden paralogy. Through topology tests, they ranked the markers by "verticality", and showed that it is positively correlated with the AB branch length. They also conducted divergence time estimation and showed that even the most vertical genes led to an implausible estimate of the origin of life.

      In parallel, Moody et al. surveyed the best marker genes using a set of 700 genomes. They recovered 54 markers, and demonstrated that ribosomal markers do not indicate a longer AB branch than non-ribosomal markers do. With the better half (27) of these marker genes, they conducted further phylogenetic analyses, which shows that potential substitutional saturation and the use of site-homogeneous models could contribute to the underestimation of the AB branch. Using this taxon set and marker set, they reconstructed the prokaryotic tree of life, which revealed a long AB branch, a basal placement of DPANN in Archaea, and a derived placement of CPR in Bacteria.

      Prokaryotic tree of life:

      The scope(s) of the manuscript is somehow split. First, it is posed as a point-to-point rebuttal to the Zhu et al. paper, on the long vs. short AB branch question. Second, it introduces a new phylogeny of prokaryotes using 27 "good" marker genes, and demonstrates that DPANN is basal to Archaea, and CRP is derived within Bacteria.

      Thanks for the summary. The two aspects of the manuscript identified by the reviewer are closely related, because the different issues boil down to the same underlying question: which genes should we use to infer the deep structure of the tree of life? The provocative work of Zhu et al. acted as an impetus to compare and evaluate the properties of several published marker gene sets, and then to identify (what our analyses suggest are) the subset best-suited for deep phylogeny, which we then use to infer an updated tree of life. We have clarified this logical structure in the revised manuscript, writing (at the end of the Introduction):

      “Here, we investigate these issues in order to determine how different methodologies and marker sets affect estimates of the evolutionary distance between Archaea and Bacteria. First, we examine the evolutionary history of the 381 gene marker set (hereafter, the expanded marker gene set) and identify several features of these genes, including instances of inter-domain gene transfers and mixed paralogy, that may contribute to the inference of a shorter AB branch length in concatenation analyses. Then, we re-evaluate the marker gene sets used in a range of previous analyses to determine how these and other factors, including substitutional saturation and model fit, contribute to inter-domain branch length estimations and the shape of the universal tree. Finally, we identify a subset of marker genes least affected by these issues, and use these to estimate an updated tree of the primary domains of life and the length of the stem branch that separates Archaea and Bacteria.”

      The second scope has inadequate novelty. A recent paper (Coleman et al. Science. 2021), which was from a partially overlapping group of authors, was dedicated to the topic of CPR placement, and indicated the same conclusion (CPR being derived and sister to Chloroflexi) as the current work does, albeit using more sophisticated approaches. The paper also addressed the debate of CPR placement (including citing the Zhu et al. paper). Additionally, the basal placement of DPANN has also been suggested by previous works (such as Castelle and Banfield. Cell. 2018). Therefore, re-addressing these two topics using a largely well-established and repeatedly adopted method on a relatively small taxon set does not constitute a significant extension of current knowledge.

      We disagree. Resolving the deep structure of the tree of life is an important topic --- this is what we, Zhu et al. (2019), and of course many others have been trying to achieve, in different and sometimes conflicting ways. Most of the published work is based on limited or biased taxon sampling (see Figure 1 Figure Supplement 14,15,16) or else focused on just one of the two prokaryotic domains of life. Furthermore, deep phylogeny is uncertain, and new results become convincing only when they receive support from multiple datasets and approaches. For instance, Coleman et al. (2021) recently found support for the placement of CPR as a sister clade to Chloroflexota rather than as a basal branch within the Bacteria. Notably, this work focused only on Bacteria, and made use of a different rooting method (with its own strengths and limitations) and taxon sampling. Most previous analyses using Archaea as an outgroup to root the bacterial tree recovered CPR as a deeply branching lineage within Bacteria, a placement likely resulting from LBA. In turn, our present findings represent an important confirmation of the CPR+Chloroflexi clade. Similarly, the basal placement of DPANN within Archaea remains controversial despite a number of studies on the topic, and our study also contributes to that ongoing debate.

      The debate:

      The first scope appears to be the more important goal of this manuscript, as it extensively discusses the claims made by Zhu et al. and presents a point-to-point rebuttal, including counter evidence. This may narrow the interest of this work to a small audience of specialists. Nevertheless, to best evaluate the current work, it is necessary to review the Zhu et al. paper and compare individual analyses and conclusions of the two studies.

      In doing so, I found that the two articles have distinct scopes that appear similar but not actually inline. To a large extent, the current work does not constitute actual rebuttal to the points made by Zhu et al. In contrast, some of the analyses presented in the current work support those by Zhu et al., despite being interpreted in a different way. For the claims that directly contest Zhu et al., I do not see sufficient evidence that they are supported by the analyses.

      Below is a summary of the comparison, which I will explain point-by-point in later paragraphs.

      • Moody et al. assessed AB branch length, while Zhu et al. assessed AB evolutionary distance (which is different).
      • Moody et al. evaluated the phylogeny indicated by a small number of core markers, while Zhu et al. evaluated the genome average using hundreds of global markers.
      • Zhu et al.'s results also showed that gene non-verticality, substitutional saturation, and site-homogeneous models shorten the AB distance, which is consistent with Moody et al.'s.
      • However, Zhu et al. found that some core markers are outliers in the genome-wide context, and the long AB distance indicated by them cannot be compensated for by the aforementioned effects. Moody et al. hasn't addressed this. Therefore, the novelty and potential impact of the current work is less compelling: It used a classical method (a few dozen core genes) and found a pattern that has been found many times by some of the same authors and others (including Zhu et al., who also analyzed core genes).

      Thanks for this detailed comparison of the two studies --- the points raised here and elaborated on below have prompted us to perform additional analyses which provide further insight into the properties and behaviour of the various marker gene sets analyzed. We nonetheless disagree that “the current work does not constitute actual rebuttal to the points made by Zhu et al.”: our finding that ribosomal and other “core” proteins are among the best phylogenetic markers for resolving both within- and between-domain relationships, estimating the length of the AB stem, and performing divergence time estimation, challenges an important claim of Zhu et al.’s study, and will be of broad interest to the community of researchers working on early life/early evolution.

      That said, we do also agree that one aspect of the disagreement between our study and that of Zhu et al. has to do with what is meant by evolutionary distance, and we have now discussed these issues in detail in the revised manuscript (as detailed below). In revising the manuscript, we have also sought to avoid a reductive focus on rebuttal, have revised the text to acknowledge important strengths and interesting features of the Zhu et al. analyses, and have made text revisions to ensure a consistent constructive tone: these are fundamental and challenging questions, and different perspectives and analyses are valuable in making progress. We also note that there has been an ongoing debate about the suitability of ribosomal genes for deep phylogeny in the literature (e.g. Petitjean et al. 2014, discussed in more detail below). Our analyses, and those of Zhu et al. (2019) previously, contribute to that broader discussion.

      Detailed responses to each of the above points follow below.

      AB distance metric:

      There is a subtle but critical difference between the scopes of the two papers: The Zhu et al. paper "reveals evolutionary proximity between domains Bacteria and Archaea". By stating "evolutionary proximity", it investigated two metrics: The length of the branch separating Archaea from Bacteria in the phylogenetic tree, i.e., the "AB branch". This was the main focus of the current work.

      The average tip-to-tip distance (sum of branch lengths) between pairs of Archaea and Bacteria taxa in the tree. A significant proportion of the Zhu et al. work was discussing this metric, and it led to several important conclusions (e.g., Figs. 4F, 5). The current work has not explored this metric.

      Thanks for raising the point about relative AB distance. In our revised manuscript, we have expanded Figure 1 and the associated analyses to include this metric. These analyses demonstrate that relative AB distance behaves similarly to AB branch length: they are positively correlated with each other; both are reduced by inter-domain HGT, and both are negatively correlated with ΔLL and with split score, an additional metric of within- and between-domain marker gene verticality which we have included in the revised Figure 1. Taken together, these results suggest that high-verticality marker genes (as judged both by the recovery of reciprocal AB monophyly, and of established within-domain relationships) support a longer AB branch and show a higher relative AB distance.

      These two metrics implicate distinct research strategies: For 1), HGTs and paralogy are usually considered problematic (as the current and many previous works argued). However, 2) is naturally compatible with the presence (and prevalence) of HGTs and paralogy.

      Authors of the current work equate "genetic distance" to "branch length" (line 70), and only investigated the latter. This equation is misleading. If organism groups A and B diverged early, but then exchanged many genes post-divergence, then this is indisputable evidence that their "genetic distance" is close. This point needs to be clearly explained in the manuscript.

      We agree with the reviewer that various definitions of evolutionary distance are possible, and some may be more useful than others for particular applications. The reviewer’s argument that “If organism groups A and B diverged early, but then exchanged many genes post-divergence, then this is indisputable evidence that their "genetic distance" is close” makes the case for a kind of phenetic distance: a distance based on overall similarity, regardless of how that similarity was brought about in terms of evolutionary process. We appreciate the democratic appeal of such a metric, and we have no desire to impose any particular philosophy of classification on the reader. However, the key point here is that methods that rely on concatenation for branch length or divergence time estimation (as used by Zhu et al., and in our current study) make the assumption that all of the sites in the concatenate evolved on the same underlying tree and if this assumption is not met, analyses can be misled. Thus, the shorter AB branch length and the more recent Archaea-Bacteria divergence times estimated from concatenations of incongruent marker genes result from unmodelled gene transfers which are misinterpreted as evidence for more recent common ancestry. Gene transfer is an important aspect of genome evolution, but none of the currently available methods, including those used by Zhu et. al., allow for genome-scale comparisons to be made in a way that accounts for our understanding of the underlying evolutionary processes.

      The point about different possible definitions of evolutionary distance made by the reviewer is valid, and we have now revised the opening of our conclusion to discuss these issues in more detail, writing:

      “We note that alternative conceptions of evolutionary distance are possible; for example, in a phenetic sense of overall genome similarity, extensive HGT will increase the evolutionary proximity (Zhu et al., 2019) of the domains so that Archaea and Bacteria may become intermixed at the single gene level. While such data can encode an important evolutionary signal, it is not amenable to concatenation analysis.”

      Core vs genome:

      This difference between "AB distance" and "AB branch length" is relevant to a more fundamental question: What defines the "evolutionary distance" between two groups of organisms? Both papers did not explicitly discuss this topic. It likely cannot be resolved in one article (as many scholars have continuously attempted on related topics in the past decades). But the discordance in understanding led to very different research strategies in the two papers, and rendering them incongruent in methodology.

      Specifically, the current work (and multiple previous works) based phylogenetic inference on only genes that demonstrate a strong pattern of vertical evolution. HGTs were considered deleterious, and needed to be excluded from the analysis. This left a few dozen genes at most, and many are spatially syntenic and functionally related (e.g. ribosomal proteins). In this work, the final number is 27. Previous critiques of this methodology have suggested that this is not a tree of life, but a "tree of one percent" (Dagan and Martin, Genome Biol. 2006).

      In contrast, Zhu et al. (and related previous works) attempted to evaluate the evolution of whole genomes by "maximizing the included number of loci.". They used a "global" set of 381 genes. They faced the challenge of "reconciling discordant evolutionary histories among different parts of the genome", because "HGT is widespread across the domains". To resolve this, they adopted the gene tree summary method ASTRAL.

      Therefore, the "AB distance" estimated by Zhu et al. is a genome-level distance, calculated by merging conflicting gene evolutions (which itself can be disputed, see below). Whereas the "AB branch" evaluated in this work is strictly the branch length in the core gene evolution. Therefore, the results presented in the two papers do not necessarily conflict, because of the different scopes.

      This point is closely related to the previous one, and the new section (final paragraphs of the Conclusion, quoted directly above) goes some way to addressing this comment. Regarding the issue of a focus on just a small proportion of vertically-evolving genes, the critical point is as above: current methods for branch length and divergence time estimation (including those used by Zhu et al.) require such vertically-evolving genes, because they make the assumption that all of the sites evolve on the same tree, i.e. trace back to the same origin via vertical evolution. We agree that most prokaryotic gene families do not evolve under these restrictive assumptions and therefore cannot be analysed using concatenation methods for branch length estimation. Indeed, one of the main points of our study is that most of the genes in the 381-gene set of Zhu et al. do not meet these assumptions and are thus unsuited for estimating evolutionary distance and divergence times.

      There is much ongoing method development which will allow more of the genome to be used in deep-time comparative analyses; Astral-Pro, FastMulRFS and SpeciesRax, among others, are recent promising steps in this direction. However, our central critique of Zhu et al. is that inferences under concatenation-based methods can be misled by HGT and other sources of incongruence, and indeed our analyses show that these unmodelled signals underlie the difference between the conclusions of Zhu et al. and other studies (e.g. (Liu et al., 2021; Spang et al., 2015; Williams et al., 2020) that have instead supported a deep divergence between Archaea and Bacteria. In our revised manuscript, we have shown that the relative AB distance, like the AB branch length, is shortened by unmodelled gene transfers (Figure 1), and that estimates of the AB stem length from different studies are similar when the congruent subset of the data is analysed with the best available substitution models (Figure 6). We therefore disagree that the scopes are distinct: richer, broader measures of genomic diversity can be proposed and, with the development of new methods, estimated; but so far, the vertical signal is the only signal that can be harnessed to infer divergence times using concatenations.

      The expanded marker set:

      The authors made a valid critique (line 121-135) that many of the 381 genes in the "expanded marker set" adopted by Zhu et al., are under-represented in Archaea. According to the PhyloPhlAn paper (Segata et al. Nat Commun. 2013) which originally developed the 400 markers (a superset of the 381 markers), these genes were selected from ~3,000 bacterial and archaeal genomes available in IMG at that time time (note that it was 2013). Zhu et al. also admitted, in the discussion section, that this marker set falls short in addressing some questions (such as the placement of DPANN). What is important in the current context, is that they were not specifically selected to address the AB distance question.

      We agree that the taxon sampling of archaea and the choice of marker genes in the Zhu et al. study were not ideal for estimating the evolutionary distance between the domains. However, we note that this distance (or proximity), and the hypothesis that traditional core genes over-estimate the Archaea-Bacteria divergence, was one of the main results of the paper (c.f. the title of that paper, “Phylogenomics of 10,575 genomes reveals evolutionary proximity between domains Bacteria and Archaea”).

      However, note that Zhu et al.'s Fig. 5A, B presented the AB distance informed by 161 out of the 381 genes. These genes have at least 50% taxa represented in both domains - the same threshold discussed in the current work (line 132).

      While the 50% sampling criterion indeed enriches for the genes of the expanded set that were present in LUCA and on the AB branch, we note that the 50% criterion represents a minimum of 4953 bacteria and 335 archaea; that is, it still reflects the unbalanced sampling of the dataset overall. For example, 30 of the genes had fewer than two archaeal homologues, and in 100 of the trees there were fewer than 50 archaea reflecting the large disparity in taxon sampling (Supplementary Information Table S1). The phylogenetic signal in these genes is discussed in more detail below. Looking at the subsampled versions of these 161 genes, we found the majority of these genes (123/161) to have no discernible AB branch length. The 38/161 genes which had an arguable AB branch length (but still with transfers/paralogs) possessed a range of AB lengths: 0.0814:5.26, with a mean AB length of 1.03 and a median of 0.635.

      Even with those sufficiently represented genes, they still found that ribosomal proteins and a few other core genes are "outliers" in the far end of the AB distance spectrum.

      The reviewer raises an interesting point about outliers with high relative AB distances, which gets to the heart of the debate about how best to estimate the evolutionary distance between Archaea and Bacteria. The new analyses of relative AB distance introduced in our revised manuscript (Figure 1) demonstrate that this metric is affected by HGT in a similar manner to AB branch length (that is, high-verticality marker genes have a greater relative AB distance (relative AB vs ΔLL: p = 0.0001051 & R = -0.2213292, relative AB vs between-domain split score: p = 2.572e-06 & R = -0.2667739). Thus, core genes can be viewed as “outliers” compared to other prokaryotic genes in the sense that they have experienced an unusually low amount of HGT. This high verticality makes them among the few prokaryotic gene families that can be analysed by concatenation methods, which make the assumption that all sites evolve on the same underlying tree topology.

      Domain monophyly in gene trees:

      The authors' efforts in manually checking the gene trees are appreciable (Table S1), considering the number and size of those trees. They found (line 147) "Archaea and Bacteria are recovered as reciprocally monophyletic groups in only 24 of the 381 published (Zhu et al., 2019) maximum likelihood (ML) gene trees of the expanded marker set."

      The domain monophyly check was valid, however the result could be misleading because any sporadical A/B mixture was considered evidence of non-monophyly for the entire gene tree. As the taxon sampling grows, the opportunity of observing any A/B mixture also increases. For example, in Puigbò et al. J. Biology. 2009, 56% (a much higher ratio) of nearly universal genes trees had perfect domain monophyly based on merely 100 taxa. This is because even the "perfect" marker genes (such as ribosomal proteins) are not completely free from HGTs (e.g., Creevey et al. Plos One. 2011), let alone the fact that there are many artifacts in the published reference genomes (Orakov et al. Genome Biol. 2021).

      Therefore, to have an objective assessment of this topic, it would be better to have a metric that allows some imperfection and reports an overall "degree" of separation (also see below).

      We agree that complementing the monophyly check with a more nuanced metric is useful. In our revised manuscript, we now also evaluate the split score (Dombrowski et al. (2020) Nat Commun) of each marker, which reflects the degree to which a gene recovers the monophyly of established taxonomic ranks (a higher score reflects the splitting of monophyletic groups into a number of smaller clades in the gene tree, and so the metric permits a degree of “imperfection”, as suggested; in addition, the metric is averaged over bootstrap replicates, so that lack of resolution or poorly-supported disagreements with the reference taxonomy do not disproportionately affect the score). This expanded analysis (Figure 1) indicates that both within- and between-domain split score and ΔLL are significantly positively correlated (R = 0.836679, p < 2.2✕10-16), and that phylogenetic markers that more strongly reject domain monophyly (higher delta-LL) also perform worse at recovering between-domain (and within-domain) relationships (higher split score) and support a shorter AB branch length.

      AB branch by gene: correlation and outliers

      Figure 1 is the single most important result in this work, because it argues that the short AB branch observed in Zhu et al. is an artifact due to "inter-domain gene transfer and hidden paralogy" (line 202). This argument is based on the observation that the indicated AB branch length is negatively correlated with "verticality" (measured by ΔLL and split score) of the gene.

      Our argument that the short AB branch results from inter-domain gene transfer and hidden paralogy is based on three main lines of evidence: (i) documentation of extensive transfers and intermixing of paralogues in the gene trees for the 381 gene set; (ii) the analyses in Figure 1, which demonstrate that verticality positively correlates with AB branch length and AB distance; (iii) the demonstration that the incremental addition of low-verticality markers to a concatenate results in a concomitant decrease in AB branch length.

      However, Zhu et al. also investigated the impact of verticality on AB distance, and they also found that they are negatively correlated (Fig. 5E). Therefore, the current result does not appear to deliver new information (as do multiple other analyses, see below).

      Zhu et al. indeed identified a weak positive relationship between gene verticality and AB distance. Our analyses go beyond that work by showing, using a variety of complementary metrics of verticality, that AB branch length and relative AB distance are strongly positively correlated with verticality (see Figure 1), and that the low verticality of the genes in the 381 gene set largely explains the difference in stem length inference between that dataset and earlier analyses (Figure 6). An additional factor not considered in the analyses of Zhu et al. was the question of whether a gene was present in LUCA, and so can provide information on the AB branch length. Our analyses (detailed below) suggest that the majority of genes (317) in the 381 gene set do not contain an unambiguous AB branch, and so do not contribute interpretable signal to estimates of the AB branch length.

      An important finding in Zhu et al., which is largely not discussed in the current work, is that a handful of "core" genes are outliers in the spectrum of AB distance, as compared to the majority of the genome (Fig. 5A). The AB distance indicated by these core genes is so long compared with the genome average that it cannot be compensated for by the impact of non-verticality, substitutional saturation, site-homogeneous model, etc (see below).

      Fig. 1A of the current work also clearly shows that many long-AB branch genes are outliers compared with the majority of the genome (the bottom of the blue bar).

      Figs. 3 and 4 attempted to show that ribosomal proteins are not outliers, but that analysis was based on a very small set of core genes, and the figures clearly show that there are outliers even in this small set (to be further discussed below).

      This comment re-iterates the reviewer’s earlier points about “core” genes as outliers compared to the majority of the genome. The key issue is that “most of the genome”, and a significant portion (317 genes) of the 381-gene set, contain features that make them unsuitable for estimation of AB branch length by concatenation, or indeed estimation of an interpretable relative AB distance. We have documented the cases of HGT and mixing of paralogues in the 381-gene dataset; this information is summarised in the main text and presented in more detail in Supplementary Information Table S1.

      Focusing on the 161 genes with >50% representation in both Archaea and Bacteria, manual inspection of gene trees inferred on the 1000-species subsample under the LG+G+F model indicate that 123/161 do not have a clear AB branch (that is, a branch that separates most or all Archaea from Bacteria). While distinguishing such cases from early gene transfers is not straightforward, there is no compelling reason to think that these genes were present in LUCA. The simplest explanation for these gene phylogenies is instead an origin within Bacteria and subsequent transfer on one or multiple occasions into Archaea. As a result, estimates of AB branch length or relative AB distance inferred from these genes cannot be straightforwardly compared to those of the traditional “core” or other genes for which the evidence of a pre-LUCA origin is stronger. Considering only the 38/161 genes for which a LUCA origin appears, from the gene phylogeny, to be likely, the mean AB branch length is 1.03, greater than that estimated from the concatenation of the most vertical genes in the expanded set (0.56), and suggesting that phylogenetic incongruence, combined with (for some families) a more recent origin explains the shorter AB distances inferred from the 381 gene set. Thus, it is not the case that the AB branch lengths (or relative distances) estimated from the majority of genes form a null distribution” against which “core” genes can be seen as outliers; instead, our analyses suggest that “core” genes are among the limited number of genes that trace vertically to LUCA.

      Regarding Figures 3 and 4, see the more detailed discussion below.

      Verticality is not causative of short AB branch:

      In spite of the outlier question, there is an important logic problem in these analyses: The authors observed that gene verticality (measured by negative ΔLL) is correlated with AB branch length (Fig. 1), and concluded that HGTs and paralogy shortened the AB branch (line 202). However, they did not directly assess the rate of evolution in this model. It is totally possible that the most vertical genes happen to be those that evolved faster at the AB split. In order to support the claim made in this work, it is important to separate the effect of the rate of evolution from the effect of HGT / paralogy.

      The ideal solution would be to include ALL genes (not just "good" ones), build gene trees, identify parts of the gene trees that once experienced HGT or paralogy, and prune off these PARTS, instead of excluding the entire gene tree. The remaining data are thus free of HGT / paralogy, based on which one can quantify the "true" AB branch length, and further assess how much it is correlated with "verticality", and whether there are still "outliers". This solution is likely not trivial in implementation, though. However, without such assessment, the observed short AB branch still only applies to the "tree of one percent", not the "tree of life".

      Thanks for this comment --- the reviewer raises a subtle and valid point. Our analyses indicate that vertically-evolving genes have longer AB branch lengths, but in the first version of our manuscript we did not test the alternative hypothesis that this relationship might simply result from a faster rate of evolution in vertically-evolving genes. To evaluate the relationship between evolutionary rate, verticality, AB branch length and relative AB distance on as broad a set of genes as possible, we took the 302 genes from the 381-gene expanded set, excluding 56 genes for which the 1000-species subsample included no archaea, and another 23 which included only 1 archaeon. To estimate per-gene evolutionary rate, we rooted each gene tree using MAD (Tria et al. 2017) and calculated the mean root- to-tip distance on the MAD-rooted gene tree, then evaluated the relationship between rate and verticality. This analysis indicated that vertically-evolving genes evolve more slowly (have shorter mean root-to-tip distances) than less vertical genes (using deltaLL and between-domain split score as proxies for marker verticality, with a Pearson’s product- moment correlation: MAD rooted mean root-to-tip distance against deltaLL: R = 0.1397803, p = 0.01506 or against split score: R = 0.1902056 p = 0.000893), despite having longer AB branches and relative AB distances (using a Pearson’s product moment correlation of MAD rooted mean root-to-tip distance against AB length: p = 0.2025, R= 0.1143076, or against relative AB distance p = 0.007435, R=0.1537479). Thus, the longer AB branches of vertically evolving genes do not appear to be the indirect result of faster evolution of those genes. These analyses are reported in the main text, where we write:

      An alternative explanation for the positive relationship between marker gene verticality and AB branch length could be that vertically-evolving genes experience higher rates of sequence evolution. For a set of genes that originate at the same point on the species tree, the mean root-to-tip distance (measured in substitutions per site, for gene trees rooted using the MAD method (Tria et al., 2017)) provides a proxy of evolutionary rate. Mean root-to-tip distances were significantly positively correlated with ∆LL and between-domain split score (∆LL: R = 0.1397803, p = 0.01506, split score: R = 0.1705415 p = 0.002947; Figure 1 Figure Supplement 5,6, indicating that vertically-evolving genes evolve relatively slowly (note that large values of ∆LL and split score denote low verticality)). Thus, the longer AB branches of vertically-evolving genes do not appear to result from a faster evolutionary rate for these genes. Taken together, these results indicate that the inclusion of genes that do not support the reciprocal monophyly of Archaea and Bacteria, or their constituent taxonomic ranks, in the universal concatenate explain the reduced estimated AB branch length.

      Differential metric for verticality:

      In spite of the similarity between the current result and Zhu et al.'s (see above), the two works approached this goal using different metrics.

      First, the authors attempted to quantify the AB branch length in individual gene trees, including those that do not have Archaea and Bacteria perfectly separated. To do so they performed a constrained ML search (line 210). I am wary of this treatment because it could force distinct sequences (due to HGT or paralogy) to be grouped together, and the resulting branch length estimates could be highly inaccurate.

      We agree with the reviewer that estimating AB branch lengths in this way might lead to inaccuracy. We note that this is, in effect, what was done in the published analysis (Zhu et al. 2019): a topology in which Archaea and Bacteria were reciprocally monophyletic was inferred using ASTRAL (a reasonable analysis, given the robustness of ASTRAL to some degree of HGT/gene tree incongruence), and then the AB branch length was estimated from the concatenation of these 381 genes, fixing on the ASTRAL topology. We performed this experiment (inferring AB branch length on constrained trees) in order to evaluate how incongruence between the gene and species trees might affect AB branch length inference.

      In contrast, Zhu et al. quantifies the average taxon-to-taxon phylogenetic distance between the two domains, regardless of the overall domain monophyly. That method was free of this concern, although it computed a different metric.

      Thanks for raising this point. As described above, in the revised manuscript we have also evaluated the relative AB distance metric used by Zhu et al., and show that it behaves similarly to the AB branch metric we evaluated in the first version of the manuscript (see revised Figure 1).

      Second, the authors assessed "marker gene verticality" using two metrics: a) AU test result (rejected or not) (Fig. 1A), c) ΔLL, the difference in log likelihood between the constrained ML tree and ML gene tree (line 222, Fig. 1B, C). I am concerned that they are sensitive to taxon sampling and stochastic events, as I explained above regarding domain monophyly. It is possible that a single mislabeling event would cause the topology test to report a significant result. In addition, they evaluate how severely domain monophyly is violated, but they do not account for intra-domain HGTs and other artifacts, which are also part of "verticality", and they can potentially distort the AB branch as well.

      In the revised manuscript, we also evaluate a complementary metric for marker gene verticality, the split score (see above), which measures the extent to which marker genes recover established relationships at a given taxonomic level (we computed both within-domain and between-domain split scores). The split score is a more granular measure than ΔLL and, by summing over bootstrap replicates, it also better accommodates phylogenetic uncertainty. The two metrics (ΔLL and split score) are positively correlated and the analyses come to the same conclusions regarding the impact of HGT and other sources of incongruence on estimates of the AB branch length and relative AB distance.

      I did not find the ΔLL values of individual markers in any supplementary table. I also did not find any correlation statistics associated with Fig. 1B.

      The ΔLL values for individual markers can be found in the data supplement in: “Expanded_Bacterial_Core_Nonribosomal_analyses/Individual_gene_tree_analyses/Expanded//Expanded_AB_AU.csv”

      We have now updated the readme.txt file for clarity and included all the new results from the analyses which we have undertaken as part of the review process in the latest version of the supplemental available on figshare (10.6084/m9.figshare.13395470) as well as updated the directory and file names for clarity. We also have added the statistics associated with the correlations in Figure 1 to the Figure Legend.

      Statistical test:

      Line 157: "For the remaining 302 genes, domain monophyly was rejected (p < 0.05) for 232 out of 302 (76.8%) genes." Did the authors perform multiple hypothesis correction? If not, they probably should.

      Thanks for this suggestion. We have now used a Bonferonni correction to account for multiple testing. As a result, fewer marker genes are rejected at the 5% level (151/302), although the overall conclusions are unaffected.

      Line 217: "This result suggests that inter-domain gene transfers reduce the AB branch length when included in a concatenation." and Fig. 1A. If I understand correctly, this analysis was performed on individual gene trees, rather than in a concatenated setting. Therefore, the result does not directly support this conclusion.

      Thanks for pointing this out. The reviewer is correct that this inference depends not only on the single gene analyses, but also on the subsequent concatenation results presented in this section. We have therefore moved this sentence later in the section, after the concatenation analysis.

      Line 224: "Furthermore, AB branch length decreased as increasing numbers of low-verticality markers were added to the concatenate (Figure 1(c))". While this statement is likely true, Zhu et al. also presented similar results (Fig. 5) despite using a different metric, and they concluded that the impact is moderate and cannot explain the status of some core genes as outliers.

      Zhu et al. did identify some of these trends, as we acknowledge in our manuscript ("The original study investigated and acknowledged (Zhu et al., 2019) the varying levels of congruence between the marker phylogenies and the species tree, but did not investigate the underlying causes.“ --- line 178; “These results are consistent with (Zhu et al., 2019), who also noted that AB branch length increases as model fit improves for the expanded marker dataset.” --- line 337) and as discussed above. Our analysis (Figure 6, Table 1) goes further in showing that the most vertical subset of the 381-gene set supports an inter-domain branch length closely similar (2.4 subs/site compared to e.g. 2.5 subs./site for the 27-gene dataset) to analyses using the traditional marker gene set.

      Concatenation and branch length:

      The authors pointed out that "Concatenation is based on the assumption that all of the genes in the supermatrix evolve on the same underlying tree; genes with different gene tree topologies violate this assumption and should not be concatenated because the topological differences among sites are not modelled, and so the impact on inferred branch lengths is difficult to predict." (line 187).

      This argument is valid. In my opinion, this is the one most important potential issue of Zhu et al.'s analysis. In that work, they inferred genome tree topology through ASTRAL, which resolves conflicting gene evolutions. However ASTRAL does not report branch lengths in the unit of number of mutations. Therefore, they plugged the concatenated alignment into this topology for branch length estimation, hoping that it will "average out" the result. That workaround was apparently not ideal.

      Yes, we agree --- this is our main critique of the Zhu et al. analyses.

      However, the practice of molecular phylogenetics is complicated. Theoretically, every gene, domain, codon position and site may have its unique evolutionary process, and there have been efforts to develop better partition and mixture models to address these possibilities. But there is a trade off; these technologies are computationally demanding and have the risk of overfitting. It is plausible that in some scenarios, the gain of concatenating many loci (despite conflicting phylogeny) may outweigh the cost of having unpredictable effects.

      But this dilemma needs to be analyzed rather than just being discussed. The Zhu et al. paper did not assess the impact of such concatenation on branch length estimation. The best answer is to conduct an analysis to show that concatenating genes with conflicting phylogeny would result in an AB branch that is shorter than the mean of those genes, and the reduction of AB branch length is correlated with the amount of conflict involved. The current work has not done this.

      Thanks for raising this point. We agree that phylogenetics is complicated and that

      we lack methods that can account for all possible factors. With respect to the impact of gene transfers on the AB branch length, and as touched on above, there are two issues here.

      The first is with the analysis actually performed by Zhu et al: of the 381 extended set genes, 79 have one or no archaea in the 1000-taxon subsample, and a further 176 have an AB branch length close to 0 (<0.00001) in the constrained analyses. To investigate further, we manually inspected ML gene trees for the 381 genes (1000 taxon subsample). Allowing for recent gene transfer, we nevertheless identified only 64 genes with an unambiguous branch separating most Archaea from most Bacteria that might correspond to the ancestral AB divergence (Supplementary File 1).

      Taken together, these analyses suggest that there is no strong evidence that these genes were present in LUCA or evolved along the AB branch, and so they do not provide information on its length. Since the branch length in the concatenation is an average over the branch length per site, the inclusion of this set of genes in the analysis did reduce the AB branch length, as demonstrated by our analyses (Figure 1(H)).

      The second issue is: for genes which were likely present in LUCA and evolved on the AB branch, does gene transfer cause a reduction in the AB branch length inferred from their concatenation? To test this, we initially tested iterative concatenations of increasing numbers of non-vertical markers (Figure 1H), as well as a comparison of the most vertical genes to the whole expanded marker set (Figure 3 Figure Supplement 2). This revealed that as more markers were added (with lower verticality), the inferred AB branch length from the concatenate was reduced. We also found an increased AB branch length when only the 20 most vertical markers were used as opposed to the whole (381 marker) dataset (0.56 vs 0.16 substitutions/site, Figure 6).

      The reviewer proposes an additional test of the impact of marker gene incongruence on branch length inference from concatenations: to compare the AB branch length before and after pruning of HGTs from individual marker gene alignments. To do this, we took the 54 marker genes from our new dataset and concatenated them before and after pruning of unambiguous HGTs. The AB branch length inferred from the concatenation with HGTs removed was 1.946 substitutions/site, compared to 1.734 substitutions/site without pruning HGTs, demonstrating the impact of even a relatively small number of HGTs on branch length estimation from concatenates.

      Divergence time estimation:

      The manuscript dedicates one section (line 230-266) to argue that the divergence time estimation analysis performed by Zhu et al. was not good evidence for marker gene suitability. Zhu et al. showed congruence of the expanded marker set with geological records whereas ribosomal proteins were conflicting with the geologic record.To support their argument, the authors estimated divergence times using the top 20 most "vertical" genes measured by ΔLL.

      It would be good to clarify which genes they are, and it would be important to check whether they include some of the most "AB-distant" ones found by Zhu et al. Their Fig. 5A shows that there are genes that divide the two domains several folds further than the ribosomal proteins (such as rpoC). If they are among the 20 genes, it will not be surprising that the estimated AB split is older than it should be.

      We now include the annotations for these 20 genes in Supplementary File 5a. The 20 most vertical genes include two of the “AB-distant” outliers identified by Zhu et al., tuf and infB, and one ribosomal marker, rpsG.

      Overall, I think this section is logically questionable. Zhu et al. suggested that "They show the limitation of using core genes alone to model the evolution of the entire genome, and highlight the value in using a more diverse marker gene set.". The current work showed that using another set of a few genes (I do not know if they include multiple "core" genes, as discussed above, but it is plausible) also did not work well. This does not refute Zhu et al.'s claim.

      What's important in Zhu et al.'s analysis is this: they demonstrated that using a small set of genes in DTE may cause artifacts due to them significantly violating the molecular clock at certain stages of evolution. Instead, using a larger set of markers that represent a portion of the entire genome would help to "smooth out" these artifacts. This of course is not the ideal solution, likely because concatenating conflicting genes and modelling them uniformly is not the best idea (see above). But as an operational workaround, it was not challenged by the analysis in the current work.

      Finally, I agree with the authors' statement that more and reliable calibrations are the best way to improve divergence time estimation.

      The dating analyses presented in the first version of our manuscript demonstrated that the apparent agreement between molecular clock estimates using the 381-gene set and the fossil record was the result of artifactual shortening of the AB branch, as discussed in detail above. Once the subset of the data least affected by these issues (that is, the most vertical subset) was used, the limitations of current clock methods, particularly with few calibrations, for dating deep nodes became clear.

      That said, we agree with the reviewer (and also R3) that the dating section in the first version of our manuscript was somewhat unsatisfactory: it identified an important limitation of the published analysis, but did not explore the underlying question of why molecular clock methods infer unrealistically old divergence times from vertically-evolving genes. In the revised manuscript we have reworked and improved this section extensively, including new analyses on the 27-gene dataset, with more fossil calibrations, that help to diagnose how and why clocks struggle to date the archaeal and bacterial stems from the available data. We now show that the old ages result from a combination of low rates of molecular evolution across the tree inferred from “shallow” calibrations, combined with a lack of age maxima for nodes other than the root of the tree; when the rate distribution is informed in this way, the long AB branch is interpreted as representing a long period of time and estimates of LUCA age are strongly influenced by prior assumptions about root maximum age. These analyses now suggest how the difficulties might be overcome in the future, for example using better calibrations (particularly maximum ages, and indeed any fossil calibrations within the Archaea), or alternatively other sources of time information such as from gene transfers. Reflecting the new, broader focus, we have moved this section to the end of the manuscript.

      AB branch by ribosomal and non-ribosomal genes:

      Two figures (Figs. 3 and 4) are two sections (line 270-303) dedicated to the argument that ribosomal markers do not indicate a longer AB branch than a non-ribosomal one. However, this is a small scale test (38 ribosomal markers vs. 16 non-ribosomal markers) compared with the similar analysis in Zhu et al. (30 ribosomal markers vs. 381 global markers). A closer look at Figs. 3 and 4 suggests that while the AB lengths indicated by the ribosomal markers are within a relatively narrow range, those by the non-ribosomal ones are very diverse, including ones that are several folds longer than the ribosomal average. This result is in accordance with that of Zhu et al.'s Fig. 5A, although the latter was describing a different metric. Do these genes also overlap the ones found by Zhu et al.?

      Nevertheless, this analysis does not falsify Zhu et al.'s, because it compared a different, much smaller, and deliberately chosen group of genes.

      As the reviewer indicates, the purpose of the analyses presented in Figures 3-4 is to evaluate the hypothesis of accelerated ribosomal protein evolution: that is, the idea that ribosomal proteins over-estimate the AB branch length due to accelerated evolution during the divergence of Archaea and Bacteria. Although this hypothesis was independently proposed in Zhu et al., to our knowledge it actually originates with Petitjean et al. (2014) GBE (https://academic.oup.com/gbe/article/7/1/191/601621; see their Figure 2), and has been at play in analyses of deep evolution and in particular the position of DPANN Archaea in the phylogeny since that time. Thus, this section of our manuscript (indeed, all but the first section) is not a critique of Zhu et al.’s work, but a contribution to the broader ongoing discussion about which marker genes are best to use in deep phylogeny. We compare only vertically-evolving genes in Figures 3-4 so as to distinguish the impact of gene function (ribosomal versus non-ribosomal) from confounding factors such as HGT, paralogy, and gene origination time.

      To clarify this point, we have modified our main text discussion to make it clear that we are making a comparison between ribosomal genes and other vertically-evolving members of the traditional “core” gene set, rather than a broader genome-wide claim. We now write:

      “If ribosomal proteins experienced accelerated evolution during the divergence of Archaea and Bacteria, this might lead to the inference of an artifactually long AB branch length (Petitjean et al., 2014; Zhu et al., 2019). To investigate this, we plotted the inter-domain branch lengths for the 38 and 16 ribosomal and non-ribosomal genes, respectively, comprising the 54 marker genes set. We found no evidence that there was a longer AB branch associated with ribosomal markers than for other vertically-evolving “core” genes (Figure 2(b); mean AB branch length for ribosomal proteins 1.35 substitutions/site, mean for non-ribosomal 2.25 substitutions/site).”

      Substitutional saturation:

      The comparative analysis of slow- and fast-evolving sites is interesting. The result (Fig. 5) is visually impactful. In my view, this analysis is valid, and the conclusion is supported. It would be better to explain the rationale with more detail to facilitate understanding by a general audience.

      Thanks for this assessment. We have now expanded on the rationale of this analysis in the main text, writing:

      “It is interesting to note that the proportion of inferred substitutions that occur along the AB branch differs between the slow-evolving and fast-evolving sites. As would be expected, the total tree length measured in substitutions per site is shorter from the slow-evolving sites, but the relative AB branch length is longer (1.2 substitutions/site, or ~2% of all inferred substitutions, compared to 2.6 substitutions/site, or ~0.04% of all inferred substitutions for the fastest-evolving sites). Since we would not expect the distribution of substitutions over the tree to differ between slow-evolving and fast-evolving sites, this result suggests that some ancient changes along the AB branch at fast-evolving sites have been overwritten by more recent events in evolution --- that is, that substitutional saturation leads to an underestimate of the AB branch length.”

      Zhu et al. also tested the impact of substitution saturation on the AB branch, using a more traditional approach (Fig. S19). They also found that the inter-domain distance is more influenced by potential substitution saturation, but the difference is minor. They concluded that (AB distance) "is not substantially impacted by saturation."

      Like other analyses, these two analyses involved very different locus sampling (27 most "vertical" genes vs. 381 expanded genes). They also differ by the metric being measured (AB branch length vs. average distance between AB taxa). Therefore, the analysis in the current work does not falsify the analysis by Zhu et al. In contrast, it is inline with (though not in direct support of) Zhu et al. and others' suggestion that there was "accelerated evolution of ribosomal proteins along the inter-domain branch" (line 25) in the 27 core genes (of which 15 are ribosomal proteins).

      We disagree that our analysis is consistent with the hypothesis of accelerated ribosomal protein evolution. The analysis that directly addresses this point is Figure 3, where we show that the distributions of AB branch lengths in single gene trees are not significantly different between ribosomal and non-ribosomal datasets (Figure 3; mean AB branch length for ribosomal proteins 1.35 substitutions/site, mean for non-ribosomal 2.25 substitutions/site).

      Evolutionary model fit:

      The authors compared the AB branch length indicated by the standard, site-homogeneous model LG+G4+F vs. the site-heterogeneous model LG+C60+G4+F, and found that the latter recovered a longer AB branch (2.52 vs. 1.45). The author's reasoning for using a site-heterogeneous model is valid, and this analysis is sound.

      However, Zhu et al. also analyzed their data using the site-heterogeneous model C60 -- the same as in this work, but through the PMSF (posterior mean site frequency) method. Zhu et al. also compared it with two site-homogeneous models (Gamma and FreeRate). The results were extensively presented and discussed (Figs. 3, 4E, F, S23, S24, Note S2). They also found that C60+PMSF elongated the AB branch compared with the site-homogeneous models (Fig. S24A). As for the average AB distance (another metric evaluated by Zhu et al., as discussed above), C60+PMSF increased this metric when using ribosomal proteins, but not much when using the expanded marker set (Fig. S25A). And overall, the elongation by C60+PMSF with the expanded markers cannot compensate for the long branch indicated by the ribosomal proteins.

      Therefore, similar to the point I made above, this analysis is sound but it does not logically falsify the conclusion made by Zhu et al., as it only concerns a small set of markers, and it recovered a previously described pattern.

      Thanks for this comment. As above, note that the second part of our manuscript presents a general analysis of the issues around marker gene and model selection using our meta-analysis and new dataset, and is not a direct response to Zhu et al’s work. On reflection, we agree that this was not sufficiently clear in the first version of the paper, and we have now modified the text to acknowledge the model fitting analyses of Zhu et al.

      The manuscript also did not clarify what the phrase "poor model fit" refers to (line 34 and line 304). If this is addressing the Gamma model evaluated by the authors, then this claim is valid though not novel (but see my previous comment on the trade-off). If that is a general reference to Zhu et al.'s methodology, then the authors should at least include the C60+PMSF model in the analysis, and show that C60 indicates a significantly longer AB branch than C60+PMSF does (if that's the case, which is doubtful). Admittedly, C60+PMSF is cheaper than the native C60 in computation, but "In some empirical and simulation settings PMSF provided more accurate estimates of phylogenies than the mixture models from which they derive." (Wang et al. Syst Biol. 2018).

      Thanks for this comment. We did not intend the phrase “poor model fit” to imply a critique of Zhu et al.’s work; as the reviewer notes, those authors carried out a range of analyses to investigate the impact of model choice on their inferences. Rather, the title of the section is intended to summarise its main conclusion, which is that substitutional saturation and poor model fit (on any dataset, and even with the best available models) can lead to under-estimation of the AB branch length. Note that the analyses in Table 1 illustrating the impact of model fit are from the new dataset that is assembled and analysed in the second part of the manuscript. As above, we agree that this was not sufficiently clear in the first version of the paper. We think the title of this section is accurate and so we have not changed it, but we have changed the final two paragraphs of the section (as quoted immediately above) so as to acknowledge the model fitting analyses of Zhu et al., and to clarify that the results are general (and based on our new dataset), rather than a critique of Zhu et al’s work.

      Finally, Zhu et al. also performed an analysis using the native C60 model on a further reduced taxon set. That result was not presented in the published paper, but it can be found in the "Peer Review File" posted on the Nature Communications website. That tree also recovered a short AB distance, and placed CPR at the base of Bacteria, and showed that this placement was not impacted by the removal of Archaea.

      Thanks for pointing us to this additional analysis. The unrooted, bacteria-only tree referred to by the reviewer (panel B) recovers a clan (that is, a cluster of branches on the unrooted tree) comprising CPR+Chloroflexi, in agreement with the analysis on the new marker dataset we present here (Figure 6). The disagreement between that analysis and the new tree presented here relates to the position of the archaeal outgroup, which in the Peer Review File panel A connects to the bacterial tree between CPR and Chloroflexi. If, as recently suggested, the bacterial root lies between Gracilicutes and Terrabacteria (Coleman et al. 2021), then CPR and Chloroflexi represent monophyletic sister lineages. We note that the CPR+Chloroflexi relationship recovered here and in Peer Review File Panel (B) has also been obtained in several other recent analyses (Taib et al. 2020, Coleman et al. 2021, Martinez-Gutierrez and Aylward 2021), as cited in the main text.

      Taxon sampling:

      My final comment is about taxon sampling. Zhu et al. developed an algorithm for less biased taxon sampling, and they argued that extensive taxon sampling is important in resolving the early evolution of life. They presented evidence showing that reduced taxon sampling changed overall topology and basal relationships (Figs. S13, S14, S23, Note S2). The analyses were performed in combination with the assessment of site sampling, locus sampling, substitution model and other factors. The importance of less biased and/or extensive taxon sampling was also noted by previous works, especially in a phylogenomic framework (e.g., Hedtke et al. Syst Biol. 2006; Wu and Eisen. Genome Biol. 2008; Beiko. Biol Direct. 2011). The current work is based on a smaller set of taxa, and it has not addressed the impact of taxon sampling. As I suggested above, some results may be sensitive to taxon sampling.

      We agree that taxon sampling is important for phylogenetics. While the analyses of Zhu et al. (2019) included a very large number of genomes, sampling of genomes (and indeed marker genes) was biased, both towards Bacteria compared to Archaea, and also within Bacteria. In our revised manuscript, we now compare the taxon sampling between Zhu et al.’s work and our new analyses (see Figure 1 Figure Supplements 13,14,15 and Figure 4 Figure Supplements 1,2). Balanced sampling is important for phylogenetic inference (Heath et al., 2008; Hillis, 1998) and, by this criteria, the taxon sampling in the analyses of Zhu et al. was not ideal. Our new analyses made use of fewer genomes (700), but these sample the known diversity of Archaea and Bacteria in a more representative way (Figure 4 Figure Supplement 1,2).

      Reviewer #3:

      Moody and coworkers principally address a recent paper presented by Zhu et al. (Nature Communications, 2019). In their paper, Zhu and coworkers claim that (i) ribosomal protein genes, commonly used in resolving deep phylogenies, have experienced an increased rate of evolution right after LUCA, and (ii) that an expanded set of markers show that the branch separating archaea from bacteria (AB-branch) is 10-fold shorter than previously thought. Moody et and coworkers first demonstrate flaws in the Zhu et al. analysis: first, the expanded gene set is biased towards bacteria, with 25% of the single-gene trees having very few archaeal counterparts. Second, that over 75% of the single-gene trees from Zhu et al are not monophyletic at domain level, suggesting a large influence of horizontal gene transfers (HGT), inter-domain exchanges, and inclusion of paralogous sequences in the original datasets. Third, they show that genes with fewer HGT display longer AB-branches. Fourth, they show that the argument by Zhu et al. that the longer AB-branch yields absurd LUCA datation is not relevant. Fifth, and maybe most important, they show that the shorter AB-branches recovered by Zhu et al in their expanded dataset result from inadequate substitution models, which lead to underestimating rates and thus branch lengths.

      Going further, they select a set of 54 manually curated markers (showing mostly monophyletic archaea and bacteria), both from ribosomal proteins (36) and non-ribosomal proteins (18) and retrieve these in a balanced set of 350 archaea and 350 bacteria. With this set, they show that ribosomal protein markers do not display longer AB-branches than non-ribosomal ones. They also show that diversity among Archaea and Bacteria, as measured as the total tree length within each domain, is very similar, when sampling equal number of genomes in both domains.

      Strengths:

      The paper is well-written and well structured. In general, the methodology chosen here is adapted to the question at hand and very rigorously followed. The balanced dataset (with equal amounts of bacteria and archaea) of 54 carefully selected genes is also appropriate to explore diversity differences between the two domains of life.

      Although all arguments presented in Zhu et al are carefully re-evaluated, the part where Moody et al show that substitutional saturation and poor model fit is artifactually producing short AB-branches is quite compelling and elegantly presented.

      Weaknesses:

      One potential weakness, more in terms of significance than in terms of scientific soundness is that the paper is mostly "reactive", responding to a single other paper. The authors might have used the data and methodology presented here to give the paper a broader scope. An example would be to provide the audience with a solid protocol or general guidelines on how to avoid artifacts in making deep phylogenies. I believe that the authors have demonstrated that they have the authority to do that.

      Thanks for this suggestion. We considered including guidelines of this type in the first version of the manuscript, but we were --- and remain --- wary of attempting to promote one particular way of doing deep phylogeny over others. These are difficult and slippery questions, and different approaches and perspectives (including ones we might disagree with) are, in a broader sense, useful in refining ideas and helping the field to make progress as a whole. That said, a recurring issue appears to be the question of the fit between model and data, both in terms of substitution model fit (as with the impact of site-heterogeneous models on branch length inferences) and the broader issue of using models that, for example, account for gene duplication or transfer. There are several recent reviews (including one by some of us) which treat these topics in detail and provide detailed advice. We have now raised and discussed these issues in our conclusion. We have also updated Figure 6 to illustrate the approach we used in assembling the new 27-gene dataset, which may be of use to others, and goes some way towards the suggestion of providing guidelines for future analyses. We now write:

      “Our analysis of a range of published marker gene datasets (Petitjean et al., 2014; Spang et al., 2015; Williams et al., 2020; Zhu et al., 2019) indicates that the choice of markers and the fit of the substitution model are both important for inference of deep phylogeny from concatenations, in agreement with an existing body of literature (reviewed in (Kapli et al., 2021, 2020; Williams et al., 2021). We established a set of 27 highly vertically evolving marker gene families and found no evidence that ribosomal genes overestimate stem length; since they appear to be transferred less frequently than other genes, our analysis affirms that ribosomal proteins are useful markers for deep phylogeny. In general, high-verticality markers, regardless of functional category, supported a longer AB branch length. Furthermore, our phylogeny was consistent with recent work on early prokaryotic evolution, resolving the major clades within Archaea and nesting the CPR within Terrabacteria. Notably, our analyses suggested that both the true Archaea-Bacteria branch length (Figure 6A), and the phylogenetic diversity of Archaea, may be underestimated by even the best current models, a finding that is consistent with a root for the tree of life between the two prokaryotic domains.”

      In the figure 6 legend, we also expand on guidelines for future analyses, writing:

      “(B) Workflow for iterative manual curation of marker gene families for concatenation analysis. After inference and inspection of initial orthologue trees, several rounds of manual inspection and removal of HGTs and distant paralogues were carried out. These sequences were removed from the initial set of orthologues before alignment and trimming. For a detailed discussion of some of these issues, and practical guidelines on phylogenomic analysis of multi-gene datasets, see (Kapli et al., 2020) for a useful review.”

      The authors use the difference in log-likelihood between the constrained and unconstrained gene trees as a proxy for verticality and thus marker gene quality (Figure 1b). However, they don't demonstrate that that metric is actually appropriate. Could the monophyly (or split score) be also involved here? The authors might want to comment on that.

      Thanks for this suggestion, which has substantially improved our analysis of Archaea-Bacteria distance and marker gene verticality (see the revised Figure 1 and associated text). We have now evaluated the relationship between AB branch length and split score (both within- and between-domain level relationships) for the expanded marker set and have updated our results and discussion accordingly. We found that deltaLL and split score (both within- and between-domains) are positively correlated with each other, and negatively correlated with AB length (that is, high-verticality markers have longer AB branch lengths). These analyses also revealed that within-domain and between-domain split scores are strongly positively correlated, implying that genes that recover domain monophyly also do better at resolving within-domain relationships.

      The argument about the age of LUCA an ad absurdum one, showing that using better suited genes one gets impossible time estimates. However, the argument presented by Zhu et al is also a "just so" argument (if we get a time estimate that doesn't make sense then the phylogeny must be wrong), which doesn't give it much weight. The authors themselves note well that this part of the paper is more revealing of the limitations of the strict clock method, or of the relaxed clock with one single calibration point, than of the quality or appropriateness of the dataset.

      We agree that the dating section in the first version of our manuscript was somewhat unsatisfactory. We have now expanded it to include new analyses on our 27-gene dataset, using more fossil calibrations, in order to diagnose why current clock methods struggle to estimate evolutionary rate near the root of the tree, and how this impacts on the age of LUCA and other deep nodes. These analyses add substantial value to this section, which has been moved to the end of the manuscript to reflect its expanded focus.

      Another small weakness (or loose end) is that manual curation of the 95 genes dataset is not consistently reducing the percentage of non-monopyhletic genes (e.g. 62 to 69% from the 95 to the 54 genes dataset for non-ribosomal genes; 21 to 33% from the 95 to the 27 genes dataset for ribosomal genes). The author don't discuss how this impacts the manual curation they perform on the datasets; however, they state that "manual curation of marker genes is important". The authors might want to discuss that aspect further.

      Thanks for raising this point. We were not sufficiently clear in describing the logic of our approach in the first version of the manuscript, and have now revised the text to clarify. In this analysis, we used a strict binary definition of monophyly --- that is, even a single inter-domain transfer leads to non-monophyly (note that this is in contrast to the re- analysis of the expanded set, where we considered whether each marker statistically rejected domain monophyly). For some genes scored as non-monophyletic in this way, manual removal of a small number of unambiguous recent transfers is sufficient restore domain monophyly; for others, HGT is extensive and it is difficult to know how to filter the sequences so as to obtain a reliable marker gene alignment; it was these latter cases that we set aside. We have now revised this section to make the logic of the approach clear, writing:

      “Prior to manual curation, non-ribosomal markers had a greater number of HGTs and cases of mixed paralogy. In particular, for the original set of 95 unique COG families (see ‘Phylogenetic analyses’ in Methods), we rejected 41 families based on the inferred ML trees, either due to a large degree of HGT, paralogous gene families or LBA. For the remaining 54 markers, the ML trees contained evidence of occasional recent HGT events. Strict monophyly was violated in 69% of the non-ribosomal and 29% of the ribosomal families. We manually removed the individual sequences which violated domain monophyly before re-alignment, trimming, and subsequent tree inference (see Methods). These results imply that manual curation of marker genes is important for deep phylogenetic analyses, particularly when using non-ribosomal markers. Comparison of within-domain split scores for these 54 markers indicated that markers that better resolved established relationships within each domain also supported a longer AB branch length (Figure 2A).”

      In summary and despite the small weaknesses listed above, my opinion is that the authors reach their goal of showning that the AB-branch is indeed a long one, and that the results support the conclusion.

      Impact:

      The main point addressed by the authors here, the time of divergence between Archaea and Bacteria, is crucial to our understanding of early evolution. The long branch separating Bacteria and Archaea has long been thought to be a long one, and the paper by Zhu et al casted a doubt about the validity of this long-standing hypothesis. Here, Moody et al convincingly establish that the divergence between archaea and bacteria is a profound one. The paper also has profound implications on the validity of the commonly used core-gene phylogenies, particularly those based on ribosomal protein genes. Indeed, it shows that the these proteins are appropriate for deep phylogenies. They also show the impact of model violations on deep phylogenies, and how to avoid them.

      We thank the reviewer for this positive assessment of impact.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thank all reviewers for their thorough assessment and constructive comments.

      For clarity, their comments have been numbered.

      Reviewer #1

      Evidence, reproducibility and clarity:

      Summary:

      Acetylation/Deacetylation controls G1/s transition in budding yeast. The lysine acetyl transferase Esa1 is here shown to play a role, in part via acetylation of the nuclear pore complex basket component Nup60, which stimulates mRNA export.

      Major comments:

      1 • Figure 1C: The curve for esa1-ts in this figure and the curve in the supplementary figure S2B are not similar, while the first shows 10% cells budding after 60 minutes it is about 50% after 60 min in S2B. Another helpful way of presenting the data could be the length of the G1 phase (from cytokinesis to budding) in the WT, esa1-ts, gcn5delta cells over time.

      We thank the reviewer for pointing this out. Indeed, there is some day-to-day variability in the budding kinetics of the temperature-sensitive esa1 mutant, and the text referred to one individual experiment. Therefore, we have changed the text to better reflect the observed variability (p. 7) and added a graph (supplementary Figure S2C) including all individual replicates. This shows that in spite of small differences between experiments, esa1-ts cells always bud slower and less efficiently than wild-type cells. We note that the data cannot be shown in the way suggested (time from cytokinesis to budding, presumably from individual cells) because cells in these experiments were released from a G1 block (after cytokinesis), and samples from cell cultures were imaged at time intervals (and not single cells over time). Time-lapse data of single cells is shown in figure 2E.

      2 • What is the rational of creating the Nup60-KN mutation. Does it prevent acetylation of Nup60, at least by GCN5 and/or esa1?

      The biophysical properties of asparagine resemble those of acetylated lysine. Therefore, the Nup60-KN mutant (lysine 467 to asparagine) is expected to mimic acetylation of Nup60 K467, which was found to be acetylated in earlier studies. Supporting the conclusion that Nup60-KN is indeed an acetyl-mimic, the nup60-KN mutation partially rescues the Start and mRNA export defects on Esa1-deficient cells. We make the rationale of the Nup60-KN mutation clearer in the current version (p. 8).

      3 • Given the much stronger phenotype of the esa1-ts+GCN5 delta condition for G1/S transition as compared to esa1-ts and that GCN5 seems to strongly acetylate Nup60 I do not understand the sole focus on esa1 in the study. The fact that the Nup60-KN cells do not show G1/S transition under esa1-ts+GCN5 delta conditions in experiments presented in Fig. S3 argues that esa1 meaidted acetylation of Nup60 is only one, probably minor aspect of G1/S transition. This should be much balanced discussed.

      We focus on Esa1 because this allows us to dissect the specific role of Nup60 acetylation and mRNA export during the G1/S transition. Of course, Esa1-dependent acetylation of Nup60 is not the only process controlling the G1/S transition, which is regulated at several levels. For example, the concentration of multiple Start activators and inhibitors scales differentially with cell size (PMID: 26390151, 32246903). In addition, daughter-specific factors inhibit Start through a pathway parallel to Nup60 deacetylation (Ace2/Ash1-dependent repression of Cln3 transcription; PMID: 19841732, 19841732). We discuss these studies in the current version (p. 17).

      As for the relative contribution of Esa1 and Gcn5 to the G1/S transition and mRNA export: both of these KATs have overlapping roles in promoting transcription, probably through distinct substrates (such as histone H2 for Gcn5, H4 for Esa1) and this may contribute to their role in Start. Consistent with this, deletion of GCN5 causes a minor delay in transcription of G1/S genes (Kishkevich, Sci. Rep 2019). On the other hand, gnc5 mutants have no detectable mRNA export defects, unlike esa1-ts (our Figure 3E). This suggests that whereas Gcn5 and Esa1 may have overlapping roles in transcription of G1/S genes, Esa1 is more specifically involved in mRNA export. The ability of Nup60-KN to rescue the single mutant esa1 but not the double gcn5 esa1 is consistent with this view: the transcription defects in the double mutant may be so severe as to prevent Start even in the presence of Nup60-KN. We have modified the discussion to mention these points. In addition, we will investigate the transcription defects of esa1 and gcn5 single and double mutants to test this possibility and include the results in a revised version.

      4 • Suppl: Fig 2: I miss the hat1delta+gcn5delta condition.

      We will include the budding index of the hat1 gcn5 double mutant in a revised version.

      Minor comments:

      5 • Figure legend 2C "at least 200 cells were scored": please state number of replicates

      Figure 2C shows RT-qPCR data. The reviewer probably means figure 1C, which shows the budding index of one experiment comparing wild type, esa1, gcn5 and esa1 gcn5 strains. This experiment was repeated 3 times, as is now mentioned in the figure 1 legend.

      6 • Figure 2E: X axis "impor" should be corrected to "import"

      We have corrected this.

      7 • Would Mex67 and/or Mrt2 overexpression recue the esa1-ts and esa1-ts+GCN5 delta phenotype?

      We will include this experiment in a revised version.

      8 • Figure 4 A: The size of the daughter cells in the hos3delta condition seems smaller as compared to esa1-ts. Is this true and can you comment this? Is a premature onset of S phase observed here?

      Since Fig 4A features only wild type and hos3∆ cells, the reviewer is probably referring to esa1-ts cells shown in figure 4B. These two figure panels are not directly comparable: cells in 4A are freely cycling, whereas those in 4B were released from a mitotic arrest using nocodazole. The mitotic arrest was done in order to avoid potentially confounding effects due to inactivation of Esa1 during S phase. However, the arrest also causes daughter cells to grow larger, explaining the size differences pointed out by the reviewer. That being said, it is true that cell size and G1 duration are intimately linked and thus the reviewer question raises a relevant point. We previously showed that although hos3 daughter cells enter S phase prematurely, their size is not significantly different from wild type (Kumar et al., Figure 1d-g). Premature onset of S phase can lead to smaller cell size but this is not the case for hos3 cells, probably due to the slightly faster growth rate of the hos3∆ mutant relative to wild type specifically during S/G2/M phases (Kumar et al., Supplementary Fig. 1b).

      9 • Figure 4D: The still images in figure 2E and 4D do not correspond with the quantitation. E.g. in Fig 2E the esa1ts cells shows Whi5 export at t=81 min, which is according to the shown quantitation unusual late.

      We will modify Figures 2E-4D in a revised version to include cells that export Whi5 at times closer to the median.

      10 • Figure 4B: it is not clear why for the quantitation a different representation is chosen as compared to 4A. It would be better to show the nuclear intensities of mother/daughter as in Figure 4A.

      The reason for the different representation between figures 4A and 4B is that 4A depicts freely cycling cells and in 4B, cells were released from a nocodazole-induced mitotic arrest (as mentioned in our response to point 8). A mitotic arrest perturbs M/D size asymmetries, as daughter cells (but not mothers) continue growing during the arrest, leading to larger nuclear size. In addition, esa1-ts daughters are smaller than wt daughters in this condition, further complicating M/D asymmetries. We thought that in this case, a better metric for protein association with the NPC is the fluorescence intensity relative to a nuclear pore component. We agree that using different types of graphs is confusing, and therefore we have removed M/D comparisons from figure 4A and now represent these data as in figure 4B: the intensity of Sac3 relative to Nup49. Finally, a good control for these experiments is the quantification of total protein levels, which we have added for Sac3. We have also removed Mtr2-GFP data until our analysis of Mtr2 total levels is complete. We hope this simplifies this figure.

      11 • Figure 4D: To strengthen these results, it would be good to perform this assay with esa1-ts Nup60-KN cells as in figure 2a. The release of Whi5-GFP is expected to behave in a similar way to the WT. This would ensure that Nup60 acetylation is a pre-requisite for Whi5 release

      I’m afraid we don't understand this suggestion. Figure 4D shows time-lapse fluorescence microscopy of Whi5 nuclear export when Sac3 is recruited to the nuclear basket. Figure 2a shows western blots of Nup60 acetylation status. Therefore it is not clear how these two assays could be done in similar ways. Perhaps the reviewer refers to a different figure panel. The purpose of the suggested experiment, if we understand properly, is to test whether Nup60 acetylation is required for Whi5 export. This is the hypothesis tested in figure 2D: Whi5-GFP export is delayed in esa1-ts, and this delay is partially rescued in esa1-ts nup60-KN, which mimics acetylation. In fact, the advance in Whi5 export observed in Figure 4D upon Sac3 anchoring to NPC is similar to that observed in a nup60-KN (Figure 2E).

      12 • Page 13 "Finally, we tested whether Esa1 targets Sac3 to G1 nuclei": The effect of esa1 knockdown on Sac3 fit with the story line and the effect esa1 imposes on mRNA export. However targeting of Sac3 which is part of a bigger complex by esa1 is a misleading statement, given that you don't show a proof of direct interactions shown, e.g. by immunoprecipiations.

      We meant to say “we tested whether Esa1 function promotes the localisation of Sac3 to the nuclear basket”. We agree that it is unknown whether this involves direct interactions between Sac3 and Esa1. We have changed the text to make this point clearer.

      13 • Page 18: "Nevertheless, our findings suggest that mammalian nucleoporins may represent a novel category of substrates for KATs and for the multiprotein complexes in which these enzymes reside, with important roles in gene expression." Given that there is little experimental evidence this statement is for my taste too strong. Rather indicate that this is a possibility which needs to be tested...

      We have changed the text as suggested.

      14 • Page 3: "Nuclear pores are macromolecular assemblies composed of approximately 30-50 different Nucleoporins": it is rather approximately 30 different nucleoporins in the species so far analyzed.

      We have corrected this as suggested.

      Significance:

      The concept of acetylation/deacetylation regulation of G1/S transition in budding yeast is very appealing. The specific (and important) contribution of Esa1, especially in comparison to GCN5 and Hat1 remains unclear as well as its precise effect on Nup60. Clarifying this, also in a more balanced way of presentation of discussion, would be of interest for the field.

      My research centers around NPC function.

      Audience: experts in the nuclear structure/function fields and cell cycle regulation.

      A more detailed characterisation of the specific roles of Esa1, Gcn5 and Hat1 in the G1/S transition and mRNA export will be included in a revised version, as mentioned in our response to point 3.

      Reviewer #2

      Evidence, reproducibility and clarity:

      In this manuscript, Gomar-Alba et al. follow up on previous work from the lab that showed that the KDAC Hos3 is targeted to the bud neck and daughter cell nuclear pore complexes in budding yeast where it slows cell cycle progression by influencing gene positioning and nucleo-cytoplasmic transport. Overall, the current manuscript describes a well-conducted study that dissects the role of acetylation and deacetylation on Nup60 during the cell cycle using genetics and microscopy. The authors conclusively identify Esa1 as counteracting Hos3 in the nucleus (Figure 1) and show that part of their effect on cell cycle progression and gene expression is mediated by acetylation of Nup60 at K467 (Figure 2). They also demonstrate that this leads to a differential localization of several mRNA export factors and suggest that deacetylation of Nup60 blocks mRNA export in daughter cells. Although this work is overall carefully done, the last conclusion is still somewhat speculative.

      I have a number of minor suggestions to improve the manuscript, but only one major concern, which revolves around the role of chromatin tethering to NPCs. The authors have shown in their previous paper that this plays a role for CLN2 and it is known that active GAL1 interacts with the nuclear periphery, but in the current manuscript this aspect is largely disregarded although I think it could play a major role in the observed mRNA export phenotypes. Therefore, I think some additional experiments and controls as well as additional analysis are required to substantiate especially the results shown in figure 5.

      Major points:

      1) Figure 2: The authors claim that the mechanism by which Nup60 acetylation promotes cell cycle progression is the enhancement of mRNA export through the NPC. In Figure 2, the authors look at the expression levels of four candidate mRNAs which all show disturbed expression in esa1-ts which is not rescued by the nup60-KN mutation, but expression of the protein of one of these candidates (CLN2) is improved. In their previous paper, the same lab has shown that the CLN2 gene is tethered to the NPC in daughter cells with deacetylated Nup60 and that this is relieved in a Nup60 K467N mutant. I think it would be important here to investigate the protein levels of additional candidates that are not regulated at the level of gene localization. Is it a general effect that protein expression is higher in the nup60KN mutant?

      We agree this is an important point. To establish if Nup60-KN regulates only genes that interact with the NPC (such as CLN2), the reviewer suggests determining the cell cycle levels of proteins encoded by other G1/S genes that do not bind NPCs. The main problem with this approach is that with the exception of CLN2, the nuclear localisation of the (about 200) G1/S regulon genes is not yet known. In addition, establishing connections between mRNA and protein levels during the first cell cycle is only possible for short-lived proteins such as Cln2. For instance, amongst the G1/S genes shown in Figure 2, Cdc21 and Rnr1 have protein half-lives of 10 and 4 h, much longer than the 90-minute yeast cell cycle (PMID 25466257). We think a more direct approach to investigate the connection between gene position and mRNA synthesis / export would be to directly visualise the localisation of single mRNAs upon perturbation of the Nup60 acetylation pathway, using single mRNA labeling techniques (smFISH or PP7). We aim to do this for CLN2 and also for GAL1 (see point 2d of this reviewer). We will attempt these experiments for a revised version of our paper.

      2) Figure 5: In figure 5, the authors investigate the expression of a different inducible RNA (GAL1) to test whether the observed effect on mRNA export is more general. Since this is a crucial point for generalizing the finding, this data needs to be presented in a more convincing manner.

      2a. GAL1 is known to be tethered to the NPC upon transcription. Whether this tethering is affected by the Nup60-KN mutant is unclear, but since Nup60 has been implicated in GAL1 tethering in the literature, this possibility is not unlikely. GAL1 therefore becomes a similar case to CLN2, where it is difficult to disentangle effects directly due to mRNA export from the effects of gene tethering on mRNA transcription and processing. Therefore, this experiment should be repeated with a system that is independent of gene tethering. For example, induction of the GAL promoter via a b-estradiol inducible VP16 transactivator does not seem to induce tethering.

      This is an excellent idea. We are not aware of studies on the localisation of the GAL1 locus induced by a VP16 transactivator, but this was investigated for the HXK1 gene. This subtelomeric gene localises to NPCs in non-glucose carbon sources, and its localisation is perturbed by VP16 transactivation in glucose (PMID: 16760983). We will investigate whether the same is true for GAL1, and if so, perform the suggested experiments.

      2b. The activation kinetics in all mutants analyzed is very different from the wildtype. Therefore, the quantification made in Figure 5C is difficult to interpret. Therefore, it might be more fair to quantify for the mutant strains at an earlier timepoint after activation when the levels are similar to the levels in the wildtype strain. E.g. in the hos3d strain at around 250 min.

      This is a good point - indeed, persistent mother/daughter asymmetry in GAL1 expression in hos3 and nup60-KN mutants could be masked by saturated levels of GFP at late time points. An alternative way to test this is to determine the time of GAL1 induction in mother and daughter cells. We have done this in wild-type and hos3 mutant cells; our results indicate that GAL1 expression occurs first in wildt-type mothers and later in their daughters, whereas it is almost simultaneous in nup60-KN mother/daughter mutant pairs (as shown for a single M-D pair in the new figure 5A). In a revised version, we will include data of GAL1 expression for M-D pairs at different times after galactose addition for cells in figures 5C and 5E.

      2c. Similarly - although not as drastic - , in figure 5E, quantification should be done at a timepoint when the induction level is similar between DMSO and Rapamycin treated samples to make conclusions about differences between mother and daughter cell.

      We agree. See our response to the previous point.

      2d. The major claim of the paper is that mRNA export is inhibited by Nup60 deacetylation. In this figure, the mRNA levels need to be quantified to validate that it is not transcription that is affecting expression.

      We agree. In addition to regulating mRNA export (as suggested by the effect of Sac3 anchoring to NPCs) Nup60 deacetylation may also inhibit GAL1 transcription (directly, and/or indirectly via disruption of Gal1-based transcriptional feedback; PMID 23150580). To directly assess the role of Nup60 acetylation in GAL1 transcription and mRNA export, it would be ideal to determine the levels of GAL1 mRNA in both the nucleus and the cytoplasm, using smFISH and/or PP7 tools, in wild type and in mutants of the Nup60 acetylation pathway as we proposed to do for CLN2 (see our response to point 1 of this reviewer). These or equivalent experiments will be included in a revised version.

      3) The manuscript investigates in detail the effects of a KN mutant, however, a non-acetylatable mutant is not investigated. Is such a mutant viable?

      We have obtained a Nup60-KR mutant, which is predicted to behave as a non-acetylatable mimic, and it is viable. We will describe its phenotype in a revised version.

      Minor comments:

      4) Figure 2E: Is the rescue really specific to daughter cells? The dynamic range in the daughter cells is much higher due to the slower and more heterogenous timepoint of Whi5 export. However, zoom-in on the early timepoints after Whi5 import before the 30 min when 50% of the cells have exported Whi5, might reveal a significant increase of mother cells with shortened time to S phase entry. I suggest that the authors test this possibility. The cells shown in the image panels also suggest that the acetyl mimic might shorten mother cell time to S phase entry. If this is not the case, the authors might want to show a different example cell. Interestingly, it appears from the supplementary figure S5, that while Nup60 K647N partially rescues the export of Whi5, budding does not seem to be different to Nup60 wt. This appears to contradict the budding after alpha factor arrest shown in figure 2.

      We thank the reviewer for this suggestion. Indeed, zooming into the first 30 minutes shows a slight increase in the fraction of nup60-KN mother cells that export Whi5; however this change is not statistically significant when considering the entire cell population (p=0.6017, Mann-Whitney test). Therefore, we will replace the cell shown in figure 2E with a more representative example.

      As for figure S5, the reviewer is correct that in these experiments nup60-KN partially rescues Whi5 export (a marker of Start) but not budding (a downstream event), and this is indeed in variance with the experiment shown in figure 2B. Different experimental conditions may contribute to this apparent discrepancy: as noted in the text, the duration of G1 phase in cells synchronised with alpha factor is not directly comparable with that of freely cycling cells.

      5) Figure 3C: The authors use a truncated version of SAC3 for overexpression, since the full length is toxic (Figure S6A). I think it would be important to include this information in the main text.

      We agree, and have included this information in the main text.

      6) Figure 4B: Is there simply less Sac3 protein in the esa1-ts mutant? Although the authors address this question in figure S9, the very low expression levels of Sac3 may make this difficult to conclude from fluorescence quantification. A Western Blot would be an important control. The relative level of Sac3 still seems to be lower in esa1-ts daughter cells compared to mother cells, but no statistical test is shown.

      We are confident that the total Sac3-GFP levels are sufficient to make accurate comparisons, in both the nucleus and the entire cell. However, we will be happy to include western blot controls for Sac3 total levels in a revised version as the reviewer suggests. As for the levels of Sac3 in M vs D cells: Sac3 is indeed asymmetrically distributed in both wild-type and esa1-ts cells (p

      7) Analysis of mother daughter pairs (e.g. figure 5C): a paired t-test would be appropriate.

      We agree. Results do not change with this new analysis (in fact, p values are even lower for wild-type M-D pairs in figure 5C).

      8) Figure 5A: Can some representative mother-daughter pairs be shown as images for both wt and mutant in the timelapse? It is difficult to see in 5A whether there are any mother daughter pairs.

      We have modified the figure to include clearly identifiable mother-daughter pairs, as requested.

      9) Figure 4C: Please show image of localization of Sac3-GFP-FRB +/- rapamycin to the NPC.

      We have added this.

      Significance:

      This manuscript describes an important advance in understanding the role of non-histone protein modification on the regulation of cell cycle progression and gene expression. It is a logical follow-up on a previous paper from the lab (Kumar et al. 2018) and beautifully builds on this work. It is to my knowledge the first mechanistic description of regulation of nuclear pore complex function by a post-translational modification. This will therefore be a very interesting paper for anyone interested in nuclear pore complex regulation and biology, non-histone protein acetylation, asymmetric cell division, and cell cycle regulation.

      Reviewer #3

      Evidence, reproducibility and clarity:

      The pre-print is dedicated to mRNA export and G1/S transition control in mother and daughter cells of budding yeasts through acetylation/deacetylation of nuclear pore component Nup60 (hsNup153). In particular, authors found that Esa1(hsTip60/KAT5) acetylates the basket nucleoporin Nup60, and this event promotes recruitment of mRNA export factors to the nuclear basket and export of polyA RNA to the cytosol. This export event promotes entry of cells into S phase; in particular, Nup60 is deacetylated by histone deacetylase Hos3 that displaces mRNA export complexes from the NPC and inhibits Start specifically in daughter cells.

      The manuscript is a well-designed and well-written study.

      Please, see my major and minor suggestions below:

      Major comments:

      1. P4-5. "deacetylation of the nuclear basket nucleoporin Nup60 does not affect Whi5 nuclear accumulation". I was confused by this statement because, in the previous article Kumar et al., 2018, both main text and abstract have the following phase "nuclear basket and central channel nucleoporins establish daughter-cell-specific nuclear accumulation of the transcriptional repressor Whi5.." Could you please address this discrepancy?

      Thank you for pointing this out. We should have written: “deacetylation of Nup60 does not strongly affect Whi5 nuclear accumulation”. The Kumar et al. paper shows that deacetylation of central channel nucleoporins (such as Nup49) is important to increase accumulation of Whi5 in daughter cells, whereas deacetylation of the basket nucleoporin Nup60 plays a relatively minor role (see Kumar et al, Figure 7c). We have corrected this in the main text.

      Fig.2A: In addition to increased Nup60 acetylation, I noticed an overall increased level of Nup60 after overexpression of Esa1 and Gcn5. Is it a statistically significant increase in the Nup60 level? It is not mentioned in the main text or figure legend. Does the acetylation level of Nup60 influence its stability?

      We don’t know if acetylation of Nup60 affects its stability, although it is an intriguing possibility. Although it´s true that Nup60 levels in the IP fraction seem to increase upon Esa1 and Gcn5 overexpression, nuclear levels of Nup60-mCherry are similar in wild-type, hos3∆ and nup60-KN (Supplementary Figure S11A). Therefore it is unlikely that changes in Nup60 acetylation affect its stability. We have added this information to the text.

      Authors determined the mRNA level of four representative genes in esa1-ts and esa1-ts nup60-KN cultures.

      3a. Do authors know if Nu60-KN expression affects the perinuclear positioning of these transcripts?

      We did not investigate the localisation of individual transcripts in this study. However, as mentioned in our replies to reviewer 2, we propose to do so for the CLN2 and GAL1 mRNAs, in order to test directly the effect of Nup60 acetylation in the positioning of specific mRNAs.

      3b.I also suggest authors investigate if Nup60-KN affects other transcripts using the RNAseq approach. Nup60-KN might improve the transcription output of other transcripts and it will be interesting to know if these transcripts share similar features.

      We agree that investigating the impact of Nup60 acetylation in mRNA synthesis genome-wide is an exciting challenge. We speculate that Nup60-KN is likely to have some effect in transcription, either directly or indirectly through perturbation of feedback regulatory loops caused by mRNA export defects (for instance, transcription of both CLN2 and GAL1 is regulated by positive feedback). However we think that these experiments are beyond the scope of our study, which is focused on mRNA export.

      3c. Do authors know if GAL1pr:HOS3-NLS expression affects specifically G1-dependent transcripts?

      Answering this question would require RNA sequencing experiments. As mentioned in the previous point, we think these are beyond the scope of our study. That being said, it is likely that the Hos3-Nup60 pathway downregulates gene expression during G1, because Nup60 deacetylation is largely restricted to this phase. Note that this is not the same as regulating expression of the G1/S regulon specifically, because Hos3 also regulates GAL1 expression (Figure 5). We mention this important point in the discussion (p. 17).

      3d. Another interesting question will be to define if there is a group of transcripts that respond specifically to the status of Nup60 acetylation during G1/S transition. Is it possible to make ts-driven Nup60-KN expression to turn in ON/OFF? However, this question is beyond the scope of this paper.

      Thank you for this interesting suggestion. The proposed experiment is technically possible (for example, expression of Nup60-KN could be induced in G1 using a GAL1 promoter, followed by RNA sequencing). We agree that this is beyond the scope of our paper but would like to explore the question in future studies.

      1. Fig.2D It is not mentioned that Cln2 is not cycling anymore upon Nup60-KN overexpression.

      The Cln2 protein peaks at 30 minutes in this experiment, and is degraded at approximately 120 minutes. This corresponds to the slow, incomplete G1/S transition wave of the esa1-ts nup60-KN mutant, as indicated in the budding index at the bottom of the panel. We added this in the figure 2 legend. Note that Nup60-KN is not overexpressed, since the KN mutation is inserted in the endogenous gene under the control of its native promoter.

      Fig.2E. Arrows indicating Whi5 export timing do not match to the numbers in the main text. For example, yellow arrows indicate Whi5 export in wt strain at 30 and 78 min, but it is stated 15 and 59 min in the text. Also, do I understand right that Whi5-mCherry is not visible in the cytosol?

      See our reply to reviewer 2, point 4: we will replace the cell shown in figure 2E with a more representative example. As for Whi5-mCherry, it is visible in the cytoplasm but only weakly (since it is diluted into the larger cytoplasmic volume), and not at all in the images shown due to the overlay with the brightfield channel.

      Did the authors analyze where SAC3 and MTR2 are localized in hos3del, Nup60KN, and Esa-ts strains once their localization was affected in the nucleus? Is the overall level Sac3 level is affected in hos3del and Nup60KN strains?

      We have imaged the localisation of Sac3-GFP and Mtr2-GFP during the whole cycle using time-lapse microscopy. Our impression is that in wild type cells, their perinuclear levels increase during S phase in daughter cells, which mirrors the increase in Nup60 acetylation. In contrast, Sac3 and Mtr2 perinuclear levels seem more stable in hos3 and nup60-KN cells. We will include these analyses in a revised version. The total level of Sac3 is not affected, as shown in the updated figure 4; see our reply to reviewer 2, point 6.

      Fig4C. "Sac3-GFP-FRB partitioned equally to M and D nuclei, in the presence of Nup60-mCherry-FKBP and rapamycin (Figure 4C)." Sac3-GFP-FRB is slightly elevated in mother cells. Did you run a statistical test between the first and the third column on the box plot?

      Comparing the first and third columns in Fig 4C (Nup60 and Sac3 in control cells) shows that the mother cell accumulation is higher for Sac3 than for Nup60 (p

      P15. "GAL1 expression levels were higher in wild-type mother cells than in their daughter, and these differences were absent in cells lacking Hos3 or expressing Nup60KN". GAL1-10 promoter contains information necessary and sufficient for recruitment to the nuclear periphery (PMID: 27489341). I wonder if GAL1pr-driven transgenes of HOS3, spt10, hat1, and etc., contain DNA sequences sufficient for targeting genes to the nuclear periphery, and these genes are asymmetrically expressed in mother and daughter cells because of the presence of GAL1pr?

      We agree that these genes may be expressed at different levels in mother and daughter cells. We don’t think this asymmetric expression affects our conclusions. Indeed, the phenotypes scored (growth on plates) apply to the population and not to individual cells. The one exception is figure 3D, in which mRNA nuclear accumulation is scored in single cells. In this case, it remains possible that some of the variability observed corresponds to differences between mothers and daughters. In this case, our measurements could under-estimate the effect of Hos3-NLS in inhibition of mRNA export. However, since we cannot differentiate M and D cells in this experiment, we prefer not to speculate on this possibility in the text.

      Minor comments:

      1. Supplementary Fig. S1, it will be easy to read cell viability assays if 1A, S1A and S1B figures have the same orientation.

      We have changed the figure as suggested.

      Could you please clarify the difference between HOS3-NLS and GAL1pr:HOS3-NLS in the text of figure legend? P.33

      We have fixed this (figure 1 legend).

      P6. I recommend adding the following sentence to help clarity of the text: "To understand how NPC acetylation regulates the G1/S transition (Start), we sought to identify the lysine acetyl-transferases (KATs) counteracting the activity of the Hos3 deacetylase. Hos3 displays asymmetric distribution between mother and daughter cells in wild type Saccharomyces cerevisiae. Overexpression of a version of Hos3 fused to a nuclear localization signal (GAL1pr-HOS3-NLS) leads to targeting of Hos3 to mother and daughter cell nuclei, deacetylation of nucleoporins, and inhibition of cell proliferation (Kumar et al, 2018)."

      We thank the reviewer for this suggestion. This has been added.

      P8. Misspelling: Though Nup60 acetylation

      This has been fixed.

      FigS7. Description of polyA distribution is missing for single gcn5del strain.

      Thank you for pointing this out. This has been added.

      Misspelling: We conclude that Esa1 and Nup60 acetylation promotes Start, at least in part, by targeting Sac3 to the nuclear basket, where it mediates mRNA export.

      This has been fixed.

      Significance

      Authors of this pre-print overview and try to resolve a fundamental and not well-studied question about NPC acetylation status and S phase entry. This work is a logical extension of their previously published work (PMID: 29531309). However, this study for the first-time links status of NPC acetylation to mRNA export through lysine acetyl transferases. It will be interesting to address this question in mammalian cells considering interaction of basket nucleoporins with Tip60/KAT5 (PMID: 24302573).

      This work might be of interest to researchers investigating RNA export, transcription regulation, and nuclear pores.

      My fields of expertise are RNA export, nucleoporins, transcription regulation.

      I do not have expertise to evaluate yeast strains used in this study.

    1. Author Response:

      Reviewer #1 (Public Review):

      5.The reported data point to an important role of the premotor and parietal regions of the left as compared to the right hemisphere in the control of ipsilateral and contralateral limb movements. These are also the regions where the electrodes were primarily located in both subgroups of patients. I have 2 concerns in this respect. The first concern refers to the specific locus of these electrodes. For premotor cortex, the authors suggest PMd as well as PMv as potential sites for these bilateral representations. The other principal site refers to parietal cortex but this covers a large territory. It would help if more specific subregions for the parietal cortex can be indicated, if possible. Do the focal regions where electrodes were positioned refer to the superior vs inferior parietal cortex (anterior or posterior), or intra-parietal sulcus. Second, the manuscript's focus on the premotor-parietal complex emerges from the constraints imposed by accessible anatomical locations in the participants but does not preclude the existence of other cortical sites as well as subcortical regions and cerebellum for such bilateral representations. It is meaningful to clarify this and/or list this as a limitation of the current approach.

      On the first issue, we have updated the manuscript to specify the subregion within the parietal cortex in which we see stronger across-arm generalization - namely, the superior parietal cortex. On the second issue, we have added text in the Discussion that reference subcortical areas shown to exhibit laterality differences in bimanual coordination, providing a more holistic picture of bimanual representations across the brain. In addition, we acknowledge that with our current patient population we are limited to regions with substantial electrode coverage, which does not include all areas of the brain.

      6.The evidence for bilateral encoding during unilateral movement opens perspectives for a better understanding of the control of bimanual movements which are abundant during every day life. In the discussion, the authors refer to some imaging studies on bimanual control in order to infer whether the obtained findings may be a consequence of left hemisphere specialization for bimanual movement control, leading to speculations about the information that is being processed for each of both limb movements. Another perspective to consider is the possibility that making a movement with one limb may require postural stabilization in the trunk and contralateral body side, including a contribution from the opposite limb that is supposedly resting on the start button. Have the authors considered whether this postural mechanism could (partly) account for this bilateral encoding mechanism, in particular, because it appears more prominent during movement execution as compared to preparation. Furthermore, could the prominence of bilateral encoding during movement execution be triggered by inflow of sensory information about both limbs from the visual as well as the somatosensory systems.

      Thank you for these comments. We have added a paragraph to the Discussion to address the hypothesis that some component of ipsilateral encoding may be related to postural stabilization.

      In response to the final point in this comment, we agree that bilateral information during execution could be reflective of afferent inputs (somatosensory and/or visual). However, the encoding model shows that activity in premotor and parietal regions are well predicted based on kinematics during the task. While visual and somatosensory system information are likely integrated in these areas, the kinematic encoding would point to a more movement-based representation.

      Reviewer #2 (Public Review):

      Weaknesses:

      1. Although the current human ECoG data set is valuable, there is still large variability in electrode coverage across the patients (I fully acknowledge the difficulty). This makes statistical assessment a bit tricky. The potential factors of interest in the current study would be Electrode (=Region), Subject, Hemisphere, and their interactions. The tricky part is that Electrode is nested within Subject, and Subject is nested within Hemisphere. Permutation-based ANOVA used for the current paper requires proper treatment of these nested factors when making permutations (Anderson and Braak, 2003). With this regard, sufficient details about how the authors treated each factor, for instance, in each pbANOVA, are not provided in the current version of the manuscript. Similarly, the scope of statistical generalizability, whether the inference is within-sample or population-level, for the claims (e.g., statement about the hemispheric or regional difference) needs to be clarified.

      We discuss at length the issue of electrode variability and have addressed this in the revised manuscript. Graphically, we have added a Supplemental Figure (S2). Statistically, we appreciate the point about the need for the analysis to address the nested structure of the data. We have redone all of the statistics, now using a permutation-based linear mixed effects model with a random effect of patient. This approach did not change any of the findings.

      As to the comment about hemispheric or regional differences, the data show that both are important factors. Our hemispheric effect is characterized by stronger ipsilateral encoding in the left hemisphere and subsequently better across-arm generalization (Figures 2-4). We then examine the spatial distribution of electrodes that generalized well or poorly and found clusters in both hemispheres of electrodes that generalize poorly. In contrast, only in the left hemisphere did we find clusters of electrodes that generalize well. These electrodes were localized to PMd, PMv and superior parietal cortex (Fig 5D). In summary, we argue that activity patterns in M1 are similar in the left and right hemispheres, but there is a marked asymmetry for activity patterns over premotor and parietal cortices.

      Additional contexts that would help readers interpret or understand the significance of the work: The greater amount of shared movement representation in the left hemisphere may imply the greater reliance of the left arm on the left hemisphere. This may, in turn, lead to the greater influence of the ongoing right arm motion on the left arm movement control during the bimanual coordination. Indeed, this point is addressed by the authors in the Discussion (page 15, lines 26-41). One critical piece of literature missing in this context is the work done by Yokoi, Hirashima, and Nozaki (2014). In the experiments using the bimanual reaching task, they in fact found that the learning by the left arm is to the greater degree influenced by the concurrent motion of the right arm than vice versa (Yokoi et al., J Neurosci, 2014). Together with Diedrichsen et al. (2013), this study will strengthen the authors' discussion and help readers interpret the present result of left hemisphere dominance in the context of more skillful bimanual action.

      The Yokoi paper is a very important paper in revealing hemispheric asymmetries during skilled bimanual movements. However, we think it is problematic to link the hemispheric asymmetries we observe to the behavioral effects reported in the Yokoi paper (namely, that the nondominant, left arm was more strongly influenced by the kinematics of the right arm). One could hypothesize that the left hemisphere, given its representation of both arms, could be controlling both arms in some sort of direct way (and thus the action of the right arm will have an influence on left arm movement given the engagement of the same neural regions for both movements). It is also possible that the left hemisphere is receiving information about the state of both the right and left arms, and this underlies the behavioral asymmetry reported in Yokoi.

      Reviewer #3 (Public Review):

      In the present work, Merrick et al. analyzed ECoG recordings from patients performing out-and-back reaching movements. The authors trained a linear model to map kinematic features (e.g., hand speed, target position) to high frequency ECoG activity (HFA) of each electrode. The two primary findings were: 1) encoding strength (as assessed by held-out R2 values) of ipsilateral and contralateral movements was more bilateral in the left hemisphere than in the right and 2) across-arm generalization was stronger in the left hemisphere than in the right. As the authors point out in the Introduction, there are known 'asymmetries between the two hemispheres in terms of praxis', so it may not be surprising to find asymmetries in the kinematic encoding of the two hemispheres (i.e., the left hemisphere contributes 'more equally' to movements on either side of the body than the right hemisphere).

      There is one point that I feel must be addressed before the present conclusions can be reached and a second clarification that I feel will greatly improve the interpretability of the results.

      First, as is often the case when working with patients, the authors have no control over the recording sites. This led to some asymmetries in both the number of electrodes in each hemisphere (as the authors note in the Discussion) and (more importantly) in the location of the recording electrodes. Recording site within a hemisphere must be controlled for before any comparisons between the hemispheres can be made. For example, the authors note that 'the contralateral bias becomes weaker the further the electrodes are from putative motor cortex'. If there happen to be more electrodes placed further from M1 in the left hemisphere (as Supplementary Figure 1 seems to suggest), than we cannot know whether the results of Figures 2 and 3 are due to the left hemisphere having stronger bilateral encoding or simply more electrodes placed further from M1.

      The reviewer makes a very valid point and this comment has led to our inclusion of a new Supplementary Figure, S2, in which we quantify the percentage of electrodes in each subregion.

      Second, it would be useful if the authors provided a bit of clarification about what type of kinematic information the linear model is using to predict HFA. I believe the paragraph titled 'Target modulation and tuning similarity across arms' suggests that there is very little across-target variance in the HFA signal. Does this imply that the model is primarily ignoring the Phi and Theta (as well as their lagged counterparts) and is instead relying on the position and speed terms? How likely is it that the majority of the HFA activity around movement onset reflects a condition-invariant 'trigger signal' (Kaufman, et al., 2016). This trigger signal accounts for the largest portion of neural variance around movement onset (by far), and the weight of individual neurons in trigger signal dimensions tend to be positive, which means that this signal will be strongly reflected in population activity (as measured by ECoG). This interpretation does not detract from the present results in any way, but it may serve to clarify them.

      To address this comment, we have added a new figure (Fig 6) which shows the relative contribution of each kinematic feature as well as their average weights across time for both contralateral and ipsilateral movements. This figure also addresses the reviewer’s question about the contribution of the target position to the model. As can be seen, features that reflect timing/movement initiation (position, speed) make a larger contribution compared to the two features which capture directional tuning (theta, phi). As the reviewer suggested, this result is in line Kaufman et al. (2016) which reported that a condition-invariant ‘trigger signal’ comprises the largest component of neural activity. We note that the target dependent features theta and phi still make a substantial contribution to the model (relative contribution: contra = 32%, ipsi = 37%). Previously, we have tested the contribution of the theta and phi features by comparing two models, one that only used position and speed (Movement model) and one that also included the two angular components phi and theta (Target Model). For a subset of electrodes, the held-out predictions were significantly better using the Target Model, a result we take as further evidence of electrode tuning within our dataset.

      The figure below shows an electrode located in M1 that is tuned to targets when the patient reached with their contralateral arm as an example. We believe that having an explicit depiction of how the four features contribute to the HFA predictions will help the reader evaluate the model. These points are now addressed in the text in the results section discussing Figure 6.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      **Summary:**

      I found this an exceptionally impressive manuscript. The evolution of Y chromosomes has until recently been nearly impossible, and this research group have pioneered approaches that can yield reliable results in Drosophila. The study used an innovative heterochromatin-sensitive assembly pipeline on three D. simulans clade species, D. simulans, D. mauritiana and D. sechellia, which diverged less than 250 KYA, allowing comparisons with the group's previous results for the D. melanogaster Y.

      The study is both technically impressive and extremely interesting (an highly unusual combination). It includes a rich set of interesting results about these genome regions, and furthermore the results are discussed in a well-organised way, relating both to previous observations and to understanding of the genetics and evolution of Y chromosomes, illuminating all these aspects. It is a rare pleasure to read such a study. I believe that this study will inspire and be a model for future work on these chromosomes. It shows how these difficult genome regions can be studied.

      Thank you for the positive evaluation of our paper. While we did not make any specific revisions in response to these comments, we did attempt to improve the writing.

      **Major comments:**

      The conclusions are convincing. The methods are explained unusually clearly, and the reasoning from the results is convincing. When appropriate, the caveats, the caveats are clearly explained. The material is clearly organised and the questions studied are well related to the results. I had a few minor comments concerning the English. Even the figure (often a major problem to understand) are very clear and helpful, with proper explanations. I have very rarely read such a good manuscript, and almost never (in a long career) found a manuscript that could be published without revision being necessary.

      Thank you for pointing out that there were minor concerns with the English. We have carefully gone through the manuscript and fixed some minor issues with the writing. The analysis found 58 exons missed in previous assemblies (as well as all previously known exons of the 11 canonical Y-linked genes, which are present in at least one copy across the group). FISH on mitotic chromosomes using probes for 12 Y-linked sequences was used to determine the centromere locations, and to determine gene orders and relate them to the cytological chromosome bands, demonstrating changes in satellite distribution, gene order, and centromere positions between their Y chromosomes within the D. simulans clade species. It also confirmed previous results for Y-linked ribosomal DNA,genes, which are responsible for X-Y pairing in D. melanogaster males. Although 28S rDNA has been lost in D. simulans and D. sechellia (but not in D. mauritiana), the intergenic spacer (IGS) repeats between these repeats are retained on both sex chromosomes in all three species. Only sequencing can reliably reveal this, as their abundance is below the detection level by FISH in D. sechellia. The 11 canonical Y-linked genes' copy numbers vary between the species, and some duplicates are expressed and have complete open reading frames, and may therefore be functional because they, but most include only a subset of exons, often with duplicated exons flanking the the presumed functional gene copy. Mega-introns and Y-loops were found, as already seen in Drosophila species, but this new study detects turn overs in the ~2 million years separating D. melanogaster and the D. simulans clade. 49 independent duplications onto the Y chromosome were detected, including 8 not previously detected. At least half show no expression in testes, or lack open reading frames, so they are probably pseudogenes. Testis-expressed genes may be especially likely to duplicate into the Y chromosome due to its open chromatin structure and transcriptional activity during spermatogenesis, and indeed most of the new Y-linked genes in the species studied clade have likely functions in chromatin modification, cell division, and sexual reproduction. The study discovered two new gene families that have undergone amplification on D. simulans clade Y chromosomes, reaching very high copy numbers (36-146). Both these families appear to encode functional protein-coding genes and show high expression. The paper described intriguing results that illuminate Y chromosome evolution. First, SRPK, arose by an autosome-to-Y duplication of the sequence encoding the testis-specific isoform of the gene SR Protein Kinase (SRPK), after which the autosomal copy lost its testis-specific exon via a deletion. In D. melanogaster, SRPK is essential for both male and female reproduction, so the relocation of the testis-specific isoform to the Y chromosome in the D. simulans clade suggests that the change may have been advantageous by resolving sexual antagonism. The paper presents convincing evidence that the Y copy evolved under positive selection, and that gene amplification may confer advantageous increased expression in males. The second amplified gene family is also potentially related to an interesting function. Both X-linked and Y-linked duplicates are found of a gene called Ssl located on chromosome 2R. In D. simulans, the X-linked copies were previously known, and called CK2ßtes-like. In D. melanogaster, degenerated Y-linked copies are also found, with little or no expression, contrasting with complete open reading frames and high expression in the D. simulans clade species in testes, consistent with the possibility of an arms race between sex chromosome meiotic drive factors. Other interesting analyses document higher gene conversion rates compared to the other chromosomes, and evidence that these Y chromosomes may differ in the DNA-repair mechanisms (preferentially using MMEJ instead of NHEJ), perhaps contributing to their high rates of intrachromosomal duplication and structural rearrangements. The authors relate this to evidence for turnover of Y-linked satellite sequences, with the discovery of five new Y-linked satellites, whose locations were validated using FISH. The study also documented enrichment of LTR retrotransposons on the D. simulans clade Y chromosomes relative to the rest of the genome, together with turnovers between the species.

      Reviewer #1 (Significance (Required)):

      As described above, the advances are both, technical and conceptual for the field. The manuscript itself does an excellent job of placing the work in the context of the existing literature.

      • Anyone working on sex chromosomes and other non-recombining genome regions should be interested in the findings reported.

      • My field of expertise is the evolution of sex chromosomes, and the evolution of genome regions with suppressed recombination. I have experience of genomic analyses. I have less expertise in analyses of gene expression, but I understand enough about such approaches to evaluate the parts of this study that use them.

      Reviewer #2:

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      The manuscript describes a thorough investigation of the Y-chromosomes of three very closely related Drosophila species (D. simulans, D. sechellia, and D. mauritiana) which in turn are closely related to D. melanogaster. The D. melanogaster Y was analysed in a previous paper by the same goup. The authors found an astonishing level of structural rearrangements (gene order, copy number, etc.), specially taking into account the short divergence time among the three species (~250 thousand years). They also suggest an explanation for this fast evolution: Y chromosome is haploid, and hence double-strand breaks cannot be repaired by homologous recombination. Instead, it must use the less precise mechanisms of NHEJ and MMEJ. They also provide circumstantial evidence that MMEJ (which is very prone to generate large rearrangements) is the preferred mechanism of repair. As far as I know this hypothesis is new, and fits nicely on the fast structural evolution described by the authors. Finally, the authors describe two intriguing Y-linked gene families in D. simulans (Lhk and CK2ßtes-Y), one of them similar to the Stellate / Suppressor of Stellate system of D. melanogaster, which seems to be evolving as part of a X-Y meiotic drive arms race. Overall, it is a very nice piece of work. I have four criticisms that, in my opinion, should be addressed before acceptance.

      Thank you for your positive comments. We respond to your concerns point-by-point below.

      The suggestion/conclusion that MMEJ is the preferential repair mechanism (over NHEJ) should be better supported and explained. At line 387, the authors stated "The pattern of excess large deletions is shared in the three D. simulans clade species Y chromosomes, but is not obvious in D. melanogaster (Fig 6B). However, because all D. melanogaster Y-linked indels in our analyses are from copies of a single pseudogene (CR43975), it is difficult to compare to the larger samples in the simulans clade species (duplicates from 16 genes). ". Given that D. melanogaster has many Y-linked pseudogenes (described by the authors and by other researchers, and listed in Table S6), there seems to be no reason to use a sample size of 1 in this species.

      We only used pseudogenes with large alignable regions (>300 bp) to prevent the potential bias toward small indels and increase our confidence in indel calling. As a result, we excluded most of the duplicates on the D. melanogaster Y chromosome. We now include 5 additional D. melanogaster Y-linked indels in the manuscript, however, the majority of indels in this species (36/41) are still from the same gene.

      Furthermore, given that D. melanogaster is THE model organism, it is the species that most likely will provide information to assess the "preferential MMEJ" hypothesis proposed by the authors.

      A previous paper has shown that male flies deficient in MMEJ have a strong bias toward female offspring (McKee et al. 2000), suggesting that MMEJ is necessary for successfully producing Y-bearing sperm, consistent with our hypothesis. We agree with the reviewer that careful genetic and cytological experiments in D. melanogaster could further clarify the role of MMEJ in the repair of Y-linked mutations. Even more revealing would be experiments using the simulans clade species, where we hypothesize the MMEJ bias is even more pronounced on the Y chromosome. We believe, however, that these experiments are beyond the scope of this study and should merit their own papers.

      Still on the suggestion/conclusion that MMEJ is the preferential repair mechanism (over NHEJ). Y chromosome in heterochromatic, haploid and non-recombining. In order to ascribe its mutational pattern to the haploid state (and the consequent impossibility of homologous recombination repair), the authors compared it to chromosome IV (the so called "dot chromosome"). This may not be the best choice: while chr IV lacks recombination in wild type flies, it is not typical heterochromatin. E.g., " results from genetic analyses, genomic studies, and biochemical investigations have revealed the dot chromosome to be unique, having a mixture of characteristics of euchromatin and of constitutive heterochromatin". Riddle and Elgin, FlyBook 2018 (https://doi.org/10.1534/genetics.118.301146). Given this, it seems appropriate to also compare the Y-linked pseudogenes with those from typical heterochromatin. In Drosophila, these are the regions around the centromeres ("centric heterochromatin"). There are pseudogenes there; e.g., the gene rolled is known to have partially duplicated exons.

      Thank you for the suggestion. We now include the data from pericentric heterochromatin and pseudogenes in supplemental data (see Fig 7). Both data types support our conclusion that indel size is only larger on Y chromosomes, which is consistent with the comparison between the dot chromosome and pericentric heterochromatin reported by Blumenstiel et al. 2002.

      In some passages of the ms there seems to be a confusion between new genes and pseudogenes, which should be corrected. For example, in line 261: "Most new Y-linked genes in D. melanogaster and the D. simulans clade have presumed functions in chromatin modification, cell division, and sexual reproduction (Table S7)".. Who are these "new genes"? If they are those listed in Table S6 (as other passages of the text suggest), most if not all of them are pseudogenes. If they are pseudogenes, it is not appropriate to refer to them as "new genes". The same ambiguity is present in line 263: "Y-linked duplicates of genes with these functions may be selectively beneficial, but a duplication bias could also contribute to this enrichment (...) " Pseudogenes can be selectively beneficial, but in very special cases (e.g.. gene regulation). If the authors are suggesting this, they must openly state this, and explain why. Pseudogenes are common in nearly all genomes, and should be clearly separated from genes (the later as a shortcut for functional genes). The bar for "genes" is much higher than simple sequence similarity, including expression, evidences of purifying selecion, etc., as the authors themselves applied for the two gene families they identified in D. simulans (Lhk and CK2ßtes-Y)

      Thank you for the suggestion. We now state our criteria for calling genes based on the expression and long CDS and correct the sentences that the reviewer refers to. The protein evolution rates of many Y-linked duplicates were surveyed in Tobler et al. 2017, who found that most are not under strong purifying selection. Our study supports this previous report. We think that protein evolution rate alone may not be a good indicator for functionality. Our current study does not focus on the potential function of these genes, and we think further population studies are required to get a solid conclusion. We changed the text to clarify this point: “Most new Y-linked duplications in D. melanogaster and the D. simulans clade are from genes with presumed functions in chromatin modification, cell division, and sexual reproduction (Table S7), consistent with other Drosophila species [17, 77].” (p15 L281-284)

      The authors center their analysis on "11 canonical Y-linked genes conserved across the melanogaster group ". Why did they exclude the CG41561 gene, identified by Mahajan & Bachtrog (2017) in D. melanogaster? Given that most D. melanogaster Y-linked genes were acquired before the split from the D. simulans clade (Koerich et al Nature 2008), the same most likely is true for CG41561 (i.e., it would be Y-linked in the D. simulans clade). Indeed, computational analysis gave a strong signal of Y-linkage in D. yakuba (unpublished; I have not looked in the other species). If CG41561 is Y-linked in the simulans clade, it should be included in the present paper, for the only difference between it and the remaining "canonical genes" was that it was found later. Finally, the proper citation of the "11 canonical Y-linked genes" is Gepner and Hays PNAS 1993 and Carvalho, Koerich and Clark TIG 2009 (or the primary papers), instead of ref #55.

      Thank you for the suggestion. CG41561 is indeed a relatively young Y-linked gene because it’s not Y-linked in D. ananassae (Muller’s element E). We already have CG41561 in Table S6 and we think that it is reasonable to separate a young Y-linked gene from the others. We also fixed the reference as suggested (p5 L116).

      Other points/comments/suggestions:

      1. a) Possible reference mistake: line 88 "For example, 20-40% of D. melanogaster Y-linked regulatory variation (YRV) comes from differences in ribosomal DNA (rDNA) copy numbers [52, 53]." reference #53 is a mouse study, not Drosophila. Thank you for pointing out this error, we fixed the reference (p4 L91).

      2. b) Possible reference mistake: line 208 "and the genes/introns that produce Y-loops differs among species [75]". ref #75 is a paper on the D. pseudoobscura Y. Is it what the authors intended? Yes, our previous paper (ref 75) found that Y-loops do not originate from the kl-3, kl-5, and ORY genes in D. pseudoobscura because they don’t have large introns in this species.

      c) line 113. "We recovered all known exons of the 11 canonical Y-linked genes conserved across the melanogaster group, including 58 exons missed in previous assemblies (Table S1; [55])." Please show in the Table S1 which exons were missing in the previous assemblies. I guess that most if not all of these missing exons are duplicate exons (and many are likely to be pseudogenes). If they indeed are duplicate exons, the authors should made it clear in the main text, e.g., "We recovered all known exons of the 11 canonical Y-linked genes conserved across the melanogaster group, plus 58 duplicated exons missed in previous assemblies."

      Thank you for the suggestion. However, the 58 exons did not include the duplicated exons. We are similarly surprised how much we will miss if we don’t assemble the Y chromosome carefully. We now mark these exons in red in Table S1 to make this point clearer.

      d) line 116 "Based on the median male-to-female coverage [22], we assigned 13.7 to 18.9 Mb of Y-linked sequences per species with N50 ranging from 0.6 to 1.2 Mb." The method (or a very similar one) was developed by Hall et al BMC Genomics 2013, which should be cited in this context. e) line 118: "We evaluated our methods by comparing our assignments for every 10-kb window of assembled sequences to its known chromosomal location. Our assignments have 96, 98, and 99% sensitivity and 5, 0, and 3% false-positive rates in D. mauritiana, D. simulans, and D. sechellia, respectively (Table S2). The procedure is unclear. Why break the contigs in 10kb intervals, instead of treating each as an unity, assignable to Y, X or A? The later is the usual procedure in computational identification of suspect Y-linked contigs (Carvalho and lark Gen Res 2013; Hall et al BMC Genomics 2013). The only reason I can think for analyzing the contigs piecewise is a suspicion of misassemblies. If this is the case, I think it is better to explain.

      Thank you for the suggestion. We did not break the contigs into 10kb intervals when we assigned the Y-linked contigs. As you suspect, our motivation for evaluating our methods and analyzing the contigs in 10kb intervals was to detect possible misassemblies. We rewrote the sentence to make this point clearer (p6 L129-132).

      1. f) Fig. 1. It may be interesting to put a version of Fig 1 in the SI containing only the genes and the lines connecting them among species, so we can better see the inversions etc. (like the cover of Genetics , based on the paper by Schaeffer et al 2008). Thank you for the suggestion. We would like to make a figure like that fantastic cover image you refer to, but the repetitive nature of the Y chromosome makes it difficult to illustrate rearrangements based on alignments at the contig-level. We instead opted to update Figure 1 to better highlight the rearrangements, still based on the unique protein-coding genes which are supported by the FISH experiments.

      2. g) Table S6 (Y-linked pseudogenes). Several pseudogenes listed as new have been studied in detail before: vig2, Mocs2, Clbn, Bili (Carvalho et al PNAS2015) Pka-R1, CG3618, Mst77F (Russel and Kaiser Genetics 1993; Krsticevic et al G3 2015) . Note also that at least two are functional (the vig2 duplication and some Mst77 duplications). Thank you for the suggestion. We now include a column to indicate the potential function of Y-linked duplicates (see Table S6).

      h) line 421: "one new satellite, (AAACAT)n, originated from a DM412B transposable element, which has three tandem copies of AAACAT in its long terminal repeats." The birth of satellites from TEs has been observed before, and should be cited here. Dias et al GBE 6: 1302-1313, 2014.

      Thank you for the suggestion. We now include a sentence to cite this reference (p27 L467-468).

      1. i) Fig S2 shows that the coverage of PacBio reads is smaller than expected on the Y chromosome. Any explanation? This has been noticed before in D. melanogaster, and tentatively attributed to the CsCl gradient used in the DNA purification (Carvalho et al GenRes 2016). However, it seems that the CsCl DNA purification method was not used in the simulans clade species (is it correct?). Please explain the ms, or in the SI. The issue is relevant because PacBio sequencing is widely believed to be unbiased in relation to DNA sequence composition (e.g., Ross et al Genome Biol 2013). Yes, we used Qiagen's Blood and Cell Culture DNA Midi Kit for DNA extraction. We suspect that the underrepresentation of Y-linked reads is driven by the presence of endoreplicated tissue in adults. Heterochromatin is underreplicated in endoreplicated cells, and thus there may simply be less heterochromatin in these tissues. Consistent with this idea, we find that all heterochromatin seems to be underrepresented in the reads, not just the Y chromosome (see Chakraborty et al. 2021; Flynn et al. 2020). We now include this discussion in the SI of our paper (see supplementary text p75).

      2. j) I may have missed it, but in which public repository have the assemblies been deposited? We link to the assemblies in Github (https://github.com/LarracuenteLab/simclade_Y) and they will also be in the Dryad Digital Repository (doi forthcoming).

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Due to suppressed recombination, Y chromosomes have degenerated, undergone extensive structural rearrangements, and accumulated ampliconic gene families across species. The molecular processes and selective pressures guiding dynamic Y chromosome evolution are not well understood. In this study, Chang et al. generate updated Y assemblies of three closely related species in the D. simulans complex using long-read PacBio sequencing in combination with FISH. Despite having diverged only 250,00 years ago, the authors find structural rearrangements, two newly amplified gene families and evidence of positive selection across D. simulans. The authors also suggest the high level of Y duplications and deletions may be mediated by MMEJ biased repair.

      The authors generated a valuable resource for the study of Y-chromosome evolution in Drosophila and describe Y chromosome evolution patterns found in previous Y chromosome sequencing studies, such as newly amplified genes, positive selection, and structural rearrangements. The authors improvements to the Drosophila simulans clade Y chromosomes are commended, as assembly of the highly repetitive Y chromosome sequences is challenging. However, the manuscript is largely descriptive, the claims are largely speculative, and lacks a clear question. There are also a number of concerns with the text and figures (see below concerns). Overall, the manuscript would be significantly improved if the authors focused on a specific question as opposed to a survey of sequence features of the Y chromosome. For example, development of the idea that MMEJ is the primary mechanism for loss of Y chromosome sequence could be nice new twist.

      Our aim is to discover and understand the many different factors and processes that shape the evolution of Y chromosome organization and function. Because these Y chromosomes were largely unassembled, we needed to first generate the sequence assembly before we could ask specific questions. We prefer not to focus the manuscript solely on one specific topic such as MMEJ repair, as our other observations and analyses may be interesting to a wide range of scientists studying topics other than mutation and DNA repair. We are therefore choosing to present the more comprehensive story about Y chromosome evolution that we included in our original manuscript.

      We also respectfully disagree with the comment that our paper is just a descriptive survey of Y chromosomal sequence features. On the contrary, we present thorough evolutionary analyses to test hypotheses about the forces shaping the evolution of Y chromosome organization and Y-linked genes. Specifically, we use molecular evolution and phylogenetic and comparative genomics approaches to show that multi-copy gene families experience rampant gene conversion and positive selection. We posit that one simulans clade-specific Y-linked gene family has undergone subfunctionalization, potentially resolving sexual conflict, and another may be involved in meiotic drive. We also use evolutionary genomic approaches to show that the distribution of Y-linked mutations indeed suggests that Y chromosomes disproportionately use MMEJ and we propose that this unique feature may shape the evolution of Y chromosome structural organization. This is, as far as we know, a novel hypothesis. We think that follow-up studies of either hypothesis merit different papers.

      **Major concerns:**

      1. Title: The authors use "unique structure" in the title, which is a vague point. Are not Y chromosomes, or any chromosome, "unique" in some manner? Also are there not more evolutionary processes governing the rapid divergence of the Y's. Thank you for raising your concern. We believe that we are justified in referring to the Y chromosome as unique among all other chromosomes in its structural properties (e.g. combination of its hemizygosity, abundant tandem repeats, large scale rearrangements, and highly amplified testis-specific genes). Because there are many properties of Y chromosomes that we believe contribute to their rapid divergence, we opted for the general phrase ‘unique structure’ to capture all of these features. Many evolutionary processes likely shape the evolution of that unique structure (e.g. Muller’s Ratchet, background selection, Hill Robertson effects; see Charlesworth and Charlesworth 2000 for a review), and these processes are well-studied, especially on newly evolved sex chromosomes. Here our focus is on evolutionarily old Y chromosomes, which may have comparatively fewer targets of purifying selection and are more likely to be shaped by positive selection (Bachtrog 2008).

      p.2, line 53-56: The authors claim that sexually antagonistic selection and regulatory evolution are causes of recombination suppression. Couldn't this statement be reversed? Recombination suppression via inversions or other rearrangements enable sexually antagonistic selection. This is a chicken or egg question, so it should be revised to have both possibilities be equal.

      Thank you for the suggestion. We think that it is unlikely that recombination suppression itself is beneficial, but for sexually antagonistic selection and regulatory evolution, recombination suppression can have short-term benefits. We rephrased this sentence to be agnostic about the direction (p2 L56).

      p.5, 118-120: Are the assemblies de novo or have they been guided based upon the D. melanogaster Y chromosome assembly? Please clarify how the authors evaluate their methods by comparing their Y-sequence assignments to known chromosomal locations.

      Thank you for the suggestion. We didn’t use D. melanogaster Y chromosome assembly to guide our assemblies. “All assemblies are generated de novo”, and thus we don’t think there is any potential bias. We first assigned Y-linked sequences using the presence of known Y-linked genes, and used this assignment to evaluate our methods. We now make the sentence clear (p5 L112).

      While the gene copy number estimates are accurate, the PacBio-based genome assemblies are still not able to accurately assemble large segmental duplications (see Evan Eichler's laboratories recent primate and human genome assemblies). A statement mentioning the concerns about accuracy of the underlying sequence and genomic architecture shown should be included in the main text. FISH provides support for the location of the contigs, but not for the accuracy of the underlying genomic architecture.

      Thank you for the suggestion. We can’t validate all Y-linked regions. We did validate the larger structural features of the assembly and only discuss the results that we are confident in. We now include sentences to address this concern (p7 L150-152).

      The authors assigned Y-linked sequences based on median male-to-female coverage. Is this method feasible for assigning ampliconic sequence to the Y given the N50 of 0.6-1.2Mb? Are the authors potentially excluding novel Y-linked ampliconic sequence?

      We validated our methods to assign contigs to a chromosome by comparing 10-kb intervals to the contigs with known chromosomal location, including the Y chromosome. Our assignments have high (96, 98, and 99%) sensitivity and low (5, 0, and 3%) false-positive rates in D. mauritiana, D. simulans, and D. sechellia, respectively (see Table S2). Based on these results, we think that this method is reasonable for Y-linked contigs with N50 of 0.6-1.2Mb.

      We might exclude some novel Y-linked sequences since we only assigned ~15Mb out of a total ~40 Mb Y-linked sequences. We acknowledged this possibility, and now include a sentence to address this concern (p31 L554-556).

      Where did the rDNA sequences go in D. simulans and D. sechellia? Can they be detected on another chromosome?

      Please see Fig S5 for detailed results. We found a few copies of rDNA on the contigs of autosomes. We assembled many copies of rDNA that can’t be confidently assigned to Y chromosomes. It’s possible that they might be located on other chromosomes. Based on our FISH data (Fig S4) and previous papers, most of these non-Y-linked rDNA copies should be on the X chromosome. However, in this study, we did not make a concerted effort to assign X-linked contigs.

      Figure 2B is hard to follow and it is unclear what additional value it provides to part A. Why is expression level of specific exons important?

      Exon duplication may be an important contributor to Y-linked gene evolution: most genes have duplications and our figure shows that at least some of these duplicates are expressed. The patterns we see indicate that duplication may play different roles in genes depending on their length. For example, the duplications involving short genes (e.g., ARY) may be functional and influence protein expression, whereas duplications involving large genes (e.g. kl-2) may not influence the overall protein expression level from this gene, although the expressed duplicated exons may play some other role. We revised a sentence in the main text and added a sentence to the figure 2 legend to make this point clearer.

      Figure 3 There are many introns that contain gaps, so it is unclear how confident one can be in intron length when there are gaps.

      Indeed, we are not confident about the length of introns with gaps. Therefore, we separated these introns and showed them in different colors.

      Figure 4: What are the authors using as a common ancestor in this figure to infer duplications in the initial branch?

      We used phylogenies to infer the origin of Y-linked duplicates. Any duplications that happened earlier than the divergence between four species are listed in the branch. We also edited the legend to make this point clearer.

      p.15, paragraph 2: The authors describe a newly amplified gene, CK2Btes-Y, in D. simulans. In the first half of the paragraph the authors state that Y-linked copies are also found in D. melanogaster but have "degenerated and have little or no expression" and call them pseudogenes. Later in the paragraph, the authors state that the D. melanogaster Y-linked copies are Su(Ste), a source of piRNAs that are in conflict with X-linked Stellate. Lastly in the paragraph, the authors discuss Su(ste) as a D. melanogaster homolog of CK2Btes-Y. The logic of defining CK2Btes-Y origins is confusing. Was CK2Btes-Y independently amplified on the D. simulans Y, or were CK2BtesY and Su(Ste) amplified in a common ancestor but independently diverged?

      The amplification of CK2Btes-Y and CK2Btes-like happened in the ancestor of D. melanogaster and D. simulans (Fig S11). However, both CK2Btes-Y and CK2Btes-like became pseudogenes (D. melanogaster CK2Btes-Y is named PCKR in a previous study) in D. melanogaster. On the other hand, Ste and Su(Ste) are only limited to D. melanogaster based on phylogenetic analyses (Fig 5A) and are a chimera of CK2Btes-like and NACBtes. The evolutionary history of this gene family has been detailed in other papers, except for the presence of CK2Btes-Y in the D. simulans complex, which we describe for the first time in this study. We now include a new figure (Figure 5B) a schematic of the inferred evolutionary history of sex-linked Ssl/CK2ßtes paralogs

      Figure 5: Is each FISH signal a different gene copy?

      Yes, based on our assemblies, Lhk-1 and Lhk-2 are mostly located on different contigs. Unfortunately, we are not able to design probes that can separate Lhk-1 from Lhk-2.

      The authors suggest DNA-repair on the Y chromosome is biased towards MMEJ based on indel size and microhomologies. Is there any evidence MMEJ is responsible for variable intron length in the canonical Y-linked genes or the amplification of new gene families? Since MMEJ is error-prone, it's a more tolerable repair mechanism in pseudogenes, so their findings might be biased. Rather than comparing pseudogenes to their parent genes, they should compare chrY pseudogenes to autosomal pseudogenes. Even more would be to track MMEJ on the dot chromosome which is known not recombine and is highly heterchromatic like the Y chromosome.

      We did compare chrY pseudogenes to autosomal pseudogenes in our study. We also add new analyses to address other issues from reviewer 2, which are similar to your concern. We now include data from pericentric heterochromatin and pseudogenes (see Fig 7). Both data types support our conclusion that indel size is only larger on Y chromosomes. This is consistent with a report that the dot chromosome and pericentric heterochromatin have similar indel size distributions (Blumenstiel et al. 2002).

      Reviewer #3 (Significance (Required)):

      While it is a benefit to have much improved Y chromosome assemblies from the three D. simulans clade species, the gap in knowledge this manuscript is trying to address is unclear. The manuscript is almost entirely descriptive and the figures are difficult to follow.

      As stated above, we respectfully disagree with the comment that the manuscript is entirely descriptive, as we present thorough evolutionary analyses to test hypotheses about the forces shaping the evolution of Y chromosome organization and Y-linked genes. We have two guiding hypotheses about the importance of sexual antagonism and DNA repair pathways for Y chromosome evolution, and we conduct sequence analyses that support these hypotheses that sexual antagonism and MMEJ affect Y chromosome evolution.

      References cited in this response:

      Bachtrog D. The temporal dynamics of processes underlying Y chromosome degeneration. Genetics. 2008 Jul;179(3):1513-25. doi: 10.1534/genetics.107.084012. Epub 2008 Jun 18. PMID: 18562655; PMCID: PMC2475751.

      Blumenstiel, J.P., Hartl, D.L, Lozovsky, E.R.. Patterns of Insertion and Deletion in Contrasting Chromatin Domains, Molecular Biology and Evolution, Volume 19, Issue 12, December 2002, Pages 2211–2225, __https://doi.org/10.1093/oxfordjournals.molbev.a004045__

      Chakraborty M, Chang CH, Khost DE, Vedanayagam J, Adrion JR, Liao Y, Montooth KL, Meiklejohn CD, Larracuente AM, Emerson JJ. Evolution of genome structure in the Drosophila simulans species complex. Genome Res. 2021 Mar;31(3):380-396. doi: 10.1101/gr.263442.120. Epub 2021 Feb 9. PMID: 33563718; PMCID: PMC7919458.

      Charlesworth B, Charlesworth D. The degeneration of Y chromosomes. Philos Trans R Soc Lond B Biol Sci. 2000 Nov 29;355(1403):1563-72. doi: 10.1098/rstb.2000.0717. PMID: 11127901; PMCID: PMC1692900.

      Flynn,J, Long, M, Wing, RA, A.G Clark, Evolutionary Dynamics of Abundant 7-bp Satellites in the Genome of Drosophila virilis, Molecular Biology and Evolution, Volume 37, Issue 5, May 2020, Pages 1362–1375, https://doi.org/10.1093/molbev/msaa010

      McKee, Bruce D. et al. “On the Roles of Heterochromatin and Euchromatin in Meiosis in Drosophila: Mapping Chromosomal Pairing Sites and Testing Candidate Mutations for Effects on X–Y Nondisjunction and Meiotic Drive in Male Meiosis.” Genetica 109 (2004): 77-93.

      Tobler R, Nolte V, Schlötterer C. High rate of translocation-based gene birth on the Drosophila Y chromosome. Proc Natl Acad Sci U S A. 2017 Oct 31;114(44):11721-11726. doi: 10.1073/pnas.1706502114. Epub 2017 Oct 19. PMID: 29078298; PMCID: PMC5676891.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The manuscript describes a thorough investigation of the Y-chromosomes of three very closely related Drosophila species (D. simulans, D. sechellia, and D. mauritiana) which in turn are closely related to D. melanogaster. The D. melanogaster Y was analysed in a previous paper by the same goup. The authors found an astonishing level of structural rearrangements (gene order, copy number, etc.), specially taking into account the short divergence time among the three species (~250 thousand years). They also suggest an explanation for this fast evolution: Y chromosome is haploid, and hence double-strand breaks cannot be repaired by homologous recombination. Instead, it must use the less precise mechanisms of NHEJ and MMEJ. They also provide circumstantial evidence that MMEJ (which is very prone to generate large rearrangements) is the preferred mechanism of repair. As far as I know this hypothesis is new, and fits nicely on the fast structural evolution described by the authors. Finally, the authors describe two intriguing Y-linked gene families in D. simulans (Lhk and CK2ßtes-Y), one of them similar to the Stellate / Suppressor of Stellate system of D. melanogaster, which seems to be evolving as part of a X-Y meiotic drive arms race. Overall, it is a very nice piece of work. I have four criticisms that, in my opinion, should be addressed before acceptance.

      The suggestion/conclusion that MMEJ is the preferential repair mechanism (over NHEJ) should be better supported and explained. At line 387, the authors stated "The pattern of excess large deletions is shared in the three D. simulans clade species Y chromosomes, but is not obvious in D. melanogaster (Fig 6B). However, because all D. melanogaster Y-linked indels in our analyses are from copies of a single pseudogene (CR43975), it is difficult to compare to the larger samples in the simulans clade species (duplicates from 16 genes). ". Given that D. melanogaster has many Y-linked pseudogenes (described by the authors and by other researchers, and listed in Table S6), there seems to be no reason to use a sample size of 1in this species. Furthermore, given that D. melanogaster is THE model organism, it is the species that most likely will provide information to assess the "preferential MMEJ" hypothesis proposed by the authors. Still on the suggestion/conclusion that MMEJ is the preferential repair mechanism (over NHEJ). Y chromosome in heterochromatic, haploid and non-recombining. In order to ascribe its mutational pattern to the haploid state (and the consequent impossibility of homologous recombination repair), the authors compared it to chromosome IV (the so called "dot chromosome"). This may not be the best choice: while chr IV lacks recombination in wild type flies, it is not typical heterochromatin. E.g., " results from genetic analyses, genomic studies, and biochemical investigations have revealed the dot chromosome to be unique, having a mixture of characteristics of euchromatin and of constitutive heterochromatin". Riddle and Elgin, FlyBook 2018 (https://doi.org/10.1534/genetics.118.301146). Given this, it seems appropriate to also compare the Y-linked pseudogenes with those from typical heterochromatin. In Drosophila, these are the regions around the centromeres ("centric heterochromatin"). There are pseudogenes there; e.g., the gene rolled is known to have partially duplicated exons. In some passages of the ms there seems to be a confusion between new genes and pseudogenes, which should be corrected. For example, in line 261: "Most new Y-linked genes in D. melanogaster and the D. simulans clade have presumed functions in chromatin modification, cell division, and sexual reproduction (Table S7)".. Who are these "new genes"? If they are those listed in Table S6 (as other passages of the text suggest), most if not all of them are pseudogenes. If they are pseudogenes, it is not appropriate to refer to them as "new genes". The same ambiguity is present in line 263: "Y-linked duplicates of genes with these functions may be selectively beneficial, but a duplication bias could also contribute to this enrichment (...) " Pseudogenes can be selectively beneficial, but in very special cases (e.g.. gene regulation). If the authors are suggesting this, they must openly state this, and explain why. Pseudogenes are common in nearly all genomes, and should be clearly separated from genes (the later as a shortcut for functional genes). The bar for "genes" is much higher than simple sequence similarity, including expression, evidences of purifying selecion, etc., as the authors themselves applied for the two gene families they identified in D. simulans (Lhk and CK2ßtes-Y) The authors center their analysis on "11 canonical Y-linked genes conserved across the melanogaster group ". Why did they exclude the CG41561 gene, identified by Mahajan & Bachtrog (2017) in D. melanogaster? Given that most D. melanogaster Y-linked genes were acquired before the split from the D. simulans clade (Koerich et al Nature 2008), the same most likely is true for CG41561 (i.e., it would be Y-linked in the D. simulans clade). Indeed, computational analysis gave a strong signal of Y-linkage in D. yakuba (unpublished; I have not looked in the other species). If CG41561 is Y-linked in the simulans clade, it should be included in the present paper, for the only difference between it and the remaining "canonical genes" was that it was found later. Finally, the proper citation of the "11 canonical Y-linked genes" is Gepner and Hays PNAS 1993 and Carvalho, Koerich and Clark TIG 2009 (or the primary papers), instead of ref #55. Other points/comments/suggestions:

      a) Possible reference mistake: line 88 "For example, 20-40% of D. melanogaster Y-linked regulatory variation (YRV) comes from differences in ribosomal DNA (rDNA) copy numbers [52, 53]." reference #53 is a mouse study, not Drosophila.

      b) Possible reference mistake: line 208 "and the genes/introns that produce Y-loops differs among species [75]". ref #75 is a paper on the D. pseudoobscura Y. Is it what the authors intended?

      c) line 113. "We recovered all known exons of the 11 canonical Y-linked genes conserved across the melanogaster group, including 58 exons missed in previous assemblies (Table S1; [55])." Please show in the Table S1 which exons were missing in the previous assemblies. I guess that most if not all of these missing exons are duplicate exons (and many are likely to be pseudogenes). If they indeed are duplicate exons, the authors should made it clear in the main text, e.g., "We recovered all known exons of the 11 canonical Y-linked genes conserved across the melanogaster group, plus 58 duplicated exons missed in previous assemblies."

      d) line 116 "Based on the median male-to-female coverage [22], we assigned 13.7 to 18.9 Mb of Y-linked sequences per species with N50 ranging from 0.6 to 1.2 Mb." The method (or a very similar one) was developed by Hall et al BMC Genomics 2013, which should be cited in this context. e) line 118: "We evaluated our methods by comparing our assignments for every 10-kb window of assembled sequences to its known chromosomal location. Our assignments have 96, 98, and 99% sensitivity and 5, 0, and 3% false-positive rates in D. mauritiana, D. simulans, and D. sechellia, respectively (Table S2). The procedure is unclear. Why break the contigs in 10kb intervals, instead of treating each as an unity, assignable to Y, X or A? The later is the usual procedure in computational identification of suspect Y-linked contigs (Carvalho and lark Gen Res 2013; Hall et al BMC Genomics 2013). The only reason I can think for analyzing the contigs piecewise is a suspicion of misassemblies. If this is the case, I think it is better to explain.

      f) Fig. 1. It may be interesting to put a version of Fig 1 in the SI containing only the genes and the lines connecting them among species, so we can better see the inversions etc. (like the cover of Genetics , based on the paper by Schaeffer et al 2008).

      g) Table S6 (Y-linked pseudogenes). Several pseudogenes listed as new have been studied in detail before: vig2, Mocs2, Clbn, Bili (Carvalho et al PNAS2015) Pka-R1, CG3618, Mst77F (Russel and Kaiser Genetics 1993; Krsticevic et al G3 2015) . Note also that at least two are functional (the vig2 duplication and some Mst77 duplications).

      h) line 421: "one new satellite, (AAACAT)n, originated from a DM412B transposable element, which has three tandem copies of AAACAT in its long terminal repeats." The birth of satellites from TEs has been observed before, and should be cited here. Dias et al GBE 6: 1302-1313, 2014.

      i) Fig S2 shows that the coverage of PacBio reads is smaller than expected on the Y chromosome. Any explanation? This has been noticed before in D. melanogaster, and tentatively attributed to the CsCl gradient used in the DNA purification (Carvalho et al GenRes 2016). However, it seems that the CsCl DNA purification method was not used in the simulans clade species (is it correct?). Please explain the ms, or in the SI. The issue is relevant because PacBio sequencing is widely believed to be unbiased in relation to DNA sequence composition (e.g., Ross et al Genome Biol 2013).

      j) I may have missed it, but in which public repository have the assemblies been deposited?

      Significance

      see above.

    1. Author Response:

      Reviewer #2 (Public Review):

      Yu et al provide a comprehensive set of experiments to determine that bradyzoites have much slower cytosolic Ca2+ parameters, which impact on gliding motility, a key process of Toxoplasma spread and persistence.

      The only main criticism that I have is the use of the MIC2-GLuc reporter to measure microneme secretion in bradyzoites. Do bradyzoites have any appreciable level of MIC2 and its associated protein M2AP?? This is important that may affect the outcome. If bradyzoites do not, then the MIC2-GLuc reporter might not have appropriate levels of M2AP to correctly traffic to the micronemes. I recommend that the authors quantitate, either by western blot or IFA, the levels of MIC2 and M2AP in bradyzoites versus tachyzoites and also show that M2AP co-localises with MIC2-GLuc to give confidence that MIC2-GLuc is trafficked correctly and thus the low readings of secretion are not just a result of the reporter mistrafficked. It would also be pleasing to see, that 1hr incubation leads to restoration of MIC2-GLuc secretion.

      We acknowledge that the expression and localization of MIC2-Gluc reporter is a potential concern. We performed western blotting (Figure 2C) and IFA (Figure 2 supplement 1A) to confirm that bradyzoites express MIC2-Gluc and M2AP albeit at lower levels compared with tachyzoites. Moreover, MIC2-GLuc and M2AP were properly co-localized to the apical end in bradyzoites, ruling out the possibility of mis-localization of the MIC2-GLuc reporter. Based on these results, we believe that MIC2-GLuc provides a reliable read-out for microneme secretion in in vitro differentiated bradyzoites. Additionally, the conclusion that MIC secretion is dampened in bradyzoites is also supported by the studies using the FNR-Cherry reporter in Figure 2E,F,G.

      Reviewer #3 (Public Review):

      This is a first study that looks in detail at Ca-controlled gliding motility and ATP supply in bradyzoites. A comparison of such different parasite stage by manipulating Ca and ATP metabolism is challenging. Intervention by chemical compounds needs to overcome a prominent cyst wall and the usage of genetic tools needs to consider the broad changes in protein expression between tachyzoites and bradyzoites as well as a heterology between individual bradyzoites. The authors used excysted bradyzoites to exclude the cyst wall as a diffusion barrier as a major factor in the efficacy of different Ca agonists. To address differences in expression levels between tachyzoites and bradyzoite stages the authors developed a ratiometric Ca sensor based upon an autocleaved GCaMP6f-BFP dimer protein.

      Overall the conclusions are well supported but there are methodological questions that need to be addressed.

      Bradyzoites show a heterogenous expression of Bag1 / Sag1 markers as well as heterologous proteins. This is shown in Fig 1A and Fig 2b for example. However, in most time-dependent measurements of Ca-dependent fluorescence (Fig 2G, 3D the authors only average three cells. This appears to be insufficient to represent the bradyzoite population. How is the variance between the three measured cells?

      We have quantified more cells in all figures related to fluorescence measurements. For measurements of single parasites in Figure 5B, 5D, 5E, 6F, 8A, 8B and Figure 7 supplement 1A, we have now quantified 10 parasites for each condition and plotted the data as means ±S.D. to show the variance. For in vitro induced cysts or ex vivo cysts in Figure Fig 2G, 3D, 3E, 4C,4G, 6E, 7B and Figure 4 supplement 1A, we measured 5 cysts or vacuoles per condition. Because these samples contain many parasites within each vacuole or cyst, they represent a greater sample size. The data are also plotted a means ±S.D.

      In addition, the Mic2 promoter driven Gluc-myc protein is not expressed in all bradyzoites. This is perhaps not suprising as Mic2 seems to be downregulated in bradyzoites according to Pittman and Bucholz et al dataset in ToxoDB. If interpreted correctly the lower expression of Gluc in some bradyzoites would favour an underestimation of the RLUs in Fig 2D.

      We acknowledge that the expression and localization of MIC2-Gluc reporter is a potential concern. We performed western blotting (Figure 2C) and IFA (Figure 2 supplement 1A) to confirm that bradyzoites express MIC2-Gluc and M2AP albeit at lower levels compared with tachyzoites. Moreover, MIC2-GLuc and M2AP were properly co-localized to the apical end in bradyzoites, ruling out the possibility of mis-localization of the MIC2-GLuc reporter. Based on these results, we believe that MIC2-GLuc provides a reliable read-out for microneme secretion in in vitro differentiated bradyzoites. Additionally, the conclusion that MIC secretion is dampened in bradyzoites is also supported by the studies using the FNR-Cherry reporter in Figure 2E,F,G.

      The maturation of bradyzoite takes several weeks. This cannot be accomplished with currently available system in vitro and the authors use 1 week matured bradyzoites. To facilitate comparability to data from other manuscripts it would be helpful if the authors could quantify the differentiation stage of the in vitro bradyzoites. This could be done by measuring the fractions of Bag1-positive and Sag1-negative bradyzoites.

      We thank the reviewer for this useful comment. We have quantified the percentage of BAG1-positive SAG1-negative bradyzoites within each cyst induced for 3, 5 or 7 days by IFA and spinning disc confocal microscopy (Figure 3 supplement 1A). This analysis demonstrated that the percentage of BAG1-positive and SAG1-negative bradyzoites reached ~70% at day 7 after induction (Figure 3 supplement 1B). For this reason, we used a 7 day induction treatment for the majority of experiments. Also, where imaging was used in the analysis, we focused on regions of in vitro differentiated cysts that expressed high levels of BAG1-mCherry.

      The mcherry and GCaMP6f signal in fig 3B seem mutually exclusive. This may be due to difference in calcium signalling between Bag1 pos or neg parasites or due to expression differences of GCaMP6f.

      To test the possibility of expression differences in GCaMP6f, we quantified the fluorescence of BAG1-mCherry and GCaMP6f in different bradyzoites within the cyst shown in Figure 3B. At time 0 prior to stimulation, we observed heterogenous expression of BAG1- mCherry while the signal for GCaMP6f expression was relatively constant (Figure 3B supplement 1C and 1D). In contrast, when in vitro differentiated bradyzoites were stimulated with A23187, they showed reduced levels of GCaMP expression in cells that were strongly positive for BAG1-mCherry (Figure 3B). Collectively, these findings are consistent with the difference in GCaMP fluorescence being due to dampened calcium responses in bradyzoites rather than expression differences. This conclusion is supported by studies on GCaMP responses in cells where we normalized for expression level using a dual-expression BFP reporter in Figure 6. Therefore, we do not think that heterogeneity in the expression of GCaMP is responsible for the observed dampened response in bradyzoites.

      The authors use syringe, trypsin-released and FACS sorted bradyzoites in multiple Ca assays. How can it be excluded that this procedure affects (depletes) Ca stores?

      In all the figures except Figure 2C-2D, we did not use FACS to sort bradyzoites. Instead, we scraped cells cultured at pH 8.2, used syringe passage through 25g needle followed by centrifugation. Cyst pellets were resuspended and digested with trypsin to liberate bradyzoites. For tachyzoites, all procedures were similar except that we did not use trypsin digestion. As a control, we have now treated tachyzoites similarly with trypsin and monitored the calcium stores using ionomycin. We found that trypsin digestion did not affect the calcium stores or response as shown in Figure 7 figure supplement 1A.

      In my opinion several experiments in this manuscript would benefit from clarification of this point. For example: In Fig 7A Fu et al measure Ca for 5min during trypsin digestion, however, for gliding assays cysts are digested for 10min. The Ca monitoring should cover the complete 10min off trypsin digest.

      We understand the concern but there were practical reasons for the slightly different times used. In panel A where we are monitoring calcium during trypsin digestion, the majority of cysts are dispersed after 5 min resulting the parasites being out of focus. As such, it is not practical to monitor beyond this time point. In the panel C, we were interested in observing parasites after the cysts where fully digested and hence we used a slightly longer time period to allow complete digestion and for the parasites to settle to the bottom of the dish before further recording. In this instance, similar to the result in A, most parasites remained dormant and did not show elevated calcium levels. In the figure, we are selectively showing a rare example where calcium signaling was observed in order to compare the patterns to what is normally observed with tachyzoites. These combined panels are not meant to be a comparison of kinetics, as this aspect is tested more directly in later experiments. We have modified the text to make the rationale for this experiment clear.

      In Fig 2B Fu et al digest infected monolayers with trypsin to release mcherry from cysts matrices. How can the authors exclude that trypsin is not digesting mCherry protein in this assay?

      I think the reviewer means 2F as in 2B we are using BAG1 mCherry to visualize bradyzoites – but they are not being liberated in this image. In 2F we use a different construct, FnR-mCherry that directs the reporter to be constitutively secreted to either the PV (surrounding tachyzoites) or the cyst matrix (surrounding bradyzoites). When the cysts are disrupted with trypsin, the mCherry is likely to disperse and may also be digested. However, this would not happen if it remains inside the parasite. This control is provided to show that the protein is secreted into the matrix. We have revised the text to clarify the use of this control.

      Fig 7 E,F: the authors measure shorter gliding distances of bradyzoite as compared to tachyzoites. Trails of both parasites however, are detected by visualizing using different antigens that may have different shedding behavior on the FBS-coated glass surface. The Bag1 trail also depends on Bag1 expression, which is shown in numerous images to not be equal among individual bradyzoites. This point is very challenging to address but should at least be discussed.

      BAG1 is used here to discern the bradyzoites, not to detect the trail. Trails are stained with either SAG1 or SRS9 – corresponding to the most abundant surface GPI anchored antigen in each stage. Since these proteins are part of the same C-C fold family and are similarly anchored, we feel they are comparable. We have added the following statement to the results: “These two surface markers are both members of the cysteine rich SRS family that are tethered to the surface membrane by a GPI anchor, thus they represent comparable reporters for each stage.”

      Fig 7E: Bradyzoites are considered to satisfy their ATP needs mostly via glycolysis and the data shown do support this capability. I find the ability of OligomycinA to block glucose-dependent gliding surprising as this suggests a necessary mitochondrial transport chain for ATP-production from glucose. This result should be mentioned clearly in the text and its implications discussed.

      The Discussion has been revised as suggested.

      Figure 8: The authors claim a recovery of bradyzoite ATP and Ca levels after 1hr incubation with carbon sources and Ca, that together enable efficient gliding. However, the elevation of bradyzoite ATP occurs after the parasites spend 2 hours in glucose-free and Ca-free conditions, whereas gliding assays are done after a short 10min trypsin digest. I am not entirely convinced that low ATP levels post-egress are responsible for the low gliding activity. Ideally gliding assays should be done after a similar purification procedure to correlate the two experiments.

      We have repeated the gliding assays using bradyzoites purified in the same manner as for the ATP measurements and found the same result that a combination of exogenous calcium and glucose enhance recovery of gliding motility (Figure 8D, 8F). In addition, we used the same time point to purify bradyzoites for MIC2-Gluc secretion and found exogenous calcium and glucose also led to an increase in MIC2-GLuc secretion, indicative of the recovery of microneme secretion (Figure 8C).

    1. Author Response:

      Reviewer #1 (Public Review):

      Summary

      Moncunill et al set out to investigate a very important question: why are half of children vaccinated three times with RTS,S AS01 protected from clinical malaria - and half not? To do so they isolated PBMCs before vaccination and one month after third vaccination and stimulated them in vitro with DMSO (vehicle control), two malaria antigens (CSP (part of RTS,S) & AMA1) or HBS (hepatitis B antigen - part of RTS,S).

      They then assessed their transcriptional response by blood transcriptional module analysis and correlated those results with previous published data on antibody titers and T cell cytokine production to find associations. To assess risk of clinical malaria, responses were compared between RTS,S vaccinated children who developed clinical malaria in the one year follow-up (cases) and those who received RTS,S or a comparator vaccine and did not (controls). They found that responses after RTS,S vaccination did not predict protection from clinical malaria. Instead a blood transcriptional module signature related to dendritic cells, inflammation, and monocytes before vaccination may be associated with clinical malaria risk.

      Strengths

      Immune correlates of protection are evaluated in African children (who are the RTS,S target population) in a natural transmission setting.

      Excellent set of controls: children (same age) vaccinated with RTS,S or comparator vaccine alongside each other -> retrospectively stratified by whether the did or did not develop clinical malaria : controls for the effect of a developing immune system and would allow to disentangle RTS,S specific and clinical malaria specific response patterns.

      Weaknesses

      RTS,S is composed of CSP & HBS. yet when PBMCs from children vaccinated three time with RTS,S are stimulated with these peptides no transcriptional differences compared to children receiving a rabies or meningitis vaccine were detected (Figure 2). this lack of recall response impacts all downstream conclusions and comparisons made in the paper.

      The fact that bulk transcriptional profiling of Ag-stimulated PBMCs (and specifically to CSP) did not identify large significant differences in BTM expression between the RTS,S vs. comparator group could be due to several factors. First of all, the frequency of antigen-specific CD4+ T cells was very low among CD4+ T cells (Figure 4 of the manuscript shows that CSP-specific CD4+ T cells comprise < 0.004% of all CD4+ T cells). This low frequency of CSP-specific T cells is consistent with other RTS,S studies [e.g. as we state on line 330, we have previously found that CSP-specific T-cells in RTS,S/AS01 vaccines comprise < 0.10% of all CD4+ T cells (1)]. Moreover, CD4+ T cells themselves comprise approximately 45-57% of all PBMCs (2). Thus, finding an expression signal between the RTS,S vs. comparator group would require the signal to be high enough to be detected in only 0.002% of all PBMCs [0.004% (% CSP-specific CD4+ T cells out of total CD4+ T cells) x 51% (average % of CD4+ T cells out of all PBMCs) = 0.002%]. Thus, lack of detectable recall response does not mean lack of recall response. Moreover, as suggested below, we opted not to focus the rest of the manuscript on the Ag-stimulation results.

      Second of all, the PBMCs were stimulated on site for 12h and then cryopreserved. This stimulation time was chosen based on the kinetics of IFN- and IL-2 mRNA response (3), but other responses may have had different kinetics and thus have already resolved or have not yet occurred by the 12-h cryopreservation. We have added text in the manuscript to discuss these caveats (“Another potential reason for why no BTMs were found to associate with the response to RTS,S/AS01 vaccination or with protection when analyzing CSP-stimulated PBMC is that all PBMC were stimulated on site for 12 hours (this stimulation time was chosen based on the kinetics of the IFN-gamma transcriptional response) and then cryopreserved. Thus, we were unable to detect earlier transient responses that had already resolved by 12 hours, as well as more delayed response that had not yet initiated by 12 hours, if such responses occurred”).

      It should be noted that in all our analyses, the stimulated results were adjusted for DMSO to focus on the antigen-specific response only. This would explain why we detect signal in the DMSO samples but not in response to stimulation. We have realized that this was not very well described in the figure captions and the Methods section and have added more details, including the model description in Methods section. As such, we do not believe that these results impact all downstream conclusions. We believe that the unstimulated results provide significant new insights into the immune and molecular mechanisms of RTS,S vaccine efficacy, not necessarily directly related to the RTS,S-specific acquired immune response. Finally, we would like to highlight the fact that we have improved our model specification to directly account for the pairing of some of the samples using a random effect using the limma package. This has slightly increased statistical power, and as such the number of significantly differentially expressed BTMs in response to stimulation is a bit higher (but still much less than that for the DMSO). Originally, we had decided against the use of a random effect due to the computational cost of estimating the random effect.

      Transcriptional responses 1 month after the final RTS,S vaccination do not predict clinical malaria risk (Figure 3) - this is a key finding, which should be central to the conclusion of this paper.

      Considering that Kazmin et al. (4) showed that the transcriptional response to the third RTS,S/AS01 dose peaks at Day 1 post-injection, with some decline by Day 6 and approximately 90% of the response having waned by Day 21 (with the caveat that Kazmin et al.’s study population was malaria-naïve adults), we do not find it surprising that there were only a few BTMs whose 1 month post-final RTS,S dose associated with clinical malaria risk. However, the point is well-taken about the relative merits of the baseline. We have edited the Discussion to include discussion of the Month 3 correlates results:

      “Compared to the 45 BTMs whose baseline levels significantly associated with clinical malaria risk in RTS,S/AS01-vaccinated African children, fewer BTMs (seven) had levels at one month post-final RTS,S/AS01 dose that significantly associated with clinical malaria risk. Moreover, if a more stringent FDR cutoff had been used (i.e. 5%), six of these seven BTMs would not have been identified. Thus it is entirely possible that, at one month post-final RTS,S/AS01 dose, there is no circulating immune transcriptomic signature predictive of risk. Such a conclusion would not be surprising, given that in malaria-naïve adults, the transcriptional response to the third RTS,S/AS01 dose has been shown to peak at Day 1 post-injection, with some decline by Day 6 and approximately 90% of the response having waned by Day 21 (17). Therefore, it is likely that the sampling scheme in this study (one month post-final dose) misses the majority of the transcriptional response to RTS,S/AS01.”

      The take-home message put forward in the title/abstract (that a monocyte and DC related pre-vaccination signature predicts risk of clinical malaria in RTS,S vaccinated children) is not strongly supported by the data. It is based on blood transcriptional modules related to monocytes being picked out when comparing RTS,S vaccinated cases and controls.

      Thank you for giving us the opportunity to provide further rationale for our focus on the 7 monocyte-related and 4 DC-related BTMs shown in Figure 6B (MAL067 column) out of the 45 total BTMs whose baseline expression associated with clinical malaria risk in RTS,S/AS01-vaccinated children. The reviewer implies that these modules were chosen for focus somewhat randomly or without justification (or, even worse, “cherry picked”), which we would agree would be an imperfect method for drawing conclusions.

      First, we have always ensured to mention that the 45 baseline modules that correlated with risk in RTS,S recipients (Fig 6B, MAL067 column) belonged to many functional annotations, including DC cells and monocytes. (Abstract: “In contrast, baseline levels of BTMs associated with dendritic cells and with monocytes (among others) correlated with malaria risk”) (Main text, lines 519-522: “Compared to the results from the month 3 analysis (7 BTMs), the baseline correlates analysis of MAL067 revealed a larger number (45) of BTMs, spanning many functional categories, whose month 0 levels in vehicle-stimulated PBMC nearly all associated with clinical malaria risk in RTS,S/AS01 recipients .”

      The focus on DC cells and monocytes is due to two reasons: 1) the fact that the DC-related modules and the monocyte-related modules were some of the most significant correlations (lines 522-524: “The BTM with the most significant association with risk was “enriched in monocytes (II) (M11.0)” (FDR = 1.80E-14), followed by “inflammatory response (M33)” (FDR = 2.45E-07) and “resting dendritic cell surface signature (S10)” (FDR = 6.03E-07).”

      Second, the baseline association of DC- and monocyte-related modules appeared to generalize across populations: (Abstract: “A cross-study analysis supported generalizability of the baseline dendritic cell- and monocyte-related BTM correlations with malaria risk to healthy, malaria-naïve adults, suggesting that certain monocyte subsets may inhibit protective RTS,S/AS01-induced responses.”; Main text: “BTMs related to dendritic cells and to monocytes were most consistently associated with risk across these three studies [“resting dendritic cell surface signature (S10)”, “DC surface signature (S5)”, “enriched in dendritic cells (M168)”, “enriched in monocytes (I) (M4.15)”, “enriched in monocytes (II) (M11.0)”, “enriched in monocytes (IV) (M118.0)”, and “monocyte surface signature (S4)” significantly correlated with risk in all three studies].”

      The first two sentences of the Discussion (lines 577-580) explain our focus on monocytes and DCs:

      “Our main finding is the identification of a baseline blood transcriptional module (BTM) signature that associates with clinical malaria risk in RTS,S/AS01-vaccinated African children. In a cross-study comparison, much of this baseline risk signature – specifically, dendritic cell- and monocyte-related BTMs – was also recapitulated in two of the three CHMI studies in healthy, malaria-naïve adults.”

      Finally, we note that the title (“A baseline transcriptional signature associates with clinical malaria risk in RTS,S/AS01-vaccinated African children”) does not restrict to DC-related or monocyte-related BTMs, rather, we chose this title based on the larger number of BTMs, and higher correlations with risk, in the baseline analysis compared to the Month 3 analysis.

      We have revised all instances where we have communicated this less clearly, e.g. “for why we identified a baseline monocyte transcriptional signature of risk” has been changed to “for why we identified monocyte-related BTMs in our transcriptional signature of risk”.

      Many other modules are picked out as well e.g. cell cycle (Figure 6B). An in-depth analysis of the genes in these module and what their up and downregulation can tell us about their function is warranted to support the conclusions.

      Thank you for the suggestion to look at the cell cycle module in Figure 6B. You make a good point that this module is the only module to show a significant association with clinical malaria risk across all 4 of the RTS,S studies and should therefore be further examined. First, we have added this to the text:

      “Only one BTM, “cell cycle and transcription (M4.0)”, was significantly associated with risk across all four studies. Of the 335 genes in this module (M4.0), 130 were also present in one or more of the six “monocyte-related” BTMs shown in Figure 6B (297 genes total across all six BTMs), suggesting that the “cell cycle” and “monocyte” results may actually be picking up the same signal.”

      We have done the gene-level analysis as suggested, resulting in 8 new supplemental figures (Figure 6-figure supplements 1-8) and one new supplemental table (S5). We have also made the following revisions to the text:

      In Results: “To gain insight into specific module-member genes that may be involved in the RTS,S/AS01 baseline risk signature, we performed the same analysis on the gene level, i.e. examined associations with clinical malaria risk for each of the constituent genes in the 45 BTMs shown in Figure 6B. Figure 6-figure supplements 1-8 show the gene-level association results within the eight BTMs that were significantly associated with clinical malaria risk in MAL067 and at least two of the three CHMI studies, and had at least one gene in MAL067 that was significantly associated with risk (these eight correspond to M4.0, S10, S5, M168, M4.3, M11.0, M4.15, and S4). Within MAL067, 35 unique genes were shown to significantly associate with malaria risk (Supplementary Table 5); 9 of these genes (CCNF, MK167, KIF18A, NPL, RBM47, CFD, MAFB, IL13RA1, and CCR1) also had significant association with non-protection in one of the CHMI studies. Although no individual gene was significantly associated with risk across >2 studies, many showed consistent effect (direction and magnitude) across 3 studies. This further supports our choice to focus on modules instead of individual genes as GSEA increases power to detect more subtle but coordinated changes in gene expression data that would be missed otherwise. For this same reason, GSEA has been shown to enhance cross-study comparisons (45).”

      In Discussion: “Our gene-level correlates analyses suggest an alternative hypothesis, however. With the caveat that the gene-level analyses were performed post hoc, high baseline expression of STAB1 (which is present in DC-related, monocyte-related, and cell cycle-related modules) was found to positively associate with clinical malaria risk (Figure 6-figure supplements 1, 2, and 6). STAB1 encodes stabilin-1 (also called Clever-1), a transmembrane glycoprotein scavenger receptor that links extracellular signals to intracellular vesicle trafficking pathways (58). Interestingly, stabilin-1high monocytes show downregulation of proinflammatory genes, and T cells co-cultured with stabilin-1high monocytes showed decreased antigen recall, suggesting that monocyte stabilin-1 suppresses T cell activation (56). Thus one possibility is that stabilin-1high immunosuppressive monocytes circulating at baseline could decrease protective RTS,S-induced T-cell responses, or inhibit another aspect of adaptive immunity. Single-cell transcriptomic profiling of PBMC or purified monocyte subsets in future RTS,S trials in African children in malaria-endemic areas could help test this hypothesis.”

      Impact

      This paper will inform future studies looking for correlates of RTS,S induced protection from clinical malaria in a variety of ways:

      It validates the blood transcriptional module approach (as published by Li S, Rouphael N, Duraisingham S, Romero-Steiner S, Presnell S, Davis C, Schmidt DS, Johnson SE, Milton A, Rajam G, et al: Molecular signatures of antibody responses derived from a systems biology study of five human vaccines. Nat Immunol 2014) to find target cell populations which can then be investigated in much more detail.

      It shows that studying PBMC recall responses after peptide stimulation post final vaccination is not the way forward, since no response is detected (Figure 2). future studies can now take an alternative approach. e.g. since unstimulated PBMCs (vehicle control) from RTS,S vaccinated children were different from those who received a comparator vaccine (Figure 2) RTS,S vaccine signatures could be picked up much more easily by whole blood RNAseq.

      It implicates innate immune cells in shaping an individuals response to a vaccine - an exciting basis for future functional and mechanistic studies.

      We are glad the reviewer appreciates the value of the study.

      Reviewer #2 (Public Review):

      This paper reports a sub-study of the RTS,S/AS01 malaria vaccine Phase 3 trial, which aimed to identify groups of genes (blood transcriptional modules, BTMs) for which expression in DMSO or antigen-stimulated PBMCs was associated with clinical malaria during a 12-month follow-up period. Study subjects were infants and children who received either RTS,S/AS01 or comparator vaccines (meningococcal C for infants, rabies vaccine for children), enrolled in the study in Tanzania and Mozambique (with some additional analyses using samples from Gabonese infants).

      Using PBMCs collected at baseline before vaccination and 3 months later (a month after the third vaccine dose), stimulated with DMSO or parasite antigens, the authors used RNA-sequencing to identify BTMs which were different between recipients of RTS,S/AS01 vs comparator vaccines; which were different between baseline and month 3 in RTS,S/AS01 recipients; and which differed between RTS,S/AS01 recipients with a malaria episode and those without a malaria episode during the follow-up period. This combination of analyses might help to distinguish BTMs specifically associated with RTS,S/AS01 vaccine efficacy from those associated with other factors influencing susceptibility to malaria. To further aid mechanistic understanding the authors examined correlations between BTMs and measures of cellular and humoral immune responses. To try to establish generalisability the authors examined whether BTMs identified in African children were also associated with developing malaria in RTS,S/AS01-vaccinated malaria-naïve adults in the United States who underwent controlled human malaria infection (CHMI).

      Strengths of the study include:

      1) The relatively large number of subjects, the large amount of transcriptomic and immunological data which has been generated (and made publicly accessible), and the extensive analysis to evaluate associations between BTMs and numerous immunological variables.

      2) Clear explanation of both the rationale and methods for most of the analyses

      3) The attempt to validate findings in the CHMI studies

      4) Matching of subjects to try to eliminate the confounding effects of age, study site, and time of vaccination

      Weaknesses of the study include:

      1) Despite the relatively large size of the study, it is hard to know whether it had sufficient power to achieve its main objective, and we are not presented with data to demonstrate how successfully the authors managed to match subjects for age, timing of vaccination and follow-up duration

      We have added the following to our “limitations” paragraph in the Discussion: “Fourth, despite the relatively large size of the study, our statistical power was limited by the number of malaria cases with available samples; sampling additional controls would not have increased our statistical power.”

      Moreover, we now also provide the new Supplementary Table 1, which provides complete information on participant match ID, site, age cohort, sex assigned at birth, and time of vaccination.

      2) The comparator group to the RTS,S/AS01 vaccine is not a single vaccine, but two vaccines, but the presentation of the data makes it difficult to identify what effect this may have had on the results

      Indeed, comparators received a rabies vaccine or the meningococcal C conjugate depending on the age cohort. However, we think that the impact on the study results and conclusions is minimal since the main results are based on baseline gene expression and its association with malaria risk within RTS,S vaccinees. Correlates of malaria risk in comparators are done separately. Comparator vaccination may be a confounding factor for age cohort, but we are not analyzing the effect of age cohort on the transcriptional profile. Comparators are only included in the analysis of RTS,S immunogenicity at post-vaccination (RTS,S vs Comparators, Fig 2A, Comparison (1)) and we have adjusted analyses by age cohort and hence by comparator vaccine. The fact that the comparators received different control vaccines only stresses that the BTMs found to be associated with RTS,S vaccination are specific to the RTS,S vaccine.

      Moreover, as an alternative way to identify RTS,S-specific transcriptional responses, we also include Comparison (2), which compares Month 3 to Month 0 transcription levels within RTS,S vaccinees. We include in the text extensive discussion of the merits and drawbacks of each comparison:

      “Two comparisons were done to characterize the transcriptional response to RTS,S/AS01 vaccination: Comparison (1): comparing gene expression in month 3 samples from RTS,S/AS01 vs comparator recipients (month 3 RTS,S/AS01 vs comparator); and Comparison (2): comparing gene expression in month 3 vs month 0 from RTS,S/AS01 recipients (RTS,S/AS01 month 3 vs month 0). Each comparison has its own advantages: Comparison (1) allows the identification of RTS,S/AS01-specific responses while taking into account other environmental factors to which the children are exposed, such as malaria exposure (albeit malaria transmission intensity was low during the study at both sites). Moreover, the very young ages of the trial participants mean that RTS,S/AS01-induced changes may be confounded with normal developmental changes in participant immune systems, further underscoring the value of Comparison (1), as it does not involve comparison across two different time points. On the other side, an advantage of Comparison (2) is that it takes into consideration each participant’s intrinsic baseline gene expression. Comparison (1) uses data from both infants and children, whereas Comparison (2) can only yield insight into RTS,S/AS01 responses in children (as baseline samples were not collected from infants).”

      3) A very "liberal" false-discovery rate (FDR) threshold has been used throughout to define significant associations. An FDR of 0.2 indicates that 20% (or 1 in 5) results which are considered significant will be false-discoveries. This means that the "significant" results must be interpreted with a high degree of caution. Typically researchers use lower FDR thresholds, like 0.05 or 0.01, although one may argue for different thresholds under different circumstances

      While it is not uncommon to use a threshold of 20% for immune correlates studies [e.g. (5-10)], we agree with you that it is important to clearly state the chosen FDR rate and to discuss conclusions in the context of the FDR rate used. We see we could improve our manuscript in this respect. We have added the following:

      Results: “Compared to the 45 BTMs whose baseline levels significantly associated with clinical malaria risk in RTS,S/AS01-vaccinated African children, fewer BTMs (seven) had levels at one month post-final RTS,S/AS01 dose that significantly associated with clinical malaria risk. Moreover, if a more stringent FDR cutoff had been used (i.e. 5%), six of these seven BTMs would not have been identified. Thus it is entirely possible that, at one month post-final RTS,S/AS01 dose, there is no circulating immune transcriptomic signature predictive of risk…”

      Discussion: “Finally, while it is not uncommon to use an FDR cutoff of 20% in high-dimensional immune correlates studies [e.g. (65-70)], our results should be interpreted with the requisite level of caution. However, we do note that many of our significant modules in the baseline risk analysis would have survived even lower FDR cutoffs (in many cases even a 1% cutoff), giving us a fair degree of confidence in our results. For example, of the seven monocyte-related BTMs whose baseline levels associated with risk, all would have survived a 5% FDR cut-off, and three even a 1% cut-off; likewise, of the four dendritic cell-related BTMs whose baseline levels associated with risk, all would have survived a 5% FDR cut-off, and three even a 1% cut-off.”

      Moreover, we have revised Figures 2, 3, and 6 so that it is easy to discern whether a specific BTM correlation would also pass more stringent FDR cutoffs, through the addition of 1, 2, or 3 asterisks where appropriate: “|FDR| < 0.2 (), < 0.05 (), < 0.01 ().” Note that, most central to the key message of the paper, many of the monocyte-related, DC-related, and cell cycle-related BTMs would have passed more stringent FDR cutoffs, with many even passing a 1% FDR cutoff (as discussed above).

      4) A perplexing finding, which is not addressed in detail, is the large number of BTMs which differ between RTS,S and comparator vaccine groups after DMSO stimulation of PBMCs, but these are not seen when PBMCs are stimulated with parasite antigens in DMSO (and a similar finding for month 3 vs month 0 samples from RTS,S recipients). This raises some concern about the stimulation experiments, because one might expect that the DMSO vehicle in the antigen preparations would trigger a similar response to DMSO alone.

      It should be noted that in all our analyses, the stimulated results were adjusted for DMSO to focus on the antigen-specific response only. This would explain why we detect signal in the DMSO samples but not in response to stimulation. We have realized that this was not very well described in the figure captions and the Methods section and have added more details, including the model description in Methods section. As such, we do not believe that these results impact all downstream conclusions. We believe that the unstimulated results provide significant new insights into the immune and molecular mechanisms of RTS,S vaccine efficacy, not necessarily directly related to the RTS,S-specific acquired immune response. We would also like to highlight the fact that we have improved our model specification to directly account for the pairing of some of the samples using a random effect using the limma package. This has slightly increased statistical power, and as such the number of significantly differentially expressed BTMs in response to stimulation is a bit higher (but still much less than that for the DMSO). Originally, we had decided against the use of a random effect due to the computational cost of estimating the random effect.

      The fact that bulk transcriptional profiling of Ag-stimulated PBMCs did not identify almost any significant differences in BTM expression between the RTS,S vs. comparator group could be due to several factors. First of all, the frequency of antigen-specific CD4+ T cells was very low among CD4+ T cells (Figure 4 of the manuscript shows that CSP-specific CD4+ T cells comprise < 0.004% of all CD4+ T cells). This low frequency of CSP-specific T cells is consistent with other RTS,S studies [e.g. as we state on line 330, we have previously found that CSP-specific T-cells in RTS,S/AS01 vaccinees comprise < 0.10% of all CD4+ T cells (1)].

      Moreover, CD4+ T cells themselves comprise approximately 45-57% of all PBMCs (2). Thus, finding an expression signal between the RTS,S vs. comparator group would require the signal to be high enough to be detected in only 0.002% of all PBMCs [0.004% (% CSP-specific CD4+ T cells out of total CD4+ T cells) x 51% (average % of CD4+ T cells out of all PBMCs) = 0.002%]. Thus, lack of detectable recall response does not mean lack of recall response. Moreover, as suggested below, we opted not to focus the rest of the manuscript on the Ag-stimulation results.

      Second of all, the PBMCs were stimulated on site for 12h and then cryopreserved. This stimulation time was chosen based on the kinetics of IFN-g and IL-2 mRNA response (3), but other responses may have had different kinetics and thus have already resolved or have not yet occurred by the 12-h cryopreservation. We have added text in the manuscript to discuss these caveats (“Another potential reason for why no BTMs were found to associate with the response to RTS,S/AS01 vaccination or with protection when analyzing CSP-stimulated PBMC is that all PBMC were stimulated on site for 12 hours (this stimulation time was chosen based on the kinetics of the IFN-g transcriptional response) and then cryopreserved. Thus, we were unable to detect earlier transient responses that had already resolved by 12 hours, as well as more delayed response that had not yet initiated by 12 hours, if such responses occurred.”.

      The authors partly achieved their aims. They identified BTMs differentially expressed between RTS,S/AS01 and the comparator vaccines, and between baseline and month 3 in RTS,S/AS01 recipients. They also identified BTMs at month 3 associated with developing malaria, and BTMs at baseline associated with developing malaria. These latter BTMs were partly replicated in the CHMI study subjects. Higher expression of BTMs associated with monocytes and dendritic cells were most consistently identified across the different analyses and their expression in stimulated baseline samples was most consistently associated with development of clinical malaria in RTS,S/AS01 recipients. However there were inconsistencies in associations between some of the studies, and it is possible that the "consistent" monocyte and dendritic cell BTMs would not be so consistent if a more stringent FDR threshold was used. However the authors conclusions are largely quite measured and for the most part they do not over-interpret the significance of their findings.

      We have added the following to the Discussion: “Finally, while it is not uncommon to use an FDR cutoff of 20% in high-dimensional immune correlates studies [e.g. (65-70)], our results should be interpreted with the requisite level of caution. However, we do note that many of our significant modules in the baseline risk analysis would have survived even lower FDR cutoffs (in many cases even a 1% cutoff), giving us a fair degree of confidence in our results. For example, of the seven monocyte-related BTMs whose baseline levels associated with risk, all would have survived a 5% FDR cut-off, and three even a 1% cut-off; likewise, of the four dendritic cell-related BTMs whose baseline levels associated with risk, all would have survived a 5% FDR cut-off, and three even a 1% cut-off.”

      Overall the work provides some evidence that baseline immunological status, particularly related to monocyte and dendritic cell responses and possibly their role in or response to baseline inflammation, may be a determinant of how well the RTS,S vaccine works to prevent malaria. This provides a basis for further work to optimise the effectiveness of the vaccine. The usefulness of PBMC stimulation to predict an individual's response to vaccination will be limited because this is not a method which can be used at scale in resource limited settings, but the concept that vaccine response could be enhanced by modifying pre-vaccine immunological or inflammatory status is potentially important. The data published with this study will be a valuable resource and will undoubtedly be used by others to address similar questions. Increasing the efficacy of malaria vaccines remains an extremely important goal, and identifying possible mechanisms which restrict the efficacy of RTS,S is important.

      References:

      1. Moncunill G, De Rosa SC, Ayestaran A, Nhabomba AJ, Mpina M, Cohen KW, Jairoce C, Rutishauser T, Campo JJ, Harezlak J, Sanz H, Diez-Padrisa N, Williams NA, Morris D, Aponte JJ, Valim C, Daubenberger C, Dobano C, McElrath MJ. RTS,S/AS01E Malaria Vaccine Induces Memory and Polyfunctional T Cell Responses in a Pediatric African Phase III Trial. Front Immunol. 2017;8:1008.
      2. Kleiveland CR. Peripheral Blood Mononuclear Cells. In: Verhoeckx K, Cotter P, López-Expósito I, Kleiveland C, Lea T, Mackie A, et al., editors. The Impact of Food Bioactives on Health: in vitro and ex vivo models. Cham: Springer International Publishing; 2015. p. 161-7.
      3. Schultz-Thater E, Frey DM, Margelli D, Raafat N, Feder-Mengus C, Spagnoli GC, Zajac P. Whole blood assessment of antigen specific cellular immune response by real time quantitative PCR: a versatile monitoring and discovery tool. J Transl Med. 2008;6:58.
      4. Kazmin D, Nakaya HI, Lee EK, Johnson MJ, van der Most R, van den Berg RA, Ballou WR, Jongert E, Wille-Reece U, Ockenhouse C, Aderem A, Zak DE, Sadoff J, Hendriks J, Wrammert J, Ahmed R, Pulendran B. Systems analysis of protective immune responses to RTS,S malaria vaccination in humans. Proc Natl Acad Sci U S A. 2017;114(9):2425-30.
      5. Liu C, Martins AJ, Lau WW, Rachmaninoff N, Chen J, Imberti L, Mostaghimi D, Fink DL, Burbelo PD, Dobbs K, Delmonte OM, Bansal N, Failla L, Sottini A, Quiros-Roldan E, Han KL, Sellers BA, Cheung F, Sparks R, Chun TW, Moir S, Lionakis MS, Consortium NC, Clinicians C, Rossi C, Su HC, Kuhns DB, Cohen JI, Notarangelo LD, Tsang JS. Time-resolved systems immunology reveals a late juncture linked to fatal COVID-19. Cell. 2021;184(7):1836-57 e22.
      6. Andersen-Nissen E, Fiore-Gartland A, Ballweber Fleming L, Carpp LN, Naidoo AF, Harper MS, Voillet V, Grunenberg N, Laher F, Innes C, Bekker LG, Kublin JG, Huang Y, Ferrari G, Tomaras GD, Gray G, Gilbert PB, McElrath MJ. Innate immune signatures to a partially-efficacious HIV vaccine predict correlates of HIV-1 infection risk. PLoS Pathog. 2021;17(3):e1009363.
      7. Lu P, Guerin DJ, Lin S, Chaudhury S, Ackerman ME, Bolton DL, Wallqvist A. Immunoprofiling Correlates of Protection Against SHIV Infection in Adjuvanted HIV-1 Pox-Protein Vaccinated Rhesus Macaques. Front Immunol. 2021;12:625030.
      8. Haynes BF, Gilbert PB, McElrath MJ, Zolla-Pazner S, Tomaras GD, Alam SM, Evans DT, Montefiori DC, Karnasuta C, Sutthent R, Liao HX, DeVico AL, Lewis GK, Williams C, Pinter A, Fong Y, Janes H, DeCamp A, Huang Y, Rao M, Billings E, Karasavvas N, Robb ML, Ngauy V, de Souza MS, Paris R, Ferrari G, Bailer RT, Soderberg KA, Andrews C, Berman PW, Frahm N, De Rosa SC, Alpert MD, Yates NL, Shen X, Koup RA, Pitisuttithum P, Kaewkungwal J, Nitayaphan S, Rerks-Ngarm S, Michael NL, Kim JH. Immune-correlates analysis of an HIV-1 vaccine efficacy trial. N Engl J Med. 2012;366(14):1275-86.
      9. Fletcher HA, Snowden MA, Landry B, Rida W, Satti I, Harris SA, Matsumiya M, Tanner R, O'Shea MK, Dheenadhayalan V, Bogardus L, Stockdale L, Marsay L, Chomka A, Harrington-Kandt R, Manjaly-Thomas ZR, Naranbhai V, Stylianou E, Darboe F, Penn-Nicholson A, Nemes E, Hatherill M, Hussey G, Mahomed H, Tameris M, McClain JB, Evans TG, Hanekom WA, Scriba TJ, McShane H. T-cell activation is an immune correlate of risk in BCG vaccinated infants. Nat Commun. 2016;7:11290.
      10. Young WC, Carpp LN, Chaudhury S, Regules JA, Bergmann-Leitner ES, Ockenhouse C, Wille-Reece U, deCamp AC, Hughes E, Mahoney C, Pallikkuth S, Pahwa S, Dennison SM, Mudrak SV, Alam SM, Seaton KE, Spreng RL, Fallon J, Michell A, Ulloa-Montoya F, Coccia M, Jongert E, Alter G, Tomaras GD, Gottardo R. Comprehensive Data Integration Approach to Assess Immune Responses and Correlates of RTS,S/AS01-Mediated Protection From Malaria Infection in Controlled Human Malaria Infection Trials. Front Big Data. 2021;4:672460.
    1. Author Response:

      Evaluation Summary:

      The authors studied the neural correlates of planning and execution of single finger presses in a 7T fMRI study focusing on primary somatosensory (S1) and motor (M1) cortices. BOLD patterns of activation/deactivation and finger-specific pattern discriminability indicate that M1 and S1 are involved not only during execution, but also during planning of single finger presses. These results contribute to a developing story that the role of primary somatosensory cortex goes beyond pure processing of tactile information and will be of interest for researchers in the field of motor control and of systems neuroscience.

      We thank all reviewers and the editor for their assessment of our paper. We acknowledge that our description of the methods and some interpretation of the results can be clarified and expanded. We address every comment and proposed suggestion in the following below.

      Reviewer #1 (Public Review):

      This is a very important study for the field, as the involvement of S1 in motor planning has never been described. The paradigm is very elegant, the methods are rigorous and the manuscript is clearly written. However, there are some concerns about the interpretation of the data that could be addressed.

      We thank Reviewer #1 for the positive evaluation of our study. We clarify our methodological choices and interpretation of the data in the following response.

      • The authors claim that planning and execution patterns are scaled version of each other, and that overt movement during planning is prevented by global deactivation. This is an interesting perspective, however the presented data are not fully convincing to support this claim:

      (1) the PCM analysis shows that correlation models ranging from 0.4 to 1 perform similarly to the best correlation model. This correlation range is wide and suggests that the correspondence between execution/planning patterns is only partial.

      The reviewer is correct that the current data leaves us with a specific amount of uncertainty. However, it should be noted that the maximum-likelihood estimates of correlations between noisy patterns are biased, as they are constrained to be smaller or equal to 1. Thus, we cannot test the hypothesis that the correlation is 1 by just comparing correlation estimates to 1 (for details on this, see our recent blog on this topic: http://www.diedrichsenlab.org/BrainDataScience/noisy_correlation/). To test this idea, we therefore use a generative approach (the PCM analysis). We find that no correlation model has a higher log-likelihood than the 1-correlation model, therefore we cannot rule out that the underlying true correlation is actually 1. In other words, we have as much evidence that the correspondence is only partial as we do that the correspondence is perfect. The ambiguity given by the wide correlation range is due to the role of measurement noise in the data and should not be interpreted as if the true correlation was lower than 1. What we can confidently conclude is that activity patterns have a substantial positive correlation between planning and execution. We take this opportunity to clarify this point in the results section.

      (2) in Fig.4 A-B, the distance between execution/planning patterns is much larger than the distance between fingers. How can such a big difference be explained if planning/execution correspond to scaled versions of the same finger-specific patterns? If the scaling is causing this difference, then different normalization steps of the patterns should have very specific effects on the observed results: 1) removing the mean value for each voxel (separately for execution and planning conditions) should nullify the scaling and the planning/execution patterns should perfectly align in a finger-specific way; 2) removing the mean pattern (separately for each finger conditions) should effectively disturb the finger-specific alignment shown in Fig.4C. These analyses would corroborate the authors' conclusion.

      The large distance between planning and execution patterns (compared to the distance between fingers) is caused by the fact that the average activity pattern associated with planning differs substantially from the average activity pattern during execution. Such a large difference is of course expected, given the substantially higher activity during execution. However, here we are testing the hypothesis that the pattern vectors that are related to a specific finger within either planning or execution are scaled version of each other. Visually, this can be seen in Figure 4B (bottom), where the MDS plot is rotated, such the line of sight is in the direction of the mean pattern difference between planning and execution—such that it disappears in the projection. Relative to the baseline mean of the data (cross), you can see that arrangement of the fingers in planning (orange) is a scaled version of the arrangement during execution (blue). The PCM model provides a likelihood-based test for this idea. The model accounts for the overall difference between planning and execution by including (and estimating) model terms related to the mean pattern of planning and execution, respectively, therefore effectively removing the mean activation of planning and execution. We have now explained this better in the results and methods sections, also referring to a Jupyter notebook example of the correlation model used (https://pcm-toolbox-python.readthedocs.io/en/latest/demos/demo_correlation.html).

      Regarding your analysis suggestions, removing the mean pattern for planning and execution across fingers as a fixed effect (suggestion 1) leads to the distance structure shown in Fig 4B (bottom)—showing that the finger-specific patterns during planning are scaled versions of those during execution (also see Fig. R1 below). On the other hand, subtracting the mean finger pattern across planning and execution (suggestion 2) will not fully remove the finger specific activation as the finger-specific patterns are differently scaled in planning and execution. Furthermore, neither of these subtraction analyses allows for a formal test of the hypotheses that the data can be explained by a pure scaling of the finger-specific patterns.

      Figure R1. RDM of left S1 activity patterns evoked by the three fingers (1, 3, 5) during no-go planning (orange) and execution (blue) after removing the mean pattern across fingers (separately for planning and execution). The bottom shows the corresponding multidimensional scaling (MDS) projection of the first two principal components. Black cross denotes mean pattern across conditions.

      • A conceptual concern is related to the task used by the authors. During the planning phase, as a baseline task, participants are asked to maintain a low and constant force for all the fingers. This condition is not trivial and can even be considered a motor task itself. Therefore, the planning/execution of the baseline task might interfere with the planning/execution of the finger press task. Even more controversial, the design of the motor task might be capturing transitions between different motor tasks (force on all finger towards single-finger press) rather than pure planning/execution of a single task. The authors claim that the baseline task was used to control for involuntary movements, however, EMG recordings could have similarly controlled for this aspect, without any confounds.

      Participants received training the day before scanning, which made the “additional” motor task very easy, almost trivial. In fact, the system was calibrated so that the natural weight of the hand on the keys was enough to bring the finger forces within the correct range to be maintained. Thus, very little planning/online control was required by the participants before pressing the keys. As for the concern of capturing transitions between different motor tasks, that it is indeed always the case in natural behavior. Arguably there is no such thing as “pure rest” in the motor system, active effort has to be made even to maintain posture. Furthermore, if the motor system considers the hold phase as a simultaneous movement phase, it should have prevented M1 and S1 to participate in the planning of upcoming movements, as it would be busy with maintaining and controlling the pre-activation. Having found clear planning related signals in M1 and S1 in this situation makes our argument, if anything, stronger.

      Finally, we specifically chose not to do EMG recordings because finger forces are a more sensitive measure of micro movements than EMG. Extensive pilot experiments for our papers studying ipsilateral representations and mirroring (e.g., Diedrichsen et al., 2012; Ejaz et al., 2018) have shown that we can pick up very subtle activations of hand muscles by measuring forces of a pre-activated hand, signals that clearly escape detection when recording EMG in the relaxed state. Based on these results, we actually consider the recording of EMG during the relaxed state as an insufficient control for the absence of cortical-spinal drive onto hand muscles. This is especially a concern when recording EMG during scanning, due to the decreased signal-to-noise ratio.

      • In Fig.2F, the authors show no-planning related information in high-order areas (PMd, aSPL), while such information is found in M1 and S1. This null result from premotor and parietal areas is rather surprising, considering current literature, largely cited by the authors, pointing to high-order motor or parietal areas involved in action planning.

      We agree with the reviewer that, to some extent, the lack of involvement of high-order areas in planning is surprising. However, we believe that task difficulty (i.e., planning demands) plays a role in the amount of observed planning activation. In other words, because participants were only asked to plan repeated movements of one finger, there was little to plan. The fact that this may have contributed to the null result in premotor and parietal areas was further confirmed by the second half of the dataset, which is not reported in the current paper. Here, we investigated the planning of multi-finger sequences, where planning demands are certainly higher. We found that high-order areas such as PMd and SPL were indeed active and involved in the planning of those, as expected. We decided to split the dataset across two publications as the multi-finger sequences have their own complexities, which would have distracted from the main finding of planning related activity in M1 and S1.

      Reviewer #3 (Public Review):

      I found the manuscript to be well written and the study very interesting. There are, however, some analytical concerns that in part arise because of a lack of clarity in describing the analyses.

      1) Some details regarding the methods used and results in the figures were missing or difficult to understand based on the brief description in the Methods section or figure legend.

      We thank Reviewer #3 for pointing out some lack of clarity in our description of the methods. We now expanded both the methods section and the figure captions (Fig. 2-3-4).

      2) I think the manuscript would benefit from a more balanced description on the role of S1. As the authors state, S1 is traditionally thought to process afferent tactile and proprioceptive input. However, in the past years, S1 has been shown to be somatopically activated during touch observation, attempted movements in the absence of afferent tactile inputs, and through attentional shifts (Kikkert et al., 2021; Kuehn et al., 2014; Puckett et al., 2017; Wesselink et al., 2019). Furthermore, S1 is heavily interconnected with M1, so perhaps if such activity patterns are present in M1, they could also be expected in S1?

      To better characterize the role of S1 during movement planning, we now include recent research showing that S1 can be somatotopically recruited even in the absence of tactile inputs.

      3) Related to the previous comment: If attentional shifts on fingers can activate S1 somatotopically, could this potentially explain the results? Perhaps the participants were attending to the fingers that were cued to be moved and this would have led to the observed activity patterns. I don't think the data of the current study allows the authors to tease apart these potential contributions. It is likely that both processes contribute simultaneously.

      We agree that our results could also be explained by attentional shifts on the fingers. It is very likely that, during planning, participants were specifically focusing on the cued finger. However, as the reviewer points out, our current dataset cannot distinguish between planning and attention as voluntary planning requires attention. We expanded the discussion section to include this possibility.

      4) The authors repeatedly interpret the absences of significant differences as indicating that the tested entities are the same. This cannot be concluded based on results of frequentist statistical testing. If the authors would like to make such claims, then they I think they should include Bayesian analysis to investigate the level of support for the null hypothesis.

      We have now clarified the parts in the manuscript that sounded as if we were interpreting the absence of significant difference (null results) as significant absence of differences (equivalence).

    1. Karnofsky suggests that the cost/benefit ratio of how we typically think of reading may not be as simple as we intuitively expect i.e. we think that 'more time' = 'more understanding'.

      If you're simply reading to inform yourself about a topic, it may be worth reading a couple of book reviews, and listening to an interview or two, rather than invest the significant amount of time necessary to really engage with the book.

      A few hours of skimming and reviews/interviews may get you to 25% understanding and retention, which in many cases may be more than enough for your needs of being basically informed on the topic. Compared to the 50 - 100 hours necessary for a deep, analytical engagement with the text, that would only get you to 50% understanding and retention.

      That being said, if your goal is to develop expertise, both Karnofsky and Adler ('How to read a book') suggest that you need a deep engagement with multiple texts.

    1. Author Response:

      Reviewer #1 (Public Review):

      The study aims to investigate the role of A11 neurons in courtship behavior and vocalizations. In particular, the authors determine the inputs/outpus of A11 neurons and uncover that the outputs are both dopamine and glutamate positive. They then lesion A11 cell bodies and terminals in the songbird song-motor nucleus HVC and find that these lesions affect song production, especially, though not exclusively, of courtship song. They also measure the location and movement of lesioned birds and find that birds with lesions of A11 cell bodies show less engagement with a female. Finally, they use fiber photometry to study the activity of A11 terminals in HVC during singing. While this is an interesting question supported by novel data, and I appreciate the diverse and creative approaches employed in this study, the role of A11 in courtship behavior appears complicated and does not easily fit into the framework proposed by the authors. In particular, the authors argue that A11 is important for coordinating innate and learned aspects of courtship, however, their data fall short of supporting this idea.

      Strengths This is an impressive data set with considerable attention to detail.

      The tracing and histology data identify some novel connections not previously described in songbirds as well as the potential of A11 neurons to co-release of glutamate and dopamine.

      Photometry provides real-time monitoring of A11 and HVC neuron activity during singing.

      In principle, targeting both HVC terminals and A11 cell bodies has the potential to lend insight into the role of HVC terminals vs. the role of projections to other areas (see below for caveats).

      We appreciate the reviewer’s efforts and attention in evaluating our manuscript. We are grateful that the reviewer recognizes strengths in our study, which we agree provides novel insights into the brain circuits that enable a fully integrated courtship display comprising learned and innate behaviors.

      Weaknesses 1) While I find the overall question and the data interesting, I am not convinced that they demonstrate that A11 is important for "coordinating innate and learned aspects of courtship". In general, birds with A11 lesions appear less motivated to perform female-directed song, however, it's not clear that this is a consequence of a lack of coordination between innate/learned aspects of behavior. Rather, perhaps A11 neurons are important to instigate or drive courtship behavior, or to relay signals from the POA or other regions important for courtship. Because the lesions abolish behavior, it is difficult to discern the role of these neurons in courtship.

      We agree that discerning the precise role of A11 is tricky. It could be acting to gate a (motivational) drive from another source, providing a primary source of this drive, and/or performing a more intricate role in coordinating the various aspects of the courtship display. The reviewer is correct that the current experiments do not allow us to clearly distinguish between these possibilities, and we have revised the manuscript accordingly, first by replacing “coordinate” with “gate” in the title and introduction and including a more thorough treatment of gating and other possible roles for A11 in lines 258-262 of the discussion. That said, we lean towards the latter possibility - a coordinating role for A11 - because of its location immediately proximal to regions that drive learned (HVC) and innate (ICo, RPgc) aspects of behavior and because A11 neurons can contain synthetic enzymes for a fast acting neurotransmitter (glutamate) in addition to DA. But, ultimately, we acknowledge that future experiments are needed to more completely answer this question.

      In addition, I disagree with the innate vs. learned distinction as recent data indicate that introductory notes, which the authors treat as innate, are actually learned (e.g. Kalra et al., 2021). Further, there is also no quantification of the effects of lesions on female-directed calls and little analysis of the activity during call production. This would seem to further complicate the overall interpretation. Overall, it's difficult to make sense of how A11 activity relates to vocalizations, especially given the innate/learned framework that they focus on.

      We thank the reviewers for drawing our attention to the recent Kalra 2021 paper, which we now cite while also making sure to emphasize that introductory notes may have learned features (lines 194-195 and 278-279). However, even that recent study concluded that males raised without a tutor or tutored on recorded songs that lack introductory notes altogether still developed songs that include introductory notes. Nonetheless, we include citation of this recent study and qualify our characterization of introductory notes as being shaped by innate predispositions and experience. Furthermore, we conducted additional analyses to quantify female-directed calling before and after 6-OHDA lesions in either HVC or A11 (results can be found in lines 164-165 and Figure 4C). In line with the divergent effects of these two types of lesions on the production of introductory notes, lesions in HVC did not affect female-directed calling whereas lesions in A11 largely abolished these vocalizations. While we acknowledge that the fiber photometry data on female-directed calling was limited, it nonetheless reinforces the conclusion that A11 transmits information to HVC about innate vocalizations, and it also transmits information to HVC about introductory notes. Along with the loss of introductory note production following A11 lesions, we do believe that our findings support the idea that A11’s role is essential to female-directed vocalizations generally, regardless of whether they are learned or innate, and of of somehow enabling the transition from production of female-directed calling and introductory notes to motifs. We have done our best to draw out these points in the revised discussion.

      2) The HVC lesions appear to create damage/necrosis (Fig 3-suppl 2) and this raises the question of the degree to which the HVC lesion effects are the result of dopamine/glutamate depletion or local damage. In particular, it is surprising that syllable structure and stereotypy show such a dramatic breakdown with HVC A11 input lesions and effectively no change with lesions of the cell bodies, even though both treatments lead to effectively similar reductions in song production.

      We appreciate that 6-OHDA lesions are not highly specific and can introduce unwanted effects on non-TH+ cells and processes. To further quantify the effects of 6-OHDA lesions on HVC cells, we conducted additional 6-OHDA injections in HVC and TUNEL staining studies in addition to the preliminary efforts we had made in the original manuscript. Quantification of these data confirmed our original impression that 6-OHDA treatment in HVC increased HVC cell death (these data are shown in Figure 3-figure supplement 2J, K). To further address this issue, we also added an analysis of song structure when D1 receptor blockers were dialyzed into HVC. No changes in song morphology were detected, similar to the lack of effects on song morphology following A11 cell body lesions (Figure 3 - figure supplement 3). Taken together, these additional experiments and analyses indicate that the changes in song morphology following 6-OHDA treatment in HVC may arise from local damage to HVC cell bodies. In contrast, the reduction in singing following A11 terminal or cell body lesions is likely to reflect diminished DA signaling from A11. However, as the reviewer notes, our primary finding is the differential effects on female-directed singing, and the distinction between more purely singing-related effects following 6-OHDA treatment in HVC and a broad effect on all courtship behaviors following 6-OHDA treatment in A11.

      3) If the idea is that A11 is important for coordinating innate and learned movements, it seems that a detailed analysis of the movements would be important. As is, the movement data provide further support of a decrease in either the motivation or ability to perform female-directed song, but they do not speak to a more specific role for A11 in coordinating innate and learned movements.

      We maintain that we did provide a detailed analysis of a number of important nonsong behaviors, including changes in head orientation and translational movements that the male makes towards the female, both of which are major appetitive features of courtship in songbirds and other vertebrates. We also appreciate that these analyses do not allow us to say much about precisely how movements are being coordinated during courtship, and we have changed language throughout the manuscript to emphasize a gating rather than coordinating role for A11. Furthermore, in response to the reviewer’s concern, we performed additional analyses of the male’s movements during courtship, including beak wipes, vertical changes in posture (“standing tall”), which are finer components of female-directed displays. Notably, this new analysis reveals that all of these behavioral components are abolished by A11 cell body lesions, but not by A11 terminal lesions in HVC (lines 168-190 and Figure 4I, J). We appreciate the reviewer’s suggestion, as we believe these additional analyses strengthen our core finding, namely that A11 functions as a hub to gate, recruit and possibly coordinate innate and learned movements to generate a complete courtship display. These different roles are more fully considered in the revised discussion (lines 256-262).

      Reviewer #2 (Public Review):

      Ben-Tov et al. investigate function of midbrain region A11 and provide evidence that it plays a role in promoting and coordinating a variety of motor responses to sexually or socially salient stimuli. They show lesions of A11 cell bodies abolish female directed calling, orienting and singing, while lesions of terminals in the song premotor nucleus HVC prevent female directed singing, but leave female directed calling and orienting intact. Together with anatomical data indicating projections from A11 to multiple downstream targets associated with song (HVC), calling (DM/ICO) and locomotion, these data support the authors' idea that A11 forms a 'hub' that drives and 'coordinates' multiple different aspects of behavioral responses to social (here female/sexual) stimuli. The results are intriguing and begin to reveal how a single social context can elicit and coordinate multiple coordinated responses. However, as outlined below, I think that some of the specific stronger claims would benefit from additional data, discussion or moderation.

      The authors also provide compelling support for the idea that A11 plays a differential role in female-directed versus undirected song. This is especially underpinned by the observations that 1) A11 afferent activity in HVC appears to differ between directed and undirected signing, with increases in activity preceding song motifs only during directed song, and 2) lesions of A11 cell bodies or inputs to HVC have a dramatic suppressive effect on directed singing, but can leave undirected song largely unchanged. These observations that A11 differentially contributes to socially elicited versus spontaneous singing seem especially interesting and merit further highlighting and discussion as one of the especially striking aspects of the study that seems distinct from the thesis of a role in coordinating learned and unlearned behaviors.

      We appreciate the reviewer’s efforts and attention in evaluating our manuscript. We are grateful that the reviewer recognizes strengths in our study, which we agree provides novel insights into the differential contribution of A11 to socially elicited versus spontaneous singing. We also agree that this point should be highlighted and we expanded our treatment of this point in the discussion section of the revised manuscript (lines 296-309).

      Specific comments

      A central idea around which the results are discussed is that A11 plays a particular role in coordinating learned versus innate behaviors. I have several questions around this thesis where further guidance from the authors about both technical points and interpretation would be helpful.

      First is the question of how specific are the manipulations and conclusions to A11 itself versus other neighboring midbrain dopaminergic regions within which it is embedded. The authors show histology of lesions, injection sites and retrograde labelling in supplementary figures, but do not provide enough guidance for me to understand the strength of the argument that manipulations are restricted to A11 and/or its afferents. Can the boundaries between A11 and neighboring regions be better demarcated? What are the neighboring regions to which there might have been spillover? For lesions of A11 axons within HVC, wouldn't 6-OHDA also damage any other dopaminergic afferent to HVC, including those coming from regions such as VTA? Some discussion of these and related points regarding the specificity of manipulations to A11 would be helpful, especially in light of the literature that points to potential roles of neighboring dopaminergic regions in contributing to motivated behaviors and song more specifically.

      We appreciate that the definition and boundaries of A11 might be confusing. We demarcated A11 and neighboring regions in the relevant figures to better define A11’s boundaries. The reviewer is correct in surmising that the VTA is fairly close to A11 and hence a reasonable concern is that 6-OHDA treatment in A11 could spill over to the VTA and possibly the SNc. To address the concern that 6-OHDA lesions in HVC might cause cell damage to other DA sources to HVC, we quantified the number of VTA/SNc cells following HVC DA lesions. This additional analysis, provided in Figure 3-figure supplement 1D-F, shows that the number of VTA/SNc cells following 6-OHDA injections into either A11 or HVC is comparable to that of intact birds. These additional analyses support the conclusion that the behavioral deficits that emerge following 6-OHDA treatments reflect damage to A11 or A11 terminals in HVC.

      These points also relate to the general question of what is meant by A11 being a 'hub for coordination of learned and innate courtship behaviors'. Ultimately, it seems likely that many regions must work together to orchestrate these behaviors, and it is not clear from the present results how much I should view A11 as having a more specific role than other neighboring dopaminergic regions (or hypothalamic regions such as POA) that are interconnected and seem likely to also play critical roles. As the authors note, many of the relevant structures, including A11 and song system structures, are recurrently connected, further complicating interpretation of any one area as a hub. In this respect, I am not sure how much the authors are intending to argue that A11 is both necessary and sufficient for driving each of the studied behaviors in a courtship context, and it would be helpful to discuss this more specifically - does 'coordination' as used here imply that A11 is capable of triggering these behaviors - an interesting possibility raised by the current results but that does not yet seem to be demonstrated - or something else?

      As we noted in our response to a similar point made by the first reviewer, we agree that discerning the precise role of A11 is tricky. As we commented in that earlier response, A11 could gating a (motivational) drive from another source, providing a primary source of this drive, and/or performing a more intricate role in coordinating the various aspects of the courtship display. We agree that the current experiments do not allow us to make a clear distinction between these possibilities, and we have revised the manuscript accordingly, including a more thorough treatment of these various roles for A11 in the discussion (lines 256-262). That said, we lean towards the latter possibility - a coordinating role for A11 - because of its location immediately proximal to regions that drive learned (HVC) and innate (ICo, RPgc) aspects of behavior and because A11 neurons can release a fast acting neurotransmitter (glutamate) in addition to DA. But, ultimately, we acknowledge that future experiments are needed to more completely answer this question. In the revised manuscript, we emphasize a gating role for A11 in the title and introduction, and then in the discussion expand to encompass the possibility of a coordinating or timing role for A11.

      One additional question regarding the framework for interpreting the function of A11 as coordinating 'learned and innate' courtship behaviors, is for some further clarification and citations regarding what is learned versus innate, especially as it relates to song. The authors characterize introductory notes as 'innate', but previous work from Rajan and colleagues has demonstrated that aspects of introductory notes including acoustic structure and patterning are influenced by learning, and I am not sure what the literature says about orienting and calling to females.

      We thank the reviewer for drawing our attention to this recent study from the Rajan group which indeed concluded that some aspects of the introductory notes are learned. We also note that this study showed that juvenile males tutored on song playbacks that lacked introductory notes or that were raised without a tutor still produced introductory notes. Nonetheless, we include a citation of this recent study and qualify our characterization of introductory notes as being shaped by innate predispositions as well as through experience and learning (lines 194-195 and 278-279). Furthermore, our original analyses of birds with 6-OHDA treatment in HVC revealed that introductory note morphology was unchanged, whereas syllable morphology was degraded. Therefore, even if certain features of introductory notes are influenced by tutor experience, they apparently do not depend on HVC in the same manner as do the learned syllables in the motif. Lastly, we conducted additional analyses to quantify female-directed calling and other movements, before and after 6-OHDA lesions in either HVC or A11. In line with the divergent effects of 6-OHDA treatment in these two regions on the production of introductory notes, lesions in HVC did not affect female-directed calling, beak wipes or changes in male’s posture, whereas lesions in A11 largely abolished all of these behaviors (Figure 4C, I, J). While we agree with the reviewer that a distinction between innate and learned behaviors may not be straightforward, the more fundamental observation is that we can dissociate different aspects of the courtship display and that A11 is situated in a position to drive, gate or coordinate a unified display that involves a variety of learned and innate vocal and non-vocal movements.

      I also would find it helpful to have some further clarification in this context about what it means to coordinate learned and innate aspects of song. The authors indicate that undirected song is largely unaffected by A11 lesions while directed song is largely eliminated, leaving only innate calls or introductory notes. I think it would be helpful to see here a more complete characterization of the nature of vocalizations that remain following A11 lesions in the female directed context. While I understand that no recognizable 'learned motifs' are produced, it is unclear from the example that is shown how much the residual vocalizations should be construed as 'severely disrupted songs' versus strings of calls that resemble innate calls that were present prior to lesions, versus 'normal' patterns of introductory notes that resemble in acoustic structure what the birds produced prior to lesions, but that never proceed to song motifs, etc. A better understanding of the nature of these residual vocalizations might also help to interpret what A11 is doing. Do these birds seem motivated to 'sing' in terms of their posture? Do the authors think that HVC is engaged or that the same residual vocalizations would be produced in a bird that had HVC lesions? How do the authors interpret these data in terms of how learned and unlearned vocalizations are normally coordinated in the context of directed singing?

      We performed additional analyses of the male’s vocalizations and movements during courtship, including female-directed calls, beak wipes, vertical changes in posture (“standing tall”), all of which are components of female-directed courtship displays. Notably, this new analysis reveals that all of these behavioral components are abolished by A11 cell body lesions, but not by A11 terminal lesions in HVC (lines 168-190 and Figure 4C, I, J). Along with our prior report that males with A11 cell body lesions do not sing female-directed motifs, the additional analysis indicates that these males produce little or no female-directed vocalizations or non-vocal behaviors of any kind.

      We previously reported that males with A11 terminal lesions produced only introductory notes but not motifs but realize that this observation would benefit from more quantification. As noted in the previous response, we previously established that introductory note morphology was unchanged by 6-OHDA treatment in HVC (Figure 4 - figure supplement 1A-D). To extend this analysis further in this revised manuscript, we built on the observation that males with 6-OHDA treatment in HVC produce only introductory notes to females, with no song motif, whereas they produce a series of introductory notes followed by motifs comprising distorted syllables when alone (Figure 3K, Figure 3 - figure supplement 2, Figure 4B). To confirm that the directed introductory notes and undirected syllables were indeed distinct vocalizations, we computed their durations and spectral similarity scores (using Sound Analysis Pro). The introductory notes produced during directed conditions differed markedly in their durations from distorted syllables produced during undirected conditions, and these two types of vocalizations had very low similarity scores, indicating that they were cleanly separable vocal behaviors (Figure 4 - figure supplement E, F). Given that introductory notes are unchanged by 6-OHDA treatment in HVC, these analyses support the idea that males treated in this manner can still produce motifs, albeit distorted ones, when alone but not when in the company of a female.

      These questions relate in part to that of how much is the trigger to sing eliminated by A11 afferent lesions versus the ability to produce the relevant song output? It seems like there may still be a trigger to sing - short latency vocal response to female - but inability to produce motif. One point that may be interesting to note in this regard is that this seems somewhat opposite of observations made in other contexts about the effects of directed versus undirected context on song - for example, juveniles can produce better song when it is directed (Kojima), and deafened birds that are beginning to exhibit song deterioration can exhibit normalization of song structure during directed conditions (Nordeen).

      We agree with the reviewer’s point that birds with 6-OHDA lesions in HVC may still be triggered to sing, but are unable to produce a motif, given that they still produce introductory notes and seem to have the right posture, orientation and proximity to the female. We appreciate the reviewer’s comment regarding changes in song that can be elicited by females in either juvenile males or adult males that are deaf, although these additional contexts fall outside of the current study, which focused on adult male finches with normal hearing.

      Reviewer #3 (Public Review):

      The authors use a combination of quantitative acoustic and other behavioral analyses to evaluate the role of the midbrain dopaminergic area A11 in the production of female-directed song in adult male zebra finches. They show that female-directed courtship displays, which consist of song and the production of female-directed displacement behaviors, are dependent on A11 because targeted chemical lesions of this structure, using 6-hydroxydopamine (6-OHDA), permanently (i.e. for at least several months) eliminate both the vocal and non-vocal elements of this behavior. Destruction of A11 axons that directly target HVC, by administering 6-OHDA into HVC, only eliminates female-directed singing without causing any change in the other observed female-directed behaviors. Because these same lesions only temporarily (5-10 days) abolish undirected song, these findings suggest that A11 is not directly involved in song production but acts instead as a gate for the production of female directed courtship behaviors. The authors follow these lesion studies with fiber photometry-based calcium imaging of A11 axons that target HVC to show that A11 activation patterns precede activity in HVC during female-directed singing and that calcium elevation is primarily elevated during the production of the many introductory notes (a component of song that is primarily observed during female-directed singing) that precede the production of the learned song motif. These findings suggest that A11 inputs to HVC likely play a role in triggering and/or activating HVC to synchronize the production of introductory notes (which are likely produced by midbrain circuits) with the learned song component that immediately follows them. In contrast, activation of A11 axons during undirected song (which contain few to no introductory notes) do not precede HVC activation patterns. Consistent with the rapid transmission of A11 neurons, the authors also confirm, as has been suggested for A11 in mammals, that A11 dopaminergic neurons co-release glutamate.

      The findings of this study are of significant interest to our understanding of the neural mechanisms by which these complex behaviors are synchronized and open up a new way of thinking about how learned behavioral motifs can be synchronized with non-learned (e.g. female displacement behavior) behaviors. The study is rigorous, with many different experimental approaches being used to examine the proposed hypotheses, and the findings are convincing. Particularly impressive is the complete elimination of female-directed courtship behaviors following targeted elimination of A11. The primary weaknesses of the manuscript lie (1) in the way they present their anatomical findings and (2) how the authors discuss their findings in the discussion. In the discussion, which is very short (~750 words), the authors miss the opportunity to draw parallels with similar studies in drosophila (they only provide a cursory statement with a few references). In the discussion, the authors propose a model that seems quite oversimplified and lacks, in fact, many of the anatomical connectivity that they show in the first part of their study (for example A11 is only shown having a unidirectional connection to ICo/DM when in fact the connections are bidirectional). The model is also presented in simple hierarchical fashion with many connections omitted. Perhaps these omissions were made to simplify the model but in my opinion such simplification possibly misrepresents the actual mechanisms involved in the coordinated control of courtship song.

      We thank the reviewer for their careful reading of the manuscript and his supportive and constructive comments. We agree that the loss of all female-directed behaviors (which we now extend to female-directed calling and other non-vocal behaviors, such as beakwipes and postural changes) following A11 cell body lesions is especially intriguing. Further, the different effects of A11 cell body lesions and A11 terminal lesions in HVC, along with the connectivity of A11, indicate that A11 acts via a range of downstream sites to gate these various female-directed behaviors. We have done our best to address the two primary weaknesses identified by the reviewer. First, we have done our best to provide a more detailed accounting of the anatomical findings. Second, we have expanded the discussion to address parallels with other studies, as in the fly, and to provide a more nuanced and complete consideration of how A11 may function to facilitate male courtship behaviors.

    1. Author Response:

      Public Review:

      This manuscript from Pacheco-Moreno et al. compares the microbiome of potato fields with and without irrigation. Irrigation is known to control potato scab caused by Streptomyces scabies and the authors hypothesized that changes in the microbiome may contribute to disease suppression after irrigation. Using 16S rRNA sequencing, they identified a number of taxa, including Pseudomonas that are enriched after irrigation. They went on to isolate and sequence the genomes of many Pseudomonas strains. By correlating the ability of Pseudomonas to suppress Streptomyces growth in vitro with genomic data, the authors identified a novel group of cyclic lipopeptides (CLPs) that can inhibit Streptomyces in vitro and in planta.

      This work provides a substantial contribution that advances our understanding of disease suppressive soil mechanisms. It is novel in scope in that it focuses on suppression of a bacterial pathogen, while many prior studies focus on suppression of fungal pathogens. Additionally, the large-scaled comparative genomics is a useful resource, and the identification of CLPs that inhibit Streptomyces is novel. Importantly, the authors provide in planta data to show role a for CLPs in disease suppression in vivo. The manuscript is well written and the data are well presented. The analyses are quite thorough and I appreciate the extensive use of genetics and metabolomics to support the genomic predictions. The main weakness is a lack of data the conclusively links the change in microbiome function to disease suppression after irrigation in the field. However, I think the data they've presented, combined with those in the drought literature, might suggest that an increase in total Pseudomonas (and the corresponding disease-suppressive genes) in well-watered soil might contribute to suppression, rather than a change in function of Pseudomonas.

      While the reviewer is correct that we cannot conclusively link disease suppression to a change in microbiome function after irrigation, we are confident that our results demonstrate a real and repeatable phenomenon that must be considered in future studies of soil scab suppression. Independent field experiments conducted two years apart both show a decrease in the proportion of suppressive pseudomonads associated with potato roots. The first experiment (Figures 1 & 2) contained too few sequenced isolates to draw statistically robust conclusions, therefore we designed the second experiment (Figure 8) to investigate this phenomenon further. This experiment showed highly significant differences in the proportion of suppressive isolates on irrigated and non-irrigated roots. The alternative hypothesis presented by the reviewer; that relative Pseudomonas and Streptomyces abundance are affected by irrigation and this may be a factor in scab suppression, is also a valid possibility, although relatively small abundance changes were observed in the data reported in Figure 1. We have amended the discussion to include this as an alternative explanation for our results.

    1. Author Response:

      Reviewer #1:

      For this manuscript, I focused on the metabolite analysis. The data is presented as supporting a common response based on shared selective histories if I'm understanding properly. However, primary metabolite data is hard to interpret in the same fashion as genetic data. This arises because of the high degree of pleiotropy wherein it is very hard to find a mutant or variant that doesn't alter primary metabolism. As such, it is possible that there is a common response less because of shared history and more because there is constraining selection that shapes what is the optimal primary metabolite response to cold in photosynthetic organisms. For example, in Arabidopsis, it has been found that accessions tend to have a highly similar primary metabolism but when they are crossed, the progeny have a vastly wider array of primary metabolism phenotypes, suggesting that the similarity in accessions is not shared genetics but constraining selection that forces compensatory variants. I don't think this detracts from the utility of including the primary metabolism but it would help to have more clarity in the strengths and weaknesses in using metabolite data to track theories and arguments that are largely genetic based.

      We fully agree with the reviewer. The idea of constraining selection is at least as interesting as our explanation, and should be in the forefront. Given this interesting idea of compensatory mutations that are private to each accession (or ‘lineage’ or ‘line’), in principle this idea also hints towards the parallel/convergent evolution (‘constraining selection’ in the reviewer’s words) of this important trait or trait complex. We re-phrased this within the manuscript and considered this comment seriously throughout. We also incorporate into our manuscript this interesting compensatory variant notion and metabolic network pleiotropy.

      One difference we would like to highlight still is that in our study (compared to Arabidopsis thaliana studies) we are comparing across many different species, ploidal levels, and varying species-level evolutionary histories. This makes our experiment different from Arabidopsis thaliana ecotype experiments and crossings; but indeed the reviewer is fully right that our results may also follow a similar evolutionary path as for Arabidopsis thaliana.

      Reviewer #2:

      Cochlearia, and other species that have rapidly evolved new ecological niches, represent excellent systems to study adaptation to past, present, future and changing environments. Furthermore, reticulate evolution within such groups offers a natural experiment to test hypotheses about the roles of hybridization, introgression, etc. on evolutionary dynamics, including pre-adaptation. However, there are also several significant challenges to using such systems, most crucially separating adaptation as the causal mechanism from the wide array of non-adaptive processes that could also cause the observed patterns. Overall, Wolf and colleagues do a nice job describing this complex taxonomic system and provide multiple lines of inquiry into how observed patterns may align with various adaptive scenarios. Despite the strong descriptive framework, I had trouble understanding exactly how causality could be assigned. Thus, the interpretation and discussion of the results felt speculative.

      Thank you for the encouraging comments. Yes, we agree: the points towards an important aspect of this kind of phylogenetic-systematic-evolutionary research, namely demonstrating causality. Honestly speaking, in such studies we are not able to show causality in its strict sense, and we think that the reviewer wants to claim this without using quite so strong wording. We considered this while re-phrasing respective paragraphs and also town down some speculative conclusion.

      Reviewer #3:

      There has been intense interest in how plants have responded during periods of rapid climate change in the past. Understanding these responses can increase our understanding of how plants might respond to rapidly accelerating anthropogenic climate shifts and help set conservation priorities. Many paleoecological studies have provided insight on how plants have migrated and persisted in suitable climate refugia (i.e. pockets of suitable habitat that exist even if regional climate is unfavorable for the persistence of a species) throughout glacial cycles, however there has been considerably less work that details the evolutionary dynamics of plants during these periods. This piece provides timely and valuable analyses illustrating the potential influence of pronounced climate change on the evolutionary dynamics of the genus Cochlearia.

      Thank you for the encouraging comments.

      The authors' use of cytogenetic analyses, organellar phylogenies, and demographic modeling allows for insights into the geographic patterns of diversity, speciation rates, and postglacial expansion scenarios of Cochlearia. Drawing unique conclusions from these different lines of evidence provides new understandings into the putative role of Pleistocene glacial cycles in driving evolutionary processes such as speciation. The study also aims to provide insight into the origins of the stated putative cold tolerance exhibited by Cochlearia by using a metabolomics approach; however, the framing and use of a single related outgroup (sister genus Ionopsidium) obfuscate the link between the results and stated conclusions.

      We appreciate this point, but indeed there is no other outgroup to be used. In this study we included all (both) genera with most of its species of tribe Cochlearieae. Within a family- wide phylogenetic context this tribe is placed along a polytomy (together with not well resolved other tribes) and stem group age of Cochlearieae is of appr. 18.9 million years ago (Walden et al., 2020). Therefore, for our research question additional outgroups from other tribes will not contribute any further information, because more basal splits are then nearly 20 million years ago (Early Miocene) with no biogeographic and environmentally defined scenarios that can be compared. 16-23 million years ago most tribes of evolutionary lineage II underwent an early radiation with highest net diversification rates (Walden et al. 2020) during this time. We included some of this information into the introduction.

      Specifically, regarding the approach that resulted in figure 4 which encompassed the metabolomics and related analyses, the initial climate groupings into 'climate ecotypes' would benefit from clarification and consideration of assignment methods. Typically, using the term ecotype invokes the idea of distinct forms of a species with phenotypic differences adapted to local conditions rather than groupings to those under broad climate regimes. While grouping populations according to climate origin can be useful, it is not clear how the final 9 WorldClim bioclimatic variables were selected (e.g. it is not apparent how importance of or correlations between climate variables, etc. were considered). Consequently, knowing this information would help understand the patterns in figure 4b, which seems to indicate that geographically distant populations experience very similar climate conditions (understanding that similarities can exist but variable selection can greatly influence these patterns).

      Thanks for this reminder to explaining selection and analyses of BioClim variables.

      As for the term “ecotype”: In plant taxonomy ecotypes are often referred to on subspecies level, in particular if environmental conditions are extremely different (e.g. heavy metal contaminated versus not-contaminated soils) and often these subspecies do not significantly differ in morphology (Noccaea caerulescens, Minuartia verna). In Cochlearia morphology is at best a morphospace which is more or less shared between all species in different ways. Species definition and taxonomy is based on a combination of largely overlapping morphospace, cytotype, ecotype and habitat types (bedrock; arctic, lowland to alpine; soil type and salt, life cycle) and distribution – often sole morphology is a bad species predictor (morphologically cryptic species – this is well-known also for some other arctic species such from the genus Draba). But the reviewer is fully right, that using the term ecotype here is somehow misleading. Our idea was to highlight that groups of taxa are combined by bioclimatic variables (and biomes or habitat types) while spanning the entire species/ecotype space of the genus – and this grouping follows also evolutionary meaningful cluster. We clarified this.

      As for selection of BioClim variables: we agree, indeed selection might have appeared arbitrary to the reader. Our original selection followed our field and cultivation experiences. However, structuring into four clusters as originally shown with the first submission is robust also when including all 19 BioClim variables. The same four cluster are retained in PCA, when temperature related BioClim variables are used only.

      Therefore, we added a Principal Component Analysis as starting point for Bioclim variable selection, secondly we added a PCA using temperature related BioClim variables 1-11 only. Built upon this we added a sentence why our nine selected variables were used to highlight the four groups in Fig. 4. The two PCA scree plots (including vector data) plus the correlation matrix and the results of a KMO test (Kaiser-Meyer-Olkin test: testing significant difference between the correlation matrix of variables and an identity matrix) are additionally provided with the Suppl. Material.

      The other concern is in regards to the framing and interpretation of these results. For instance, in the results (lines 329-330) and discussion (lines 419-423), the impression is given that experimental results here match those found in plants belonging to a different genus (i.e. Arabidopsis). However, rather than attributing this to more generally conserved mechanisms in response to considerable cold stress, the authors relate this to the unique history of Cochlearia (and its relationship to the drought adapted sister genus). The authors also note that surprisingly there was no demarcation of cold responses between the climate-defined groupings. Detailing why this is surprising given some of the other conclusion statements would be helpful. Some targeted revision to strengthen this link would be useful to bolster the inference of about the origins of cold tolerance in Cochlearia, rather than making it seem like this result could be expected in other taxa.

      Thank you for this. We agree that we did not explain our reasoning as well as we could and we now have reworked this. Original lines 329-330 simply refers to the (expected) and obvious general response to cold – some explanatory text has been added, e.g. such as at the end of the discussion and directly with the above-mentioned lines.

      Lastly, another area that would benefit from some clarification and tightening is revisiting the connection between the results and stated conclusions. For instance, some of the statements from the introduction and conclusions indicate the reader might expect explicit niche exploration analyses and more detailed genomic approaches. It is not abundantly clear for a general audience how these results definitively demonstrate how genetic diversity was rescued in reticulate and polyploid gene pools or species barriers were torn down. These are very specific, strong claims that do not appear to be explicitly discussed outside of the introduction/discussion or directly related to the results presented in this manuscript.

      Thank you for pointing out how this could be read in this way. We have revised this to indicate that agree: we do not think our data ‘definitively demonstrate’ (in the reviewer’s words, not our) this. We modify the text to avoid such interpretation.

      This is no way diminishes the considerable effort of the authors to conduct the informative array of presented analyses, but more closely aligning the conclusions within the scope of presented results (or providing direct links on how the results provide these insights) would help increase the effectiveness of this manuscript.

      Many thanks for this very encouraging note. We have worked to incorporate these thoughtful comments.

    1. When absent in teacher education programs and national policies, it is little wonder that many English teachers may be both stymied and fearful about addressing the civic, healing needs of classrooms

      This is very interesting because during my teacher preparation program here at UIC, we often were covering SEL and trauma-informed lessons, but I don't think anything can truly prepare teachers for the way our society is built on trauma of BIPOC folx. For example, many of our trauma-informed pedagogies are centered around student wellness and trauma they experience on a personal level, and while we may beat around the bush that the real problem are social oppressive systems and their impact on students, it feels very rare that we are examining how trauma is ingrained in every aspect of our society for Black and Brown children. For example, none of us were prepared to support students through the trauma of Covid-19, but the trauma that Black and Brown students are experiencing from Covid-19 is very different from the experience of many white families, and that is, again, because of the oppressive systems that we know of but don't fully examine as being the reason why students have so much trauma.

    1.  So what gets in the way of our pursuit of it? I think we most often resist going through the process of mastery  for two reasons: it can be deeply uncomfortable along the way and we doubt our ability to become expert.

      I think that this statement is 100% true. When people first start to get into new things and new skills then it probably seems very weird at first. For example if it's a sport then you're not going to be used to it after. It may feel uncomfortable because it's something that is new for you and your body/mind has to get the chance to adapt to it. As time goes on you gradually start getting better and better at this skill and it becomes a hobby or like the title states...something you're really good at.