10,000 Matching Annotations
  1. Last 7 days
    1. Reviewer #2 (Public review):

      I have completed a thorough review of this paper, which seeks to use the large datasets of species occurrences available through GBIF to estimate variation in how large numbers of plant and animal species are associated with urbanization throughout the world, describing what they call the "species urbanness distribution" or SUD. They explore how these SUDs differ between regions and different taxonomic levels. They then calculate a measure of urban tolerance and seek to explore whether organism size predicts variation in tolerance among species and across regions.

      The study is impressive in many respects. Over the course of several papers, Callaghan and coauthors have been leaders in using "big [biodiversity] data" to create metrics of how species' occurrence data are associated with urban environments, and in describing variation in urban tolerance among taxa and regions. This work has been creative, novel, and it has pushed the boundaries of understanding how urbanization affects a wide diversity of taxa. The current paper takes this to a new level by performing analyses on over 94000 observations from >30,000 species of plants and animals, across more than 370 plant and animal taxonomic families. All of these analyses were focused on answering two main questions:<br /> (1) What is the shape of species' urban tolerance distributions within regional communities?<br /> (2) Does body size consistently correlate with species' urban tolerance across taxonomic groups and biogeographic contexts?

      Overall, I think the questions are interesting and important, the size and scope of the data and analyses are impressive, and this paper has a potentially large contribution to make in pushing forward urban macroecology specifically and urban ecology and evolution more generally.

      Despite my enthusiasm for this paper and its potential impact, there are aspects that could be improved, and I believe the paper requires major revision.

      Some of these revisions ideally involve being clearer about the methodology or arguments being made. In other cases, I think their metrics of urban tolerance are flawed and need to be rethought and recalculated, and some of the conclusions are inaccurate. I hope the authors will address these comments carefully and thoroughly. I recognize that there is no obligation for authors to make revisions. However, revising the paper along the lines of the comments made below would increase the impact of the paper and its clarity to a broad readership.

      Major Comments:

      (1) Subrealms

      Where does the concept of "subrealms" come from? No citation is given, and it could be said that this sounds like an idea straight out of Middle Earth. How do subrealms relate to known bioclimatic designations like Koppen Climate classifications, which would arguably be more appropriate? Or are subrealms more socio-ecologically oriented? From what I can tell, each subrealm lumps together climatically diverse areas. It might be better and more tractable to break things in terms of continents, as the rationale for subrealms is unclear, and it makes the analyses and results more confusing. The authors rationalized the use of subrealms to account for potential intraspecific differences in species' response to urbanization, but that is never a core part of the questions or interpretation in the paper, and averaging across subrealms also accounts for intraspecific variation. Another issue with using the subrealm approach is that the authors only included a species if it had 100 observations in a given subrealm, leading to a focus on only the most common species, which may be biased in their SUD distribution. How many more species would be included if they did their analysis at the continental or global scale, and would this change the shape of SUDs?

      (2) Methods - urban score

      The authors describe their "urban score" as being calculated as "the mean of the distribution of VIIRS values as a relative species-specific measure of a response to urban land cover."

      I don't understand how this is a "relative species-specific measure". What is it relative to? Figures S4 and S5 show the mean distribution of VIIRS for various taxa, and this mean looks to be an absolute measure. Mean VIIRS for a given species would be fine and appropriate as an "urban score", but the authors then state in the next sentence: "this urban score represents the relative ranking of that species to other species in response to urban land cover".

      That doesn't follow from the description of how this is calculated. Something is missing here. Please clarify and add an explicit equation for how the urban score is calculated because the text is unclear and confusing.

      (3) Methods - urban tolerance

      How the authors are defining and calculating tolerance is unclear, confusing, and flawed in my opinion.

      Tolerance is a common concept in ecology, evolution, and physiology, typically defined as the ability for an organism to maintain some measure of performance (e.g., fitness, growth, physiological homeostasis) in the presence versus absence of some stressor. As one example, in the herbivory literature, tolerance is often measured as the absolute or relative difference in fitness of plants that are damaged versus undamaged (e.g., https://academic.oup.com/evolut/article/62/9/2429/6853425?login=true).

      On line 309, after describing the calculation of urban scores across subrealms, they write: "Therefore, a species could be represented across multiple subrealms with differing measures of urban tolerance (Fig. S4). Importantly, this continuous metric of urban tolerance is a relative measure of a species' preference, or affinity, to urban areas: it should be interpreted only within each subrealm".

      This is problematic on several fronts. First, the authors never define what they mean by the term "tolerance". Second, they refer to urban tolerance throughout the paper, but don't describe the calculation until lines 315-319, where they write (text in [ ] is from the reviewer):

      "Within each subrealm, we further accounted for the potential of different levels of urbanization by scaling each species' urban score by subtracting the mean VIIRS of all observations in the subrealm (this value is hereafter referred to as urban tolerance). This 'urban tolerance' (Fig. S5) value can be negative - when species under-occupy urban areas [relative to the average across all species] suggesting they actively avoid them-or positive-when species over-occupy urban areas [relative to the average across all species] suggesting they prefer them (i.e., ranging from urban avoiders to urban exploiters, respectively).<br /> They are taking a relativized urban score and then subtracting the mean VIIRS of all observations across species in a subrealm. How exactly one interprets the magnitude isn't clear and they admit this metric is "not interpretative across subrealms".

      This is not a true measure of tolerance, at least not in the conventional sense of how tolerance is typically defined. The problem is that a species distribution isn't being compared to some metric of urbanness, but instead it is relative to other species' urban scores, where species may, on average, be highly urban or highly nonurban in their distribution, and this may vary from subrealm to subrealm. A measure of urban tolerance should be independent of how other species are responding, and should be interpretable across subrealms, continents, and the globe.

      I propose the authors use one of two metrics of urban tolerance:

      (i) Absolute Urban Tolerance = Mean VIIRS of species_i - Mean VIIRS of city centers<br /> Here, the mean VIIRS of city centers could be taken from the center of multiple cities throughout a subrealm, across a continent, or across the world. Here, the units are in the original VIIRS units where 0 would correspond to species being centered on the most extreme urban habitats, and the most extreme negative values would correspond to species that occupy the most non-urban habitats (i.e., no artificial light at night). In essence, this measure of tolerance would quantify how far a species' distribution is shifted relative to the most highly urbanized habitat available.

      (ii) % Urban Tolerance = (Mean VIIRS of species_i - Mean VIIRS of city centers)/MeanVIIRS of city centers * 100%<br /> This metric provides a % change in species mean VIIRS distribution relative to the most urban habitats. This value could theoretically be negative or positive, but will typically be negative, with -100% being completely non-urban, and 0% being completely urban tolerant.

      Both of these metrics can be compared across the world, as it would provide either absolute (equation 1) or relative (equation 2) metrics of urban tolerance that are comparable and easily interpretable in any region.

      In summary, the definition of tolerance should be clear, the metric should be a true measure of tolerance that is comparable across regions, and an equation should be given.

      (4) Figure 1: The figure does not stand alone. For example, what is the hypothesis for thermophily or the temperature-size rule? The authors should expand the legend slightly to make the hypotheses being illustrated clearer.

      (5) SUDs: I don't agree with the conclusion given on line 83 ("pattern was consistent across subrealms and several taxonomic levels") or in the legend of Figure 2 ("there were consistent patterns for kingdoms, classes, and orders, as shown by generally similar density histograms shapes for each of these").

      The shapes of the curves are quite different, especially for the two Kingdoms and the different classes. I agree they are relatively consistent for the different taxonomic Orders of insects.

      Comments on revised version:

      I believe their response is thorough and thoughtful. I still disagree with them on some fundamental points of their methodology. However, I would prefer to let my review and their response stand as is. This will allow engaged readers to see both sides of the arguments and judge for themselves whether they believe the revisions are sufficient and if my concerns are valid.

    1. Reviewer #1 (Public review):

      Summary:

      This study demonstrates, through a series of EEG and MEG experiments, that the human brain automatically categorizes words from alphabetic and non-alphabetic languages, and it unpacks the neural mechanisms of this process from multiple angles. The work examines not only univariate repetition-suppression (RS) effects, but also how repeating or alternating languages influences the representational similarity of words within and across language categories.

      Strengths:

      The univariate RS effects across multiple experiments lend support to some of the main conclusions.

      Comments on revised version.

      The authors have made appropriate revisions and supplements in response to the issues I raised, which has largely resolved my concerns.

    2. Reviewer #2 (Public review):

      Summary:

      This study investigates how the human brain categorizes visual words from distinct writing systems (alphabetic vs. non-alphabetic). Using a repetition suppression paradigm combined with electroencephalography and magnetoencephalography, the authors conducted nine experiments with independent participants to identify the neural network underlying language-based categorization, characterize its temporal dynamics, and test whether this process operates independently of linguistic properties such as semantic meaning and pronunciation.

      Strengths:

      The study employs a well-validated design with clear control conditions and systematically manipulates key variables including writing system, language familiarity, and native language background. The use of nine experiments with independent participant samples strengthens the reliability and replicability of the results. The work combines EEG and MEG, cross-validating findings across imaging modalities to support the reported neural effects. A combination of univariate, multivariate, and connectivity analyses is used to characterize neural responses and network interactions. Results are consistent across multiple language groups and for both familiar and unfamiliar languages, supporting the generalizability of the identified neural mechanism beyond specific languages or prior experience.

      Comments on revised version.

      Earlier versions of the manuscript framed these findings as more directly reflecting the social-categorization function of language. In the revised manuscript, the authors now more carefully distinguish language-based word categorization from broader claims regarding social categorization and explicitly acknowledge that the current experiments do not directly test social evaluation or intergroup processes. These revisions improve the conceptual precision of the work and address my major concern from the previous review.

      The additional methodological clarifications and supplementary analyses also strengthen the manuscript. Overall, I believe the revised version provides solid evidence for rapid language-based categorization of visual words across different writing systems.

    1. Reviewer #1 (Public review):

      Summary

      This manuscript addresses an important question in auditory neuroscience and neuroprosthetics: whether cortical responses to cochlear implant stimulation resemble those evoked by natural acoustic stimulation, or whether electrical stimulation engages a distinct cortical representation. The authors use high-density intracranial EEG recordings in rats to compare responses to pure tones in normal-hearing animals with responses to single-channel cochlear implant stimulation in deafened animals. They combine analyses of event-related potentials, high-gamma activity, trial-by-trial variability, PCA/TCA-based dimensionality reduction, and decoder-based measures of stimulus information.

      Strengths

      A major strength of the study is the question it addresses. Understanding how electrical cochlear stimulation is represented centrally is highly relevant for cochlear implant design, fitting strategies, and rehabilitation. The comparison between acoustic and electrical stimulation, including within-animal comparisons in a subset of cases, is valuable because it directly addresses whether implant-evoked activity can be interpreted within the framework of normal acoustic tonotopy.

      The methodological approach is also a strength. Dense cortical surface recordings provide simultaneous access to spatial and temporal features of auditory cortical responses. The combination of PCA, TCA, and decoder analyses gives complementary views of the data, and the information-transfer analysis provides an interesting way to ask whether representations learned from acoustic stimulation generalize to electrical stimulation.

      Weaknesses:

      The main weakness is that the evidence for spatial organization remains difficult to interpret. In Figure 2, the authors argue that both tone-evoked and cochlear implant-evoked responses are spatially organized, but the slope analyses are not significant for the cochlear implant condition. The revised vector-strength analysis supports the presence of non-random spatial structure, but this is not the same as demonstrating a clear graded cochleotopic organization. The manuscript would be strongest if it consistently distinguished between non-random spatial structure, coarse topography, and true graded tonotopy or cochleotopy.

      A related issue is that some figure titles and interpretive statements still appear stronger than the data justify. For example, the TCA results in Figure 7 are described as revealing topographically organized latent spatial factors, but the statistical support appears strongest for normal-hearing high-gamma responses, with weaker or non-significant results in other conditions. These data remain interesting, but they would be better framed as evidence for weak or coarse spatial structure rather than robust topographic organization across all modalities.

      The decoder analyses are improved, especially with the added tone-to-tone control. This control supports the conclusion that poor acoustic-to-CI transfer is not simply a failure of the TCA/LDA pipeline. However, the analysis remains model-dependent, and the absolute information transfer values are low. It would be helpful either to include an analogous analysis using raw ERP/high-gamma features or to explain more explicitly why the TCA-based approach is the appropriate primary test. The data support poor generalization between acoustic and implant-evoked cortical responses, but claims about perceptual qualities should remain speculative because perception is not directly measured in these experiments.

      Finally, although methodological reporting is much improved, some verification remains indirect. The authors provide useful implantation criteria and cite prior validation of their deafening approach, but the manuscript would be clearer if it explicitly distinguished between validation performed in the present animals and validation based on previous cohorts. This distinction is important because surgical variability, implantation efficacy, and deafening completeness can influence the interpretation of cochlear implant experiments.

      Comments on revised version.

      The revised manuscript is considerably improved. The authors have clarified several methodological details, added a statistical framework that better accommodates both paired and unpaired animals, provided a clearer account of animal cohorts, added peripheral ECAP/forward-masking data to support the cochlear specificity of implant stimulation, and included a useful positive control for the cross-modal decoder analysis. These additions make the manuscript stronger and help readers interpret the main findings more confidently.

      The results support the conclusion that acoustic and cochlear implant stimulation evoke cortical responses with different properties. In particular, acoustic responses support better single-trial stimulus decoding than cochlear implant responses, and decoders trained on acoustic responses transfer poorly to implant-evoked responses. The evidence for spatial organization is more nuanced. The cochlear implant condition shows evidence of non-random spatial structure, but not a clear graded cochleotopic map. The normal-hearing condition is also less visually clear than might be expected from prior tonotopy studies, although the added analyses and comparisons to previous work help contextualize this result. Overall, the study makes a valuable contribution, provided that the claims about spatial organization and perceptual interpretation remain appropriately cautious.

      The revision addresses several important concerns from the original version. The use of mixed-effects models better matches the partially paired experimental design. The expanded Methods improve reproducibility. The new cohort schematic helps clarify which animals contributed to behavioral and neural datasets. The ECAP forward-masking measurements add useful peripheral validation, and the within-modality decoder control strengthens the interpretation of the poor cross-modal transfer result. Together, these changes substantially improve the manuscript.

      The work is likely to be of interest to auditory neuroscientists, cochlear implant researchers, and neuroengineers. Even where some conclusions require cautious wording, the dataset and analytical framework may be useful for future studies aiming to relate cortical responses to implant programming, perceptual learning, or closed-loop neuroprosthetic approaches.

      Overall, the revised manuscript is stronger and addresses an important problem with useful methods and analyses. The results most convincingly show that acoustic responses support better single-trial decoding than acute cochlear implant responses, and that acoustic-trained decoders generalize poorly to implant-evoked activity. The evidence for robust spatial organization, especially in the cochlear implant condition, is more limited and should be presented with appropriate caution.

    2. Reviewer #2 (Public review):

      Summary:

      This article reports measurements of iEEG signals on the rat auditory cortex during cochlear implant or sound stimulation in separate groups of rats. The observations indicate some spatial organization of cochlear implant stimuli, but that is very different from cochlear implants.

      Strengths:

      The study includes interesting analyses of the sound and cochlear implant representation structure based on decoders.

      Weaknesses:

      The observation that responses to cochlear implant stimulation (stimulation) is spatially organized is not new (e.g. Adenis et al. 2024)

      The claim that spatial and temporal dimensions contribute information about the sound is also not new there is a large literature on this topic.

      The analyses supporting the claim that there is a mismatch between cochlear implant and sound representation are still unclear, particularly in Fig. 8.

    3. Reviewer #3 (Public review):

      Summary:

      Through micro-electroencephalography, Hight and colleagues studied how the auditory cortex in its ensemble respond to cochlear implant stimulation compared to the classic pure tones. Taking advantage of a double implanted rat model (Micro-ECoG and Cochlear Implant), they tracked and analyzed changes happening in the temporal and spatial aspects of the cortical evoked responses in both normal hearing and cochlear-implanted animals. After establishing that single trial responses were sufficient to encode the stimuli properties, the authors then explored several decoder architectures to study the cortex ability to encode each stimuli modality in a similar or different manner. They conclude that a) intracranial EEG evoked responses can be accurately recorded and did not differed between normal hearing and cochlear-implanted rats; b) Although coarsely spatially organized, CI-evoked responses had higher trial-by-trial variability than pure tones; c) Stimulus identity is independently represented by temporal and spatial aspect of cortical representations and can be accurately decoded by various means from single trials; d) and that Pure tones trained decoder can't decode CI-stimulus identity accurately.

      Strength:

      The model combining micro-eCoG and cochlear implantation and the methodology to extract both the Event Related Potentials (ERPs) and High-Gammas (HGs) is very well designed and appropriately analyzed. Likewise, the PCA-LDA and TCA-LDA are powerful tools that take full advantage of the information provided by the cortical ensembles.

      The overall structure of the paper, with a paced and exhaustive progress through each step and evolution of the decoder is very appreciable and easy to follow. The exploration of single trial encoding and stimulus identity through temporal and spatial domains is providing new avenues to characterize the cortical responses CI stimulations and their central representation. The fact that single trials suffice to decode the stimulus identity regardless of their modality is of great interest and noteworthy. Although the authors confirm that iEEG remains difficult to transpose in clinic, the insights provided by the study confirm the potential benefit of using central decoders to help in clinic settings.

      Weakness:

      The conclusion of the paper, especially the concept of distinct cortical encoding for each modality, is unfortunately partially supported by the results as the authors ignored fundamental limitations of CI related stimulation.

      First, the authors stimulated in a Monopolar mode which, albeit being clinically relevant, notoriously generates a high current spread in rodent models. Comparing the averaged BF maps for iEEG (Fig-2A, C), BFs ranged from 4 to 16kHz with a predominance of 4kHz BFs. The lack of BFs at higher frequencies might reveal a potential location mismatch between the frequency range sampled at the level of the cortex (low to medium frequencies) and the frequency range covered by the CI inserted mostly in the first turn-and-a-half of the cochlea (high to medium frequencies). Looking at Fig-2F (and to some extend 2A) most of CI electrodes elicited responses around the 4kHz regions and averaged maps show a predominance of CI-3-4 across cortex (Fig-2C, H and Sup Fig. 3) from areas with 4kHz BF to areas with 16kHz BF. It is doubtful that CI-3-4 are located near the 4kHz region based on Müller's work (1991) on the frequency representation in the rat cochlea. Moreover, Supplemental figure 3 shows that only a couple of CI electrodes are predominately represented at the level of the cortex. Thus, it seems possible that current spread ended stimulating indistinctly higher turns of the cochlea or even the modiolus in a non-specific manner, greatly reducing (or smearing) the place-coding/frequency resolution of each electrode, which in turn could explain the coarse topographic (or coarsely tonotopic according to the manuscript) organization of the cortical responses.

      Second, although the authors acknowledge that post-lingual CI users always have an adaptation period, their conclusion is based on measurements that are relatively "early" in the CI-use timeline so to speak since iEEG were collected a) acutely right after mono-aural implantation and stimulation, b) under anesthesia, c) using unmodulated pulse train fixed at 900pps regardless of the electrode used and thus lacking any temporal information shifts in relationship to electrode cochleotopic placement. Basically, all CI electrodes had the same rate whereas you would expect basal CI electrodes to be amplitude modulated at higher frequencies than apical electrodes.

      As much as the reviewer likes the overall approach with the use of PCA-LDA and TCA, and agrees that information transfer seems inexistant at time of measurement, authors should be more careful in their strong conclusion that two distinct encoding exist. The non-overlapping between sound and electric stimulation representations might exist only transiently and this should be acknowledged a bit more in the discussion. Without repetition of iEEG measurement at later period with chronic use of the CI, it is not possible to definitively claim that two distinct, non-overlapping coding co-exist at all times.

      Nevertheless, the reviewer wants to reiterate that the study proposed by Hight et al. is well constructed, relevant to the field and that the overall proposal of improving patient performances and help their adaptation in the first months of CI use by studying central responses should be pursued as it might help establish new guidelines or create new clinical tools.

    1. Reviewer #1 (Public review):

      Summary:

      This study investigates the impact of Pink1 loss on glial function and neuronal health in a Drosophila model, highlighting the role of mitochondria-organelle contacts and key genes such as Ccz1, Vps13, Mon1, and Rab7. The work provides insights into cellular processes underlying neurodegenerative diseases, with a focus on glia-neuron interactions.

      Comments on revised version:

      I have reviewed the revised manuscript and the authors' responses to previous comments. The authors have addressed the key concerns raised by the reviewers, including validation of the Mz-GAL4 line and additional control experiments. The remaining issues caused by experimental constraints are understandable in this study.

      However, several concerns remain. Notably, some key results were removed due to the use of inadequately characterized fly lines, and the lack of follow-up experiments to address these issues raises concerns regarding the validity and reliability of the findings. Furthermore, the absence of experiments examining Rab7-mediated membrane trafficking or the interactions between mitochondria and lysosomes in the Pink1 mutant presents a limitation. These missing elements reduce the clarity and interpretability of Figure 5 for readers.

      On a positive note, the data showing that reducing Vps35/Vps13 enhances neuronal function and rescues Pink1 mutant phenotypes in ensheathing glia contributes meaningfully to the overall narrative.

      Despite these limitations, this research addresses an important question in neuroscience using the Drosophila model. It provides a novel perspective on Parkinson's disease and neurodegeneration by exploring mechanisms underlying Pink1 loss and suggesting a role for mitochondria-organelle interactions in ensheathing glia, potentially regulated via Vps35/Vps13-mediated pathways.

      Overall, the current version presents a clear and meaningful contribution to the field.

    2. Reviewer #2 (Public review):

      Summary:

      This study proposes a novel role for ensheathing glia (EG) in a Pink1-model of Parkinson's disease and shows that this cell population exhibits the highest number of DEG in a pre-symptomatic stage. In the olfactory system, there seems to be morphological changes in this cell-type that resembles an 'activated' state and the authors further show that the neuronal loss of Pink1 is responsible for this defect. The authors go on to show that manipulation of Pink1 in EG also leads to some defects in the visual system and in the dopaminergic neurons (DAN) that innervate the mushroom body (MB), and performed a screen based on the 'on-transient' defect of the ERG to identify potential genes that may modulate the function of EG in synaptic regulation. They focus on several genes related to vesicle trafficking including Vps13, and Vps35 and performed some additional experiments in the visual system and MB to propose the role of vesicle/lipid trafficking in EG as an important factor for PD pathogenesis.

      Strengths:

      The study proposes functional and mechanistic connections between several genes that have been linked to PD (PINK1, VPS35 and VPS13A/C). I feel that the data presented in Figure 1-Figure 3C are performed with rigor and are convincing/novel. The selection of Drosophila to study the questions is also a strength and the lab has extensive experiences in this field and model organism.

      Weaknesses:

      In this revised manuscript, a number of concerns raised by this and the other reviewer was addressed. The authors now admitted that some of the genetic reagents used in their screen and follow up assays were inappropriately utilized, and changed the latter half of the paper (Fig 3D-F4) quite significantly (e.g. now only 1 gene is considered as a hit in Fig3D, analysis of several genes in Fig4 have been removed and replaced by some experiments performed on Vps35). The transition between Figure 3D and Figure 4 is quite abrupt, and they don't seem to follow up on the CG17660 (the single hit from their screen, which is not further validated so it is not clear whether this genetic reagent is clean or not) and the effect of Vps35 RNAi in synaptic phenotype. Therefore, there is still a weakness in Figure 3D-Figure 4, which weakens the paper, especially since the new model diagram the authors provided in Figure 5 is not really investigated at the molecular level.

    1. Reviewer #1 (Public review):

      Summary:

      In this study entitled "Linking Germline Telomere Removal to Global Programmed DNA Elimination in Tetrahymena Genome Differentiation" Nagao and colleagues examine the fate of germline chromosome ends during somatic genome differentiation in the ciliate Tetrahymena thermophila. During sexual reproduction, a new somatic genome is created from a zygotic, germline-derived genome by extensive programmed DNA elimination events. It has been known for some time that the terminii of the germline chromosomes are eliminated, but the exact process and kinetics of the elimination events has not been thoroughly investigated. The authors first use germline-specific telomere probes to show that the loss of these chromosome ends occurs with similar timing as other DNA elimination events. By comparative analysis of the assembled germline and somatic genomes, the authors find the ends of each of the germline chromosomes are composed of few hundred kilobases of micronuclear limited sequences (MLS) that are removed starting around 14 hours after the start of conjugation, which initiates sexual development. They then develop an in-situ hybridization assay to track the fate of one end of chromosome 4 while simultaneously following the adjacent macronuclear destined sequence (MDS) retained in the new somatic genome. This allows the authors to more clearly show that these adjacent chromosomal segments are initially amplified in the developing genome before the terminal MLS is eliminated. Finally, they mutate the chromosome breakage sequence (CBS) that normally separates the MLS terminus from the adjacent MDS region as show that strains that develop with only one mutant chromosome can produce viable sexual progeny, but it appears that both the MLS and the MDS from the mutant chromosome are lost. If both chromosome copies have the CBS mutation, the cells arrest during development and do not eliminate many germline limited sequences and fail to produce viable progeny. Overall, this study provides many new insights into the fate of germline chromosome ends during somatic genome remodeling and suggests extensive coordination of different DNA elimination events in Tetrahymena.

      Strengths:

      Overall, the experiments were well executed with appropriate controls. The findings are generally robust. Importantly, the study provides several novel findings. First, the authors provide a fairly comprehensive characterization of the size of the MLS at the end of each germline chromosome. They also report on the highly repetitive composition of these chromosome terminii. Second, the authors develop a novel method to study the fate of chromosome terminii during development and use it conclusively track the elimination of these terminii. Third, the authors show that the elimination of these terminii appears to occur concurrently with most other DNA elimination events during somatic genome differentiation. And fourth, the authors show that failure to separate these eliminated sequences from the normally retained chromosome alters the fate of these adjacent MDS and loss of the cells ability to produce viable progeny. The authors initially hypothesized that DNA elimination may be blocked due to inappropriate silencing of genes in the MDS region when the CBS is mutant, but gene expression analysis showed that this is not the case.

      Weaknesses:

      After revising the manuscript based on the initial reviewers' critique, most weaknesses have been addressed. On weakness remaining is that since the authors only mutated the end of one germline chromosome, it is not clear whether the elimination of the MDS adjacent to the terminal MLS on chromosome 4 when the CBS is mutated is a general phenomenon, i.e. would happen at all chromosome ends, or is unique to the situation at Chromosome 4R. Knowing whether it is a general phenomenon or not would provide important insight into the authors findings. The authors did attempt to look at other chromosome ends, but technical limitations currently stymie this effort.

      The other weakness is that it remains unclear how failure to carry out DNA elimination appears to induce a checkpoint during development, but this open question is not unique to this study.

      Comments on revised version.

      The authors have significantly improved the study. The addition of the RNA-seq analysis allowed these researchers to show that their initial hypothesis - that loss of a CBS leads to inappropriate gene silencing in the neighboring MDS region - appears not to be the case. I do not have further suggestions for the authors.

    2. Reviewer #2 (Public review):

      Summary:

      Mochizuki and colleagues investigated how the germline (MIC) telomere was removed during programmed genome rearrangement in the developing somatic nucleus (MAC). Using an optimized oligo-FISH procedure, the authors demonstrated that MIC telomeres were co-eliminated with a large region of MIC-limited sequences (MLS) demarcated on the opposite side by a sub-telomeric chromosome breakage site (CBS). This conclusion was corroborated by the latest assembly of the Tetrahymena MIC genome. They further employed CRISPR-Cas9 mutagenesis to disrupt a specific sub-telomeric CBS (4R-CBS). In the uniparental progeny (mutant X WT), DNA elimination of the sub-telomeric MLS was not affected, but the adjacent MAC-destined sequence (MDS) may be co-eliminated. However, in the biparental progeny (mutant X mutant), global DNA elimination was arrested, revealing previously unrecognized connections between chromosome breakage and DNA elimination. It also paves the way for future studies into the underlying molecular mechanisms. The work is rigorous, well-controlled, and offers important insights into how eukaryotic genomes demarcate genic regions (retained DNA) and regions derived from transposable element (TE; eliminated DNA) during differentiation. The identification of chromosome breakage sequences as a critical architectural element of the genome separating TE-derived regions from functional genes is a key conceptual contribution.

      Strengths:

      New method development: Oligo-FISH in Tetrahymena. This allows high-resolution visualization of critical genome rearrangement events during MIC-to-MAC differentiation. This method will be a very powerful tool in this area of study.

      The conclusion is strongly supported by integrated analyses of PCR-based assays, as well as cytological, genomic, and transcriptomic data.

      Rigorous genetic analysis of the role played by 4R-CBS in separating the fate of sub-telomeric MLS (elimination) and MDS (retention).

    3. Reviewer #3 (Public review):

      Programmed DNA elimination (PDE) is a process that removes a substantial amount of genomic DNA during development. While it contradicts the genome constancy rule, an increasing number of organisms have been found to undergo PDE, indicating its potential biological function. Single-cell ciliates have been used as a prominent model system for studying PDE, providing important mechanistic insights into this process. Many of those studies have focused on the excision of internally eliminated sequences (IES) and the subsequent repair using non-homologous end joining (NHEJ). These studies have led to the identification of small RNAs that mark retained or eliminated regions and the transposons that generate double-strand breaks.

      In this manuscript, Nagao and Mochizuki examined the other type of breaks in ciliates that are healed with telomere addition. They specifically focused on the sequences at the ends of the germline (MIC) chromosomes, which have received relatively less attention due to the technical challenges associated with the highly repetitive nature of the sequences. The authors used the Tetrahymena model and developed a set of new tools. They used a novel FISH strategy that enables the distinction between germline and somatic telomeres, as well as the retained and eliminated DNA near the chromosome ends. This allows them to track these sequences at the cellular level throughout the development process, where PDE occurs. They also analyzed the more comprehensive germline and somatic genomes and determined at the sequence level the loss of subtelomeric and telomere sequences at all chromosome ends. Their result is reminiscent of the PDE observed in nematodes, where all germline chromosome ends are removed and remodeled. Thus, the finding connects two independent PDE systems, a protozoan and a metazoan, and suggests the convergent evolution of chromosome end removal and remodeling in PDE.

      The majority of sites (8/10) at the junctions of retained and eliminated DNA at the chromosome ends contain a chromosome breakage sequence (CBS). The authors created a set of mutants that modify the CBS at the ends of chromosome 4R. CBS regions are challenging for CRISPR due to their AT-rich sequences, making the creation of the 4R-CBS mutants a significant breakthrough. They used the FISH assay to determine if PDE still occurs in these mutant strains with compromised CBS. Surprisingly, they found that instead of blocking PDE, its adjacent retained DNA is now eliminated, suggesting a co-elimination event when the breakage is impaired. Furthermore, in biparental mutant crosses, no PDE occurred, and no viable progeny were produced, indicating that the removal of chromosome ends is crucial for proper PDE and sexual progeny development. Overall, the work demonstrates a critical role for 4R-CBS in separating retained and eliminated DNA.

    1. Reviewer #1 (Public review):

      [Editor's note: This version has been assessed by the Reviewing Editor without further input from the original reviewers. The authors have addressed all concerns raised by the reviewers; no further changes are required at this point.]

      Summary:

      The manuscript by Yang et al. investigates the relationship between multi-unit activity in the locus coeruleus, putatively noradrenergic locus coeruleus, hippocampus (HP) sharp-wave ripples (SWR) and spindles using multi-site electrophysiology in freely behaving male rats. The study focuses on SWR during quiet wake and non-REM sleep, and their relation to cortical states (identified using EEG recordings in frontal areas) and LC units.

      The manuscript highlights differential modulation of LC units as a function of HP-cortical communication during wake and sleep. They establish that ripples and LC units are inversely correlated to levels of arousal: wake, i.e. higher arousal correlates with higher LC unit activity and lower ripple rates. The authors show that LC neuron activity is strongly inhibited just before SWR detected during wake. During non-REM sleep, they distinguish "isolated" ripples from SWR coupled to spindles and show that inhibition of LC neuron activity is absent before spindle-coupled ripples but not before isolated ripples, suggesting a mechanism where noradrenaline (NA) tone is modulated by HP-cortical coupling. This result has interesting implications for the roles of noradrenaline in the modulation of sleep-dependent memory consolidation, as ripple-spindle coupling is a mechanism favoring consolidation. The authors further show that NA neuronal activity is downregulated before spindles.

      Strengths:

      In continuity with previous work from the laboratory, this work expands our understanding of the activity of neuromodulatory systems in relation to vigilance states and brain oscillations, an area of research that is timely and impactful. The manuscript presents strong results suggesting that NA tone varies differentially depending on coupling of HP SWR with cortical spindles. The authors place their findings back in the context of identified roles of HP ripples and coupling to cortical oscillations for memory formation in a very interesting discussion. The distinction of LC neuron activity between awake, ripple-spindle coupled events and isolated ripples is an exciting result and its relation to arousal and memory opens fascinating lines of research.

    2. Reviewer #2 (Public review):

      Summary:

      In this study, authors studied the synchrony between ripple events in Hippocampus, cortical spindles and Locus Coeruleus spiking. The results in this study together with the established literature on the relationship of hippocampal ripples with widespread thalamic and cortical waves, guided authors to propose a role for Locus Coeruleus spiking patterns in memory consolidation. The findings provided here, i.e. correlations between LC spiking activity and Hippocampal ripples, could provide basis for future studies probing the directional flow or the necessity of these correlations in the memory consolidation process. Hence, the paper provides enough scientific advance to highlight the elusive yet important role of Norepinephrine circuitry in the memory processes.

      Strengths:

      Authors were able to demonstrate correlations of Locus Coeruleus spikes with hippocampal ripples as well as with cortical spindles. Specific strength of the paper is in the demonstration that the spindles that activate with the ripples are comparatively different in their correlations with Locus Coeruleus than those which do not.

    3. Reviewer #3 (Public review):

      This manuscript examines how locus coeruleus (LC) activity relates to hippocampal ripple events across behavioral states in freely moving rats. Using multi-site electrophysiological recordings, the authors report that LC activity is suppressed prior to ripple events, with the magnitude of suppression depending on ripple subtype. Suppression is stronger during wakefulness than during NREM sleep and least pronounced for ripples coupled to spindles.

      The study is technically sound and addresses a timely and important question regarding how LC activity interacts with hippocampal and thalamocortical network events across vigilance states. While the findings are interesting, they remain observational in nature.

    1. Reviewer #1 (Public review):

      [Editors' note: this version has been assessed by the Reviewing Editor without further input from the original reviewers. The authors have addressed the comments raised in the previous round of review.]

      Summary:

      In this study, the authors propose that HSV-1 infection degrades the class I histone deacetylases HDAC1 and HDAC2. The MDM2 E3 ubiquitin ligase from the DNA damage response pathway is responsible for ubiquitinating these HDACs that are subsequently degraded via proteasomes. The authors hypothesize that HDAC degradation will cause hyperacetylation of viral chromatin and enable viral gene transcription.

      Strengths:

      The ubiquitination of HDAC1 & HDAC2 by Mdm2 and the mapping studies are clear.

    2. Reviewer #2 (Public review):

      Summary:

      The authors discovered that HDAC1/2 are degraded in HSV-1 and PRV infections. They attempted to establish a new mechanism by which HDAC1/2 are translocated to the cytoplasm to be degraded in HSV-1 infection, and the degradation causes changes in histone acetylation to affect the DDR pathway.

      Strengths:

      (1) Interesting findings of HDAC1/2 degradation during HSV-1 and PRV infection, and it may impact more than the virology field.

      (2) Significant work to identify the ubiquitin site in HDAC1/2 and K63 linkage.

    1. Reviewer #1 (Public review):

      [Editors' note: this version has been assessed by the Reviewing Editor without an additional round of formal review from the original reviewers. The authors have addressed the comments raised in the previous round of review.]

      Summary:

      In the manuscript "Pathogen-Phage Geomapping to Overcome Resistance," Do et al. present an impressive demonstration of using geographical sampling and metagenomics to guide sample choice for enrichment in human-associated microbes and the pathogen of interest to increase the chances of success for isolating phages active against highly resistant bacterial strains. The authors document many notable successes (17!) with highly resistant bacterial isolates and share a thoughtfully structured phage discovery effort, potentially opening the door to similar geomapping efforts across the field. While the work is methodologically strong and valuable for the community, there are a few areas where additional clarification and analysis could better align the claims with the data presented.

      Strengths:

      (1) The manuscript describes a well-executed and transparent example of overcoming a major obstacle in therapeutic virus identification, providing a practical success story that will resonate with researchers in microbiology and medicine.

      (2) Many phage researchers have anecdotally experienced a similar phenomenon, that a particular wastewater treatment plant always seems to have the pathogens you need. Quantifying this with metagenomics modernizes and adds evidence to this phenomenon in a way that could help researchers reproduce this success in a methodical way.

      (3) The methodology of combining environmental sampling, viral screening, and host-range analysis is clearly articulated and reproducible, offering a valuable blueprint for others in the field.

      (4) The data are presented with appropriate analytical rigor, and the results include robust sequencing and metagenomic profiling that deepen understanding of local viral communities.

      (5) The 17 successes yielding 35 phages have a lot of phylogenetic novelty beyond what the Tailor labs have typically found with previous methods.

      (6) The work highlights a practical and innovative solution to an increasingly important clinical problem, supporting the development of personalized antiviral strategies.

    2. Reviewer #2 (Public review):

      Summary:

      The manuscript by Do and colleagues aims to develop a workflow for isolating and identifying bacteriophages with potential applications in phage therapy against antibiotic-resistant pathogens. The workflow integrates geΦmapping as a strategy to identify potential phage sources, ΦHD as a device for phage concentration, and RΦ as a phage library constructed from the initial sampling, resulting in the discovery of 36 new phages. The paper is overall interesting, and the proposed method appears robust and effective.

      Strengths:

      The methods proposed combined state-of-the-art strategies to solve an ever-increasing problem of antibiotic resistance. The methods are robust, and the controls are appropriate. The integration of environmental sampling, concentration strategies, and downstream genomic characterization is a clear strength and provides a potentially scalable framework for identifying candidate therapeutic phages. The manuscript is clearly written overall, and the results support the main conclusions.

      Comments on revised version:

      The manuscript has been adequately improved and adjusted according to the comments. There are minor points such as Table S10 is labelled in the top of the page as Table S11. Also, is a little unconventional to cite result figures and tables in the introduction.

      For the question 10, regarding why some of the most abundant vOTUs in the 5L sample were not detected in the concentrate. The answer does not satisfy, as it focuses on why very low abundant vOTUs will not be detected, but the question is why some of the most abundant vOTUs were not detected. This does not affect the results or interpretation made.

    1. Reviewer #1 (Public review):

      Summary:

      Gurnani et al. explore how dynamical properties of neural networks influence capacity for and mechanisms of learning. Specifically, they focus on Brain Computer Interface (BCI) learning, in which manipulations are applied to a decoder that maps neural activity onto computer cursors. This paradigm was introduced by Sadtler et al. 2014, and has become an influential part of the neuroscience motor learning literature. A particularly fascinating outcome of that body of work is the observation that "within-manifold" perturbations (WMPs), which preserve covariance structure in the neural population, are easier to learn than "outside-manifold" perturbations (OMPs), which break this. Since deep network parameter access is challenging (to say the least) in monkey experiments, the intuition for this split in learnability is ripe for modeling and theory work. Indeed, the authors here introduce a feedback-driven recurrent neural network model whose output drives a simulation of a neural decoder commonly used in BCI studies like the Sadtler paper. While there have now been several modeling studies exploring how neural networks could solve this task, the feedback control perspective gives the authors' new model an interesting niche. Overall, this is a thoroughly done and well-written modeling study, and a solid contribution to the literature on within- and outside-manifold perturbations.

      Strengths:

      Reframing the OMP and WMP learning from a feedback-driven dynamical systems perspective, not just a geometric one, is an interesting take. The controllability analysis (along with the clear difference in input-driven and recurrence-driven learning) is quite a cool result that helps better frame what might be happening in the primate brain during similar tasks.

      Weaknesses:

      Some of the more interesting aspects, especially the controllability) and the differences between input-driven and recurrence-driven learning could be further developed, either by showing more analyses or running more comparisons. A few sections could benefit from some additional clarity on the strength and significance of results.

    2. Reviewer #2 (Public review):

      Summary:

      The constraints on learning in the brain remain elusive. Using BCIs, Sadtler et al. demonstrated that the brain can rapidly learn new decoders that lie within the intrinsic neural manifold (short-term adaptation), while showing substantial difficulty learning decoders that lie outside the manifold. This finding suggests that neural manifolds impose constraints on learning. However, even among within-manifold decoders, there was considerable variability in learning rates that could not be explained solely by geometric factors.

      Here, Gurnani et al propose that, in addition to manifold structure, neural dynamics (i.e., the flow field across states) impose critical constraints on learning. To test this idea, the authors trained RNNs that received real-time feedback (e.g., position error signals) during a BCI task in which the network controlled a cursor. The authors showed that short-term adaptation to a new decoder is facilitated by plasticity in sensory inputs, and that pre-existing dynamics influence the speed of adaptation across different decoders. These findings may explain previously unresolved constraints observed in BCI learning and suggest an important role for neural dynamics in constraining sensorimotor learning in the brain.

      Strengths:

      Overall, the work is highly impactful and is likely to motivate a new generation of BCI and learning experiments combining large-scale neural recordings with latent dynamical systems analyses. The paper is clearly written, and I only have minor comments, primarily for clarification.

      Weaknesses:

      There are no major weaknesses. Please see below for minor comments.

      (1) If I understand correctly, most analyses do not distinguish between the preparatory phase and the movement phase. Given that the preparatory phase is largely controlled by feedforward input, I suspect that most of the dynamical constraints underlying learning variability arise during the movement phase. Is this correct? If so, could the authors clarify or directly test this distinction?

      (2) P4: Position vs. velocity decoders: It would be helpful to describe whether and how the choice of velocity versus position decoders influences whether perturbations are learnable, and whether input-driven constraints arising in this task are similar.

      (3) The variance criteria used to screen decoder perturbations may themselves covary with learning rate, behavioral asymmetry, and overlap with controllable subspaces. A quantification of this relationship would help contextualize the findings and inform the design of future BCI experiments.

      (4) To support the comparison between Figures 3 and 7, and the conclusion that Figure 3 better matches the experimental data, which is an important point of the manuscript, could the authors provide quantitative values from the experimental data (e.g., how large is the change in variance within oPCs, etc)?

      (5) Figure 8h: Is the variability in learning rates in models with different controller networks explained by the same dynamical constraints described in Figure 6? Demonstrating consistent dynamical constraints across model architectures would strengthen the paper's central conclusion.

      (6) Figure 8f: Why does feedforward controllability differ between conditions? This is mentioned in the text, but no explanation is provided.

    1. Reviewer #1 (Public review):

      Summary:

      This study investigates the role of the medial prefrontal cortex (mPFC) in generating goal-directed actions under threat, using a progressive behavioral paradigm, neural recordings, and optogenetic inhibition in mice. The authors demonstrate that while mPFC GABAergic neurons strongly encode cues, actions, and errors, particularly under high cognitive demand, this neural activity is not causally required for executing avoidance behaviors. By rigorously controlling for movement and arousal, the researchers found that much of the observed mPFC signaling actually reflects baseline behavioral states rather than the generation of the actions themselves. This dissociation between encoding and causality challenges traditional views of mPFC as an executive controller of action and provides a nuanced understanding of its role in evaluative and contextual processing.

      Strengths:

      The behavioral paradigm employed in this study is one of its greatest strengths, offering a rigorous, progressive, and well-controlled framework to dissect the neural mechanisms underlying avoidance under threat. This three-phase task design is particularly well-suited to tease apart the contributions of learning, discrimination, and cognitive load to both behavior and neural activity.

      By tracking movement (speed, rotations) and including it as a covariate in statistical models, the authors also underscore the need to control for movement and baseline activity when interpreting cortical signals, which is relevant for all studies of brain-behavior relationships, ensuring that behavioral changes are not due to general arousal or motor activity.

      Finally, the study combines multiple advanced techniques-fiber photometry, single-cell calcium imaging (miniscopes), and two distinct optogenetic inhibition methods-to provide a comprehensive look at both neural encoding and causal necessity.

      Comments on revised version.

      The authors adequately addressed all of the reviewers' comments and made great improvements to the manuscript, particularly enhancing the methods and figures to significantly improve clarity and readability.

    2. Reviewer #2 (Public review):

      Summary:

      The manuscript by Sajid et al. describes a comprehensive behavioral, imaging and optogenetic dataset investigating the role of the mPFC in avoidance and escape behaviors. Although many movement- and task-related variables are encoded by mPFC GABAergic neurons, the main conclusion is that they are unlikely to control behavioral output.

      Strengths:

      The manuscript is generally well executed and plausible in its conclusions. It provides an alternative viewpoint to many articles describing the involvement of mPFC to behavior, based on a complex multi-stage behavioral paradigm acquired and analyzed in an unbiased way.

      Weaknesses:

      This reviewer sees two weaknesses.

      (1) In some cases, the explained variance, marginal and conditional, is low, suggesting the models only modestly capture the complexity in the data.

      (2) The manuscript is challenging to read due to the comprehensive and unbiased presentation style.

      Comments on revised version.

      The authors did a good job at addressing the reviewers' comments. One minor additional suggestion is to add references for the statement in the last paragraph of the discussion for the mPFC lesion studies.

    1. Reviewer #1 (Public review):

      [Editors' note: this version has been assessed by the Reviewing Editor without further input from the original reviewers. The authors have addressed the comments raised in the previous round of review.]

      Summary:

      Morgan et al. studied how paternal dietary alteration influenced testicular phenotype, placental and fetal growth using a mouse model of paternal low protein diet (LPD) or Western Diet (WD) feeding, with or without supplementation of methyl-donors and carriers (MD). They found diet- and sex-specific effects of paternal diet alteration. All experimental diets decreased paternal body weight and the number of spermatogonial stem cells, while fertility was unaffected. WD males (irrespective of MD) showed signs of adiposity and metabolic dysfunction, abnormal seminiferous tubules and dysregulation of testicular genes related to chromatin homeostasis. Conversely, LPD induced abnormalities in the early placental cone, fetal growth restriction and placental insufficiency, which was partly ameliorated by MD. The paternal diets changed placental transcriptome in a sex-specific manner and led to a loss of sexual dimorphism in the placental transcriptome. These data provide a novel insight on how paternal health can affect the outcome of pregnancies, which is often overlooked in prenatal care.

      Strengths:

      The authors have performed a well-designed study using commonly used mouse models of paternal underfeeding (low protein) and overfeeding (Western diet). They performed comprehensive phenotyping at multiple timepoints including of the fathers, the early placenta and late gestation feto-placental unit. The inclusion of both testicular and placental morphological and transcriptomic analysis is a powerful non-biased tool for such exploratory observational studies. The authors describe changes in testicular gene expression revolving around histone (methylation) pathways that are linked to altered offspring development (H3.3 and H3K4), which is in line with hypothesised paternal contributions to offspring health. The authors report sex differences in control placentas that mimic those in humans, providing potential for translatability of the findings. The exploration of sexual dimorphism (often overlooked) and its absence in response to dietary modification is novel and contributes to the evidence-base for the inclusion of both sexes in developmental studies.

      Comments on revised version:

      The authors have done a great job addressing my concerns. The description of the data analysis and the figures are now much clearer. The inclusion of the potential links between the microbiome and male reproductive fitness is informative and improves the flow of the discussion.

    2. Reviewer #2 (Public review):

      Summary:

      The authors investigated the effects of a low-protein diet (LPD) and a high sugar- and fat-rich diet (Western diet, WD) on paternal metabolic and reproductive parameters and feto-placental development and gene expression. They did not observe significant effects on fertility; however, they reported gut microbiota dysbiosis, alterations in testicular morphology, and severe detrimental effects on spermatogenesis. In addition, they examined whether the adverse effects of these diets could be prevented by supplementation with methyl donors. Although LPD and WD showed limited negative effects on paternal reproductive health (with no impairment of reproductive success), the consequences on fetal and placental development were evident and, as reported in many previous studies, were sex-dependent.

      Strengths:

      This study is of high quality and addresses a research question of great global relevance, particularly in light of the growing concern regarding the exponential increase in metabolic disorders, such as obesity and diabetes, worldwide. The work highlights the importance of a balanced paternal diet in regulating the expression of metabolic genes in the offspring at both fetal and placental levels. The identification of genes involved in metabolic pathways that may influence offspring health after birth is highly valuable, strengthening the manuscript and emphasizing the need to further investigate long-term outcomes in adult offspring.

      The histological analyses performed on paternal testes clearly demonstrate diet-induced damage. Moreover, although placental morphometric analyses and detailed histological assessments of the different placental zones did not reveal significant differences between groups, their inclusion is important. These results indicate that even in the absence of overt placental phenotypic changes, placental function may still be altered, with potential consequences for fetal programming.

      Comments on revised version:

      The authors have adequately addressed all my previous comments.

    1. Joint Public Review:

      [Editor's Note: The previous reviewers comments were felt to be addressed by the reviewers and myself and have improved the work.]

      In this study, the authors suggest that DuoHexaBody-CD37, a biparatopic CD37-targeting antibody, can induce direct cytotoxicity in diffuse large B-cell lymphoma (DLBCL) cells through antibody clustering and SHP-1 activation, independent of complement. They further propose that DuoHexaBody-CD37 inhibits cytokine-mediated pro-survival signalling, suggesting a broader role for CD37-directed therapy in disrupting tumour supportive signalling networks.

      A strength of the study is the systematic in vitro characterisation of signalling responses to DuoHexaBody-CD37 across both malignant and normal B-cells. The inclusion of phosphoproteomic profiling and mutant constructs provides mechanistic detail, and the findings may be of interest to researchers working on antibody therapeutics in lymphoma.

      However, the evidence supporting key mechanistic processes - particularly the specific subtype requirement for Fc receptor crosslinking - is incomplete and would benefit from further functional validation. While CD37 has been explored previously as a therapeutic target, this study does add mechanistic insight into direct cytotoxicity and cytokine modulation. Nevertheless, the exclusive reliance on in vitro systems makes the translational relevance unclear.

      Overall, the study provides valuable insight into CD37-mediated signalling in lymphoma cells, but the evidence remains incomplete to support broader conclusions about therapeutic impact. The additional mechanistic data included during revision are informative, but the precise basis of the observed cytotoxic effects remains incompletely defined.

    1. Reviewer #1 (Public review):

      The manuscript "Heterozygote advantage cannot explain MHC diversity, but MHC diversity can explain heterozygote advantage" explores two topics. First, it is claimed that the recently published by Mattias Siljestam and Claus Rueffler conclusion (in the following referred to as [SR] for brevity) that heterozygote advantage explains MHC diversity does not withstand an even very slight change in ecological parameters. Second, a modified model that allows an expansion of MHC gene family shows that homozygotes outperform heterozygotes. This is an important topic and could be of potential interest to the readership of eLife if the conclusions are valid and non-trivial.

      The resubmitted manuscript addresses several questions from my previous review. In particular, there is a more detailed description of how the code of Siljestam and Rueffler ([SR]) was used for the simulations and the calculation of the factor 2.7 x 10^43 that is the key to the alleged breakdown of the numerical reasoning presented by in [SR].

      Yet I think that important aspects of my critique of the first statement of the manuscript about the flaws of [SR] model remain unanswered. I guess the discussion becomes rather general about the universality and robustness of various types of models to parameter changes. My point is that none of the models is totally universal. The model in [SR] is not phenomenological as none of the parameters or functional forms were derived empirically. Instead, it is a proof of principle demonstration that inevitably grossly simplifies the actual immune response. The choice of constants and functions used in Eqs. (1-5) is dictated by the mathematical convenience and works in a limited range of parameter values. It is shown in [SR] that for 3 pathogens and reasonable "virulence " \nu, the alleles branch. These conclusions are supported by the analytically derived Adaptive Dynamics branching criteria (7), which, contrary to the statement is the cover letter (" It is clear from Fig. 4 of Siljestam and Rueffler that the branching condition is far from sufficient for high MHC diversity.") is perfectly confirmed by the simulation data shown in Fig. 4.

      The mathematical simplicity of the [SR] model generates various artifacts, such as the mentioned by the Author reduction of the "condition" by an enormous factor 2.7 x 10^43 and the resulting decrease in the "survival" induced by the addition of a new pathogen. This occurs at the very large value of \nu=20, whose effect is enormous due to the Gaussian form of (1), which, once again, was chosen for the mathematical convenience. In reality, a new pathogen cannot reduce the "survival" by such a factor as it would wipe out any resident population. So to compensate for such an artifact, the additional factor c_max was introduced to buffer such an excess. There is no reason to fix c_max once for an arbitrary number of pathogens, because varying c_max basically reflects the observation that a well-adapted individual must have a reasonable survival probability. At the same time, there are many ways in which the numerical simulation may break down when the survival rates become of the order of 10^(-43) instead of one, so it comes to no surprise that the diversification, predicted by the adaptive dynamics, does not readily occur in the scenario with an addition or removal of the 8th pathogen with a very high virulence \nu=20.

      I have doubts that the reported breakdown of the [SR] model with fixed c_max remains observable with less extreme values of m and \nu (say, for \nu=7 and m=3 plus or minus 1 used in Fig. 3 in the manuscript).

      So I still find the claim that " the phenomenon that leads to high diversity in the simulations of Siljestam and Rueffler depends on finely tuned parameter values" is not well substantiated.

    2. Reviewer #2 (Public review):

      Summary:

      This study addresses the population genetic underpinnings of the extraordinary diversity of genes in the MHC, which is widespread among jawed vertebrates. This topic has been widely discussed and studied, and several hypotheses have been suggested to explain this diversity. One of them is based on the idea that heterozygote genotypes have an advantage over homozygotes. While this hypothesis lost early on support, a reason study claimed that there is good support for this idea. The current study highlights an important aspect that allows us to see results presented in the earlier published paper in a different light, changing strongly the conclusions of the earlier study, i.e., there is no support for a heterozygote advantage. This is a very important contribution to the field. Furthermore, this new study presents an alternative hypothesis to explain the maintenance of MHC diversity, which is based on the idea that gene duplications can create diversity without heterozygosity being important. This is an interesting idea, but not entirely new.

      Strength:

      (1) A careful re-evaluation of a published model, questioning a major assumption made by a previous study.

      (2) A convincing reanalysis of a model that, in the light of the re-analysis-loses all support.

      (3) A convincing suggestion for an alternative hypothesis.

      Weakness:

      (1) The title of the study is catchy, but it is explained only in the very end of the paper.

    1. Reviewer #1 (Public review):

      In this study, the authors set out to determine how two classes of kinase inhibitors, which stabilise a disease-relevant enzyme in either an active (Type I) or inactive state (Type II), influence its organisation and interactions with microtubule filaments in cells. Using the state-of-the-art in-cell structural imaging approaches, they examine how these compounds affect the formation of protein filaments and their association with microtubules, and succeed in defining the underlying structural basis for these differences.

      A major strength of the work is the application of in-cell cryo-electron tomography combined with correlative imaging, which enables direct visualisation of protein organisation in a near-native cellular context. The data convincingly demonstrate that the Type I inhibitor compound stabilising the active state promotes extensive LRRK2 filament formation and microtubule bundling, whereas compounds stabilising the inactive state markedly reduce these interactions. The structural analysis further provides insight into how conformational states relate to filament organisation, including modelling of previously unresolved regions of the protein.

      These findings are internally consistent and align well with prior biochemical and structural studies, many of which were performed by the same team.

      There are, however, some limitations that should be noted. The experiments rely on overexpression of the I2020T mutant form of the LRRK2 protein, which is a rare variant, in a single cell type (293T cells), which may not fully reflect endogenous behaviour or wild-type LRRK2 in a physiological context. In addition, while the imaging data are compelling, the functional consequences of the observed filament formation and microtubule association remain unclear.

      The study therefore provides strong descriptive and structural insight, but more limited evidence linking these observations to cellular or disease-relevant outcomes.

      Overall, the authors largely achieve their aims, and the results support their central conclusion that different classes of kinase inhibitors have distinct effects on protein organisation in cells. The work represents an important advance in understanding how small molecules can reshape protein architecture in a cellular environment, with potential implications for therapeutic strategies. The methodological approach will also be of broad interest to the field, as it highlights the power of in-cell structural biology to study dynamic protein assemblies that are difficult to capture using traditional approaches.

    2. Reviewer #2 (Public review):

      Summary:

      Mutations in Leucine-Rich Repeat Kinase 2 (LRRK2) are a major cause of Parkinson's disease. LRRK2 PD-related mutations all result in increased kinase activity. Therefore, LRRK2 has been the focus of the development of kinase inhibitors. So far, two classes of kinase inhibitors have been identified: type 1 LRRK2-specific inhibitors that stabilize LRRK2 in a closed active-like conformation and broad-range type 2 inhibitors that stabilize LRRK2 in an open inactive-like conformation. Basiashvili et al. used here in cell structural biology to study the effect of both type 1 and type 2 inhibitors on the localization and structural conformation of LRRK2-I2020T.

      Strengths:

      They showed that Type 1 and not Type 2 inhibitors induce LRRK2 filament/ on microtubules. Furthermore, they were able to build a structural map of full-length LRRK2 I2020T bound to a Type 1 inhibitor in a closed kinase confirmation. Together, this work thus confirms the data of previous studies that showed that LRRK2 Type 1 and 2 inhibitors differently affect filament formation.

      Weaknesses:

      All conclusions are fully supported by the provided data. However, as the authors indicated themselves, the physiological relevance of LRRK2 microtubule binding is questionable. Furthermore, although the authors used a full-length LRRK2 protein, like in previously published structures, the resolution of the N-terminal domains is rather poor. Therefore, it also remains unclear what we learn from this structure compared to the previously published structures.

    3. Reviewer #3 (Public review):

      Summary:

      This paper describes new insights into the effects of type-I and type-II LRRK2 inhibitors on HEK293T cells that over-express GFP-labeled LRRK2-I2020T. Using correlative light microscopy and cryo-electron tomography, a type-I inhibitor leads to the extensive decoration of microtubules with LRRK2, which is not seen for a type-II inhibitor. Subtomogram averaging reveals that LRRK2 binds to the microtubules in a closed-kinase conformation, with density for the N-terminal arms.

      Strengths:

      The paper is well written; the CLEM and cryo-ET appear to be done to a high standard. Consequently, I have only minor comments.

      Weaknesses:

      The resolution of the subtomogram averages is somewhat limited, but the authors have adequately limited the number of degrees of freedom in the fitting of their atomic models by only allowing rigid-body transformations of separate parts of LRRK2.

      The authors should include FSC curves between the rigid-body fitted atomic models and the various sub-tomogram average maps.

    1. Reviewer #1 (Public review):

      The authors of this study developed a method to quantify calvarial bone marrow from MRI head scans, enabling the study of its composition in large datasets of adults, usually collected to study the brain. Bone marrow intensity can be semi-quantitatively measured in T1-weighted MRI scans due to the greater signal intensity of fat than watery red marrow. This is an ingenious use of the MRI-produced information for other important phenotypes, such as bone structure and marrow content. Different head types were tested for complying with the model, which is notable.

      The model was also successfully validated using several publicly available MRI resources - real data - in (1) a dataset consisting of 30 individuals that were scanned 10 times each at 3-day intervals, and (2) the monozygotic (MZ) twin data from the Human Connectome Project cohort. Then the authors applied this validated method to head-MRI scans from the UK Biobank (n=33,042) to extract information on the spatial distribution of bone marrow adiposity (BMA) in the calvaria, allowing a GWAS to identify associated genes.

      The authors revealed high heritability and identified 41 genetic loci significantly associated with the BMA trait, including six sex-specific loci. Of note, statistics estimate that 99% of BMA trait-influencing variants are shared with BMD (497 of 500 variants), which may mean these results demonstrate the biological relevance to bone health. Some of the BMA genes were found related to the Wnt pathway, including WNT16, WNT4, NXN; this is a "positive control", since the Wnt/β-catenin signaling pathway was suggested as an important determinant of BMA. Also, associations in genes (BMP4, DLX5, LGR4, LRP4, SFRP4) that are known to specifically influence adiposity, are encouraging. Integrating mapped genes with bone marrow single-cell RNA-seq data revealed patterns of adipogenic lineage differentiation and lipid loading.

      The study also investigated the genetic overlap between BMA and twelve (or 13) "brain and body" traits and identified significant genetic correlations with BMI, cognitive ability, and Parkinson's disease.

      In sum, since MRI head scans present a hitherto unexplored opportunity to address unresolved aspects of bone marrow biology, this study is both timely and innovative.

      There are, however, some assumptions, findings, and their interpretation, which require more critical focus.

      Sex-specificity is well described and studied here. Men have higher BMA than women, but post-menopausal women catch up in the BMA values. The authors believe that calvarial marrow has a number of features that make it particularly well-suited to the study of BMA process - which is clinically important in other bone sites. It has a simple "sandwiched" structure that they are able to model. This is true only to some extent: a condition called "Hyperostosis frontalis interna", of unknown etiology (described by Smith & Hemphill in 1956) - is characterized by irregular overgrowth of the inner table of the frontal bone (symmetric/bilateral). Although not of clinical significance, typically benign, studies report a prevalence of 12%; However, it's most common in postmenopausal women - where prevalences up to 49% in women over the age of 65 - have been reported. Thus, sexual dimorphism is obvious and the effect of estrogen is likely shared with whichever bone - and marrow - age-related pathology. So, for women not using HRT, this new layer of the bone might interfere with the calvarial BMA readings and in turn, affect the BMA-related analyses. The authors suspect that the effect of BMA on BMD may be biased in women; they should comment on those "with low BMD and high BMA" given that hyperostosis frontalis might be an issue. A strong effect of SNPs in the ESR1 chromosomal region might be akin to the above concern.

      Then, there is a perfect overlap of the BMA SNPs that are shared with BMD (497 of 500 variants), which may prove a "face validity" of the MRI-derived BMA. However, the BMD in the study was heel-derived eBMD - which is a good proxy for osteoporosis and is mostly driven by trabecular bone. Thus, there might be a concern that the BMA metrics capture some trabecular BMD.

      Next, integrating mapped genes with existing bone marrow single-cell RNA-sequencing data revealed patterns of adipogenic lineage differentiation and lipid loading. The problem here is that the scRNAseq studies of the Bone Marrow niche are overwhelmingly mouse. The authors might wish to justify why they are relevant to humans (in the absence of the human-specific scRNAseq).

      For genetic correlation analysis, the authors selected 7 body and 6 brain traits. The latter traits reflect cognition (general cognitive ability and educational attainment) and brain-related disorders. This selection might seem arbitrary. The interpretation of genetic correlation with cognitive ability, education, and Parkinson's disease was attributed to the recently discovered vascular channels that link calvarial bone marrow to the meninges. This is a fascinating hypothesis, which requires functional proof. However, there might be simpler explanations. Thus, the diploe and the inner table of the calvarium are drained by the same veins as the dura. From the anatomy textbook, we know that diploic veins connect the pericranial and endocranial venous system through the skull.

    2. Reviewer #2 (Public review):

      Summary:

      This study develops a new artificial intelligence method for high-throughput analysis of skull bone marrow from MRI data, which may be useful for large-scale biological analyses. Using this method, the authors then attempt to estimate skull bone marrow adiposity (BMA) using T1-weighted signal intensity from MRI scans of ~33,000 people, followed by genome-wide association analysis; however, the approach is inadequate because T1-weighted signal intensity is not validated for measurement of bone marrow adiposity. If it could be validated, the study would be an important advance in understanding of bone marrow adiposity and skeletal biology.

      Strengths:

      This paper is well-written, and the figures are nicely presented. The neural network method used for analysing skull bone marrow is innovative, and the authors validate this through several approaches. Therefore, the authors have achieved the aim of developing a method for large-scale analysis of skull bone marrow from MRI data.

      The GWAS is reasonably well-powered and addresses potential ethnicity differences, with one GWAS done across white males and females, and a separate GWAS in non-white participants. The methodology also conforms to common GWAS standards, including for mapping genetic variants to candidate genes. Moreover, the study further investigates the biological roles of these genes by analysing their expression in single-cell RNA sequencing data.

      Weaknesses:

      The fundamental weakness is that T1-weighted MRI signal intensity (T1W) is used as an estimate of BMA, but it has never been validated for this. The authors show that this T1W parameter measures something that is heritable and can be compared between subjects, but they don't show that it actually measures (or even estimates) calvarial BMA. There is an attempt to do so by comparing the T1W parameter with data from quantitative T1 images: the authors show a reasonable correlation with some of the quantitative T1 image data. However, this still does not show that the parameter is measuring BMA; it could be measuring some other biological characteristic, but this remains unclear. So, there is a need to validate the T1W parameter against an established measure of BMA, such as the bone marrow fat-fraction or proton density fat fraction measured from multi-echo MRI analysis.

      Without validating this BMA measurement method, it is not possible to interpret the GWAS or other findings reported in the study.

      A less critical weakness is that the GWAS has been done only on a single cohort, without replicating the findings in a follow-up cohort. For example, the authors could repeat their analysis on the remaining ~50,000 UK Biobank imaging participants for whom MRI data is now available. However, this would be pointless without knowing what biological characteristic(s) the T1W parameter is actually reflecting.

      [UPDATE, June 2026: since writing this review in September 2024, the reviewer has changed their opinion and now has confidence in the reliability of the T1W method used to estimate BMA. The reviewer would like to explain that their original critiques were based largely on previous discussions with a colleague with expertise in magnetic resonance and medical physics, who was extremely negative about use of T1W signal intensity to estimate BMA; this colleague’s criticisms may not have been objective, and clouded the reviewer’s overall impression of the present study. The reviewer and others have since completed BMA analysis using dual-echo MRI data in the UK Biobank; the findings of these studies, both for genetic and pathophysiological associations, are largely consistent with the findings of the present study, underscoring the reliability of the T1W-based BMA estimates.]

    3. Reviewer #3 (Public review):

      Summary:

      This manuscript, "Estimating bone marrow adiposity from head MRI and identifying its genetic 2 architecture", brings together the groups of Drs. Kaufmann and Hughes in a tour de force work to develop an artificial neural network that localizes calvaria bone marrow in T1-weighted MRI head scans, with the goal of studying its composition in several large MRI datasets, and to model sex-dimorphic age trajectories, including the effect of menopause.

      Strengths:

      Bone marrow adiposity is a very active tissue with far-reaching implications for tissue crosstalk and human health than we had initially recognized. Although MRI has been used to measure BM, studies such as the one by these two groups are still lacking whereas very large datasets are analyzed using advanced AI machine learning tools coupled with genetic studies and a specific pathology. The groups had to develop new methods and new AI machine-learning tools for the imaging analyses.

      Weaknesses:

      Some aspects of the work that authors could add additional clarification.

      (1) Imaging Limitations: The authors provide an excellent overview and references supporting the use of MRI as a method for assessing marrow fat, particularly with some specific modifications. However, MRI images can be affected by various factors, including the presence of other tissues as well as specific MRI settings, which are much harder to precisely control when using different datasets.

      (2) The specific density of cranial bones as it relates to the types of bone marrow: Cranial bones are extremely dense structures, which naturally interfere with MRI imaging. While it is thought that cranial bones have mostly "red bone marrow", this is only true for a short time in humans. How sensitive is their system in differentiating between red and yellow BM?

      (3) Both items above are further complicated by aging, but aging is not a linear event as we have learned. There are specific bursts of aging in humans around the age of 45 and early 60s. How do the system and model predict or incorporate these peaks of aging? It seems from the data shown that aging is reflected more as a linear phenomenon. Is this because additional aging datasets are needed?

      (4) The authors describe in richness of detail their AI learning programming and how it extracted the data from datasets. The authors also show some important correlations with specific genes, SNPs. What is not clear is how conditions such as anemia for example. An expected finding would be that patients with chronic anemia have lower bone marrow (BM) signal intensity on MRI scans than healthy people. This is because the signal intensity of BM depends on the fat-to-cell ratio in the tissue. Furthermore, patients with a host of musculoskeletal disorders ranging from osteopenia to osteoporosis, sarcopenia, and osteosarcopenia will also have altered MRI scans. When using such large datasets how did the authors control or exclude these pathological conditions, or were all these conditions likely present?

      (5) Some of the genes and SNPs although significant showed very small correlations. What is their likely physiological significance?

      (6) The authors could use this excellent manuscript to expand their discussion to include the need for studies like theirs to be also complemented by multi-OMICS studies that will include proteomics and lipidomics of BM, bones, and muscles.

    1. Reviewer #1 (Public review):

      Summary:

      In this manuscript, the authors investigate ubiquitylation of RPS27A/eS31 by the E3 ligase RNF25 in response to translational stress. Previous studies have identified RPS27A/eS31 ubiquitylation at Lys113 under conditions where translation factors are trapped in the ribosomal A-site. Here, the authors extend this work by testing whether additional translational stress conditions, including amino acid deprivation, induce RPS27A/eS31 ubiquitylation. They further show that GCN1 is required and explore a possible competition between RNF25 and GCN2 for GCN1.

      Strengths:

      This study expands on the range of stress conditions leading to RPS27A/eS31 ubiquitylation, reporting that it occurs in a variety of conditions associated with ribosome stalling, including amino acid deprivation. These observations are useful because they suggest that the RNF25 pathway may not require translation factors trapped in the ribosomal A-site, but may instead respond more broadly to translational perturbations associated with ribosome collisions.

      Weaknesses:

      The evidence supporting several of the major claims is incomplete, and additional controls and orthogonal approaches would greatly strengthen the evidence presented. In particular:

      (1) It is unclear whether the different conditions used to induce translational stress lead to ribosome stalling or collisions. The model presented by the authors seems to rely on ribosomal collisions, but this is not shown. In addition, further investigating amino acid deprivation beyond the removal of Arg or Lys would strengthen the paper.

      (2) Ubiquitylation of RPS27A/eS31 by RNF25 is used throughout the paper as a readout of RNF25 activity and is assumed to be on Lys113 based on previous work, but is not formally shown here.

      (3) Rescue experiments of the different mutants used in this study with wild-type and different domain deletions (i.e., ΔRWD for RNF25, ΔRWD-binding for GCN1) would help confirm specificity and strengthen the mechanistic claims.

      (4) The conclusion that RPS27A/eS31 ubiquitylation supports translation (Figure 4) is based entirely on polysome/monosome ratios, which are difficult to interpret without additional assays of translation output, elongation, or collision.

      (5) The idea that RNF25 competes with GCN2 for GCN1 binding is interesting, and related models have recently been proposed in RNA damage. The effect of GCN2 KO on RNF25-dependent ubiquitylation appears modest, and the data would be strengthened by rescue experiments with wild-type GCN2 and GCN2 mutants defective in GCN1 binding. The authors propose: "that the RNF25 pathway acts as a first line of defence to resolve ribosome collisions, outcompeted by GCN2 binding to GCN1 under acute stress." This model would suggest a further increase in RPS27A/eS31 ubiquitylation upon Arg/Lys deprivation in GCN2 KO cells, since this is the condition in which GCN2 is expected to be activated and engaged with GCN1 (i.e., when it would be competing with RNF25), but no further increase in RPS27A ubiquitylation is observed. It is therefore not clear that these data support the proposed model. Contributing to this may be the fact that many of these assays are performed in a USP16 KO background, which may make it difficult to assess changes in RPS27A/eS31 ubiquitylation.

      (6) Given that several RWD domain proteins can interact with GCN1, and that DRG2 KO appears to affect RPS27A/eS31 ubiquitylation (Figure S5), the data do not support the GCN2-specific title. The results are more consistent with a broader, incompletely characterized network of GCN1-associated RWD domain-containing proteins that seems to affect RNF25-dependent ubiquitylation rather than with a demonstrated RNF25-GCN2 competition mechanism. Further characterization of GCN2-dependent ISR activation (p-eIF2a and ATF4 WB) in the absence of RNF25 in Arg/Lys starvation will help shed light on the RNF25-GCN2 competition. The authors use K113R, but this is not shown to prevent RNF25 engagement with GCN1, so a RNF25 KO should be used.

      Overall, the study contains useful observations, but the mechanistic claims are not yet fully supported.

    2. Reviewer #2 (Public review):

      Summary:

      The authors show that deprivation of Arginine and Lysine induces a ~50% increase in the ratio of ubi-RPS27A to RPS27A, and this induction requires E3 ubiquitin ligase RNF25. The authors show ZAKalpha and EDF1 are not required for steady state or ribosome stalling-induced ubi-RPS27A, while GCN1 is required. The ratio of polysomes to monosomes is increased in RNF25 knockdown cells or when translation is activated by ISRIB in a RPS27A K113R mutant cell line. GCN2 KO cells indicate elevated levels of ubi-RPS27A, and overexpression of the GCN2 RWD domain reduces levels of ubi-RPS27A.

      Strengths:

      (1) The authors identified a novel pathway to sense amino acid deprivation, indicated by ubi-RPS27A, previously implicated in ribosome stalling.

      (2) The authors find antagonism between two proteins known to act downstream of GCN1, giving insight into how signaling occurs from an upstream sensor of ribosome stalling to multiple downstream pathways.

      Weaknesses:

      (1) The authors suggest that, based on increased Polysome/Monosome ratios, there is more disome stalling in RNF25 KD cells and RPS27A K113R cells treated with ISRIB, but this readout is very indirect and could be driven by other changes in the cell other than ribosome stalling.

      (2) While the authors propose that GCN2 and RNF25 compete for binding to GCN1, no evidence was shown that RNF25 binds to GCN1 in cells, nor that the interaction increases when GCN2 is absent.

      (3) The use of USP16 to enhance the detection of ubi-RPS27A in many experiments brings the question of whether USP16 KO may alter the protein levels of any known regulators of ribosome collisions? (i.e. ZNF598, GCN1, EDF1, ZAKalpha, etc.) If USP16 KO causes changes in other important regulators of collisions, the authors could be identifying genetic interactions with USP16 in their experiments throughout the paper.

      (4) In Figure 5E, the expression level of the GCN2 3K RWD domain looks to be lower than the WT RWD domain; perhaps this could be what is driving the smaller decrease of ubi-RPS27A seen with GCN2 3K vs WT.

    3. Reviewer #3 (Public review):

      Summary:

      This study examines the role of RNF25 in translational quality control. Previous work indicated that RNF25 is activated by ribosomes stalled with defective elongation or termination factors bound in the A-site. Here, the authors provide evidence that RNF25 is activated by other treatments that evoke ribosome stalling, including amino acid starvation, where the A-site may be empty, leading to ubiquitination of RPS27A in a manner requiring the ISR collision sensor Gcn1, but not EDF1 and ZAKα, involved in the RQC and RSR surveillance pathways. They present some evidence from polysome profiling that RNF25 and its ubiquitination of RPS7A help resolve ribosome collisions and support translation elongation in basal conditions. They further show that KO of Gcn2 increases RPS27A ubiquitination in basal conditions, but not in amino acid-starved cells, and that RPS27A ubiquitination was reduced on overexpressing the WT RWD domain of Gcn2 but not a variant harboring substitutions of residues predicted to bind Gcn1. Based on these findings, they propose a model that, in response to ribosome stalling induced by various stresses, Gcn1 recruits RNF25 via the latter's RWD domain to ubiquitinate RPS27A and thereby resolve ribosome stalling and promote continued elongation. If collisions increase even further, GCN1 recruits GCN2 instead of RNF25 to elicit the ISR.

      Strengths:

      The data is convincing that a variety of triggers leading to diverse stalled ribosomal states, including amino acid limitation, can activate RNF25, suggesting that activation of this pathway does not require the presence of trapped protein factors in the ribosomal A-site but is a more general response to ribosome collisions. It is also convincing that Gcn1 is required for RNF25 activation under all of these conditions, which is consistent with previous findings that Gcn1 is required for RNF25 function in the presence of trapped elongation or termination factors. The finding that EDF1 and ZAK are not needed for RNF25 activation in amino acid starvation conditions is of interest for EDF1, given the recent claim that it is required for full ISR activation.

      Weaknesses:

      The evidence presented from polysome profiling that RNF25 helps resolve naturally occurring ribosome collisions in basal conditions is not compelling, as eliminating RNF25 could be increasing the rate of initiation rather than increasing stalled ribosomes as the means of increasing the P/M ratio. The Rps27A-K113R mutation could have the same effect of increasing initiation, which could have been obscured by inhibiting the ISR with ISRIB.

      The evidence that RNF25 competes with Gcn2 for Gcn1 binding is also not compelling. While it's convincing that Rps27A-Ubi is elevated in basal conditions on eliminating Gcn2, loss of GCN2 would be expected to increase ribosome loading on mRNAs, potentially elevating the frequency of collisions and thereby stimulating RNF25 activity indirectly.

      It's also quite puzzling and left unexplained why they observed no further increase in Rps27A-Ubi on -Arg/-Lys starvation in the cells lacking Gcn2. Why wouldn't -Arg/-Lys starvation lead to further stalling and RNF25 activation in the absence of Gcn2? (Since Gcn2 KO increases Rps27A-Ubi in the presence +Arg/+Lys conditions, it can't be that Gcn2 is required for RNF25 function.) The same puzzling and unresolved observation was made in the cells lacking DRG2. One possible explanation for this conundrum is that low-level RNF25 abundance limits further activation.

      The quantitative effects of overexpressing the Gcn2 RWD domain on Rps27A-Ubi, constituting their other evidence presented to support the competition model, are quite small in magnitude.

    1. Reviewer #1 (Public review):

      Summary:

      This study identifies mutations in alpha-tubulin that suppress Tau-induced neurodegeneration using the C. elegans model of Tauopathy, suggesting a potentially interesting role for microtubule properties in modulating Tau toxicity. These missense mutations cluster in the C-terminal Tau-interacting helix 12 region of alpha-tubulin genes (tba-1, tba-2, and mec-12). Further analysis, particularly using the strongest suppressor tba-2, shows that it rescues Tau-induced behavioral deficits and neuronal loss without significantly altering bulk tau-phosphorylation, aggregation, or binding to soluble tubulin. The authors suggest that altered microtubule properties underlie the neuroprotective effects, and manipulating microtubule properties may have therapeutic potential.

      Strengths:

      The study is conceptually interesting as it shows that Tau-induced neurotoxicity can, in this model, be partially uncoupled from canonical pathological hallmarks such as Tau-hyperphosphorylation and aggregation. The identification of multiple independent mutations in the same structural region of three alpha-tubulin genes provides support for the functional relevance of helix 12 in modulating Tau-induced toxicity. The authors demonstrate significant rescue of behavioral deficits (using motility and manual thrashing assays) and neuronal loss in both WT-tau and FTLD-associated TauV337M in combination with mutant alpha-tubulins, suggesting a general mechanism for tubulin-regulated modulation of Tau-toxicity. Moreover, the correlation between mutant tubulin expression levels and the extent of rescue supports a causal relationship.

      Weaknesses:

      One of the major claims of this manuscript is that altered microtubule properties suppress Tau toxicity. The only supporting evidence in this context provided by the authors is reduced taxol-stabilized microtubule mass, which does not fully explain neuronal loss or the rescue of behavioral deficits. What remains unclear is whether these mutations alter microtubule dynamics, catastrophe, lattice stability, or axonal transport.

      The authors show that mutant tba-2 reduces total tau levels by ~45%. This level of reduction is likely significant but underexplored in the manuscript. Why are the Tau levels reduced? How is Tau getting cleared- is there enhanced autophagy or ubiquitin-proteasome pathway getting upregulated in tba-2 + Tau animals? Or one or more of the Tau species not detectable by the antibodies used in this study? The observation that the mec-12 mutant rescues Tau-induced phenotypes without altering Tau levels suggests that suppression can occur through Tau-independent mechanisms. This raises an important unresolved question regarding the extent to which suppression is Tau-dependent vs Tau-independent across different mutant alpha-tubulin genes, complicating the interpretation of the rescue phenotypes.

      Given that Tau primarily associates with the microtubule lattice in vivo, measuring interactions with soluble tubulin may not fully capture biologically relevant binding dynamics and therefore does not exclude the possibility that these mutations alter tau-microtubule interactions at the lattice level or may affect the binding of other MAPs/regulators, thereby altering stability or trafficking.

      A large body of conclusions is drawn from behavioral rescue and biochemical assays. This limits the understanding of how molecular changes in tubulin might affect cellular mechanisms of neuroprotection. Are there changes in the neuronal microtubule organization, Tau localization, or its redistribution in the mutant alpha-tubulin background? Are there differences in soluble vs oligomeric vs insoluble Tau in mutant tba-2 and mec-12 animals?

      The suppression of behavior in the co-pathology model is interesting but mechanistically insufficient, mainly because the underlying basis of suppression is not examined in these models. Moreover, it remains unclear whether tubulin-Tau genetically interacts with Aβ or TDP-43, and what cellular mechanisms account for the partial rescue observed in these co-pathology models.

    2. Reviewer #2 (Public review):

      Summary:

      The manuscript by Benbow et al. identifies, through a genetic screen, key tubulin mutants that, with high confidence, rescue tau-mediated ND phenotypes. This manuscript is well written, and the experimental results strongly support the authors' claims that these tubulin mutants can rescue ND-linked phenotypes in C. elegans while having little to no direct effect on Tau aggregation.

      Strengths:

      Benbow et al. use a relatively unbiased forward genetic screen to identify mutations associated with phenotypes that suppress tauopathy-related defects. The authors then logically focus on the various α-tubulin missense mutations identified in H12, which are known to localize to the external face of microtubules. The authors also carefully compare their established tauopathy-associated phenotypes in the WT TauH model, with and without specific α-tubulin mutations, using appropriate controls throughout. Lastly, the authors provide partial mechanistic insight into the α-tubulin mutant-mediated rescue, showing that these effects are independent of tau aggregation and tau phosphorylation, and instead suggest that the α-tubulin mutations may confer altered microtubule assembly properties based on the sedimentation assays.

      Weaknesses:

      While the claims are largely supported by the experimental outcomes, the authors at times do not provide enough detail in the text for readers to interpret the data sets independently. In addition, some claims appear to be slightly overstated relative to the data or the degree of error associated with those data.

    1. Reviewer #1 (Public review):

      Summary:

      The protein DELE1 is a critical component to signal mitochondrial stress to the cytosol: under stress conditions, a truncated form of DELE1, termed DELE1(CTD) accumulates in the cytosol as an oligomer, binds the HRI kinase, which triggers the integrated stress response.

      Leveraging the structural knowledge of the DELE1(CTD) oligomer, this study attempts to interfere with the oligomerization process, using an AI-designed protein that binds to the DELE1(CTD) oligomerization interface. The starting hypothesis is that such a binder shall selectively inhibit the DELE1-signalled mitochondrial stress response. The authors use established AI pipelines (RFdiffusion) to make a series of such binders, characterize them with biochemical methods and a crystal structure of the binder in its free state. When over-expressing the binders in HEK293T cells, the authors report that mitochondrial stress - induced with a drug - does indeed not lead to triggering the stress response, confirming their starting hypothesis.

      The work is an elegant demonstration of how AI-designed proteins can specifically interfere with cellular mechanisms.

      The conclusions of the work are mostly well supported by data; there are some mechanistic gaps, however, about the interaction mechanisms.

      Strengths:

      The study is a nice combination of (i) a clear structure-derived hypothesis on how to interfere with a signalling mechanism, (ii) state-of-the-art protein design tools, (iii) a mostly robust biochemical characterization, and (iv) cellular experiments to demonstrate the effects of the binders.

      Weaknesses:

      The crystal structure of the binder5, while confirming its AlphaFold model, does not provide direct evidence of the binding mode to DELE1. Direct structure determination, using crystallography (which may require cleaving the MBP domain) would make their mechanistic arguments stronger.

      The demonstration that the binders do not inhibit the DELE1-HRI interaction is interesting; however, the underlying mechanism, in particular where the DELE1-HRI binding occurs, is not explored.

      While this study opens perspectives on how to interfere with DELE1-signalling, it is unlikely that these binders are actually useful for medical applications (compared to small-molecule drugs), as acknowledged in the manuscript.

    2. Reviewer #2 (Public review):

      Summary:

      Previous structural analyses of DELE1 by the authors revealed that the first α-helix within the TPR repeat domain provides the oligomeric interface of DELE1, and that DELE1 octamer formation is required for maximal ISR activation. Based on these findings, the authors designed peptides intended to bind this oligomeric interface and showed that these peptides interfere with DELE1 oligomerization in vitro and attenuate ISR activation in cultured cells.

      Strengths:

      The series of in-vitro data sets showing direct binding of the designed peptides to DELE1 and inhibitory effects on its oligomerization are convincing.

      Weaknesses:

      The physiological (or experimental) significance of inhibiting the DELE1-HRI-ISR pathway using these peptides has not been clearly demonstrated, particularly given that the very limited cell biological outcomes are tested in the current manuscript.

    3. Reviewer #3 (Public review):

      Significance of the findings and the strength of evidence:

      The article presented by Yang et al. describes the development of protein binders targeting the C-terminal domain of the protein DELE1, which is involved in the mitochondrial integrated stress response (mitoISR) pathway. It was shown earlier that DELE1 is imported into the mitochondria and cleaved by the inner mitochondrial membrane protease OMA1, resulting in an N-terminal and C-terminal domain, the latter being transported back into the cytosol, where it interacts and activates the kinase HRI. HRI, in turn, phosphorylates eIF2α, resulting in selective translation of mRNAs encoding proteins involved in stress signalling, such as the transcription factor ATF4. ATF4 activates expression of genes involved in amino acid balance, redox homeostasis and proteostasis. The C-terminal domain of DELE1 (DELE1CTD) was structurally and functionally characterized by earlier by cryo-EM by Jie Yang and co-workers. These studies suggest that it forms an octamer with D4 symmetry consisting of two tetramers arranged in a tail-to-tail arrangement. In this octamers two interfaces were identified, one between the monomers in the tetramers and one connecting the tetramers to form the octamer. In this earlier work, it was also shown by mutational studies that interrupting the first interface has an impact on the OMA1-DELE1-HRI-eIF2α-ATF4 pathway upon mitochondrial stress in human cells. To this end, the authors concluded in the current manuscript that it might be interesting and also of therapeutic interest to develop a protein binder that binds DELE1 and disrupts oligomer formation. The authors set up a de novo protein design approach using RFdiffusion to design a protein scaffold and ProteinMPNN to design the side chains to create protein binders targeting the α-helix α1 in DELE1CTD that is directly involved in the formation of the first interface forming the tetramer. As I am not an expert in protein design, I cannot judge the quality of this data. The candidates were evaluated by AlphaFold3 to confirm complexes formed between the designs and DELE1CTD. In the end, 12 designed protein binders were selected for further analyses. These proteins were recombinantly produced in E. coli and purified. The proteins DELE1 full-length (DELE1fl) and DELE1CTD were produced as MBP-fusion proteins to improve solubility and stability. Co-expression studies with mbp-delet1CTD revealed that 11 out of the 12 binders co-eluted with MBP-DELE1CTD from a size-exclusion chromatography column, indicating complex formation. Without the presence of the binders, MBP-DELE1CTD elutes as a higher oligomer, suggesting that the binders interfere with oligomerisation. Further analyses included the impact of the presence of selected binders on stress-induced ISR. The authors found that different binders had a slightly different impact on the outcome upon treatment with stressors, and also compared two different stressors. This was concluded by assessing the ATP4 protein level by immunoblotting. The interaction of selected binders with DELE1CTD was subsequently confirmed by co-immunoprecipitation experiments. To evaluate whether the impact of the binders is restricted to mitochondrial stress studies, eliciting endoplasmic reticulum stress showed no effect on ATF4 levels. The presence of the binders furthermore impaired recovery of tubulated mitochondria following mitochondrial stress induction, resulting in more fragmented mitochondria. The authors determined a crystal structure of one binder at a resolution of 2.6 Å and performed AlphaFold3 predictions to model the complex between binders and DELE1CTD. The interface is characterized by many hydrophobic residues. From this data, they concluded some interface mutants and tested those concerning their impact on the interaction. Indeed, mutation of these hydrophobic side chains to charged residues interfered with complex formation. Finally, the authors show that binder binding to DELE1CTD does not interfere with the binding of HRI kinase. Overall, the methodology applied is state-of-the-art, and the manuscript is well-written. The design of protein binders targeting DELE1 involved in mitochondrial stress signalling is interesting for basic science to study stress signalling, but also therapeutically. However, as ISR has a positive impact on disease development and ageing, but also a negative one, depending on the degree of activated ISR, a therapeutic use would need to be precisely applied. The study has some weaknesses, and particularly the structural data seems to have severe issues.

    1. Reviewer #1 (Public review):

      Summary:

      This study investigates epigenetic and three-dimensional chromatin alterations associated with primary trastuzumab resistance in HER2-positive breast cancer using integrated CUT&Tag, RNA-seq, and Micro-C analyses in JIMT1 (resistant) and SKBR3 (sensitive) cell models. The authors identify widespread remodeling of histone modification landscapes, chromatin compartment organization, and promoter-enhancer looping, highlighting SGK1 as a candidate epigenetically activated mediator associated with intrinsic resistance. The manuscript provides a technically solid and extensive multi-omic resource for the study of HER2-positive breast cancer resistance states.

      Strengths:

      The study integrates multiple state-of-the-art epigenomic and chromatin conformation approaches, including CUT&Tag, RNA-seq, and Micro-C, generating a comprehensive dataset that will likely be valuable to the field. The analyses are generally technically rigorous and well executed, and the manuscript is overall clearly written. The integration of chromatin architecture, enhancer activity, transcriptional regulation, and histone modification profiling provides an informative overview of large-scale epigenomic remodeling associated with resistant versus sensitive HER2-positive breast cancer states. The identification of SGK1-associated chromatin activation and enhancer rewiring is particularly interesting and supported by multiple orthogonal datasets.

      The inclusion of both intrinsic and acquired trastuzumab resistance models also strengthens the study conceptually, even if the biological interpretation remains somewhat complex.

      Weaknesses:

      The major limitation of the study is that many of the central mechanistic conclusions remain largely correlative. Although coordinated changes in chromatin architecture, histone modifications, enhancer activity, and SGK1 expression are observed, direct evidence demonstrating that these epigenetic alterations causally drive SGK1 activation or trastuzumab resistance is currently lacking.

      In addition, the interpretation of SGK1 as a broader trastuzumab-resistance driver is somewhat weakened by the analyses in the acquired resistant SKBR3_HR model, where SGK1-associated chromatin and transcriptional changes appear largely absent. This raises the possibility that SGK1 dependency may reflect a lineage- or model-specific vulnerability intrinsic to JIMT1 cells rather than a generalizable resistance mechanism.

      The study also remains descriptive in several sections. Numerous chromatin interactions and compartment changes are cataloged without sufficient biological contextualization or mechanistic integration. As a result, parts of the manuscript currently read more as a comprehensive epigenomic profiling resource than a fully mechanistic study of resistance biology.

      Finally, the translational impact is limited by the lack of patient-level validation linking SGK1 activation to trastuzumab response or clinical outcome in HER2-positive breast cancer cohorts.

    2. Reviewer #2 (Public review):

      Summary:

      Duan, Hua et al. used CUT&Tag and Micro-C to investigate that in primary trastuzumab-resistant HER2+ breast cancer cells, promoter H3K4me3 rather than H3K27me3 is strongly correlated with transcriptional activity. Resistant cells also exhibited more abundant promoter-enhancer loops and enriched cohesin at loop anchors, accompanied by shifts in A/B compartment status. Through multi-omics integration, the authors identified SGK1 as a key gene showing elevated promoter H3K4me3 levels, enhancer activation, strengthened chromatin loops, and upregulated transcription in resistant cells, and validated SGK1 as a potential therapeutic target. These findings reveal the coordinated interplay between three-dimensional chromatin architecture and epigenetic modifications, offering important insights into trastuzumab resistance in HER2+ breast cancer.

      Strengths:

      Previous investigations into trastuzumab resistance have largely focused on genetic mutations or individual epigenetic modifications. In contrast, this study moves beyond genetic or single epigenetic views by integrating histone modifications and 3D chromatin architecture into a unified framework, proposing a synergistic model of promoter H3K4me3, enhancer activation, and chromatin looping that underlies non-genetic resistance. It provides a new conceptual basis for understanding non-genetic resistance mechanisms. Secondly, using high-resolution epigenomic and conformational mapping together with bidirectional in vitro and in vivo functional validation, it establishes a solid link between epigenetic changes and phenotypes, and demonstrates that SGK1 inhibition suppresses tumor growth in a xenograft model, revealing clear translational potential.

      Weaknesses:

      (1) All findings are based on a single pair of cell lines, JIMT1 and SKBR3, which does not allow exclusion of cell line‑specific effects. The authors did not examine SGK1 expression levels, promoter H3K4me3 status, or relevant chromatin loops in tumor tissues from patients with clinical trastuzumab resistance. Consequently, whether the conclusions can be extrapolated to actual patient populations remains unclear, which limits the clinical relevance of the findings. It is recommended that the authors directly validate the key findings using tumor samples from patients with clinical trastuzumab resistance or analyze the correlation between SGK1 expression levels and disease-free survival or pathological complete response using data from public databases for HER2+ breast cancer patients, which would help address the current limitation of lacking clinical sample validation and the uncertainty regarding the association of SGK1 with patient prognosis and treatment response.

      (2) In the Discussion, the authors propose that SGK1 may assume the role of AKT to sustain mTOR activation, thereby bypassing the dependence on HER2 signaling following trastuzumab inhibition. Although this hypothesis is supported by published literature, the present study provides no direct signaling evidence, such as examining phosphorylation changes of SGK1, AKT, mTOR, or their downstream effectors.

    1. Reviewer #1 (Public review):

      Summary:

      The authors used a large dataset evaluating gut carriage of Enterobacterales and ESBL organisms from children aged 6-24 months as the basis for a modeling study to investigate what factors are most important for determining the prevalence of ESBL resistance. The modeling incorporated travel, a simple model of carriage duration (short and long), fitness cost of resistance on transmission and clearance, and antibiotic use. They found that antibiotic use is the primary driver of resistance prevalence, with transmissibility of resistant strains also important for setting the prevalence. Travel, while important when prevalence is very low, plays less of a role in maintaining prevalence once it is established (in keeping with other recent work). They estimated the fitness cost of resistance (terming a reduction of 14% on the rate of transmission and an increase of 23% on the rate of clearance as "low"). While the extent of assumptions and simplifications makes me skeptical of the quantitative conclusions, the qualitative ones seem reasonable and reinforce the long-held principles of the field--reducing antibiotic pressure and interrupting transmission--and highlight the importance of understanding the biological factors that shape the duration of carriage and the likelihood of colonization.

      Strengths:

      This study incorporates many of the factors that might influence the carriage prevalence of ESBL Enterobacterales. This builds on the work led by this group, both in primary data collection and in theory. Overall, it's such a tough problem that I commend the authors for trying to tackle it. The authors take a thoughtful, rigorous approach, acknowledging simplifications and assumptions where they need to, so as to evaluate the various factors shaping ESBL prevalence.

      Weaknesses:

      Part of the reason it's such a tough problem is that we have limited data to structure and parameterize a complex model.

      (1) The data are not sufficiently described.

      The primary data source for this modeling exercise comes from a study of 6-24-month-old children who underwent rectal swabs and evaluation of the carriage prevalence of Enterobacterales, and then whether these Enterobacterales were ESBL; moreover, the study included data on travel and on antibiotic use. Could the authors please direct us to these primary data? Could the authors also justify the parameters in their models from these data--for example, could they please provide the distribution of antibiotic use and the associated timing? Could they also explain why they decided to treat all Enterobacterales as if they were E. coli (line 307)? Is there evidence that all Enterobacterales occupy the same niche and compete with each other?

      (2) The model should be more fully described and the limitations explored/explained.

      - The authors should point to the code and the ODEs.<br /> - I understand the focus on the pediatric population; the authors argue that this is reasonable because ESBL colonization is similar across age groups. But presumably, antibiotic use differs across age groups, and there is colonization pressure from within households.<br /> - The authors only consider resistance to extended-spectrum beta-lactams and use of beta-lactam antibiotics, but ESBL Enterobacterales are often resistant to other antibiotics as well. How much does the use of other antibiotics also select for Enterbacterales that happen to carry ESBL resistance? "One bug/one drug" modeling, as done here, neglects the complexities of the actual patterns of resistance and range of antibiotic use.<br /> - Do the data support the T3 or S3 compartments, which, if I understand correctly, means no exposure to antibiotics can happen during three months after either treatment or travel? What do the data say about the patterns of antibiotic use? I'd imagine that the likelihood of antibiotic use is not homogenous, but instead, there are some who use repeated rounds of antibiotics.<br /> - Why do the authors exclude individuals who used antibiotics in the prior 7 days? What justifies that cutoff? The authors speculate that the impact of excluding these individuals is likely to be minimal; why exclude them, then? Did the authors evaluate the results if they were included?<br /> - What is the basis of "niche differentiation", as described starting on line 221? Why should clearance of one strain be slower when the strain co-occurs in a host with a strain of another type?

    2. Reviewer #2 (Public review):

      Overview:

      This study integrates several datasets into a unified modeling framework that incorporates several mechanisms thought to impact the spread of ESBL-resistant bacterial strains. The model accounts for tradeoffs between persistor and colonizer strains, travel rates, antibiotic treatment and strain clearance, direct competitive interactions, and, most importantly, a series of distinct costs associated with the carriage of ESBL resistance. The resulting 75-compartment model is internally consistent and structurally neutral. However, the parameter estimation is flawed in many ways, compromising the interpretations of the model.

      On the usage of the Swedish infant data set to estimate colonization and persistence:

      First, while other papers have taken similar approaches, the Swedish infant data set is fundamentally inadequate to estimate colonization and persistence rates. This is because very few colonies were typed per sampling event (2 to 6 colonies per event). The original authors themselves argued that strains of indistinguishable morphology would not be able to be differentiated by this method. They also provided data showing that strain identity was not directly related to colony morphology (same strain often displaying distinct morphologies).

      The consequence of this is that strains present in low abundance would be missed with a high likelihood. However, if they were to be stochastically sampled, this would count as a "colonization" event, and if they were missed in subsequent samplings, this would count as a "loss" event. In other words, the statistical methods described conflate within-host dynamics (which might lead to distinct within-host abundances) with between-host dynamics (colonization and loss).

      Beyond this conceptual issue, some technical aspects aren't particularly sound. The mean of the inferred posterior for the lambda and mu parameters are then used to calculate the beta, gamma, d, and epsilon parameters through a linear regression. The more technically correct way of doing this would be to directly infer these parameters from the data and obtain a full posterior for these parameters.

      This highlights another issue: these parameters are passed down to the next statistical model as point estimates, with no associated uncertainty. This artificially inflates the (already low) confidence of the estimates for the cost parameters.

      Finally, when this procedure generated parameters that were inconsistent with their expectations (clearance is too high to explain prevalence in France), they adjusted the parameters by discarding and recalculating their beta parameters to artificially enforce neutrality between their strains and enforce the expected prevalence. This is problematic because beta and gamma were jointly estimated, and there is no particular reason why some of them should be discarded. The more natural interpretation would be that parameters inferred from Swedish infants do not translate well to French adults, which should preclude their usage in this context.

      On the estimation of costs of ESBL resistance:

      The core of the second statistical model is to use prevalence data, travel data, and treatment data in conjunction with the previously inferred colonization and loss parameters to infer the costs of carrying antibiotic resistance. Therefore, the accuracy of this section is contingent on an accurate estimation of the previous parameters. However, these colonization and loss parameters are inherited with no uncertainty (just point estimates are passed down), which, as previously mentioned, generates an artificially precise posterior distribution for the resistance parameters.

      However, the most severe issue with the statistics lies in the choice of priors for the cost parameters. All of them are uniform in a positive range that implies a positive cost. Importantly, the average over a positive range will always be positive; therefore, this method will ALWAYS estimate a positive mean for the costs. Note that the posterior distribution of some cost parameters seems to peak around zero and abruptly decays with no mass to the left of zero. This is caused by the choice of prior. Had delta been allowed to be negative (i.e., antibiotic resistance carried a benefit, having the prior be uniform between -1 and 1), the posterior distribution would likely be much more symmetrical, and the confidence interval would have included 0.

      Restating, because the prior is a continuous function between 0 and 1, it contains infinitely more mass in the region that represents there being a cost (delta>0) than in the region representing no cost (delta=0). This means that it is a mathematical impossibility for this model to infer the absence of a cost.

      Therefore, the main finding of the paper ("We found that resistance is costly") is a mathematical artifact of the prior choice and of the model structure.

    3. Reviewer #3 (Public review):

      Cotto and colleagues integrated data analysis with mathematical modeling to examine extended-spectrum beta-lactamase (ESBL)-producing E. coli in France. While ESBL prevalence has risen globally, it has stabilized at approximately 6-8% across Europe. Established risk factors for ESBL carriage include prior antibiotic exposure and travel to high-prevalence regions, most notably South-East Asia. The dataset incorporated information on ESBL-producing E. coli and travel history in young children, and the model was calibrated to ECDC surveillance data on ESBL across Europe, supplemented by literature-derived parameters on antibiotic use, E. coli biology, and transmission dynamics. The authors report that ESBL-carrying strains exhibit a 14% fitness cost in community transmission relative to susceptible bacteria, yet are cleared 23% less frequently. ESBL carriage was strongly associated with factors that prolong gut colonization. Both antibiotic treatment rates and transmission efficiency were identified as key determinants of community-level ESBL prevalence.

      Strengths:

      The study addresses a clinically and epidemiologically important topic. The integrated modeling approach is methodologically sound and well-suited to disentangling the relative contributions of transmission and antibiotic selection pressure.

      Weaknesses:

      Several concerns regarding the data used in this study warrant consideration. First, model calibration relied on ECDC surveillance data pooled across multiple European countries, several of which have substantially lower antibiotic consumption than France (ECDC ESAC-Net Annual Epidemiological Report, 2024). Given that antibiotic use is a primary driver of ESBL selection, ESBL prevalence is likely to be heterogeneous across these settings. Calibrating to a geographically diverse dataset risks introducing systematic bias into parameter estimates that may not be representative of the French context. The authors should repeat the analysis using France-specific data, or, where this is not feasible, restrict the calibration dataset to countries with comparable antibiotic consumption profiles. Second, the travel exposure data may be insufficient to adequately capture importation dynamics from South-East Asia, as the cohort consisted exclusively of young children, a demographic less likely to travel to high-prevalence regions than older age groups. This may result in an underestimation of travel-associated importation as a contributor to community ESBL prevalence, and the generalizability of these findings to the broader population should be interpreted with caution.

    1. Reviewer #1 (Public review):

      The study by Epp et al. has indeed gotten a lot of attention. As so often in the fMRI literature, some voices had taken the results out of proportion as if this result would suggest that we cannot trust fMRI. This is so, while informed researchers are aware of the capabilities and challenges of BOLD as a measure of neural activity. The paper was discussed and criticized on many aspects from various angles. E.g. with respect to unestablished models of estimating CMRO2, the 40% figure is being overestimated by the mask definition, and expected neuronal and vascular effects underlying the discordance.

      The first publications of these discussions are being shared now. E.g. Chen et al. https://doi.org/10.1038/s41593-026-02288-y. The manuscript at hand augments this discussion. Specifically, the manuscript provides a direct statistical refutation of the recently proposed widespread physiological sign reversal between BOLD and CMRO2.

      By reanalyzing a high-profile dataset, the authors demonstrate that the previously reported 40% discordance rate is an artifact of statistical uncertainty rather than a genuine physiological phenomenon. This critical re-evaluation restores some confidence in the canonical interpretation of BOLD signals that was recently challenged. It highlights the necessity of rigorous statistical validation in quantitative fMRI.

      The following points should be addressed:

      (1) Absence of evidence is taken as evidence of absence

      The group-level significance analysis, summarized in the horizontal bar chart and cortical surface maps, labels non-significant voxels as 'CMRO2 not reliable', and the discussion concludes that positive BOLD responses are predominantly concordant with metabolism.

      The paper treats voxels with non-significant CMRO2 effects as 'statistically uncertain' rather than as potentially reflecting genuine null metabolic changes, conflating absence of evidence with evidence of absence. Because the 77.2% of voxels shown as light orange could reflect either real null metabolism or insufficient power, the paper cannot distinguish between these. This ambiguity matters because a genuine null metabolic response to positive BOLD would itself be physiologically interesting and would not straightforwardly support 'predominant concordance'.

      (2) Contextualization in other current literature

      I feel that the introduction of the paper could also consider the embedding of the current literature about biophysical processes in the negative areas.

      The negative responses have partly been discussed in the literature on quantitative physiology: e.g., Bohraus et al have been able to pinpoint the source of negative CMRO2 in positively activated voxels to large veins (https://doi.org/10.1016/j.celrep.2023.113341). Huber et al. have found that the neurovascular coupling (arterial venous weighting) is different in positively and negatively activated brain areas, making the interpretation of derived parameters on physiology hard.

      (3) Stylistic comments.

      In places, the tone of the language could be revised to ensure that it is perceived as making a constructive contribution to the discussion.

    2. Reviewer #2 (Public review):

      Summary:

      The rebuttal aims to provide a statistical re-evaluation of Epp et al. to investigate the effects of CMRO2 uncertainty on concordance/discordance analysis between BOLD signal responses and CMRO2 change estimates based on an R2 framework. The authors observe markedly higher variance in CMRO2 compared to BOLD, which raises concerns about sign classification purely based on group means/medians.

      Strengths:

      The study is well motivated, and the analytical pipeline is rigorous and has been provided. Overall, the manuscript provides several thoughtful and rigorous analyses that contribute meaningfully to the ongoing discussion surrounding neurovascular coupling and CMRO₂ estimation.

      Weaknesses:

      Some aspects of the analytical framework could be improved, as well as the discussion of the caveats of the methods of this and the original paper.

      (1) The binomial framework discussed on line 110 and described on line 321 reduces continuous ΔBOLD and ΔCMRO2 measurements to binary concordant/discordant labels, which may overemphasize unstable sign flips near zero effect sizes while discarding potentially meaningful magnitude information. The authors acknowledge that this overly strict approach yields very few meaningful voxels. A better justification or explanation of what we are meant to take away from this, other than the variability in the measurement, which is also explored elsewhere, would be helpful to the reader.

      (2) In the methods, in the section entitled: Voxel Selection: BOLD Activation Mask, the authors describe their more traditional univariate statistical method as compared to the PLS approach used in the Epp paper. While I appreciate why the authors chose this approach, which simplifies interpretation, is it possible that this led to a lower number of discordant voxels? If yes, then I would suggest this be also added in the discussion of how the original Epp paper's methodological choices led to the very large percentage of discordant voxels.

      (3) In the original paper, it looks to me like the discordant voxels have low CBF change and low rOEF. The gadolinium-based CBV measurement used to calculate OEF is a measure of total blood volume, while the blood volume that contributes to BOLD resides predominantly in veins and capillaries. Given the long PLD of the ASL acquisition and the total blood volume measurement, it seems to me that it is possible that discordant voxels may have high arterial blood volume, leading to overly large CBV measurement and an underestimation of CBF at this PLD (especially given their young age, for which I would expect ATT to be closer to 1-1.5s based on recent literature). While this is not currently discussed in this paper, it might be relevant to discuss how acquisition choices could bias some voxels towards erroneous CMRO2 estimates, which in turn would lead to these voxels being identified as discordant.

      (4) In the methods, on line 267, the authors describe how they calculated ΔCMRO2 and how it differs from the original paper. A short discussion of how this choice is likely to affect the variance estimates would be warranted, given that the original paper seems to have chosen their method for the explicit purpose of decreasing error propagation. Especially, I wonder if this difference could account for the observation that "77.2% of voxels showed no statistically significant group-level ΔCMRO₂ effect".

    1. Reviewer #1 (Public review):

      Summary:

      In this study, the authors perform longitudinal mesoscale calcium imaging of visual and other cortical areas following binocular enucleation (blinding through the removal of the eyes) in adult mice. The study is observational and exploratory, and analyzes changes in the frequency distribution of calcium signals during locomotion and quiescence as a function of time after enucleation. They also analyze correlations between calcium signals in different brain regions to ask how apparent connectivity between regions changes over time. The main conclusions are (1) that there are multiple timescales of plasticity; (2) that the coupling between locomotion and activity in visual areas flips sign after enucleation, and (3) that correlations between brain areas are modulated by this long-lasting plasticity. Overall, the data are likely to be useful to researchers studying the impact of injury and catastrophic loss of sensory inputs on brain reorganization, but it is hard to draw firm conclusions from the observations provided beyond the very general conclusions listed above.

      Strengths:

      (1) The longitudinal imaging of multiple brain areas simultaneously allows the investigators to follow plastic changes in the same animals over time, to address questions about how apparent connectivity and brain state modulation unfold after injury.

      (2) The data suggesting a flip in sign of the coupling between movement and "activity" in visual areas is interesting and potentially novel.

      Weaknesses:

      (1) The mesoscale imaging has limitations. In particular, the authors use words/phrases such as "activity" and "functional connectivity" without ever discussing what the measures they provide with this approach (frequency distribution of summed calcium fluctuations, and the correlation between this measure across brain areas) actually mean, or how they approximate spike-based measures or cellular-resolution Ca signals. The manuscript would benefit from an in-depth discussion of these limitations.

      (2) In general, the figures are difficult to follow. In many cases, what is being plotted is hard to extract without a lot of work, and metrics are not well-justified. For example, they calculate the R value between movement power and spectral power of the Ca signal to quantify changes across time in the coupling between movement and activity (Figure 2). But from the example given, this does not look like a continuous relationship, and though R values are significant its not clear that this correlation is a good way of quantifying the change in sign they attempt to document. Figure 7 is impossible to read, and areas quantified are not indicated. The reader should not have to work this hard to figure out what they are plotting.

      (3) It would be reassuring to rule out an effect of repeated imaging on the metrics they describe here. Longitudinal imaging of the same duration without enucleation would be the best control. Alternatively, they do have multiple baseline measurements that they collapse into one value in most of their plots.

      (4) The discussion is very long. They spend a lot of time trying to relate their findings to the larger literature on visual deprivation, but because of differences in paradigms (enucleation, laser ablation, visual deprivation, binocular vs monocular) and differences in measures (see point 1), it's hard to draw conclusions. In my view, the manuscript would benefit from less speculation about plasticity mechanisms and more discussion of the strengths and weaknesses of their approach.

    2. Reviewer #2 (Public review):

      Summary:

      This study uses cortex-wide mesoscopic calcium imaging to investigate how adult vision loss induced by bilateral enucleation alters spontaneous cortical activity across behavioral states, including quiescence, locomotion, and anesthesia. The authors perform longitudinal imaging over two time scales, spanning days to weeks and weeks to months after enucleation, enabling them to track the changes of cortical reorganization.

      The main findings are that oscillatory activity in V1 undergoes a strong reversal in its relationship to behavioral state. Before enucleation, V1 activity is positively correlated with locomotion and negatively correlated with quiescence, whereas after vision loss, this pattern reverses. State-transition dynamics are similarly altered: locomotion onset shows reduced V1 activation, while cessation of locomotion is associated with increased activity after enucleation, while it caused suppression during baseline. In addition, the authors report an increase in slow-wave (0.1-4 Hz) activity in V1 after enucleation, starting in the first week and lasting over many weeks. Although these effects show partial recovery over time, many abnormalities persist for weeks to months.

      At the network level, the study reveals altered large-scale cortical organization, including reduced functional connectivity involving V1 that appears to remain impaired.

      Strengths:

      Overall, the work provides a thorough characterization of how adult vision loss reshapes cortical dynamics, particularly with respect to behavioral-state modulation.

      Weaknesses:

      However, there is also a lack of clarity due to the way the data are presented. Moreover, the study remains largely descriptive, as it does not address the mechanisms underlying these changes or their functional significance, making it difficult to interpret the broader implications of the observed cortical reorganization.

    3. Reviewer #3 (Public review):

      Summary:

      The authors track cortical activity across the dorsal cortex of head-fixed mice for up to ten weeks following bilateral eye removal, asking how the cortex reorganizes over an extended period after vision loss. They report a rapid and long-lasting reversal of the normal relationship between movement and visual cortex activity, together with a delayed, weeks-long window of enhanced slow-wave activity during rest and a persistent reorganization of large-scale cortical correlations.

      Strengths:

      The longitudinal scope is the work's strength. Tracking the same animals over a ten-week window after sensory loss is technically demanding and rarely done, and it yields a temporal picture that short studies cannot provide. The observation that the movement-related activation of the visual cortex inverts within a day and only partially recovers over weeks is striking and has not been documented at this timescale. The analysis is internally consistent across two protocols (short- and long-term) and frames the changes by behavioral state, focusing on rest versus movement. This is a useful analysis that the field has not systematically applied to studies of deprivation.

      Weaknesses:

      The manipulation is unusually severe: removing both eyes eliminates patterned vision, non-image-forming light input, and all residual retinal signals abruptly and irreversibly, in contrast to the milder and often reversible manipulations the discussion draws on. Without a sham-surgery control, the early effects cannot be cleanly separated from the surgery itself.

      The language of "plasticity" runs ahead of what the data actually measure, since the study quantifies spontaneous activity and pairwise correlations but does not assess receptive fields, evoked responses, synaptic changes, or the causal manipulation of any candidate circuit. The discussion nevertheless attributes findings to specific interneuron circuits, molecular pathways, and thalamocortical reorganization, none of which are tested in this study.

      The imaging method also constrains what can be claimed: widefield calcium signals are dominated by superficial-layer and excitatory output and cannot resolve the cell-type-specific mechanisms invoked in the discussion. Because the key findings lie in the low-frequency band where vascular contamination is greatest, the hemodynamic correction, particularly in the deprived state, where vascular tone itself may be altered, deserves more validation than it currently receives.

      Finally, the presentation relies heavily on group-level heatmaps in the main figures, with raw traces, spectrograms, and per-animal trajectories at the key inflection points (day 1, week 1, week 10) largely absent. This makes it difficult to judge whether the reported patterns are coherent across animals.

    1. Reviewer #1 (Public review):

      Summary:

      This study uses an encoding model approach to compare a range of different deep learning models in predicting functional MRI data, collected while participants played the game "Super Mario Bros" inside the scanner. The fMRI data is rich, within-subject data, with around 15 hours of gameplay for each of five participants who took part in the study. A range of models are compared, including deep RL models (PPO), behaviour cloning (imitation learning), supervised visual models (ResNet), and untrained but structurally equivalent models. The main metric of model comparison is brain prediction (i.e., cross-validated R^2, and within-subject generalisation to out-of-distribution gameplay), rather than focussing on which model features are being encoded.

      The core results are:

      (1) The deep RL and imitation learning models show a modest improvement in prediction accuracy relative to the untrained and visual models (around a 1-2% increase in R^2). Notably, this is against a background in which the untrained model - essentially random projections of the gameplay pixels - can explain around 6 or 7% of the variance in fMRI data (Figure 2). So, the improvement in model fit is a small (but significant) one, and a major driver of prediction scores appears to be low-level visual stimulation as opposed to gameplay prediction.

      (2) There is little variation across layers in prediction accuracy in the trained models. In the untrained model, prediction accuracy drops across layers. This suggests that the prediction accuracy in this untrained model results from its (early-layer) representations being closer to what is presented on screen - as the random weights move the untrained model's representation away from sensory features, it becomes less predictive of the brain. In a trained model, meaningful representations are maintained in deeper layers - and interestingly, there is no clear correspondence between layers of the model and layers of the visual pathway.

      (iii) There is a noticeable improvement in brain prediction by both the deep RL and imitation models with model training. In other words, the 1-2% increase in R^2 mentioned in point (i) is a result of the training, rather than any other factor.

      (iv) None of the models, including the untrained model, perform well in generalising to out-of-distribution data held out from the training/evaluation. This leads to the claim that the brain's encoding representations are 'brittle'.

      Strengths:

      (1) A major strength of the dataset is that it contains rich, extended naturalistic gameplay data within individual subjects. This mirrors some of the advantages seen in other naturalistic datasets (e.g., natural scenes dataset, storybook listening, video watching) - but there are very few examples of such data where the subject is controlling or generating the behaviour in the naturalistic task. This allows potentially new questions to be asked about how these representations are learned across time, within individual participants.

      (2) A further strength of the manuscript is the clarity with which the aims and hypotheses are articulated in the introduction, and evaluated/discussed throughout the paper. This provides a clear set of objective criteria against which to evaluate the performance of the resulting models; the paper is also written in a very clear and honest way, in that some of the a priori hypotheses are not supported - this makes for a more transparent report than one written in an a posteriori manner.

      (3) Finally, although the results in comparing different models are perhaps not as impressive as one might have hoped, the authors have been quite careful in making the models comparable in terms of their architecture and number of parameters, etc. This means that any variation in prediction is likely attributable to the different objective functions used to train the models, rather than other features of the model architecture.

      Weaknesses:

      (1) The work is currently framed as "training neural networks from scratch...leads to brittle brain encoding" - but I'm not sure that the results fully support this. First, the brittleness is still present in the untrained network (i.e., random projections of pixels), as shown in Figure 5b. This implies that the brittleness may not be a consequence of the network training, but of overfitting to the encoding (ridge regression) model of the fMRI data (as the authors acknowledge when presenting these results). I would instead encourage the authors to shift the emphasis slightly towards the (modest) improvement in prediction using the RL/imitation objectives, and/or the (similarly modest) improvement in prediction with training, rather than foregrounding the brittleness of the encoding.

      (2) While the analyses of how model prediction improves with training are nice, it is a shame that there is no consideration of how prediction improves (or otherwise) across the training of the participants. Do participants improve across the 15 hours of gameplay - or do they, for instance, become more predictable by the imitation learning model? Is this more true in the naïve participants than those with extensive past experience of Mario? And does this in any way lead to better alignment with model predictions across sessions? These all seemed like natural questions that could benefit from the unique longitudinal nature of this dataset, and it seemed a shame that they were not touched upon at all.

      (3) While there is little variation between the models in terms of predictive performance, it is currently a little unclear whether this is simply due to fitting a set of highly parameterised models to the data, or because the models are themselves fundamentally similar in their representations. One way to address the latter point might be to perform some kind of RSA or CKA (Kornblith et al, arXiv 2019; Williams et al, bioRxiv 2024) across the layer representations within-model, and between-models, to ask how similar (or different) the learned representations are between the different models used for fMRI prediction.

    2. Reviewer #2 (Public review):

      Summary:

      This paper aims to test whether training models to play video games from visual inputs through reinforcement learning leads to better matches to human visual encoding during gameplay, compared to models with the same architecture and training images but with different training objectives. The authors find a slight advantage for the RL model, but encoding performance and generalization overall are weak and variable.

      Strengths:

      This was a reasonable hypothesis to test, and the model comparisons adequately represent other possibilities for training a model of the given architecture. The ResNet proxy is a particularly interesting way to benefit from a larger model's pre-training while still using the same constrained architecture and training set.

      Weaknesses:

      I always prefer to see learning curves for models on the tasks they were trained on, just to contextualize their performance on the brain encoding results, but they are not shown here.

      The paper misses some of the relevant literature that has performed similar comparisons across learning objectives for visual encoding models, such as https://arxiv.org/abs/2112.02027 and https://pmc.ncbi.nlm.nih.gov/articles/PMC10569538/

      The authors end up advocating for the idea that large-scale pre-training is needed in order to build good visual encoders for matching human data. In many ways, this was already known (given that brain encoding scores scale with imagenet performance, which requires at least a moderate amount of general-purpose image training to achieve). However, they also note that "the brain encoding performance of the ResNet model was not significantly different from that of the Untrained model." I would assume that an ImageNet-trained ResNet would be in the direction of the type of large-scale pre-trained model the authors advocate for (even when not trained for action generation), yet their results don't support this direction being the solution. Are their results about Resnet not surpassing an untrained model consistent with prior work, and if not, why not? How do they view this in light of their argument for the use of larger models?

    3. Reviewer #3 (Public review):

      Summary

      In this paper, the authors have 5 human subjects learn to play Super Mario Bros while undergoing fMRI for 15 hrs each. They compare a reinforcement learning (RL) model (PPO), an imitation learning (IL) model, and a vision model (ResNet) in their ability to play the game, match human behavior, and, critically, explain human brain activity.

      The key findings can be summarized as follows:

      (1) RL, IL, and vision models explain similar amounts of variance in the BOLD signal (Fig 2a), with a significant but small trend of RL > IL > ResNet (Tab 1).

      (2) Untrained models with the same architecture explain a smaller but very similar amount of variance (Figure 2a, Table 1).

      (3) The brain maps across all models (and layers) are strikingly similar, with the strongest effects in visual, parietal, and motor regions (Figures 2b, 2d; Supplementary Material II).

      (4) Behavioral and neural performance are correlated across model checkpoints (but not levels), such that later checkpoints in training have better behavioral and neural encoding performance (Figures 3 & 4), although the neural effect plateaus pretty quickly.

      (5) Out-of-distribution performance is quite poor, both behaviorally (Figure 5a) and neurally (Figure 5b).

      I believe this work will be of interest to neuroscientists, cognitive scientists, and AI researchers alike. There has been a growing trend in neuroscience to adopt AI models as cognitive models of complex perception and action, while at the same time, AI researchers are increasingly looking at the brain for inspiration. The key finding of this paper -- that these models fail to generalize to out-of-distribution levels -- questions the core assumptions of this whole enterprise.

      Strengths:

      Unlike previous studies applying machine learning to naturalistic game-play, the authors take great care to make sure their models are evaluated on an equal footing, using equivalent or similar architectures/number of parameters and training data.

      While the number of subjects (5) is relatively small, the amount of data per subject (15 hours) is impressive, which is important for fitting the imitation learning & ResNet models and for obtaining reliable encoding performance for each individual subject. The authors employed a train/val/test split and held out sets, the gold standard in the literature.

      Overall, the paper was well-written and easy to follow. The figures clearly illustrate the main findings.

      Weaknesses:

      (1) Missing statistical tests

      I think the main weakness of the paper is that many of the claims are qualitative in nature and lack appropriate statistical tests, for example:

      - "The conv3 layer has the highest brain encoding score";<br /> - "Robust association between task performance and brain encoding" ;<br /> - "Level patterns strongly predict brain encoding";<br /> - "Brain encoding performance was severely degraded";<br /> - "Effect of training on brain encoding was apparent".

      While these effects are indeed qualitatively visible in the figures, it is unclear which of these differences are significant (with the notable exception of Table 1). I believe the paper would benefit substantially if these effects were quantified and every claim were supported by the appropriate statistical tests. As an example, with the exception of Table 1 and the corresponding paragraph, I could not find any p-values in the results section.

      (2) Missing model performance and human-likeness

      Also absent from the results is an assessment of model performance on the task and similarity to human performance/behavior. From Figures 3 and 4, we can see that the game score of PPO is around 500-1000 - how does that compare to the humans? We can also see that the imitation scores for IL are around 0.4-0.7, but what does that mean? Such results would be crucial to assess if the models have indeed learned to play the games and/or imitate the humans, and therefore, whether they would be good candidates as cognitive models (before even looking at brain activity). At minimum, plotting the human versus model game scores (see e.g. Tomov et al. 2023 Neuron, Figure 2) would be helpful; or, if you'd like to dig deeper, showing that human actions are more valuable or more likely under those models (see e.g. Cross et al. 2022 Neuron, Figure 2). It might also be helpful to look at imitation scores for the RL model and game performance of the imitation model -- I suspect they will both be bad, but they can at least serve as informative baselines for their counterparts.

      (3) Possible undertraining

      Relatedly, one possible explanation for why the Untrained model does so well is that all the models may be effectively undertrained. For example, while there are no training curves in the paper, it seems from the spacing of the checkpoint game scores (x-axis on Figure 3c) that the RL model may not have converged yet (it would be helpful if those were somehow colored by training epoch). Showing training curves would be helpful (i.e., something similar to Figure 3a, except with performance on the y-axis).

      Additionally, it would be great to provide more details regarding the PPO training protocol. How many episodes? How many steps per episode? How many steps for all of the training? Similarly, for the imitation learning model: batch size, number of epochs, optimizer, scheduler, etc.

      (4) Mysterious poor encoding performance of Untrained and ResNet models on the held-out set

      Critically, and related to that, I'm a little confused about the Untrained model results on the held-out set (Figure 5b, top row on the right). Why should those be any different from the test set results with the Untrained model (Figure 2a, right, fourth row from the top)? It makes sense why the other models are worse on the held-out set -- they have never been trained on any frames from those levels. However, the untrained model has not been trained on *any* frames from *any* levels, including the test set and the held-out set.

      The same is true for the ResNet model, which is pre-trained on a completely separate data set and yet similarly shows worse performance on the held-out set compared to the test set.

      This cannot be explained by the ridge regression, which has no parameters or hyperparameters fitted on either the test set or the held-out set.

      The big discrepancy in the untrained model & ResNet results between the test and the held-out set makes think that there is something substantially different about the levels in that held-out set; that they are truly out of distribution compared to the other 20 levels (e.g., maybe they're the last 2 hardest levels and look completely differently? e.g. ResNet proxy in Fig 5c shows worse performance than the mean, which is indicative of an anti-correlation). Alternatively, it may be some issue with the analysis pipeline. The poor generalization results are central to the claims of the paper, so I believe this should be clarified.

      (4) Brittleness conclusion rationale

      I'm not quite on board with the author's rationale that "[poor model performance on the out-of-distribution levels] demonstrates that the models we tested are limited in scope and may not provide a valid inference of brain-like processing, as human behavior remains robust and generalizable across levels".

      For one, unlike the models, humans were actually trained on those levels, so it would not be surprising if they perform just as well on them as on the other levels (but do they? Again, it would be great to see some behavioral data from the humans and the models).

      Second, as the authors themselves show, task performance and human-likeness do not really correlate with neural encoding across levels (Fig 4a & b, respectively), so even if model performance remained "robust and generalizable" on the held-out levels, that will not necessarily translate to good neural encoding.

      Thirdly, and perhaps most importantly, unless the test set and held-out set were sampled exclusively from the practice phase when the subjects have mastered all the levels (that doesn't seem to be the case, but the authors should clarify), then the humans are continuously learning, which means that their own internal representations of the game are evolving. That's not the case for the models, which I assume are in "inference mode" when their representations are extracted for neural encoding. That is, their weights are frozen. So there's a fundamental mismatch between the mode in which humans are operating (continuously learning and executing) and the mode in which the models are operating (just executing). While this is true for all the levels, it may partially account for the discrepancy in the held-out set specifically.

    1. Reviewer #1 (Public review):

      This study adds important data identifying how ocular motor neurons are transcriptionally specified and identifies additional genes important in ocular motor neuron function. The evidence supporting the claims is convincing, with bulk and single-cell RNA sequencing as well as functional testing of the vestibulo-ocular reflex. This work will be of interest to developmental biologists and eye movement specialists.

      Gershowitz, Hamling, et al investigate genes that specify specific cell populations within cranial motor nuclei III and IV, which control eye movements, by bulk and single-cell RNA sequencing, confirmatory in situ hybridization, and functional studies of vestibulo-ocular reflex in knock-out animals. They take advantage of the timing difference in the generation of dorsal versus ventral cells to selectively mark early-born (dorsal) vs late-born (ventral) cells using the Kaede photolabile protein. They used bulk RNASeq to identify differentially expressed genes between the two populations (which innervate different extraocular muscles). They next used single-cell RNASeq to further identify specific subpopulations of motor neurons and identify 3 main clusters, which broadly map to dorsal CNIII, CNIV, and ventral CNIII. They show that the differentially expressed genes identify subpopulations of neurons, rather than reflecting temporal changes related to cell age via a series of in situ hybridizations across ages. Finally, they show that knock-out of Sim1a, which is unregulated in dorsal nIII neurons, leads to decreased vestibulo-ocular reflex, despite a normal number of neurons in nIII. They tested the knock-out of two other differentially expressed genes, nav2a and onecut1, but found both normal cell number and normal vestibulo-ocular reflex.

      The conclusions of this paper are well supported by the data. As the authors acknowledge, additional experiments would add to the interpretation. Since the Sim1a mutants have normal cell numbers, the authors hypothesize that axon guidance may be disrupted, leading to the phenotype. This could be relatively easily assessed using the Isl1-GFP transgenic line and examining innervation patterns in the extraocular muscles. Additionally, testing horizontal eye movements and eye movements in response to visual, rather than vestibular, inputs would further refine the phenotypes and perhaps identify eye movement abnormalities in the mutant fish with normal VOR.

      More information on why these specific genes were prioritized for functional testing would be helpful, as it is unclear why these three genes were the top candidates.

      The authors should also include a discussion of other subtypes of oculomotor neurons, beyond which muscle they innervate. For example, there are oculomotor neurons that form single neuromuscular junctions on fast, singly-innervated fibers, and there is a separate pool of motor neurons that innervate the slow, multiply-innervated fibers. It would be interesting to note if there were any gene expression differences within the clusters that might represent this subdivision of neurons.

      This data is likely to be of great use to the field in further studies of cranial motor neuron biology.

    2. Reviewer #2 (Public review):

      Summary:

      The goal of the work is to identify genes that are uniquely expressed in subsets of eye muscle-innervating motor neurons, as a way to identify candidate genes for strabismus, a congenital vision disorder in humans. The author's previous work identified birth-order differences that correlate with the positions of neurons in the oculomotor (cranial nerve III) motor nucleus. Here, they use Kaede photoconversion to distinguish early- from late-born neurons and identified transcriptional differences between them by bulk RNA sequencing of FACS-sorted cells. Separately, they used single-cell RNA-Seq to sequence the transcriptomes of 89 extraocular motor neurons. They find signatures of early-born mIII, late-born mIII, and mIV neurons. While there is some overlap in gene expression, some of the differentially expressed genes are confirmed by HCR as being unique to one of these three populations of extraocular motor neurons.

      The authors test the functions of three differentially expressed genes in the vestibulo-ocular reflex by measuring the speed of rotation of the eye in response to the larval fish being tilted 15° from horizontal. One mutant, in the sim1a transcription factor, has markedly slowed responses. Although this is a global knock-out, the authors argue that this defect in the vestibulo-ocular reflex is due to a loss of sim1a function specifically in dorsal mIII neurons because sim1a is not expressed in the two upstream neurons in the vestibulo-ocular reflex circuit.

      Strengths:

      (1) This is the first time that transcriptional differences between and within extraocular muscle-innervating neurons have been described during development. In identifying differentially expressed genes that correspond with anatomical, functional, and temporal subdivisions of these neurons, they support the idea that gene expression programs established early in development underlie the functional differences amongst these neurons.

      (2) The combination of bulk RNA-Seq and single-cell RNA-Seq strengthens the identification of sim1a-expressing early-born mIII neuron subtype.

      (3) The work identifies candidate genes for strabismus.

      Weaknesses:

      (1) The authors show that sim1a is only expressed in mIII neurons and no other cells in the vestibulo-ocular reflex, as evidence that the phenotype in sim1a mutants is due to loss of its expression specifically in mIII neurons. However, as the authors note in the discussion, sim1a has other functions in zebrafish, including global calcium homeostasis via specification of the corpuscles of Stannius. The loss of this, or of some other sim1a function, could be indirectly responsible for the slow vestibulo-ocular response in sim1a mutants.

      (2) The authors perform the vestibulo-ocular response test in sim1a mutants at 7 dpf, which is within a day of when the mutants die, raising the concern that the slowed response is due to a dire systemic condition. The argument that nav2 mutants also die at 7 dpf but have a normal response is weak, since death does not always take a single course.

      (3) The evaluation of the sim1a mutant phenotype is limited to the vestibulo-ocular reflex. The authors do not explore whether the oculomotor neuron innervation of target extraocular muscles is affected in sim1a mutants.

    1. Reviewer #1 (Public review):

      Summary:

      In this article, the authors couple a 3d vertex model to the extracellular matrix and include activity through contractile springs at the edge. They study, sequentially, the distribution of shear stresses in liquid and solid spheroids, the correlation between stress and cell shape, and the spatial distribution of stresses. The authors find that stresses are higher in solid spheroids (somewhat unsurprisingly), but that the stress distributions are wider in the fluid spheroids. Moreover, stress and shape are not correlated with each other in solids (that seems to be due to vertex model peculiarities), but they are for liquids. In contrast, for solids, the stresses are concentrated at the interface.

      The authors attribute a lot of the phenomenology to strain-stiffening properties of vertex models as being akin to a network model (correctly in my opinion). Then they strain individual cells and confirm this link, though I missed any explanation of how they did this. Would it have to be within a medium for computational consistency?

      Finally, they generate an extended vertex model, where they replace the single face linking cells with a double face and mechanoresponsive springs. This allows for stronger coupling of individual cell motion to eventual movement out of the spheroid.

      Strengths:

      Coupling a three-dimensional vertex model to the extracellular matrix, modelled as a crosslinked fiber model, is a computational tour-de-force. Adding activity through fluctuations at the interface is also of the correct symmetry (stresses), instead of the self-propulsion which has been used by other authors, and which is not compatible with Newton's 3rd law. This also allows for accurate back-and-forth mechanical coupling between the cells and the ECM.

      I would like to highlight that deriving vertex model stress tensors in full three dimensions is an open problem due to the complex topology. Any progress is valuable, and decomposing things into tetrahedra like here will allow for connections with, in particular, finite element approaches. Therefore, adding some of these results (eq. 13) to the main text would strengthen the paper in my opinion.

      Adding the nonlinear springs to the VM in the 3rd act is a good idea, and a first step to mechanical feedback. One might argue that at this point, removing the vertex model part would even be an option.

      Weaknesses:

      The paper is written in a very qualitative manner, with all of the model equations and analysis hidden in the supplementary information. I do not understand this choice, as it makes things fuzzy and hard to read. The conclusion is also very long and simply reiterates the previous points.

      At the same time, this paper is rather thin on new results and reads more like a handful of new simulations carried out using the method established in [10] (from largely the same authors). Moving some of the actual results to the main text would help, in particular, the 3d stress formulation and the definitions of different measures.

      Vertex models also have a very clear limitation: They cannot model the transition from a confluent to a non-confluent tissue, and individual cells or groups of cells leaving the spheroid. Even having a surface and having significant deformations of the surface are numerically dicey, so the current model is at the edge of what is feasible. The model as written can only do "invasion" by a single cell moving outward, and then another following it a bit (or not).

      I strongly suspect that further progress on 3d cell models will need particle-based models or models where cells are fully meshed surfaces (some of which are in development currently).

      However, none of these problems is mentioned anywhere in the text. The authors also do not review the increasingly broad zoology of other models.

    2. Reviewer #2 (Public review):

      Summary:

      The manuscript concerns the mechanisms by which cells in a spheroid embedded in the extracellular matrix can escape, either as single or multiple cells.

      Strengths:

      Overall, the manuscript is well written and easy to follow. The claims are mostly justified by the data. Some data can be better analyzed and presented to strengthen the conclusion.

      Weaknesses:

      (1) The description around Figure 2c is not exactly well supported by their results. While values close to 0 for sigma3 dot g3 for solid-like spheroids indicate little correlation between the direction of maximum stress and maximum elongation, this analysis alone does not imply that highly stressed cells are necessarily less globular. The dot product combines the magnitudes of the two vectors and the angle between them. For the distribution graph, it would be useful to have the cumulative frequency equal 1.

      (2) One of the central claims of the paper is that morphology alone is not a reliable indicator of mechanical state. Since the authors compute cellular stresses and cellular shape in their simulation (i.e., Figure 3a and b), can the authors directly plot these two quantities for individual cells in solid-like and fluid-like spheroids?

      (3) There is experimental evidence showing the solid stress inside a spheroid is higher than at the periphery (e.g., https://www.nature.com/articles/ncomms14056). How does this cellular stress relate to these experimental measurements, since they are opposite to what is simulated here (i.e., the authors find max shear stress is lowest in the center and increases towards the boundary, which is opposite to what is measured?

      (4) It's worth pointing out that stress fibers aren't really prominent in cells in 3D spheroids. Nonetheless, cells moving on collagen fibers would have stress fibers and utilize contractile actomyosin bundles to generate traction forces.

      (5) In section 2D, it talks about the result that as the kcc associated with the boundary cell is decreased 10-fold for every 5 percent strain decrease in the fiber target spring length, can this result be shown? I have a hard time seeing where this came from.

      (6) The results of single-cell vs. two-cell breakouts shown in Figure 5 b and c are very qualitative and should be accompanied by some quantitative comparison.

    3. Reviewer #3 (Public review):

      Summary:

      The authors describe a mathematical and computational approach used to compute stresses and cellular deformations in a multicellular spheroid embedded in a fiber network. This approach is then used to predict stress and cellular anisotropy distributions in "solid-like" and "fluid-like" spheroids. Simulations show that shear stresses in solid-like spheroids are large and concentrated at the boundary of the spheroid, yet cells do not align with the direction of the largest shear. Conversely, shear stresses in fluid-like spheroids are smaller and uniformly distributed in the spheroid. In this case, cellular elongation is more likely to be aligned with the direction of the largest shear stress. The model and simulations also predict a nonlinear stress-strain relationship that is indicative of strain stiffening. This strain-stiffening is more pronounced in fluid-like spheroids. In an extension of the preliminary polyhedral vertex model, in which cellular interfaces are shared, the authors incorporate mechanical cell-cell interactions via adhesion springs between neighboring vertices. Using this extension, they show that cell breakout is more likely to occur in fluid-like spheroids, where cells are more likely to elongate and stiffen, allowing for larger forces to be exerted on the surrounding fiber network. Furthermore, the authors state that anisotropic cell-cell adhesion is required for multicell streaming during breakout.

      Strengths:

      The modeling and computational approach used in this research is this work's biggest strength. Treating the embedded spheroid as a set of polyhedra, where each polyhedron represents a single cell, is a mechanically robust, yet still tractable way to model multicellular spheroids in three dimensions. Starting with expressions for constraining cell volume and surface area as well as a surface energy term, the authors derive an expression for an averaged stress tensor for each polyhedron. This allows the authors to approximate the stress in each polyhedral cell that is caused by cellular deformations during mechanical interactions with the extracellular fiber matrix. This is a clever and robust approach that is based on fundamental mechanical principles that allow one to make reasonable predications about the mechanical state of the spheroid under a variety of conditions.

      Weaknesses:

      The weakness of the manuscript is the exposition. There are significant pieces of critical information missing from the manuscript that would make the presented work significantly more understandable and better support the authors' claims. Most importantly, many necessary details of the model are missing. I was able to get a better understanding of some of these details by reading the authors' earlier work (ref [10] in the submitted manuscript), and for this reason, I do feel that this work has value. However, several descriptions must be added for the paper to be more readily understandable. These include (1) a better explanation of what drives motion, in particular in the case where no external fiber network is present. (2) What physically distinguishes fluid-like spheroids from solid-like spheroids? Simply stating the value of the parameters s0 with no explanation is not sufficient. (3) An explanation of how histograms in Figure 2 are calculated is necessary. Are these histograms based on one simulation or several simulations? (4) The experimental results are briefly mentioned, but significantly more connection between these results and the numerical results of the cell breakout model is needed. (5) The description of the model that incorporates variable cell-cell attachments and cell breakout is very terse and needs more detail. Moreover, while the description of the results of this model is strong, the figure that illustrates cell breakout (Figure 5) is difficult to interpret. Addressing these and other issues will make the current manuscript, which presents an interesting model and result, much stronger and easier to read.

    1. Reviewer #1 (Public review):

      Summary:

      In this paper, the authors conduct both experiments and modeling of human cytomegalovirus (HCMV) infection in vitro to study how the infectivity of virus (measured by cell infection) scales with the viral concentration in the inoculum. A naïve thought would be that this is linear in the sense that doubling the virus concentration (and thus the total virus) in the inoculum would lead to double the fraction of infected cells. However, the authors show convincingly that this is not the case for HCMV, using multiple strains, two different target cells, and repeated experiments. In fact, they find that for some regimens (inoculum concentration) infected cells increase faster than the concentration of the inoculum, which they term "apparent cooperativity". The authors then provided possible explanations for this phenomenon and construct mathematical models and simulations to implement these explanations. They show that these ideas do help explain the cooperativity, but can't be conclusive as to what is the correct explanation. In any case, this advances our knowledge of the system and it is very important when quantitative experiments involving MOI are performed.

      Strengths:

      Careful experiments using state-of-the-art methodologies and advancing multiple competing models to explain the data.

      Weaknesses:

      Minor weaknesses in explaining the implementation of the model. However, some specific assumptions, which to this reviewer were unclear, could have substantial impact on the results. For example, whether cell infection is independent or not. This is expanded below.

      In the revised version, the authors address almost all of these minor weaknesses, strengthening the paper and its reproducibility.

      Suggestions to clarify the study:

      In the revised version, the authors carefully consider these suggestions and provide further details, clarifications and even some new results. Regarding the question of how infection of a cell with one virus could lead to lower probability for a secondary infection, I think that it is possible that infected cells activate antiviral programs that lead, for example, to lower expression of surface receptors. This has been considered at least in hepatitis C virus infection. However, this is a minor point.

      Overall, I think the revised version provides a sound study with relevant conclusions, and I thank the authors for their thoughtful consideration of my previous comments.

    2. Reviewer #2 (Public review):

      In their article, Peterson et al. wanted to show to what extent the classical "single hit" model of virion infection, where always the same quantity of virion is required to infect a cell, does not match with empirical observations based on human cytomegalovirus in vitro infection model, and how this would have practical impacts in experimental protocols.

      Strengths:

      - The use of a very simple and robust experimental assay, where they infected cells with serially diluted virions and measured the proportion of infected cells with flow cytometry. This convincingly showed how the proportion of infected cells differed from a "single hit" model which they simulated using a simple mathematical model ("power-law model"), and better fitted a model where virions need to cooperate to infect cells.

      - The use of different cell types and virus strains, which allows to draw some generalizations.

      - The exploration of the mechanisms that could explain this apparent cooperation, using biologically plausible simulations.

      - The practical consequences that this phenomenon has for lab virologists as well as modelers.

      Weaknesses:

      - The impossibility to discriminate between biological mechanisms is an important limitation of this study and calls for developing experimental designs able to further understand this question.

      - The outcome of the virion clumping remains highly sensitive to the choice of the clumps size distribution, which is itself very complicated to estimate, especially at high dilution.

      - The impossibility to directly fit the mathematical models to the data limit them to a qualitative discussion.

      Overall, this work is very valuable as it raises the general question of how the estimate of infectivity can be biased if extrapolated from a single virus titer assay. The observation that HCMV virions often cooperate and that this cooperation varies between context seems robust. The putative biological explanations would require further exploration.

      This topic is very well known in the case of segmented viruses and the semi-infectious particles, leading to the idea of studying "sociovirology", but to my knowledge this is the first time that it was explored for a non-segmented virus, and in the context of MOI estimation.

    1. Reviewer #1 (Public Review):

      This study by Charendoff et al provides interesting observations related to global histone hypermethylation in host cells, during Chlamydia trachomatis infections. The core observation they report is that the host histones are highly hypermethylated during infection, and this appears to be an amplifying effect due to continuous inhibition of demethylases, in part due to a metabolic shift in the host where succinate amounts (which inhibit demethylases) increases. The authors claim specifically due to the bacteria, since antibiotic treatment prevents histone hypermethylation (but leaves you wondering about cause/consequence correlations).

      The core observation of hyper methylation is very interesting, and well documented. There are a number of points to consider though in order to fully substantiate the findings, and close out loose ends. My comments are broad - and built around the interpretations (vs the data presented).

      (1) Related to observations coming Fig 1C etc, and connecting to Fig 3 - the hyper methylation appears to be across different protein arg/lys residues - and is not histone specific. So, is it just a consequence of high SAM pools and flux in infected cells? i.e. the bacterial infection increases SAM pools in cells, and provides an increase in substrate pools for the methyltransferases, leading to protein hyper methylation. The approach used here only measures steady-state SAM amounts (and not SAM flux or utilisation). For example, reduced SAM amounts in nuclei could be due to increased utilisation of SAM. The experiments done with the demethylase does not actually answer this question - if you decrease demethylase activity, you will get an increase in net methylation. The authors see an increase in net methylation in the infected cells - this would suggest that in addition (or perhaps primarily) to reduced demethylase activity, there could be much higher SAM utilisation/flux. Again, the over expression of JMJ proteins does not resolve this problem.

      (2) Adding to this - what happens to SAM pools in the cells treated with the inhibitors? This actually may not look like the slightly reduced SAM pool observed in infected cell nuclei. Also, what is the SAM/SAH ratio (a very useful indicator of methylation activity).

      (3) There is a correlation/implication issue here in Fig 2 - cells with C. trachoma's infection show hyper methylation. But these are the only cells with high C. trachomatis. So it is a bit ingenious to say that histone hyper methylation correlates with bacterial proliferation. The cells without bacteria don't have hyper methylation - and that does not have anything to do with the bacterial proliferation.

      (4) The claim that demethylase activity is down in infected cells again comes primarily from the increased succinate (2-fold) amounts in infected nuclei - and then correlated with experiments where succinate, (permeable) a-KG are supplemented in excess. While I personally like the hypothesis that the hypermethylation might be a result of an imbalance in cofactors (succinate vs a-KG) in infected cells, the data presented is very premature to make that conclusion. Again, steady state measurements of only succinate cannot provide a clear answer to that question. For example, is there a clear allocation/flux difference (between a-KG, and leading out to glutamate/glutamine, vs flux through the TCA and increased succinate accumulation? Is there a bottleneck/build-up of succinate in cells that might lead to the increase in nuclei? This also opens another direction of possible regulation - increased histone succinylation. When you see a large increase in succinate in the nucleus, before looking at demethylase activity - it becomes obvious if succinate itself increases histone succinylation (through HATs).

      (5) What might the authors hypothesise about why this hyper methylation happens? It appears in some ways that hyper methylation happens - potentially due to a metabolic bottleneck that the bacteria triggers (and there is a build-up of SAM and/or succinate, and altered flux out of a-kg). The methylation is just a visible outcome - but may not be central to pathogenesis or viability.

    2. Reviewer #2 (Public Review):

      Strengths:

      (1) Because the study compares genuinely infected cells with uninfected cells within the same infected cell population, it enables a clearer and more rigorous comparison.

      (2) By using multiple Chlamydia species and cells from multiple host species (human and mouse), and obtaining consistent findings across these systems, the study demonstrates the generality of bacterium-induced epigenomic alterations.

      (3) The study shows that the epigenomic changes are caused by reduced activity of JMJC domain-containing lysine demethylases, demonstrating through multiple complementary approaches-including the use of a demethylase inhibitor, overexpression of target-specific demethylases, and analysis from the perspective of cofactors required for JMJC domain-containing demethylases-that decreased lysine demethylase activity constitutes the molecular mechanism underlying the increased H3 methylation levels induced by Chlamydia infection.

      (4) By performing ChIP-seq analyses of H3K4me3 and H3K9me3, the study clearly delineates, on a genome-wide scale, how infection leads to increased levels of these epigenomic marks.

      Weakness:

      (1) Reduction of cofactors such as Fe2+ or a-KG decreases the activity of JMJC-domain-containing lysine demethylases (thereby directly affecting histone H3 lysine methylation). However, these cofactors are also involved in the activities of other epigenetic regulators, such as TET enzymes that contribute to DNA demethylation and SIRT family proteins that mediate histone deacetylation. Therefore, it cannot be excluded that modulation of these factors indirectly leads to the changes in H3 lysine methylation dynamics targeted in this study.

      (2) Related to point 1, although overexpression of JMJC-type demethylases has been shown to reduce the Chlamydia infection-induced increase in H3 lysine methylation, it is well known that over production of these enzymes, while target-specific, also leads to a genome-wide reduction of lysine methylation. Thus, a decrease in lysine methylation upon expression of these demethylases does not necessarily demonstrate that the infection-induced increase in H3 lysine methylation is caused by impaired JMJC-type demethylase activity.

    3. Reviewer #3 (Public Review):

      In this manuscript, the authors explore a molecular basis for hypermethylation of histones in epithelial cells infected with the obligate intracellular bacterial pathogen Chlamydia trachomatis. This is of particular interest given that Chlamydia is known to drastically alter host cell gene transcription, and histone hypermethylation would suggest a new way by which Chlamydia interferes with gene expression of its host. Histone methylation was previously implicated in the introduction of dsDNA breaks in infected cells, and the chlamydial effector NUE was reported to methylate histones, but the role of this modification in dictating host cell gene transcription has been unexplored. The authors use a suite of tools to approach this question, including various -omics techniques, genetic approaches, and biochemical assays. Overall, the manuscript provides many interesting pieces of data, though some of them are difficult to reconcile, which may reflect methodological hurdles that are not fully addressed in the current version of the manuscript. My major concerns regard the rationale/interpretation for various mechanistic experiments and that the heterogeneity of the histone hypermethylation phenotype is not addressed which I believe may explain some apparent inconsistencies in the results.

      Using an immunofluorescent approach, the authors show that a subpopulation of the nuclei in Chlamydia-infected cells (~10-20%) exhibit high amounts of methylated histone species. This occurs during the late stages of infection, near the time when Chlamydia would lyse the host cell and positively correlates with bacterial burden. Accordingly, halting chlamydial growth blocks the onset of histone hypermethylation. Exogenously supplying cofactors for histone demethylases, the low activity of which is implicated in the histone hypermethylation phenotype, reduces histone hypermethylation. In general, these data are compelling and raise interesting questions about the role of histone methylation in governing chlamydial egress from infected cells. Interestingly, these behaviors seem to arise independently of NUE, the secreted chlamydial histone methyltransferase, supporting the notion that a metabolic reprogramming may underlie the hypermethylation phenomenon.

      As noted above, the authors propose that hypermethylation arises due to decreased demethylase activity in infected cells. However, the data do not conclusively support this interpretation. For example, the approaches used to probe demethylase activity rely on (i) a direct biochemical measure of demethylase activity, (ii), pharmacological inhibition of demethylase, and (iii) heterologous expression of a specific demethylase. With the exception of (i), these approaches would be expected to alter histone methylation regardless of the source. That is, inhibition of demethylases should increase histone methylation regardless of whether the source of methylation is increased methylase or decreased demethylase activity. Similarly, overexpression of a demethylase would be expected to reduce cognate histone methylation arising either from increased methylase or decreased demethylase activity.

      Moreover, the authors report that the effect of the demethylase inhibitor on histone hypermethylation is significantly potentiated by infection, suggesting that infected cells have greater methylase activity than uninfected cells, because the latter barely respond to the presence of demethylase inhibitor. In other words, a dramatic increase in histone methylation in the presence of demethylase inhibitor is most parsimoniously explained by increased methylation (no longer being removed by demethylase), not decreased demethylation (which would be analogous to treatment with demethylase inhibitor). The authors do not directly assay methylase activity. These concerns extend to the rationale used to justify experiments with infected mice, which the authors treat with the demethylase inhibitor.

      The authors perform experiments to characterize the consequence of hypermethylation genome-wide. Because the authors do not enrich for those cells which exhibit histone hypermethylation, the results reflect the mixed population, and therefore presumably dilute out important signal related to the phenomena under investigation. For example, the proteomic analysis of post-translational modifications identifies only one methylated histone species, whereas the immunofluorescent approach shows consistent effects across five different methylated histone species. Moreover, the chromatin immunoprecipitation analysis indicates that there is unexpectedly a lower density of methylated histones at regions which are also enriched in uninfected cells. The authors argue that this suggests increased methylation is happening "outside" of these histone-dense regions, but direct evidence in support of this claim is lacking.

      In sum, this paper provides compelling evidence in support of the notion that histones are hypermethylated at various residues late in chlamydial infection, that this process is modulated by known cofactors of demethylases, and is the result of high levels of bacterial replication in the cell. That histone hypermethylation governs host gene transcription during chlamydial infection suggests a relatively novel mechanism by which Chlamydia subverts the host cell to establish a replicative niche or egress to infect a new cell. The information obtained regarding the methylation status of host proteins and host gene transcription controlled by a metabolic cofactor during infection will be a useful resource for other researchers. However, in the current version of the manuscript, the mechanistic basis for these behaviors is relatively unclear.

    1. Reviewer #1 (Public review):

      Summary:

      This study addresses an important question in reinforcement learning and metacognition by distinguishing value confidence from decision confidence and testing how each is computationally represented. The findings are significant because they suggest that value confidence is well captured by Bayesian uncertainty, whereas decision confidence reflects a hybrid computation combining probability correct with broader value certainty. The evidence is promising, supported by multiple datasets and model comparisons.

      Strength.

      (1) A major strength of the study is that the authors test their hypotheses across multiple datasets, including previously published datasets and newly collected data. This broad empirical approach increases the generality of the findings.

      (2) The Bayesian model of value confidence has a clear theoretical basis. The proposed hybrid model of decision confidence is also intuitive. It appears to capture important aspects of the decision confidence data.

      (3) The paper provides a useful framework for linking how certainty about value estimates guides the subsequent choice and the corresponding decision confidence.

      Weakness

      (1) The conceptual link between value confidence and decision confidence is not yet fully established. The manuscript argues that overall value certainty contributes to decision confidence, but this conclusion is based largely on the latent variable that the model infers from the decision confidence experiment alone. A more direct test would require measuring value confidence and decision confidence within the same participants and task, and analysing how these two types of confidence interact.

      (2) The individual-difference analyses in Figure 5 are methodologically challenging. The predictors used in these analyses are derived from model fits to the behavioural data and are then correlated to behaviour in the same task. This creates a risk that correlations inevitably arise. Thus, it does not assure that correlations are cognitively meaningful.

      (3) The model recovery results suggest that some candidate models are not clearly distinguishable.

      (4) The manuscript would benefit from clearer explanations of why specific models capture particular behavioural patterns.

      (5) The claim that value confidence modulates the exploration-exploitation trade-off should be interpreted carefully, because the model uses global uncertainty across both options, not option-specific value confidence.

    2. Reviewer #2 (Public review):

      Summary:

      In this work, the authors propose a common value-estimation framework based on Bayesian inference and show that it can account for both participants' confidence in their value estimates ("value confidence") and for their confidence in their final choices ("decision confidence").

      Strengths:

      The study extends several established findings in the confidence and reinforcement-learning literature. In particular, the authors not only examine decision confidence but also directly model value confidence, and they replicate the idea that decision confidence reflects a combination of multiple computations, previously described for categorical decisions (Navajas et al., 2017), in the context of continuous value-based decisions. I therefore consider the work a useful contribution to the field.

      Weaknesses:

      However, I believe that the scope of the conclusions is overstated relative to the results that are actually presented.

      (1) Interaction between value confidence and decision confidence

      The abstract and introduction frame the study as addressing a major gap in the literature, namely, the lack of direct investigation of the interaction between value confidence and decision confidence. Yet the manuscript never directly tests the interaction between these two quantities. Instead, the authors show that the reported decision confidence depends not only on the probability of being correct, but also on the precision of the decision variable DV, which is related to the precision of the value estimates underlying value confidence. While this is related to the proposed research question, it is not a direct analysis of the interaction between value confidence and decision confidence themselves.

      (2) Unified computational framework

      Similarly, the claim that the study provides a "unified computational framework" appears somewhat overstated. The proposed models build on standard and well-established Bayesian frameworks and extend them specifically to account for decision confidence. While this demonstrates that both forms of confidence can be expressed within a common Bayesian formalism, the manuscript does not establish a direct computational interaction or shared mechanism between them beyond their dependence on the same underlying uncertainty estimates.

      (3) "Phenotypes" interpretation

      The interpretation of the observed individual differences as distinct "behavioural phenotypes" also appears overstated. The reported analyses primarily show continuous variability across participants in the relative weighting of different components contributing to confidence reports, rather than evidence for qualitatively distinct categories or computational subtypes of decision-makers.

      (4) Decision confidence terminology

      I also found some conceptual ambiguity in the terminology used throughout the manuscript. Early in the paper, decision confidence is defined normatively as the subjective probability of having made the correct choice, corresponding to P(DV>0). Later, however, the authors show that participants' confidence reports are better explained by a combination of this probability and the precision of the decision-variable distribution. Despite this distinction, the manuscript continues referring to the reported quantity simply as "decision confidence." Clarifying the distinction between the theoretical construct and the empirical reports (for example, by referring to "reported decision confidence") would improve conceptual clarity.

    3. Reviewer #3 (Public review):

      Summary:

      Comay, Solovey, and Barttfeld aim to provide a unified computational account of confidence in reinforcement learning by distinguishing value confidence-the certainty associated with latent value estimates-from decision confidence-the confidence that a particular choice is correct. Across new experiments and reanalyses of previously published datasets, they argue that value confidence is best described by Bayesian posterior precision, that this form of confidence adaptively reduces decision noise as learning progresses, and that decision confidence is better captured by a hybrid model combining Bayesian probability correct with a more global estimate of value certainty. They further propose that individual differences in the relative weighting of these components define "confidence phenotypes" that predict task performance, exploration-exploitation behavior, and metacognitive accuracy.

      Strengths:

      A major strength of the study is that it addresses an important conceptual distinction that is often blurred in the confidence literature. The paper usefully separates uncertainty about latent environmental states from confidence in an action derived from those latent beliefs. This distinction is especially important in reinforcement learning, where uncertainty is not merely a retrospective judgment about accuracy but can directly shape future sampling, learning, and action selection. The manuscript is therefore well positioned to bridge work on Bayesian confidence in perceptual decision-making with work on uncertainty-guided learning and exploration.

      A second strength is the authors' use of multiple datasets and model comparisons. The claim that value confidence tracks Bayesian uncertainty is supported across tasks in which participants explicitly report confidence in value estimates, including datasets where reward variance is manipulated. The latter manipulation is particularly useful because it helps distinguish a Bayesian uncertainty account from simpler models based only on the number of observations. The finding that value confidence modulates the softmax slope and thereby promotes more exploitative choices as uncertainty decreases is also theoretically coherent and supported across several datasets, including a preregistered replication.

      The manuscript's most interesting and potentially impactful contribution is the hybrid model of decision confidence. The authors show that a model based only on Bayesian probability correct captures confidence on correct trials better than on incorrect trials, whereas adding an "overall value confidence" term improves the fit. This is a useful result because it suggests that confidence reports in reinforcement learning may not be a pure readout of decision-level discriminability, but instead may combine decision-specific evidence with more global latent-state uncertainty. This could help explain why human confidence often deviates from ideal Bayesian predictions, especially on error trials.

      Weaknesses:

      However, the interpretation of the hybrid model remains the main weakness of the paper. The second term, overall value confidence, is not equivalent to the precision of the decision variable. It can dissociate from decision difficulty: two options can be far apart but individually uncertain, or nearly identical but individually well estimated. The authors appear to recognize this issue and have reframed the term as "overall value confidence" rather than decision-variable precision. This is a useful clarification, but the conceptual role of the term still requires sharper treatment. In its current form, it is sometimes described as part of a unified confidence computation, but it may be more accurately understood as a biasing or contextual signal that modulates reported confidence without necessarily improving decision calibration.

      A related concern is model identifiability. In many reinforcement-learning tasks, probability correct and overall value confidence both change systematically over the course of learning. As a result, the hybrid model may gain predictive power partly because it captures generic time-on-task or learning-progress effects, rather than because participants explicitly combine two separable uncertainty signals. The manuscript would be stronger if it more clearly demonstrated that the two latent variables are distinguishable in the behavioral data, for example, through model recovery, parameter recovery, cross-validated prediction, and analyses of the correlation between latent regressors across task conditions and individuals.

      The link between the decision rule and confidence model also deserves more scrutiny. The authors use value confidence to modulate decision noise in the choice model, and then use a related global value-confidence term in the confidence-report model. This creates an appealing unified architecture, but it also raises the possibility that the same latent variable is doing multiple kinds of explanatory work. The paper would benefit from a clearer separation between uncertainty as a driver of choices, uncertainty as a determinant of confidence reports, and uncertainty as an inferred latent variable extracted from the same behavioral data.

      From a computational neuroscience perspective, the manuscript would also benefit from a more explicit discussion of how these confidence quantities might be represented neurally. The current model treats value confidence, probability correct, and overall value confidence as scalar latent variables available to the observer. Yet uncertainty-related computations may be represented nonlinearly in neural population activity rather than as explicit scalar readouts. Work on nonlinear neural decoding and population codes has shown that task-relevant variables can be carried by nonlinear statistics of neural activity, especially when nuisance variables obscure mean tuning, and that behavioral choices can reveal whether such nonlinear information is efficiently decoded. This literature provides a useful framework for connecting the present behavioral model to possible neural implementations of value and decision confidence.

      Overall, the authors largely achieve their goal of demonstrating that value confidence and decision confidence are computationally dissociable in reinforcement learning. The evidence for Bayesian value confidence is strong, and the evidence that confidence-guided exploitation improves the account of choice behavior is convincing. The evidence for the hybrid account of decision confidence is promising but would be strengthened by additional analyses clarifying model identifiability, the interpretation of the overall value-confidence term, and the conditions under which the model makes distinct predictions from simpler time-, value-, or evidence-based alternatives. The paper is likely to be useful for researchers interested in computational models of confidence, metacognition, and adaptive behavior under uncertainty.

    1. Reviewer #1 (Public review):

      This study provides evidence that the apicoplast-locaized isoform of acyl-carrier protein (ACP) has acquired important non-enzymatic functions in the malaria parasite. Previous studies have shown that the apicoplast-located FASII-dependent pathway of fatty acid synthesis is not essential in Plasmodium blood stages. In contrast, genome-wide knockout studies suggested that ACP, a key protein in this pathway, is essential in these stages, indicating that it may have additional non-canonical functions. In this study, the authors confirm that ACP is essential in Pf blood stages (using both apicoplast IPP rescue and conditional knockdown); show that this essential function requires modification with 4-phosphopantetheine and use proximity biotinylation and complementary immunoprecipitation pull-down approaches to provide compelling evidence that ACP binds to and stabilizes the apicoplast-located isoform of pyruvate kinase II. Notably, these interactions appear to differ from those associated with the binding of mitochondrial isoforms of ACP to proteins involved in Fe-S biosynthesis. Loss of ACP was shown to lead to a decrease in PKII levels and apicoplast DNA/RNA synthesis, consistent with loss of NTP synthesis in this organelle. The data are clear and very well described, and the findings represent a significant advance in our understanding of metabolic regulatory mechanisms in apicomplexan apicoplast studies.

      Strengths:

      The study uses a variety of complementary genetic approaches to demonstrate the essentiality of ACP and the enzyme involved in its activation with 4-PP in Pf blood stages, demonstrating that the ascribed non-enzymatic function is mediated by holo-ACP. Similarly, a number of complementary biochemical approaches, including proximity biotinylation, immunoprecipitation, and co-expression of PfACP and PK-II in a heterologous bacterial expression system, are used to confirm the physiological significance of the PfACP and PK-II interaction. The study also reports additional findings, such as the independence of P. faciparum blood stages on exogenous (media) fatty acids, indicating that intracellular stages can salvage all of their requirements from the red blood cell.

      Weaknesses:

      Overall, this is a very strong study. While questions remain around the function of other apicoplast ACP-interacting proteins detected in this study, I don't have any suggestions for significant improvements.

    2. Reviewer #2 (Public review):

      This study focuses on revealing the essential divergent function of the Acyl Carrier protein (ACP) in the deadliest human malaria parasite, Plasmodium falciparum. More precisely, using inducible KO, cellular and biochemical approaches, the authors determined that instead of a canonical role for ACP allowing the de novo synthesis of fatty acids in the apicoplast (essential relict plastid) of the parasite, the enzyme couples with pyruvate kinase II to generate nucleoside triphosphate to maintain parasite survival during blood stages. The study is novel, well-designed, providing interesting new data on Plasmodium and apicomplexa biology. The results convincingly support the major claim of the study. However, it is currently incomplete to support some claims on the essentiality of some apicoplast pathways.

      In this study, Geher et al. focused on deciphering the role of the Acyl Carrier Protein (ACP) present in the relict non-photosynthetic plastid, i.e. the apicoplast of the most lethal human malaria parasite, Plasmodium falciparum. More particularly, they determined an essential function of ACP independent of its usual/typical function as the central protein for the normal function of the apicoplast Type II fatty acid synthesis (FASII) pathway. Rather, the protein seems to associate with the apicoplast Pyruvate Kinase II, together generating an essential nucleoside triphosphate (NTPs) source to fuel the apicoplast and parasite survival instead.

      By generating a TetR-DOZY-based inducible KD line for ACP, they confirmed that the protein is indeed essential to maintain apicoplast integrity and parasite survival during asexual blood stages, as previously predicted and experimentally shown. They showed that ACP requires a biochemical modification, typically activating the protein for its function in the FASII pathway, i.e. binding of the 4-PP group by holoACP synthase. Then, they showed that the other enzymes of the FASII pathway are likely dispensable during the blood stage, as they were able to generate a KO line of the first enzyme of the pathway, FabD (which was predicted to be essential in P. falciparum). Based on a cell culture approach in a controlled culture medium, they further claimed that, unlike current evidence-based hypotheses, the FASII pathway (and thus a potentially FASII-linked ACP) has no role/activity during blood stages. Using a proximity biotinylation approach, they determined that ACP associates with the apicoplast pyruvate Kinase II (PKII), previously shown to generate NTPs in the apicoplast for energy and DNA/RNA maintenance (Xia et al. 2019), and not to fuel the FASII pathway as its main function in blood stages. Finally, they showed that the disruption of ACP induces the reduction of the presence/content in PKII in the parasite, as well as the drastic reduction of the apicoplast DNA and RNA content. Together, they concluded that the main function of ACP is indeed the NTP formation via its association with PKII, rather than its canonical role for the generation of fatty acids in the apicoplast.

      This study is novel and focuses on a topic of particular interest in malaria biology, but also for most of the apicomplexa-related diseases, and beyond for plastid bearing orgnaisms and this unusual role for ACP. The study is well thought out with proper biochemical approaches that convincingly point to this association of ACP with PKII for NTP synthesis as a major function during P. falciparum blood stages. However, there are currently some important experimental issues/flaws, missing experiments that induced wrong interpretations and thus do not support some important claims of the study, notably for the role of FASII and the interaction between ACP and PKII.

      Therefore, at this point, the study is only partial and would require major additions and/or important text edits/revisions before being considered for acceptance.

      Major points:

      From the graph of P. falciparum growth, we can see that in the lipid-rich condition, where both FabH KO and ACP KO can survive, the addition of mevalonate was essential for the growth of ACP KO. Along with the other evidence (PKII association, DNA levels...), we therefore agree that PfACP is involved in the mevalonate pathway. The authors claim that the FASII pathway is inactive/not essential in the P. falciparum blood stage. However, the authors have not shown any evidence on whether ACP is or not involved in the FASII pathway during the asexual blood stage. As currently designed, the experiments presented cannot conclude on that point for several reasons. Indeed, it was previously shown that (i) the expression of the protein from the FASII pathway are all present in blood stages and are significantly upregulated in patients that are under under "nutrient starvation" (Daily et al. Nature 2007), (ii) that, growing parasites under similar low lipid conditions in vitro induces an activation/upregulation of FASII, which can be measured by stable isotope precursor labelling and lipidomics (Botté et al. 2013), (iii) that growing the PfFabI KO line under deprived lipid conditions leads to parasite death (Amiar et al. 2020), indicating that the FASII pathway can become critical, if not essential, depending on the host nutritionnal content together correlating patients' data and metabolic adaptation for the same reasons in the related parastie Toxoplasma gondii (Amiar et al. 2020, Krishnan et al. 2020, Liang et al. 2020, Primo et al. 2021, Charital et al. 2024, Dass et al. 2024, Bitew et al. 2025).

      Here, the authors are expecting to show that FabH (and thus the FASII pathway) is not essential in an experiment that is not designed to be in low lipid conditions but rather in lipid rich conditions: Such high lipid conditions of culture in this study is granted by daily feedings with high fatty acid supplement (30-90 uM palmitic acid and 30-60 uM oleic acid). These fatty acid concentrations were used previously by Mitamura et al. (2005) and Mi-ichi et al.(2007) to replace non-determined supplements such as Serum or Albumax supplement to grant similar growth by a completely controlled culture medium.

      This means the concentrations above do not represent limited fatty acid concentrations, especially not with daily feeding (representing an excess supplied amount of lipids, unlike regular 48h feedings) that allowed the authors to easily reach very high non-physiological parasitaemia of more than 20%!! Amiar et al. previously showed essentiality of FabI in P. falciparum in the limited fatty acid culture at a lower concentration (<30uM 16:0, <45um 18:1), than the Mi-Ichi et al. controlled medium with regular 48 h culture feeding. Therefore, with the current experimental settings, the FAH KO is placed in high lipid conditions, thus preventing any conclusion on its essentiality under low lipid conditions.

      Furthermore, it is too uncertain to conclude that ACP is only essential for the mevalonate pathway. This would be a similar discussion to the Yeh et al. 2011 and the Swift et al., where induced Apicoplast knockout caused parasites to require IPP to survive, but there were always remnant apicoplast vesicles and thus the putative presence of an active FASII in the parasite, where de novo fatty acid synthesis could be maintained. Amiar et al. (2020) and Krishnan et al. (2020) showed that disruption of FASII and absence of de novo FA synthesis in T. gondii could be compensated by the exogenous supplementation of myristic acid, C14:0. Here, high fatty acid supplementation using commercially available fatty acids may include unexpected fatty acid species such as myristic acid in palmitic acid or oleic acid, since all commercially available fatty acids guarantee only >99% but not 100%. If P. falciparum requires a very, very low amount of myristic acid to survive, the amount of possible contamination, like 1 nM, may be sufficient to maintain their survival. Thus, ACP and FabH might be very important to generate de novo fatty acids within parasites, but this was not shown by the authors.

      Therefore, the manuscript currently contains incorrect conclusions on the potential essentiality/use of FASII, against current experimental evidence.

    1. Reviewer #1 (Public review):

      Summary:

      In this manuscript, Duss et al. use several complementary and state-of-the-art strategies to characterize the effects of norepinephrine release from LC axons on post-synaptic cell types in the hippocampus. While a large body of research supports an important role for NE signaling in hippocampal function, the precise role by which NE promotes these effects remains poorly elucidated, in large part due to the complexity that adrenergic subtypes can be expressed in a variety of cell types and promote a variety of responses. Towards assessing this, the authors first establish an optogenetic strategy by which their delivery stimuli mimic endogenous activation of LC in 'moderate' and 'high' acute stress events, using NE sensors to titer stimulation patterns to similar levels of NE release. They then conduct a series of 2P imaging experiments in mice and compare response properties of various cell types in the hippocampus (excitatory and inhibitory neurons, and astrocytes) when the animal is 'naturally' or optogenetically aroused (via activation of the LC). The results are surprising. Whereas natural arousal causes activation of astrocytes, pyramidal cells, and interneurons, optogenetic activation of the LC does almost the opposite, with only astrocytes responding positively. Another important finding from the study is that astrocytes seem to be the most responsive cell type in the hippocampus to NE release, suggesting they could be key components for downstream functional effects of NE release in this brain region.

      Strengths:

      (1) The study was methodically done with respect to the characterization of how optogenetic parameters related to levels of NE release. Also, the analysis of their calcium imaging of various cell types in the hippocampus was very comprehensive.

      (2) Related, their discovery that cell types in the hippocampus respond differently to NE release, while not a completely unexpected finding, is something that has not been addressed experimentally in such a direct way before (to my knowledge).

      (3) Their finding that optogenetic stimulation of the LC produces opposing results to when these cells are naturally activated has wide implications for the LC field and potentially beyond.

      Weaknesses:

      I was surprised that no efforts were made to further assess what might be causing this discrepancy in hippocampal responses to optogenetic vs. natural activation of the LC. Some experiments that I felt were missing:

      (1) The authors go to great lengths to measure NE release in a variety of arousing conditions (tail lift, foot shock, 5Hz LC opto, 20Hz LC opto), but then in their 2P imaging, they're comparing the opto results to a 'natural' arousal state defined as when the mice were in motion. Maybe I missed it, but I wasn't sure that they ever checked the level of hippocampal NE release in this running state, similar to what they did in the other arousal conditions. Thus, it wasn't clear to me how comparable this state was to the optogenetic stimulation.

      (2) The authors do a nice experiment to show that increases in the hippocampal NE sensors are dependent on LC activity via optogenetic inhibition of the LC (Figure 1, Supplement 3). It seems like a missed opportunity to include a similar strategy in their 2P testing, to assess whether the differing responses of pyramidal cells, interneurons, and astrocytes are truly due to NE release. I could imagine it might be difficult to precisely time LC inhibition with periods of movement, but I imagine that mice would still run even if the LC is inhibited.

    2. Reviewer #2 (Public review):

      Summary:

      The manuscript aims to determine the extent to which LC-mediated NA release in the CA1 region of the hippocampus (at both population and cellular levels) contributes to physiological arousal responses associated with innate behaviors (stress, locomotion). The manuscript is divided into two parts in which the authors compare time-locked responses in astrocytes, interneurons (pan-targeting), and pyramidal (CaMKIIa-driven targeting) cells.

      In the first part of the manuscript, the authors perform bulk recordings of either NA release or calcium activity locked onto either 'natural arousal' events (tail lift, foot shock, force swim) or direct optogenetic activation of LC somas. A first aim is to identify an optogenetic stimulation frequency that would mimic NE release in the target area by low- and high-intensity stressors. In the second aim, they compared evoked responses across cell types and concluded that stressors and direct LC activation trigger similar responses in astrocytes but not in interneurons or pyramidal cells.

      In the second - and most extended - part of the manuscript, the authors performed 2-photon cellular recordings of these different cell populations and compared responses evoked by the onset of locomotion vs. direct activation of the LC. Doing so, they observed a great degree of heterogeneity across these two conditions and across cell types. They conclude that NA effects on the hippocampus are primarily mediated by astrocytes and that LC-NA neuromodulation alone does not recapitulate the full breadth of 'natural arousal' modulations. They conclude that other neuromodulators likely contribute to how the hippocampus responds to high arousal levels.

      Strengths:

      Overall, the manuscript is well written and the figures are particularly clear.

      Optogenetics is a very successful technique in contemporary neuroscience, yet one important identified limitation is that it operates largely in a non-physiological regime, driving spike rates in regions rarely visited under normal physiological operations. This has raised valid concerns about the physiological relevance of findings obtained from studies using this technique. Here, the authors aimed at calibrating optogenetic manipulations of the LC so as to match the physiological release of NA observed in specific behavioral contexts. This is a valuable endeavor that could bring the field towards more reproducible and broadly valid findings.

      Another important open question is how different cell types coordinate to support global network activity and adaptive behavior. By recording distinct cell populations from the same region (CA1) and in response to the same category of endogenous versus exogenous events (locomotion or LC activation), it becomes possible to unravel important and specific operation modes, here also linked to a specific category of neuromodulation signaling.

      Weaknesses:

      This manuscript was difficult to review. There is clearly a lot of work and effort that went into it, and the multiple techniques seem well implemented, often with appropriate controls. Yet, the general framing, the links between experiments and interpretations, unfortunately, look questionable in my opinion. Below, I unpack what I think are the 4 main weakness points.

      (1) Incomplete calibration of optogenetic manipulations to physiological regimes

      While mapping optogenetic stimulation protocols to physiological variations is valuable, the proposed approach suffers from major limitations. First, the only parameter that is calibrated is the peak of NE release (as estimated from GRAB-NE fluorescence). Thus, it excludes other important aspects of the response, including trial-to-trial variability and the temporal dynamics of the response. Furthermore, stressor and LC activation conditions are simply non-comparable in terms of the duration of the stimulation (e.g., 3 min swim test versus 10s optogenetic stimulation), likely involving neuromodulation at different timescales (phasic vs. tonic). Albeit not explicitly mentioned, the number of trials and inter-trial interval between successive stimulations are also likely unmatched. On another note, the identification of the best stimulation frequency seems based on a grid of predefined values, while a more precise, continuous assessment could have easily been used. Finally, even though phasic NE release is known to depend on baseline tonic NE levels (especially with a sensor that reports a sublinear function of NE concentration), this dimension is ignored.

      (2) Weak links between imposed stressors and spontaneous locomotion

      The general approach is surprising: authors calibrated the optogenetic stimulation protocol on a range of stress-related behaviors and applied this to locomotion behavior. Indeed, while the first part of the manuscript uses different stressors in freely moving contexts to 'naturally' elevate arousal, the second part uses spontaneous locomotion bouts in a head-fixed situation as proxies for heightened 'natural' arousal. These two parts are very difficult to relate, and it is entirely unclear how NE regimes observed in the first context generalize to the second. Yet, on several occasions, the authors directly relate the first (fiber photometry, Fig.1) and second (2-photon, Fig. 2-6) parts of the manuscript. For instance, they conclude in favor of a "weak alignment between astrocytic responses to arousal and to LC stimulation on a cellular basis, despite the similarity of the bulk response." It remains unclear why closer preparations weren't used in the two parts, such as time-locked change in GRAB-NE2m fluorescence according to either locomotion onset or in a fear conditioning assay, both using fiber photometry in a head-fixed setting.

      (3) LC optogenetics and spontaneous locomotion differ by more than the origin of the arousal drive

      By directly comparing spontaneous locomotion and LC activation, the authors imply that the only difference between these two conditions is the origin of arousal: endogenous vs. exogenous, respectively. Furthermore, they interpret LC activation as triggering a pure NA effect while locomotion would reflect the conglomerate modulation from multiple neuromodulatory systems. On the one hand, LC activation likely results in the recruitment of other arousal centers (the raphe serotonin system, for instance, see 10.1101/2025.03.26.644382). On the other hand, differences between these conditions span well beyond specific arousal centers (see the massive motor-related activity in cortical dynamics: 10.1038/s41593-019-0502-4). Another, more methodological concern is the larger instability of the field of view during locomotion by comparison to optogenetic activation. While I am sure the authors corrected for movement-related translation in x and y directions, there might still be residual motion artefacts in the z direction that could account for some of the differences between the two conditions.

      (4) Loose equivalence between locomotion and natural arousal

      On many occasions, the authors draw a direct equivalence between spontaneous locomotion and 'natural arousal'. Arousal is a multifaceted concept that relates to far more behavioral readouts and network states than just locomotion. For instance, imagine a freezing mouse in response to a threat: locomotion would be absent, but the animal would still be quite aroused. It is ok to leave aside a particular readout and focus on other one(s) (especially thus in the case of arousal, which has many aspects). However, in that case, a single readout cannot be equated with 'natural arousal' as a whole. Instead, terms like 'locomotion' or 'locomotion-linked arousal' should be preferred. Indeed, in the particular case of locomotion, what is being readout is the upper part of the arousal continuum, whereas pupil size or whisker pad movements can also provide a more complete readout, including the lower and intermediate parts of that same continuum. While it is not necessary to include other arousal readouts (once claims are appropriately modified), the motivation for leaving out available readouts (lines 187-201) feels like a post-hoc rationalization.

      In sum, these 4 points call in my opinion for a profound change in how results are presented and interpreted. If agreed, a solution could be to leave aside the first part of the manuscript, to provide a more accurate picture of the differences between optogenetic activation and spontaneous locomotion, and to better flag the limitations of the approach (a part that I believe is entirely missing in the current version).

    3. Reviewer #3 (Public review):

      Summary:

      In this study, the authors focused on the CA1 region of the hippocampus to compare Ca2+ dynamics in astrocytes, pyramidal neurons, and interneurons in response to optogenetic stimulation of locus coeruleus-triggered noradrenaline (NA) release, or movement (natural arousal)-triggered NA release. The most striking finding is that all studied cell types responded differently to LC stimulation compared to natural arousal. The description of these findings is important as a resource for further mechanistic studies on how multiple neuromodulator systems may interact or for predicting the consequences of the selective impairment of the noradrenergic system.

      Strengths:

      The technical design and conduct of the experiments, analysis including statistics, as well as the presentation of the results, are timely and very solid.

      Weaknesses:

      The identity and localization of NA receptors responsible for effects on neurons are less clear, and therefore, the difference between LC stimulation and natural arousal is less surprising. However, the presented data are consistent with the established finding that astrocytes directly sense NA mainly through α1 adrenergic receptors, yet in this study, astrocytes that responded strongest to LC stimulation did not respond strongest to natural arousal, and vice versa for other astrocytes.

      The authors seem to favor diversity of astrocyte responsiveness as an explanation, but also mention differences in LC activation pattern and distance of individual astrocytes to NAergic nerve terminals. Therefore, this warrants a careful consideration of a critical aspect of the experimental design. The authors delivered Ca2+/NA sensors as well as the optogenetic tools via AAV. While Figure 1 Supplement 3 suggests that most LC neurons were transduced, AAV transduction will almost certainly lead to a diversity in copy numbers per cell. On the receptor side, this can lead to an artificial diversity in Ca2+ response detection sensitivity among individual cells, but more importantly, for the LC, this could account for a different pattern of activation by optogenetic stimulation compared to activation by natural arousal. Such a problem would remain unnoticed with the currently presented matching of optogenetic and natural arousal stimulations of LC using population NA sensor signals (Figure 1, fiber photometry).

      Major suggestion:

      A critical experiment to test for this caveat would be to ideally express the NA sensor in astrocytes (due to their space-filling process arborizations and direct response to NA; but expression in neurons, as present, would work as well) and study the spatial pattern of NA release using two-photon microscopy, comparing multiple days and LC stimulation by optogenetics versus natural arousal. In case these experiments revealed nonuniform NA signal patterns, stable over days, but different when caused by optogenetic stimulation versus natural arousal, it would possibly shift the interpretation of the astrocyte response patterns towards depending mainly on NA release rather than diversity in NA responsiveness. Such a finding would be consistent with studies that compared arousal-mediated Ca2+ dynamics in NAergic terminals and Bergmann glia in the cerebellum (PMID: 36790089). On the other hand, in case these added experiments revealed similar NA release patterns in response to optogenetic stimulation versus natural arousal, then the presented findings would convincingly represent a biological phenomenon.

      Minor suggestion:

      Using "movement" as a proxy for arousal is very appropriate. To avoid the misunderstanding that different phenomena have been studied, it may be useful to acknowledge that early studies of noradrenergic signaling to astrocytes have found that speed of locomotion does not correlate well with astrocyte Ca2+ responses, and electromyographic signals have been used as a "proxy for movement" (PMID: 24945771).

    1. Reviewer #2 (Public review):

      The study by Chen, Deng et al. aims to develop an efficient viral transneuronal tracing method that enables retrograde tracing in larval zebrafish. The authors utilize pseudotyped rabies virus that can be targeted to specific cell types using the EnvA-TvA system.

      Pseudotyped rabies virus has been used extensively in rodent models and, in recent years, has begun to be developed for use in adult zebrafish. However, compared to rodents, the efficiency of spread in adult zebrafish is very low (~one upstream neuron labeled per starter cell). Additionally, there is limited evidence of retrograde tracing with pseudotyped rabies in the larval stage, which is when most functional neural imaging studies are conducted in the field. In this study, the authors systematically optimized several parameters for rabies tracing, including rabies virus strains, glycoprotein types, temperatures, expression construct designs, and the elimination of glial labeling. The optimal configurations developed by the authors are up to 5-10-fold higher than more commonly used configurations.

      The results are compelling and support the conclusions.

    1. Reviewer #1 (Public review):

      Summary

      The authors apply dynamic representational similarity analysis (dRSA), a method introduced in de Vries and Wurm 2023, to source-reconstructed MEG data from 40 participants who viewed ballet dancing sequences under three conditions: normal viewing, up-down inversion, and temporal piecewise scrambling. In normal viewing, they replicate their previous finding of a hierarchical pattern of leading-edge neural representations, with view-invariant body motion represented earliest in time (around 500 ms before the corresponding stimulus state), followed by view-dependent body motion (around 200 ms) and pixelwise motion (around 150 ms). Inversion selectively attenuates the leading-edge representation of view-invariant body motion while enhancing view-dependent body motion. Scrambling abolishes all leading-edge motion representations and instead increases post-stimulus representations of body posture. The authors interpret these findings as evidence that biological motion perception relies on a hierarchy of priors operating within a predictive-processing framework, with inversion specifically disrupting holistic priors and scrambling disrupting kinematics priors.

      Strengths

      The empirical work is careful and technically ambitious. The dRSA framework introduced in the 2023 paper is a useful methodological contribution to the study of dynamic neural representations, and the present manuscript extends it in well-motivated directions. The dataset is substantial: 40 participants, source-reconstructed MEG, three within-subject conditions. The replication of the 2023 normal-condition findings in an independent 40-subject sample is solid, which is increasingly rare and welcome in the field. The inversion and scrambling manipulations are well-motivated, and the conditions are matched on stimulus identity. Principal component regression is used appropriately to handle the genuine challenge of correlated and autocorrelated stimulus features, and the authors validate this choice through simulations. Eye position is included as a covariate and successfully regressed out, addressing a common confound in MEG decoding work. Behavioral catch trials demonstrate that participants attended to the stimuli across conditions. Both frequentist and Bayesian statistics are reported with appropriate corrections for multiple comparisons. The inversion result, in particular, is striking, and the asymmetry between view-invariant and view-dependent representations is informative.

      Weaknesses

      The central interpretive step in the manuscript treats a negative-lag dRSA peak as direct evidence for active hierarchical predictive inference. The data are equally consistent with at least three other accounts that the manuscript does not engage with, and the conclusion is therefore stronger than the data support.

      First, the leading-edge dRSA signature is a natural consequence of nonlinear temporal integration of autocorrelated stimulus features. A long line of work from the Winawer and Grill-Spector labs (Zhou et al. 2018, Zhou et al. 2019, Stigliani et al. 2017, Kim et al. 2024) has established that the human visual cortex implements compressive temporal summation with delayed divisive normalization and that temporal integration windows progressively increase from early to higher visual areas. A nonlinear-summation response to an autocorrelated feature encodes deviations from the recent baseline. For smooth trajectories, this is essentially a local derivative, and the derivative inherits the trajectory's leading edge as a free consequence - no predictive machinery required. The integration-window hierarchy that Kim et al. (2024) recovered from voxelwise spatiotemporal pRFs maps onto the 150 / 200 / 500 ms hierarchy reported here almost one-for-one. That alignment is unlikely to be coincidental and deserves explicit treatment.

      Second, the experimental design places participants firmly in the regime where Dayan's successor representation (SR) predicts that the brain holds a precompiled associative cache of trajectory structure. Each unique sequence is presented approximately 47 times across the experiment. An SR in Dayan's original formulation is a precompiled lookup table, not an online inference engine - querying it during familiar trajectories produces leading-edge representations through passive associative retrieval, mechanistically distinct from active prediction despite producing similar signatures. The senior author's own lab has demonstrated SR-like representations in V1 (Ekman, Kusch, de Lange 2023 eLife), but this paper is not cited or engaged with in the present manuscript despite its direct relevance.

      Third, the canonical computational model of biological motion perception (Giese and Poggio 2003 Nat Rev Neurosci) is a fully feedforward template-matching architecture that predates the predictive-coding framing of biological motion. It accommodates the inversion effect (templates tuned to upright statistics), the hierarchy of timescales (graded leaky integrator time constants), and the scrambling effect (broken sequence-neuron activation) without invoking generative models or prediction errors. The manuscript cites Giese-tradition work for the inversion-effect literature but does not engage with the model itself, even though it is the field standard.

      The inversion result, while empirically striking, has a simpler interpretation than the one offered. Inversion makes viewpoint-invariant body computation fail because the underlying machinery is tuned to upright body statistics. A weaker representation produces a weaker dRSA signature at every lag, including the leading edge - no appeal to priors in the active-inference sense is required. The view-dependent enhancement under inversion fits this reading naturally: when viewpoint abstraction fails, processing falls back to viewpoint-specific representations that remain extractable. The manuscript implicitly acknowledges this when it states that "predictions were channeled to the level at which prediction was still possible," but does not notice that this concession softens the strong predictive-coding inference.

      The scrambling result is internally awkward on the predictive-coding framing. The paper acknowledges that pixelwise motion prediction should, in principle, survive 200-500 ms scrambled segments (typical latency around 150 ms) but reports that it does not. The proposed save - that segments are "too short to start up prediction" - undercuts the framework, since by the same logic, most of normal viewing would also be pre-prediction. A cleaner reading is that scrambling destroys the temporal autocorrelation of stimulus features, which is the prerequisite both for nonlinear-summation neural responses to produce leading-edge representations and for SR-style associative retrieval to operate.

      A further concern is that the experimental design and analysis pipeline are structurally biased toward producing the cleanest possible predictive signature. The 14 stimuli are repeated extensively, and trials are averaged across repetitions before dRSA is computed, filtering out exactly the variability that would distinguish online prediction from amortized retrieval. The 2023 paper reports a control comparing the first and last thirds of the experiment, but this test is in the post-saturation regime for any plausible associative-learning rate and does not actually adjudicate the question. A first-exposure or first-run analysis would be diagnostic. Finally, the behavioral task changed between the 2023 paper and the present manuscript. The earlier paradigm asked participants to recognize the current motion ("arms moving up?"), while the present paradigm asks participants to judge whether an occluded video continues correctly. The latter explicitly demands prediction. This change transforms the experimental context from naturalistic viewing into one that actively incentivizes predictive engagement, potentially inflating the very signatures the paper interprets as spontaneous prediction.

      The 2023 Nature Communications paper actually navigated these interpretive questions more carefully than the present manuscript does, explicitly stating that the approach "does not provide conclusive evidence for predictive processing/coding theory but leaves the door open for related theories such as adaptive resonance or Bayesian inference without predictive coding." The current manuscript would benefit from restoring that epistemic discipline. The data and methods are valuable; the interpretive frame is overstated relative to what the evidence supports.

      Impact and utility

      The dataset and dRSA framework are useful contributions to the study of neural representation of dynamic stimuli, and the inversion and scrambling conditions open productive lines of inquiry. The interpretive over-commitment to predictive processing risks limiting the paper's reach into adjacent literatures - temporal integration, successor representations, template-matching biological motion models, encoding-model approaches - where the findings could land productively. With a more pluralistic interpretive frame, this work would speak to a substantially broader audience and connect more naturally with existing mechanistic accounts of dynamic visual processing.

    2. Reviewer #2 (Public review):

      Summary:

      In this manuscript, de Vries and colleagues apply successful probabilistic inference and predictive coding frameworks to the question of biological motion perception. In contrast to most studies of predictive processing in humans, which rely on the presentation of discrete events, they instead aimed to track continuous predictions in the context of more naturalistic inputs such as biological motion. In these settings, the authors have previously demonstrated an inverted temporal hierarchy of prediction whereby high-level movement features (e.g., view-invariant body motion) are predicted earlier than lower-level ones (e.g., pixelwise motion). The specific question they set out to address in this manuscript is whether these predictions derive from prior beliefs about the biological and physical organization of biological movements versus the local extrapolation of motion from past observations.

      The authors used anatomical MRI-driven source reconstruction of MEG activity recorded from human participants watching either normal, vertically-mirrored, or temporally scrambled movies. They then aimed to correlate activity in preselected ROIs with summary representations of these movies based on different visual features at 3 different hierarchical levels using RSA. Doing so, they could confirm that predictive processes could be identified prior to the change in the stimulus and organized anatomically along the visual cortical hierarchy. Critically, they report that mirrored movies selectively disrupted the highest processing level while the lowest level remained largely unaffected. Interestingly, the predictions at the intermediate level were boosted in mirrored movies, suggesting a possible channeling of predictions at this level when highest-level predictions are unavailable. Finally, disrupting all predictive aspects with the scrambled movies entirely abolished predictions at all levels, with signals mainly reflecting reactive bottom-up processing of inputs.

      In sum, biological motion perception relies on a tight coordination of multi-level predictions based on both motion-related holistic and kinematics priors.

      Strengths:

      Overall, this is a very strong manuscript, with the text being clearly written. I liked the fact that the authors not only compared responses to normal videos against the same videos flipped upside-down, but also to temporal piecewise scrambling of that same video, allowing to identify the respective roles of holistic motion priors vs. temporal predictions. Of course, more work is needed to tease apart what key quantities are represented in these holistic priors. For now, the authors argue that they likely combine prior beliefs about the biological organization of bodies, such as the likely angle of joint movements, and about the physics of reality, such as gravity. Further work teasing apart these aspects would be interesting to read!

      All analyses seem well executed and, while some aspects of the presentation of results could be slightly improved (see below), the manuscript is very clear and the conclusions are supported by the data. Finally, I liked the words of caution the authors added to the discussion. For instance, while they largely used negative vs. positive latency as a proxy for top-down vs. bottom-up processing respectively throughout the manuscript, they also accurately acknowledge that predictive computations could also modulate processes at positive lags, through, for instance, latency modulation.

      Weaknesses:

      The main aspect of the work I was left to struggle with is this idea that priors can be read out directly from large patterns of activity rates as measured with MEG. While some past experimental work does support this view, theoretical proposals also suggest that one benefit of predictive coding lies in its computational and energy-efficient properties, whereby only novel, unpredicted aspects are encoded in the rate of neural activity. Some other research lines, for instance, focusing on silent working memory, also report the brain's ability to store important computations in ways that are not reflected in costly increases in overall activity. The authors do not really unpack why they expect to see predictions to be encoded in such a way in the first place. They also do not discuss what that implies in terms of neural organization and whether other aspects of neural activity (e.g., oscillations, synaptic weights) could subtend predictive processing in this context. At the end of the day, this activity change is clearly there in the data, so that's totally fine to interpret that; it just would be helpful to unpack what such an implementation of prior beliefs would imply in terms of neural organization.

      The other weakness point I see is the little consideration for behavior throughout the paper. Behavior is indeed mostly treated as a negative control, ensuring that differences between conditions at the neural level do not follow from different behavioral strategies or other peripheral factors. Critically, task design nicely incorporates two types of tasks: one that is related to motion (occlusion of movement) and one that's independent of it (color change of fixation cross). Yet, these conditions are not directly compared at the neural level. It would be useful to see whether the neural signatures of prediction are largely independent from the ongoing task or whether behavior gates the types of priors and prediction processes that are applied to incoming sensory inputs. Moreover, the text says that "neither in accuracy nor in reaction time was there a significant difference between conditions", yet significance stars in Figure 1d seem to suggest there is a difference in the fixation cross task. What am I missing? If there is indeed a difference in overall performance, can the results (esp. the reduced dRSA correlation strength in normal < inverted < scrambled movie) be interpreted in terms of a multi-tasking cognitive cost?

      I also have some other minor questions and comments:

      (1) In this task situation, prediction does not only come in the continuous domain but also relies on a mental simulation model, in particular in the occlusion task. However, corresponding literature, notably the work by Shepard & Metzler (1971) on mental rotation (as well as follow-ups), is not mentioned here, I believe. Could the authors perhaps mention this if they think that's relevant (if not, feel free to ignore).

      (2) I'm concerned that the novelty of dynamic RSA as explained at lines 56-64 might appear slightly exaggerated. After all, isn't it just a generalization of matrix correlation in model and brain time domains? (Again, feel free to ignore if I misunderstood.)

      (3) How do authors explain that high-level motion prediction is still significantly larger than zeros (correct?) in the inverted movie condition? Shouldn't it be entirely abolished?

    3. Reviewer #3 (Public review):

      Summary:

      The authors investigate whether the brain's predictive representation of observed biological motion depends on holistic priors about body structure or on kinematic priors about motion continuity. The manuscript applies dynamic representational similarity analysis to MEG data from a large number of participants viewing ballet sequences under three conditions: normal, upside-down inverted, and temporally scrambled into short epochs.

      Strengths:

      The study reports that inversion selectively attenuates predictions of view-invariant body motion and enhances predictions of view-dependent body motion, while leaving low-level pixel-wise motion prediction unaffected. Further, scrambling eliminates predictive motion representations at every level and instead produces stronger post-stimulus representations of body posture, with view-invariant posture also delayed. The pattern across the two manipulations is internally consistent, holds across both peak magnitude and peak latency measures, and is also supported by a neural-to-neural dynamic representational similarity analysis (dRSA) analysis between normal and inverted conditions. The principal component regression pipeline is validated through simulations showing that it recovers the model of interest while suppressing covarying models. In particular, the inversion result provides strong evidence that high-level predictions of biological motion depend on holistic priors while predictions at lower levels do not, and the finding that disruption at the top of the hierarchy does not propagate down is informative for predictive processing accounts that assume a more cascading architecture.

      Weaknesses:

      The interpretation of the scrambling result is the main caveat of the manuscript. The claim that low-level motion prediction depends on kinematic continuity rests on the absence of pixelwise motion prediction in the scrambled condition, but the 200 to 500-ms segments may not be sufficient for prediction to develop, as the authors also point out. Without a parametric manipulation of segment length, it is difficult to distinguish a genuine dependence on kinematic priors from a floor. The interpretation of increased post-stimulus posture representations as prediction errors is also somewhat indirect, since a positive latency does not rule out potential top-down modulation/factor.

    1. Reviewer #1 (Public review):

      Summary:

      Dhillon and Lewis present an optical approach to record single CRAC channel activity, overcoming the long-standing barrier imposed by the channel's extremely small unitary conductance. By fusing HaloTag to Orai1, labeling with JF646-BAPTA, and combining TIRF microscopy with whole-cell voltage clamp (Patch-TIRF), the authors achieve genuine single-channel resolution. A central contribution is the recognition that JF646-BAPTA undergoes reversible photophysical blinking that can be readily mistaken for gating events. The authors exploit the multi-dye labeling of hexameric Orai1, combined with voltage-clamped definition of open and closed fluorescence levels, to distinguish true gating transitions from blinks. The result is the first kinetic characterization of single CRAC channel openings activated by STIM1, reporting multiple open and closed states with durations from about 0.1 s to tens of seconds, predominantly high open probabilities ({greater than or equal to} 0.7), and an unexpected population of "silent" channels that co-localize with STIM1 but show no detectable activity over the observation window.

      Strengths:

      The work is technically rigorous, and the controls are appropriate. The integration of patch-clamp voltage control with TIRF imaging is a thoughtful methodological choice that defines the open- and closed-channel fluorescence reference levels with precision, providing a quantitative framework that the field has lacked. The use of the non-conducting Orai1-E106A mutant as a specificity control (Figure 4C) is exactly the right experiment, and the demonstration that JF646-BAPTA signals require Ca²⁺ flux through Orai1 itself anchors the entire approach. The identification and characterization of JF646-BAPTA blinking (Figures 2 and 3) is a significant contribution in its own right. The authors show clearly that the dye exhibits long-lived dark states and that transitions to zero fluorescence, rather than to a finite calcium-free baseline, are diagnostic of blinking rather than channel closure. This caveat has immediate implications for the interpretation of recent work using the same dye on other calcium-permeable channels, and will recalibrate the broader field of HaloTag-based single-channel optical recording. The kinetic analysis itself reveals something that was previously inaccessible: seconds-long open times, multi-state gating behavior, and a population of channels that co-localize with STIM1 yet remain electrically silent. These findings are physiologically meaningful and would not have been detectable by macroscopic electrophysiology. Overall, an outstanding study.

      Weaknesses:

      The manuscript would benefit from a small number of additional analyses of the existing data and modest refinements to the presentation. The discrete-channel interpretation of the intensity histogram in Figure 6C, the open probability distribution in Figure 8C, and the assignment of the "silent" channel population are all interesting and likely correct, but each rests on assumptions that the authors are well positioned to test directly using data already in hand. Brief additional discussion of the dynamic range of JF646-BAPTA in situ and of how the temporal resolution of the recordings shapes the inferred kinetic model would also help readers calibrate the findings.

      None of these points challenges the central claims of the paper, and none requires new experiments.

    2. Reviewer #2 (Public review):

      Summary:

      Dhillon and Lewis use the enhanced brightness of the new calcium indicator dye JF646-BAPTA attached to Orai1-bound HaloTag to identify single CRAC channel events detected as [Ca2+]i fluctuations rather than currents. This enables them to detect Orai1single channel kinetics of permeation, overcoming the currently unmeasurable single channel CRAC conductances (~ 20-40 fS). TIRF microscopy narrows the z-section and improves calcium event localization.

      JF646-BAPTA reversibly blinks between fluorescent and non-fluorescent states, complicating single-channel detection. Blinking occurs both in permeabilized cells with saturating Ca2+ and in intact cells at physiological [Ca2+]i. Using voltage clamp and TIRF imaging, CRAC gating events were distinguished from blinking by analyzing fluorescence responses to voltage changes.

      Hyperpolarization (-100 mV) increases fluorescence, indicating channel opening. Responses blocked by La3+ confirm specificity for Orai1, while minimum fluorescence at +30 mV corresponds to closed channels. Dynamic range and response kinetics help differentiate genuine gating from blinking artifacts. Long channel openings (seconds to tens of seconds) are observed, with most open times around 1.2 seconds. Longer openings (tens of seconds) are present but difficult to sample. Silent channels constitute 11% of puncta.

      The paper carefully examines a new method to sample CRAC kinetics, which should enable further mechanistic studies of STIM control of ORAI and modulation by other signaling components such as calcineurin. Development of bright nonblinking dyes or dyes whose blink rates are directly correlated with a calcium-binding site will enhance this route of investigation.

      Comments:

      This is an excellent methodological study, rigorous and thorough. I wondered whether La3+ alone could alter JF646-BAPTA blinking, but the authors show that JF646-BAPTA exhibits reversible transitions to a non-fluorescent state (blinking) under both Ca2+-saturated and physiological conditions, independent of channel activity or the presence of La3+.

      Strengths:

      A novel method providing additional tools to study store-depletion induced Ca currents mediated by Stim-Orai family members.

      Weaknesses:

      Limited by blinking dyes, the only ones currently sensitive enough to measure the calcium fluxes through single channels.

    3. Reviewer #3 (Public review):

      Summary:

      Previous work from the Cahalan lab used fluorescent Genetically Encoded Ca2+ Indicators (GECI), like GCaMP6f, tethered to the N- or C- terminus of Orai1 to monitor CRAC channel optical signals (Dynes et al., PNAS 2016 PMID: 26712003; J Gen Physiol 2020 PMID: 32589186; PNAS 2023 PMID: 37729200). In this study from the Lewis lab, the HaloTag system enables C-terminal labeling of Orai1 with a reactive JF646-BAPTA loaded into cells. The article raises two key issues with the Ca2+ indicator probe that may limit potential applications: probe loading conditions and blinking.

      Making Sense of Probe Probe-lems:

      This is a three-component system: the hexameric Orai1 channel, the Halo tag, and the Ca2+ indicator (four components if you count the GFP- or mCherry-tagged STIM1 in the endoplasmic reticulum membrane that activates the plasma membrane Orai1 channel). The Orai1 channel, tagged with the Halo protein, appears to function normally, judging from the characteristic inwardly rectifying Ca2+ current first observed in T lymphocytes (Lewis and Cahalan, Cell Regulation 1989 PMID: 2519622). One problem is to find a condition for indicator dye loading that results in complete and uniform labeling with the covalently linked JF646 indicator. JF646-BAPTA is a far-red fluorescent indicator related to BAPTA, with a Kd of ~150 nM. The esterified form can be loaded into cells, as is routinely done for Ca2+ indicators like fura-2 or fluo-4. Ideally, to monitor local Ca2+ in the cytosolic nanodomain of the Orai1 channel, the indicator should react with each and every Halo tag of the hexameric channel. The authors assessed published methods by varying the exposure time to the JF646-BAPTA-esterified probe. The authors then used green JF552 labeling following red JF646-BAPTA loading to assess the completeness of labeling. Even overnight incubation of Halo-tagged cells was not sufficient. The addition of Pluronic treatment for 1 hr improved labeling, and a standard condition was adopted. Under this condition, no additional labeling with the green JF552 was seen, implying complete labeling with JF646-BAPTA. However, even with complete labeling, several additional effects might reduce the effective signal-to-noise, which is lower in these studies than expected from in vitro measurements - for example, if the JF646-BAPTA molecules are incompletely de-esterified, or if there is quenching between the closely spaced probes attached to the channel hexamer.

      A second, more serious problem analyzed by this article is that the JF646-BAPTA probe blinks on and off spontaneously, making it problematic to monitor true single-channel events in which the channel open state is assessed by the fluorescent probe. The authors distinguish blinking from channel-gating events by carefully noting the residual level of fluorescence in the absence of Ca2+ influx. Blinking events occur in bursts that reduce fluorescence transiently to zero, whereas the closed channel labeled with JF646-BAPTA retains a low level of fluorescence (~20%). To circumvent the blinking issue, the authors use whole-cell patch recording, in conjunction with optical recording (Patch-TIRF). This allows channel-gating events to be identified by step-wise changes in fluorescence due to Ca2+ entry upon hyperpolarization to -100 mV, above a baseline level of fluorescence at +30 mV, which the authors presume represents the closed channel level of fluorescence. Irreversible photobleaching is an additional issue, limiting the recording times to less than 1 minute.

      Visualizing Orai1 Single-Channels:

      With the blinking problem circumvented, at least in part, the authors uncovered a wide variety of single-channel events. Cells with low expression levels of Orai1 revealed 0-3 active Orai1 channels per STIM1 puncta. The range of gating behavior at the single-channel level is one of the revelations in this study. A substantial fraction (11%) of puncta contained "silent" channels that did not open (detected by the non-zero level of baseline fluorescence for closed channels). At the other extreme, some channels remained open for tens of seconds. On average, channels that opened and closed stochastically exhibited a bi-exponential distribution of bright states (open channels), with a major component of fast events (92 ms) and a minor component of slower ones (1190 ms), as well a single-exponential distribution of dark states (closed channels), and open probabilities >0.7. Channel open/closed times and the high open probability of active Orai1 channels seen here reinforce previous work based on analysis of CRAC current fluctuations in whole-cell recording, and optical single-channel recording using a different genetically encoded Ca2+ indicator, G-GECO1, tethered to Orai1 (Prakriya and Lewis, J Gen Physiol 2006 PMID: 16940559; Dynes et al., PNAS 2016 PMID: 26712003).

      Expression levels for single-channel optical recording must be low; accordingly, puncta contained only 0-3 active channels. However, under conditions of high STIM1 and Orai1 expression, conventionally used to investigate channel function, as in Figure 1, cells with large currents express many thousands of active channels. The number of active channels per cell can be calculated by dividing the peak current (~-100 pA) by the voltage (-100 mV); this corresponds to a whole-cell conductance (G) of ~1 nS (conductance is measured in Siemens). The single channel conductance (gamma, too low to detect electrically) is estimated by noise analysis to be 20-40 fS. Thus, the number of active channels is given by G / gamma corresponding to a range of > 25,000 - 50,000 open channels per cell. Under similar conditions of high STIM1/Orai1 co-expression in HEK cells, individual Orai1 channels were visualized at high density in puncta by freeze-fracture electron microscopy (Perni et al., PNAS 2015 PMID: 26351694), revealing puncta packed with Orai1 particles corresponding to hundreds to >1000 channels per punctum. Measuring the center-to-center distances between particles in puncta revealed two peaks in a distribution of inter-particle lengths: 9 nm (consistent with the approximate width of the Orai1 channel hexamer) and 15 nm (possibly due to two adjacent Orai1 channels held together by intervening STIM1 dimers).

      Strengths:

      The authors do an excellent job of analyzing and discussing probe artifacts that can confound measurements at the single-channel level. On the technical side, we thank the authors for including a photon 'budget' for their imaging experiments by including: the conversion factor from camera intensity units (c.u.) to photoelectrons, cell background fluorescence levels, and nominally Ca2+ free single channel fluorescence levels. One parameter missing from the list is the size of the region of interest used for channel recording. We expect the intensity measurements provided in the channel traces to correspond to mean ROI intensity levels. Upon knowing the ROI size in pixels, the magnitude of fluorescent signals could then be calculated in photons. Taken together, these values will aid comparisons to previous work and help guide subsequent researchers doing their own optical recording.

      The most important finding of this study is the ability to analyze single-channel properties of active Orai1 channels using the HaloTag approach. By direct measurement, the authors confirm previous work that there are at least two open states and that the CRAC channel open probability is greater than 0.7.

      Like any good study, this work suggests opportunities for further work. At the chemistry level, one focus should be the development of new probes that don't blink and have lower affinity for Ca2+ to circumvent unwanted responses to global Ca2+ signaling. Far-red probes like JF646-BAPTA have the advantage of reduced scattering for in vivo imaging applications. At the level of channel molecular function, the results pave the way for unraveling mechanisms of channel gating, such as the requirement for STIM1 binding to activate sub-states of Orai1, and how the channel undergoes Ca2+-dependent inactivation. At the cellular physiology level, localized Ca2+ probes should help to clarify mechanisms that couple to changes in gene expression and reveal Ca2+ signaling in subcellular structures, including dendritic spines. As a nice proof of principle, Halo-tagging enabled Ca2+ signals to be measured in primary cilia (Deo et al., J Am Chem Soc 2019 PMID: 31430138). Future users of HaloTag and GECI Ca2+ indicators will need to confront the issues (probe-lems) at the single-channel level that are carefully raised and analyzed in this article.

      Weaknesses:

      The major confounding issue identified here is probe blinking. The authors find a way to circumvent the issue, but not to prevent it. Is it triggered by high laser light intensity? Do the six JF646-BAPTA molecules tagging a single Orai1 channel exhibit quenching or correlated blinking?

      Which type of probe is better for understanding more about the CRAC channel function? It is difficult to evaluate the pros and cons of the HaloTag and GECI approaches without a side-by-side comparison under identical conditions (except for the probe, obviously). With respect to Ca2+ affinities, higher Kd values (lower affinity) are probably better. JF646-BAPTA has a relatively low Kd value (150 nm) compared to Orai1-GCaMP6f (620 nM in situ), which may account for the saturation of optical signals at potentials more negative than -75 mV in this study. In contrast, saturation did not occur at negative potentials with Orai1-GCaMP6f in the study by Dynes et al., 2020. Lower affinity also makes the probe more resistant to unwanted signals from global increases in Ca2+. With respect to response kinetics, the finding that JF646-BAPTA has faster Ca2+ binding and unbinding kinetics than GECIs in Deo et al., 2019, occurred before publication of the jGCaMP8 series indicators in Y. Zhang et al., Nature 2023. Kinetic measurement of Orai1-jGCaMP8f fusions was reported in Dynes et al., PNAS 2023, and these measurements were performed using the same patch-TIRF approach as the present manuscript. While photoinactivation of jGCaMP8f fused to Orai1 interfered with kinetic measurements, Orai1-jGCaMP8f V203Y (a mutant with greatly reduced photoinactivation) exhibited a tauon of 10 ms and tauoff of 15 ms, roughly twice as fast as the values reported for Orai1-HaloTag-JF646-BAPTA in the present manuscript. The manuscript text comparing Halo-Tag kinetics with GECI should be revised accordingly.

      The authors suggest that single-channel events reported previously for Piezo1 channels (Bertaccini et al., Nat Comm 2025 PMID: 40593468) may be due to probe blinking. However, that study included two critical controls that demonstrate that signals reflect bona fide channel activity rather than blinking artifacts. Notably: (1) treatment with channel activator Yoda1 increased bright-state occupancy (Figure 3C - 3G), and (2) increasing channel open probability by administering a mechanical stimulus increased bright-state occupancy (Supplementary Figure 13).

    1. Reviewer #1 (Public review):

      Summary:

      This manuscript deals with the ability to identify material hardness from the vibrations induced by single light taps on that surface. Psychophysical tests of human perception under varying conditions of modified fingertip compliance and/or externally imposed vibrations demonstrated that total spectral energy was the main determinant of perceived hardness and that perception of increased hardness can be induced by adding external vibration at the time of contact.

      Strengths:

      The experiments are well-reported and the data potentially useful, but much narrower than is implied by the (provisional) title and abstract. Their potential application to tactile perception in virtual reality seems promising, but the largely unexplored need for synchronization with physical contact and modulation with velocity and force of that contact seems likely to complicate proposed applications to prosthetics and telerobots.

      Weaknesses:

      (1) The authors have confused discriminability with perception. The sense of touch is derived from several different types of mechanoreceptors and processed into several dimensions of haptic perception. The fact that subjects can rank surface material hardness correctly when requested to focus on that alone does not mean that they rely on total spectral energy normally or that total spectral energy is normally perceived as surface material hardness, as opposed to other aspects of materials, such as their surface texture. They have not considered the effects of more complex features of most surfaces, such as curvature, lamination or other exploratory movement strategies besides light taps.

      (2) Discussion section. Lines 262-264 are overstated. Dynamic spectral energy can be used to modify perceived hardness when exploratory movements are limited to taps that are unlikely to generate any other useful cues, such as skin deformation or proprioception. The authors have not explored what happens if there actually are conflicting cues in non-vibratory modalities. There are many different examples from sensory psychophysics of percepts that arise from taking the mean of conflicting cues (e.g. stereophonic sound localization) and others that arise from a dominant modality (e.g. self-motion perception from visual flow fields, vestibular signals and proprioception).

      The authors have ignored the substantial literature on artificial tactile sensors and their ability to identify texture, hardness and other haptic properties of materials. These have emphasized the importance of the many types and parameters of exploratory movements, which were loosely specified and not quantified in these studies.

      See:

      Li, Q., Kroemer, O., Su, Z., Veiga, F. F., Kaboli, M., & Ritter, H. J. (2020). A Review of Tactile Information: Perception and Action Through Touch. Ieee Transactions on Robotics, 36(6), 1619-1634. doi:10.1109/tro.2020.3003230.

      Fishel, J. A., & Loeb, G. E. (2012). Bayesian exploration for intelligent identification of textures. Frontiers in Neurorobotics, 6(4). doi:10.3389/fnbot.2012.00004

      Fishel, J. A., & Loeb, G. E. (2012). Sensing Tactile Microvibrations with the BioTac - Comparison with Human Sensitivity. Paper presented at the IEEE/RAS-EMBS International Conference on Biomedical Robotics and Biomechatronics, Rome.

      (3) Introduction (lines 23-31) and Discussion (lines 296-298). The notion that tactile receptors are "frequency tuned" is something of a straw man. Different receptor types are preferentially sensitive to different broad spectral bands, but it has long been known that they can be driven by larger stimuli outside those bands and that humans have very limited ability to discriminate actual frequency of tactile vibration (as opposed to auditory pitch), particularly for frequencies greater than the maximal one-to-one firing rate of neurons (~200-300 Hz). Conversely, fine onset timing of spikes in tactile afferents appears to be available from brief contact taps to identify features other than hardness; see:

      Johansson, R. S., & Flanagan, J. R. (2009). Coding and use of tactile signals from the fingertips in object manipulation tasks. Nature Reviews Neuroscience, 10, 345-359.

      Pruszynski, J. A., Flanagan, J. R., & Johansson, R. S. (2018). Fast and accurate edge orientation processing during object manipulation. eLife, 7, e31200.

      (4) Methods section. The Lofelt L5 actuator used to apply vibrations to the fingernail is rather large for use on multiple fingers of a haptic display. Do the authors know of any more compact technology with the requisite power and frequency response? One of the most useful contributions of this paper is to suggest that those details matter relatively little, which opens up more compact technologies such as piezoelectric actuators.

      (5) Methods section. It is good that headphones were used to block and mask audible tapping sounds, which are known to be capable of generating tactile illusions (Jousmäki, Veikko, and Riitta Hari. "Parchment-skin illusion: sound-biased touch." Current biology 8.6 (1998): R190-R191). But that suggests that hardness might be signalled by precisely timed acoustic stimuli, which would be much easier to deliver than fingertip vibration.

    2. Reviewer #2 (Public review):

      This paper aimed to demonstrate that total spectral energy alone is sufficient to drive hardness perception and material identification. Through five user studies, they tested materials ranging in stiffness and with covered fingers to support their claim. Using a spectral energy compensation framework, they concluded that total spectral energy alone, regardless of frequency content, was sufficient to support material hardness percepts. However, it should be noted that all experiments used a tapping procedure, which is not the standard exploratory procedure when judging material hardness. A tapping method also selectively enhances vibratory feedback while limiting others. This fundamentally limits the scope of their work, and assessing their claim on generalizability would require further experimentation.

      Some additional clarification and extension on the experiments are also suggested:

      (1) According to Lederman and Klatzky (1987), pressure, and not tapping, is the exploratory procedure humans use to judge hardness. And during tapping instead (as used in all experiments), it is expected that the dominant cue available to the user comes from vibrations, as other mechanical cues, such as skin stretch, are limited. These vibrations could serve as a proxy for hardness, as claimed by the authors, but it is unclear if the participants are basing their evaluations on perceived hardness or vibration intensity. A more fundamental question that needs to be answered to support the paper's claim is whether a single tap is sufficient for conveying a material's hardness. To better support their claim, I recommend that the authors include an experiment using participants' bare fingers with materials of the same modulus but different damping coefficients. These materials would produce different vibration signals when tapped, but are equivalent in hardness.

      (2) The setup text for experiment 4 does not match the results. Results suggest that a finger covered with a bubble and touching a soft material was used (i.e. dual compliance), but the setup describes otherwise. The authors should clarify this and confirm that this is different from experiment 2.

      (3) As silicone, foam, and rubber can have very similar or different hardness depending on the specific material used, please report the hardness of each material tested (Shore or Young's modulus) to better understand the range of stiffness tested.

      (4) In the "materials grouping and selection" section, it states that a pilot study suggested hard materials tended to be perceptually similar while softer materials were easily distinguishable. However, this contradicts the results in experiment 1. The authors should expand on the details of the pilot study and address the inconsistency between its findings and experiment 1.

      (5) The methods section suggests that individual recordings for each material were performed before the experiment. Please clarify if this is correct, or if a single signal for each texture was used across all participants. Additionally, were the participants' tap pressure controlled during either the recordings or in the experiments? If not, how do the authors account for the difference in intensity that would be generated due to different tapping pressures across participants and trials?

    1. Reviewer #1 (Public review):

      Summary:

      This study develops a novel theory to account for various aspects of dopamine signals, particularly dopamine ramps. They propose that dopamine reward prediction error (RPE) signals are generated by a dual-process learning system in which values inferred by a model-based system enter the RPE asymmetrically into the update target but not the prediction (equation 6). The work offers specific, mechanistic explanations of Krausz et al. (2023) and Guru et al. (2020), Kim et al. (2020) by maintaining an RPE interpretation, and presents an alternative to the state-uncertainty account in Mikhael et al. (2022) that doesn't require the asymmetric uncertainty assumption Mikhael needs, using Campbell et al. (2025) in a thoughtful way. The asymmetric-RPE idea is clean and well presented. Overall, this study makes an important contribution to the field.

      Strengths:

      The theory is relatively simple and intuitive. It addresses a long-standing controversy or mystery in the field of dopamine.

      Weaknesses:

      (1) The biggest outstanding question is what V_TD does - letting V_MB drive everything would seem to produce much of the same outcomes in the settings discussed here. The discussion suggests that in situations where there is little contribution of the model-based system, the backpropagating bump is a feature (e.g. Amo et al.). It would be interesting to see if this is a true outcome of the model, potentially by varying the arbitration parameter k. This is an interesting alternative account from eligibility trace explanations of the lack of backpropagating bump in some experimental settings.

      (2) The model-based accounts are quite simplistic, and this should probably be acknowledged - it does help delineate their contribution, but in the model, only the goal-reward value is updated; everything else is a known computation. Perhaps engage more deeply with Sagiv et al?

      (3) The application of Campbell et al. (2025) to push back on Mikhael (lines 253-259) is interesting: if striatum to VTA implements TD via synaptic delays such that V(s_t) is a delayed copy of V(s_{t+1}), then state uncertainty is necessarily shared between the two terms in the RPE, defeating Mikhael's required asymmetry.

      But the same circuit logic creates tension for the dual-process model. It seems they are proposing that the frontal cortex projects V_MB into VTA dopamine neurons (as proposed in 3.1 and the Discussion) and adds to the prediction error derived from the biphasic filtering of value. But the biphasic idea (and data of Campbell et al.) implies that the V(t+1) and -V(t) come from the same source and are proportional. Adding the V_MB term is akin to adding a positive bias, breaking the optimality of the TD error for predicting value and predicting over-learning of cached value. It is worth considering whether V_MB passes through a similar filter - I am not sure if it is fatal if V_MB contributes somewhat to the negative term of the update error.

      (4) A few places where the predicate of the conclusion needs more care. The "normative" framing throughout 3.2 and the Discussion is normative conditional on the architecture already including a separate cached system that needs to converge to the true value function and on a system in which the model based is learnt much faster - see comments about learning rate parameter later.

      (5) Kim et al. is cited heavily as a data source for Figure 4, but is never engaged with as a theoretical alternative, even though Kim et al. explicitly argued that an appropriate state representation makes standard TD compatible with ramps and the teleport responses. That is, Kim et al. is already a TD account of these phenomena, and doesn't require a second learning system. The introduction and Mikhael discussion treat the field as if the choice were between "dopamine = value" (Hamid, Howe, Mohebi) and dopamine = RPE-with-special-conditions (Mikhael, Kato-Morita), but Kim et al.'s framework is also dopamine = RPE. Two specific places this matters: (i) Figure 4 currently demonstrates that the dual-process model reproduces the Kim teleport results, but Kim et al.'s framework also reproduces them - the figure doesn't distinguish the two, and I am not sure the figure gives this message cleanly. (ii) Kim et al. report that ramps develop with training over days; the manuscript should address whether the dual-process model has an alternative explanation for this, especially given the contrast with the Guru result (ramps diminishing with training over a longer timescale).

      (6) The arbitration parameter k is fixed at 0.5 throughout, and the paper acknowledges this is for simplicity, but a supplementary panel sweeping k ∈ {0, 0.2, 0.5, 0.8, 1.0} on the key figures (Figure 1B convergence, Figure 2D ramp dynamics, Figure 3D Krausz updating) would be informative. At k = 0, the model reduces to standard TD; at k = 1, it's effectively V_MB-driven. I think these would be easy to add and help clarify the work this assumption is doing.

      (7) Learning-rate asymmetry needs justification. The story relies on α_MB >> α_TD throughout (α_MB = 0.50, α_TD = 0.01 - a 50× ratio). With α_MB = 0.5, a single rewarded trial moves R[goal] halfway to the new value, which would predict strong dependence of dopamine ramp amplitude on the previous trial's outcome. This is testable in existing data (Krausz et al. should have enough trials to fit the exponential decay constant for trial-history dependence; Guru's swap-session data likewise), and the paper would be strengthened by explicitly deriving and checking that prediction.

      (8) α_MB is dropped to 0.10 specifically for the Krausz simulation without justification in the text - Why? Either the value should be the same as elsewhere, or the paper should explain why Krausz's task requires slower MB learning. It would be good to check the robustness of the Krausz simulation - the test phase is a single set of three trials (t-2 = omission, t-1 = reward, then t = 50% rewarded) after training on a single set of 500 simulated trials (believe only one random seed is used - given the high alpha, varying this set of simulated trials seems important). Also, do they get the other result in Krausz (t-2 = reward, t-1 = omission, t = 50% rewarded)?

      (9) It might be possible to fit the alpha to the Guru and Krausz simulations - this might be informative to show the range over which it varies.

      (10) The Kato and Morita account is cited in the introduction but never really discussed again - it would be good to engage with this a bit more in the discussion. The rejection of the value-based accounts seems to rely primarily on Kim et al., where the value and TDRPE accounts differ, but this could be directly acknowledged, rather than absorbing credit for this into their model.

    2. Reviewer #2 (Public review):

      Summary:

      This paper offers a novel theoretical account of dopamine ramps. The key idea is that the reward prediction error (putatively signaled by dopamine) uses a partially model-based estimate for future value (the prediction target). Because the model-based value estimate emerges more rapidly than the model-free estimate, it inflates the RPE, and this inflation increases with reward proximity - hence ramps. The authors show that this account can explain many aspects of existing data on dopamine ramps across several different studies.

      Strengths:

      Overall, I liked this paper. The idea is interesting and plausible. The paper is well-written and clearly argued. The modeling has been done rigorously.

      Weaknesses:

      My major comments are: (1) it's not always clear which phenomena are uniquely well-explained by this new account vs. earlier accounts; and (2) the limitations of the account are not entirely transparent.

      (1) The paper models some of the studies reported by Kim et al (2020). As was already shown in that paper, a standard TD error could explain the results (although a major limitation of that treatment was that it did not model the recursive effect of RPEs on learning, as discussed in the Mikhael paper). It's not clear if there's additional explanatory value provided by this new account, though, of course, it's good to know that those results are captured by the new account. Likewise, Mikhael et al (2022) already offered an account of their data (somewhat more complex than the standard TD model). Again, it's not clear if there's additional explanatory value provided by the new account (and again, it's nice to see that the model can capture these results). Finally, I found myself wondering whether the Guru et al (2020) result couldn't be explained by a more standard TD model (assuming the value function is sufficiently convex). I don't think it's essential that the new account provides additional explanatory value in every case, but I think it's important to convey to readers what's new and what's not, as well as what aspects of the data require particular kinds of mechanisms to explain. It would be really helpful to see the predictions of alternative TD models in order to make this clearer.

      (2) The Mikhael model was motivated by the puzzle that ramping is observed in navigation tasks (with sensory cues) but typically not in classical conditioning tasks lacking sensory cues. The correction term, derived from normative considerations, explained this discrepancy. It's not clear to me if/how the new account can explain the discrepancy.

    3. Reviewer #3 (Public review):

      Summary:

      This work presents a new hypothesis for why dopamine signals have sometimes been observed to "ramp up" in spatial tasks as rodents approach a location associated with reward. In essence, the hypothesis is that value estimates (i.e., predictions about future rewards) from a model-based system, which may be able to more quickly form such estimates via an inference-like process, can be used to speed up the (relatively slow) learning of such estimates by a model-free system. This is suggested to occur by including the model-based estimate as part of the target towards which model-free estimates are updated in the course of temporal-difference (TD) learning. The early discrepancy between these estimates can be expected to give rise to systematic TD errors - putatively represented in dopaminergic activity - that give rise to dopamine ramps, which are expected to diminish over time as the estimates of both systems converge. The authors show that a model that implements this idea makes predictions about dopamine activity that are a good qualitative match to data from a number of recent experimental studies.

      Strengths:

      The work suggests a normative account for a phenomenon that has persistently troubled the canonical theory of dopamine function. The account is appealing in its elegance and simplicity, and the authors present compelling evidence that it can capture the empirical observations of key recent papers. Another strength of the account is that it readily suggests avenues for future theory development and experimental test, including what the 'best' target estimate should be at any given time, how rapidly one might expect ramps to develop or diminish, and the neural implementation of the proposed algorithm. This is likely to stimulate further theoretical and experimental work in the field.

      Weaknesses:

      One aspect of dopamine "ramps" that was troubling from a theoretical standpoint was their apparent persistence over time. Given the authors' prediction that these would disappear over time in a stable environment and the supporting evidence they cite (from Guru et al., 2000), the reader might be left confused about the state of evidence about whether dopamine ramps persist or not. Perhaps relatedly, the issue of how the activity of dopamine cells and dopamine release are related is not discussed, which may be relevant given that early studies (e.g., Howe et al., 2013) used voltammetry to measure extracellular dopamine concentrations.

    1. Reviewer #1 (Public review):

      Summary:

      The authors develop alignment methods for layer-specific widefield calcium imaging in the mouse cortex. Under the assumption that the majority of the widefield signal originates at the level of the cell bodies, different cortical layers will appear at different locations in a top-down view as a function of the curvature of the mouse cortex. The authors develop software tools to correct for this, as well as depth-dependent source blurring. Finally, they apply these tools to investigate functional connectivity differences of different neuron types and find only subtle differences.

      Strengths:

      The work is technically strong, the experiments well executed, and the presentation clear.

      Weaknesses:

      One concern I have is that the central assumption underlying the rationale for the depth correction, namely that the source of the majority of the widefield signal is the cell body, may be incorrect. Layer 5 neurons have a dense axo-dendritic plexus very close to the surface of the cortex. Given the attenuation length of visible light in tissue, as well as our own measurements (https://elifesciences.org/articles/71476#fig6s1), I suspect that the majority of the widefield calcium signal originates in the superficial axo-dendritic plexus. The authors acknowledge this possibility, but there are a few simple measurements they could make to address this more directly. If indeed, as I suspect, the majority of the calcium signal originates in the first 50 um of tissue (even when imaging layer 5 neurons), the curvature correction is counterproductive, of course. The authors could test the effect of adding brain slices of varying thicknesses on top of e.g., a layer 2/3 widefield recording. If the authors are correct, and most of the signal is from cell bodies, this should, at most, attenuate the layer 2/3 recording to the level of a layer 5 recording. Anecdotally, while doing the measurements for the figure referenced above, we have done this experiment with a 100 um thick slice, and no quantifiable calcium responses remained.

    2. Reviewer #2 (Public review):

      Summary:

      This manuscript by Lorenzo and colleagues presents wide-field cortical imaging data obtained from experiments conducted with three triple-transgenic mouse lines that specifically express the calcium sensor GCaMP6f in neurons of layers 2/3, 5, and 6 of the neocortex, respectively.

      It first includes a methodological contribution aimed at optimizing the analysis of the acquired signals, taking into account both the geometry of the neocortex and photon scattering in the cortical tissue, which affect fluorescence signals differentially depending upon their cortical depth of origin.

      In particular, they built upon the work previously published in eLife by Waters in 2024, which, based on a simulation of photon scattering using a Monte Carlo random-walk model, provided an estimate of the tissue volumes contributing to the fluorescence signals measured from the surface in several mouse lines expressing Gcamp in a layer-specific manner.

      The authors here additionally performed empirical measurements of the point spread function at different cortical depths to determine spatial kernels to be used to deconvolve wide-field imaging data acquired from their three-layer-specific GCaMP6f-expressing mouse lines. They assess the added value of this deconvolution approach based on recordings of the cortical responses evoked by whisker stimulation in the barrel cortex, using lightly anesthetized, layer 2/3 and layer 5 GCaMP6f-expressing mice.

      Altogether, these proposed methods aim at optimizing the registration of recorded signals on a common reference frame, allowing to compare cortical spatiotemporal dynamics recorded from distinct layer-specific GCaMP-expressing mice.

      The manuscript further contains a more neurophysiological contribution, directly utilizing the proposed methods to perform a comparative layer-specific functional connectivity analysis from data collected with the 3 different mouse lines, while the mice were head-fixed below the macroscope.

      Strengths:

      Wide-field 1-photon functional optical imaging, which allows recording cortical spatiotemporal dynamics over a large portion of the dorsal neocortex in mice, has become a tool of choice to study how activity over a wide range of cortical areas is orchestrated in various behavioral contexts. The ever-increasing availability of transgenic mice exhibiting pan-cortical calcium- or voltage-dependent sensors within specific neuronal populations is generating a growing interest in these approaches among the neuroscientific community.

      Nowadays, it is possible to image specifically the activity of excitatory neurons whose cell bodies are located in given cortical layers. However, interpreting fluorescence signals recorded from the surface while originating from deep layers proves difficult due to photon scattering, which reduces image definition, as previously established by Waters et al. (2024).

      The ability to correct for this blurring effect and to place the recorded signals within a common frame of reference is therefore essential not only for comparing activity across layers but also for integrating findings across studies, thereby advancing our collective understanding of neocortical physiology.

      In this sense, this work by Lorenzo and colleagues is definitely both timely and valuable.

      Overall, the manuscript is clearly structured and well-written, and the figures are of excellent graphic quality.

      The proposed approach to correct the blurring of the fluorescent signals, which increases with depth, by means of empirical measurements of point spread functions and deconvolution, seems pertinent and efficient.

      Finally, the authors have collected evoked and spontaneous dynamics of calcium signals from 3 different layer-specific GCaMP mice, which in itself represents a substantial experimental effort, not least because of the need to generate the animals. Out of these data, they provide a unique comparative analysis of layer-specific functional connectivity.

      Weaknesses:

      To fully benefit a large community, some aspects of the proposed methodological advances need to be more detailed in the manuscript and potentially refined. For instance, it is very difficult to evaluate, given the tiny confocal images provided in Figure 1, the potential contribution of GCaMP signal from apical dendrites of layer V neurons in Rbp4-GCaMP6f mice. It is also difficult for the reader to assess the added value of the layer-specific reference maps, given that functional image registration relies on nonlinear transformations and limited detail is provided regarding the procedure used to realign the functional data with these maps (lines 465-467). It is not really clear how the illustrated "composite maps" and the "five functional spots" used for the registration are computed. In addition, one could question the choice of the large time windows used to generate these composite maps/functional landmarks. Since the early component of the evoked responses is more likely to reflect the location of the initial thalamocortical inputs, restricting the analysis to the early phase of the responses might improve the accuracy of primary cortical area identification. This concern regarding the time window used to define specific cortical representation areas may also be relevant to Figure 4, which illustrates the results of the proposed deconvolution approach used to correct for photon scattering (although the time windows used for these analyses are not specified).

      With regard to Figure 4, the reader might wonder why the results are not illustrated similarly for the layer 6 mice. It would therefore be useful to clearly indicate whether these data are not shown because they were not collected, or because it proved impossible to identify single whisker representations, despite the proposed deconvolution procedure.

      Regarding the analysis of layer specificity in terms of functional connectivity, the authors extensively use the term "resting-state" to describe the behavioral context of data collection, given that the animals were not engaged in a goal-directed task. However, because the mice were experiencing head fixation beneath a functional epifluorescence macroscope for only the second time, it is questionable whether this state can truly be classified as "resting." As indicated by the global quantification of body movements, the animals most likely alternated between quiet wakefulness and more active phases.

      To allow the reader to accurately interpret the reported functional connectivity differences, the authors should at least provide a quantification of the time animals spent in the quiet versus active states, and assess whether these proportions were comparable between the different mouse lines. Another way to address this issue would be to perform functional connectivity analyses after splitting the data according to these two states based on body movement quantification, although it is difficult to assess the feasibility of this approach without knowing the temporal distribution of these states within the dataset.

      This seems particularly important since differences in neural cross-regional correlation patterns have been linked to arousal levels, with a comparable optical imaging approach, by Shahsavarani and colleagues (Cell Reports, 2023), who compared initial and prolonged resting periods. In addition, the authors report here that layer differences in functional connectivity are more pronounced in regions associated with the default mode network, whose activity is likely to differ between quiet and active wakefulness.

      Finally, given the richness of the dataset, it would be very interesting to assess how the proposed deconvolution approach affects PCA-ICA-based functional parcellation of spontaneous cortical activity (Reidl et al., NeuroImage, 2007; Makino et al., Neuron, 2017) and whether it enables cross-layer comparisons of independent cortical modules. Such supplementary analyses would substantially increase the impact of this work.

    3. Reviewer #3 (Public review):

      This paper provides valuable technical and theoretical validation of layer-specific wide-field imaging. Here, the authors use specific transgenic lines that provide layer-specific cell body expression (and some superficial dendrites). They then use deconvolution approaches and potentially more accurate atlases based on depth-dependent features to register and resolve what are layer-specific functional GCaMP signals.

      In general, the work is extremely well done, and I have little specific criticism. I think the author should be commended for their creative solutions, including using the light source at different depths to measure apparent scattering and blurring, allowing them to incorporate the deconvolution approach.

      Throughout the manuscript, they refer to the signals as layer-specific and, for the most part, conclude similar functional connectivity as in different layers with some noted exceptions. This is an outstanding resource for the community.

      Major Comment:

      I think they should add some caveats that the lines that they employ do contain dendrites that are in more superficial cortices. Could they make some estimates of signal contribution from these, say, layer 6 neuron superficial dendrites versus the deep somata? This clarification should be included in the abstract; maybe they could call these apparent somatic signals? Another way of doing this would be a Soma-targeted deep indicator, but this is probably beyond the scope of the paper.

      Alternatively, how much of the layer 5 signal would be expected to be recovered?

    1. Reviewer #1 (Public review):

      Summary:

      The current manuscript characterizes in detail the macrophages in the thymus. The authors identify two distinct populations of thymic macrophages and describe their surface marker expression and transcriptional signatures. They also explore their ontology and kinetics of settling and persistence in the thymus and find that the TIMD4+ macrophages are derived from embryonic progenitors and self-maintain in the thymus, while the TIMD4- macrophages are derived from monocytes. Most importantly, the authors test the functional importance of thymic macrophages for T cell development using an in vitro depletion system, from which they conclude that macrophages are important for one of the earliest selection steps in T cell development - the beta selection.

      Strengths:

      The authors use state-of-the-art techniques, such as multiple genetically modified mice, multi-color flow cytometry, single-cell RNA sequencing, genetic fate mapping, and fetal thymic organ culture (FTOC) combined with depletion. Their work is in good agreement with prior published studies on the subject, such as Tacke et al. (PMID: 26091486) and Zhou et al. (PMID: 36449334). In addition to reproducing prior knowledge, the authors uncover novel and unexpected facets of thymic macrophage biology, such as their SpiC independence and the fact that TIMD4- thymic macrophages depend on CCR2 (Tacke et al. have shown that the overall thymic macrophage compartment is normal in CCR2-/- mice). Most surprisingly, the authors claim that thymic macrophages control an early checkpoint in T cell development, the beta selection. This has not been reported before, as beta selection is usually considered a cell-autonomous process in thymocytes that does not require input from other cells.

      Weaknesses:

      The thymic macrophage depletion experiments are not well controlled, and the authors' interpretation of the results is a stretch. First, the treatment depletes other cell types, most notably dendritic cells (DCs), which have well-known roles in thymic selection (though not specifically in beta selection). The authors' reasoning that macrophages are abundant in the cortex, where beta selection occurs, while DCs are enriched in the medulla, seems questionable, as the embryonic thymus typically lacks (or has very small) medulla. A second salient point is that the authors haven't ruled out direct toxicity of the dimerizer drug AP20187 on thymocytes (specifically DN cells) in MAFIA mice.

      Altogether, this is a solid manuscript that largely confirms the previously established ontogeny and heterogeneity of thymic macrophages. However, the participation of thymic macrophages in beta selection needs stronger evidence.

    2. Reviewer #2 (Public review):

      This manuscript from Zuniga-Pflucker laboratory describes that thymic macrophages are heterogeneous in flow cytometric and transcriptomic profiles, containing two major populations characterized by TIMD4 and CX3CR1 expression. These macrophage populations are both parenchymal in the thymus but are unequal in developmental ontogeny, Flt3 expression history, and CCR2 dependency. The manuscript further reports the interesting findings that the depletion of thymic macrophages impairs thymocyte development at the DN3 beta-selection checkpoint. These results provide an important advance for further understanding of thymus biology, especially in view of the contribution of heterogenous thymic macrophage subpopulations.

      However, Zhou et al. previously reported essentially similar heterogeneity in thymic macrophages. It was demonstrated that TIMD4+ macrophages and CX3CR1+ macrophages have distinct origins and are different in developmental characteristics (27). The authors should better clarify what was previously demonstrated and what is newly described in this study. Zhou, et al. also demonstrated that TIMD4+ macrophages are localized in the cortex whereas CX3CR1+ macrophages distribute in the medullary region. Whether or not these previous findings are reproduced and supported in the present study is important in view of the new finding that thymic macrophages are important for beta-selection, which is presumed to occur in the thymic cortex. The authors may be able to suggest more strongly that TIMD4+ macrophages regulate beta-selection in the thymic cortex through phagocytic efferocytosis. (Indeed, the Figure 1 legend states that frozen thymic sections were used for immunofluorescent staining to identify the localization of thymic macrophages, without showing the results.)

  2. Jun 2026
    1. Reviewer #1 (Public review):

      Summary:

      The authors use Dyngo-4a, a known Dynamin inhibitor to test its influence on caveolar assembly and surface mobility. They investigate whether it incorporates into membranes with Quartz-Crystal Microbalance, they investigate how it is organized in membranes using simulations. Finally, they use lipid-packing sensitive dyes to investigate lipid packing in the presence of Dyngo-4a, membrane stiffness using AFM and membrane undulation using fluorescence microscopy. They also use a measure they call "caveola duration time" to claim that something happens to caveolae after Dyngo-4a addition and using this parameter, they do indeed see an increase in it in response to Dyngo-4a, which is reduced back to the baseline after addition of cholesterol.

      Overall, the authors claim: 1) Dyngo-4a inserts into the membrane and this 2) results in "a dramatic dynamin-independent inhibition of caveola scission". 3) Dyngo-4a was inserted and positioned at the level of cholesterol in the bilayer and 4) Dyngo-4a-treatment resulted in decreased lipid packing in the outer leaflet of the plasma membrane 5) but Dyngo-4a did not affect caveola morphology, caveolae-associated proteins, or the overall membrane stiffness 6) acute addition of cholesterol counteracts the block in caveola scission caused by Dyngo-4a.

      Overall, in this reviewers opinion, after the additional experiments in the review process, all claims are now well-supported by the presented data from electron and live cell microscopy, QCM-D and AFM.

      Significance:

      A number of small molecule inhibitors for the GTPase dynamics exist, that are commonly used tools in the investigation of endocytosis. This goes as far that the use of some of these inhibitors alone is considered in some publications as sufficient to declare a process to be dynamin-dependent. However, this is not always correct, as there are considerable off-target effects, including the inhibition of caveolar internalization by a dynamin-independent mechanism. This is important, as for example the influence of dynamin small molecule inhibitors on chemotherapy resistance is currently investigated (see for example Tremblay et al., Nature Communications, 2020).

      The investigation of the true effect of small molecules discovered as and used as specific inhibitors and their offside effects is extremely important and this reviewer applauds the effort. It is important that inhibitors are not used alone, but other means of targeting a mechanism are exploited as well in functional studies. The audience here thus is besides membrane biophysicists interested in the immediate effect of the small molecule Dyngo-4a also cell biologists and everyone using dynamic inhibitors to investigate cellular function.

      Comments on revised version.

      Overall, in this reviewer's opinion, after the additional experiments in the review process, all claims are now well-supported by the presented data from electron and live cell microscopy, QCM-D and AFM.

    2. Reviewer #2 (Public review):

      Summary:

      In this manuscript, the authors probe the mechanisms by which Dyngo-4a, a dynamin inhibitor used to block endocytosis, impact caveolae dynamics. They provide compelling evidence that Dyngo-4a inhibits caveolae dynamics and endocytosis (as well as several other aspects of plasma membrane dynamics) by a dynamin-independent mechanism. They also provide strong computational and experimental data showing that Dyngo-4a inserts into membranes and decreases lipid packing in the outer leaflet of the plasma membrane. Finally, they demonstrate that the addition of excess cholesterol to cells reverses the effects of Dyngo-4a on caveolae dynamics, presumably by reversing lipid packing defects. Based on these findings they conclude that lipid packing regulates caveolae dynamics and endocytosis in a cholesterol-dependent manner.

      This work should be of value to cell biologists interested in plasma membrane remodeling and membrane trafficking, biophysicists that study small molecule/membrane interactions and membrane remodeling processes, and chemists interested in designing drugs to target membrane trafficking machinery and pathways.

      Strengths and weaknesses:

      This work addresses the important topic of how a widely used endocytic inhibitor actually works. In the process of addressing this question, the authors uncover unexpected connections between how lipids are packed in cell membranes and membrane dynamics. The methods are appropriate and many of the claims made in this work are well supported by data.

      The authors have also been responsive to comments raised during review by including additional experimental evidence that Dyngo-4a inhibits caveolae endocytosis as well as documenting the effects of Dyngo-4a on caveolae morphology.

      The work also raises some interesting questions for the future. As one example, the authors note that in addition to inhibiting caveolar dynamics, Dyngo-4a inhibits generalized plasma membrane mobility, transferrin uptake, and fusion of fusogenic liposomes to the plasma membrane. More work will be required to determine whether these events are mediated by a common, lipid packing-dependent mechanism.

    1. Reviewer #1 (Public review):

      [Editors' note: this version has been assessed by the Reviewing Editor without further input from the original reviewers. The authors have addressed the comments raised in the previous round of review.]

      Summary:

      This manuscript offers a careful and technically impressive dissection of how subpopulations within the subthalamic nucleus (STN) support reward-biased perceptual decision-making. The authors recorded STN neurons in monkeys performing an asymmetric-reward visual motion discrimination task, then combined single-unit analyses, regression modeling, and drift-diffusion model (DDM) fitting to identify functionally distinct neuronal clusters. Each subpopulation shows unique relationships to computational decision variables - evidence accumulation rate, decision bound, and non-decision time - as well as to post-decision evaluative signals including choice accuracy and reward expectation. The revised manuscript substantially strengthens the original submission by improving both the objectivity of neuron selection and the robustness of the clustering solution.

      Strengths:

      The asymmetric-reward paradigm cleanly separates perceptual and motivational contributions to STN activity, allowing the authors to characterize how neurons blend these distinct sources of information. The dataset is extensive and well-controlled, and the behavioral and neural analyses are tightly integrated. Relating cluster-specific activity to DDM parameters provides an interpretable computational link between population signals and behavior. The clustering solution is now validated across two algorithms, two monkeys, and subsets of trials - establishing that the three-cluster structure is robust. The new Figure 9 offers a conceptually useful, if necessarily speculative, synthesis connecting the identified subpopulations to distinct basal-ganglia pathways (hyperdirect versus indirect). The new Figure 8 documenting the anatomical intermingling of subpopulations is also important, as it directly informs the interpretation of prior and future STN stimulation studies.

      Weaknesses:

      The inferred relationships between neural clusters and DDM parameters remain correlational - the authors now appropriately flag this throughout, and the causal inference gap is acknowledged in the Discussion with concrete proposals for future targeted perturbation strategies. While a generative multi-cluster model would further strengthen mechanistic interpretation, the conceptual framework in Figure 9 provides a reasonable intermediate step given the scope of the study and the absence of simultaneous population recordings, which preclude direct inter-cluster covariation analyses. These remaining limitations are inherent to the experimental design rather than analytical oversights.

      Comments on the previous version:

      The authors have responded thoroughly and constructively to all of my concerns. The revised clustering pipeline - incorporating finer temporal resolution, objective neuron selection, outlier removal, a second clustering algorithm, cross-monkey validation (Rand indices of 0.94 and 1.0 for the two monkeys), and trial-subset stability analysis - substantially increases confidence in the three-cluster solution. The correlational nature of the DDM-activity relationships is now clearly stated, and the Discussion appropriately contextualizes the causal inference gap while suggesting feasible future directions. The new Figure 9 provides the conceptual synthesis I had hoped for, within the realistic scope of the present study. I am satisfied with the authors' responses and have no further requests.

    2. Reviewer #2 (Public review):

      This study uses monkey single-unit recordings to examine the role of the STN in combining noisy sensory information with reward bias during decision-making between saccade directions. Using multiple linear regressions and clustering approaches, the authors overall show that a highly heterogeneous activity in the STN reflects almost all aspects of the task, including choice direction, stimulus coherence, reward context and expectation, choice evaluation, and their interactions. The authors report in particular how three classes of neurons map to different decision processes evaluated via the fitting of a drift-diffusion model. Overall, the study provides evidence for functionally diverse and anatomically intermingled populations of STN neurons, supporting multiple roles in perceptual and reward-based decision-making.

      This study follows up on work conducted in previous years by the same team and complements it. Extracellular recordings in monkeys trained to perform a complex decision-making task remain a remarkable achievement, particularly in brain structures that are difficult to target, such as the sub-thalamic nucleus. The authors conducted numerous analyses of STN activities, using sophisticated statistical approaches and functional computational modeling.

    1. Reviewer #1 (Public review):

      Summary:

      In the paper, the authors propose a new RNA velocity method, TSvelo, which predicts the transcription rate linearly based on the expression of RNA levels of transcription factors. This framework is an extension of its recent work TFvelo by including unspliced reads and designing a coherent neuralODE framework. Improved performance was demonstrated in six diverse datasets.

      Strengths:

      Overall, this method introduces innovative solutions to link cell differentiation and gene regulation, with a balance between model complexity (neuralODE) and interpretability (raw gene space).

      Comments on revised version:

      The authors have added comprehensive analyses in this revision, and all of my concerns have been very well addressed. Here, I just want to re-emphasize the original points 1 and 3.

      (1) The analysis and clarification are very helpful - thanks! I found that Fig. R1 and R2 are very insightful, as DoRothEA-only returns much worse performance. Please consider adding these two figures to the supp figure and possibly highlighting your setting for edge pruning (down-weights); therefore, the model is more likely to be affected by false negatives than false positives in the TF-target prior.

      (3) Please consider adding some discussion on the challenges in capturing cell cycle transitions.

    2. Reviewer #3 (Public review):

      Despite the abundance of RNA velocity tools, there are still major limitations, and there is strong skepticism about the results these methods lead to. In this paper, the authors try to address some limitations of current RNA velocity approaches by proposing a unified framework to jointly infer transcriptional and splicing dynamics. The method is then benchmarked on 6 real datasets against the most popular RNA velocity tools.

      Comments on revised version.

      The Authors addressed all my comments suitably. I'd like to thank them for the time they spent addressing them: the revised paper is much more convincing.

      I have 2 very minor follow-up concerns:

      (1) I appreciated the simulation study, however, no null simulation is present.<br /> We know RNA velocity tools are inclined to provide false positives: trajectories even when the data doesn't have any.<br /> I'd be helpful to add null simulations where the data has no trajectories and see if methods erroneously identify any.

      (2) Several of the novel analyses are only reported in the Supplementary material and only references in the main text (e.g., "A validation of TSvelo on simulated data is provided in Fig. S1 and Fig. S2 in the Supplementary Information."). This is pity!

      If allowed, I'd add some comments about the new analyses (simulations, computational benchmarks, etc...) also in the main text.

    1. Reviewer #1 (Public review):

      The authors have conducted substantial additional analyses to address the reviewers' comments. However, several key points still require attention. I was unable to see the correspondence between the model predictions and the data in the added quantitative analysis. In the rebuttal letter, the delta peak speed time displays values in the range of [20, 30] ms, whereas the data were negative for the 45{degree sign} direction. Should the reader directly compare panel B of Figure 6 with Figure 1E? The correspondence between the model and the data should be made more apparent in Figure 6. Furthermore, the rebuttal states that a quantitative prediction was not expected, yet it subsequently argues that there was a quantitative match. Overall, this response remains unclear.

      A follow-up question concerns the argument about strategic slowing. The authors argue that this explanation can be rejected because the timing of peak speed should be delayed, contrary to the data. However, there appears to be a sign difference between the model and the data for the 45{degree sign} direction, which means that it was delayed in this case. Did I understand correctly? In that regard, I believe that the hypothesis of strategic slowing cannot yet be firmly rejected and the discussion should more clearly indicate that this argument is based on some, but not all, directions. I agree with the authors on the importance of the mass underestimation hypothesis, and I am not particularly committed to the strategic slowing explanation, but I do not see a strong argument against it. If the conclusion relies on the sign of the delta peak speed, then the authors' claims are not valid across all directions, and greater caution in the interpretation and discussion is warranted. Regarding the peak acceleration time, I would be hesitant to draw firm conclusions based on differences smaller than 10 ms (Figures R3 and 6D).

      The authors state in the rebuttal that the two hypotheses are competing. This is not accurate, as they are not mutually exclusive and could even vary as a function of movement direction. The abstract also claims that the data "refutes" strategic slowing, which I believe is too strong. The main issue is that, based on the authors' revised manuscript, the lack of quantitative agreement between the model and the data for the mass underestimation hypothesis is considered acceptable because a precise quantitative match is not expected, and the predictions overall agree for some (though not all) directions and phases (excluding post-in). That is reasonable, but by the same logic, the small differences between the model prediction and the strategic slowing hypothesis should not be taken as firm evidence against it, as the authors seem to suggest. In practice, I recommend a more transparent and cautious interpretation to avoid giving readers the false impression that the evidence is decisive. The mass underestimation hypothesis is clearly supported, but the remaining aspects are less clear, and several features of the data remain unexplained.

      Comments on revised version.

      The authors have reworked the sections of the text where the narrative was too strong or binary wrt alternative interpretations. The result is well balanced. No further recommendation.

    2. Reviewer #3 (Public review):

      Summary:

      The authors describe an interesting study of arm movements carried out in weightlessness after a prolonged exposure to the so-called microgravity conditions of orbital spaceflight. Subjects performed radial point-to-point motions of the fingertip on a touch pad. The authors note a reduction in movement speed in weightlessness, which they hypothesize could be due to either an overall strategy of lowering movement speed to better accommodate the instability of the body in weightlessness or an underestimation of body mass. They conclude for the latter, mainly based on two effects. One, slowing in weightlessness is greater for movement directions with higher effective mass at the end effector of the arm. Two, they present evidence for increased number of corrective sub movements in weightlessness. They contend that this provides conclusive evidence to accept the hypothesis of an underestimation of body mass.

      Strengths:

      In my opinion, the study provides a valuable contribution, the theoretical aspects are well presented through simulations, the statistical analyses are meticulous, the applicable literature is comprehensively considered and cited and the manuscript is well written.

      Weaknesses:

      I nevertheless am of the opinion that the interpretation of the observations leaves room for other possible explanations of the observed phenomenon, thus weakening the strength of the arguments.

      I raised the following points in my original review, but I find that the authors have judiciously addressed these points through their various revisions.

      I believe that the article constitutes a valuable contribution and that the results and conclusions are certainly worthy of consideration by the human motor control community.

      (1) The authors model the movement control through equations that derive the input control variable in terms of the force acting on the hand and treating the arm as a second-order low pass filter (Eq. 13). Underestimation of the mass in the computation of a feedforward command would lead to a lower-than-expected displacement to that command. But it is not clear if and how the authors account for a potential modification of the time constants of the 2nd order system. The CNS does not effectuate movements with pure torque generators. Muscles have elastic properties that depend on their tonic excitation level, reflex feedback and other parameters. Indeed, Fisk et al.* showed variations of movement characteristics consistent with lower muscle tone, lower bandwidth and lower damping ratio in 0g compared to 1g. Could the variations in the response to the initial feedforward command be explained by a misrepresentation of the limbs damping and natural frequency, leading to greater uncertainty to the consequences of the initial command. This would still be an argument for un-adapted feedforward control of the movement, leading to the need for more corrective movements. But it would not necessarily reflect an underestimation of body mass.

      *Fisk, J. O. H. N., Lackner, J. R., & DiZio, P. A. U. L. (1993). Gravitoinertial force level influences arm movement control. Journal of neurophysiology, 69(2), 504-511.

      While the authors attempt to differentiate their study from previous studies where limb neuromechanical impedance was shown to be modified in weightlessness by emphasizing that in the current study the movements were rapid and the initial movement is "feedforward". But this incorrectly implies that the limb's mechanical response to the motor command is determined only by active feedback mechanisms. In fact:

      (a) All commands to the muscle pass through the motor neurons. These neurons receive descending activations related not only to the volitional movement, but also to the dynamic state of the body and the influence of other sensory inputs, including the vestibular system. A decrease in descending influences from the vestibular organs will lower the background sensitivity to all other neural influences on the motor neuron. Thus, the motor neuron may be less sensitive to the other volitional and reflexive synaptic inputs that it may receive.

      (b) Muscle tone plays a significant role in determining the force and the time course of the muscle contraction. In a weightless environment, where tonic muscle activity is likely to be reduced, there is the distinct possibility that muscles will react more slowly and with lower amplitude to an otherwise equivalent descending motor command, particularly in the initial moments before spinal reflexes come into play. These, and other neuronal mechanisms could lead to the "under-actuation" effect observed in the current study, without necessarily being reflective of an underestimation of mass per se.

      (2) The subject's body in weightless is much more sensitive to reaction forces in interactions with the environment in the absence of the anchoring effect of gravity pushing the body into the floor and in the absence of anticipatory postural adjustments that typically accompany upper-limb motions in Earth gravity in order to maintain an upright posture. The authors dismiss this possibility because the taikonauts were asked to stabilize their bodies with the contralateral hand. But the authors present no evidence that this was sufficient to maintain the shoulder and trunk at a strictly constant position, as is supposed by the simplified biomechanical model used in their optimal control framework. Indeed, a small backward motion of the shoulder would result in a smaller acceleration of the fingertip and a smaller extent of the initial ballistic motion of the hand with respect to the measurement device (the tablet), consistent with the observations reported in the study. Note that stability of the base might explain why 45º movements were apparently less affected in weightlessness, according to many of the reported analyses, including those related to corrective movements (Fig. 5 B, C, F; Fig. 6D), than the other two directions. If the trunk is being stabilized by the left arm, the same reaction forces on the trunk due to the acceleration of the hand will result in less effective torque on the trunk, given that the reaction forces act with a much smaller moment arm with respect to the left shoulder (the hand movement axis passes approximately through the left shoulder for the 45º target) compared to either the forward or rightward motions of the hand.

      (3) The above is exacerbated by potential changes in the frictional forces between the fingertip and the tablet. The movements were measured by having the subjects slide their finger on the surface of a touch screen. In weightlessness, the implications of this contact can be expected to be quite different than on the ground. While these forces may be low on Earth, the fact is that we do not know what forces the taikonauts used on orbit. In weightlessness, the taikonauts would need to actively press downward to maintain contact with the screen, while on Earth gravity will do the work. The tangential forces that resist movement due to friction might therefore be different in 0g. . Indeed, given the increased instability of the body and the increased uncertainty of movement direction of the hand, taikonauts may have been induced to apply greater forces against the tablet in order to maintain contact in weightlessness, which would in turn slow the motion of the finger on the table and increase the reaction forces acting on the trunk. This could be particularly relevant given that the effect of friction would interact with the limb in a direction-dependent fashion, given the anisotropy of the equivalent mass at the fingertip evoked by the authors.

      I feel that the authors have done an admirable job of exploring the how to explain the modifications to movement kinematics that they observed on orbit within the constraints of the optimal control theory applied to a simplified model of the human motor system. While I fully appreciate the value of such models to provide insights into question of human sensorimotor behaviour, to draw firm conclusions on what humans are actually experiencing based only on manipulations of the computational model, without testing the model's implicit assumptions and without considering the actual neurophysiological and biomechanical mechanisms, can be misleading. One way to do this could be to examine these questions through extensions to the model used in the simulations (changing activation dynamics of the torque generators, allowing for potential motion backward motion of the shoulder and trunk, etc.). A better solution would be to emulate the physiological and biomechanical conditions on Earth (supporting the arm against gravity to reduce muscle tone, placing the subject on a moveable base that requires that the body be stabilized with the other hand) in order to distinguish the hypothesis of an underestimation of mass vs. other potential sources of under-actuation and other potential effects of weightlessness on the body.

      In sum, my opinion is that the authors are relying too much on a theoretical model as a ground truth and thus overstate their conclusions. But to provide a convincing argument that humans truly underestimate mass in weightlessness, they should consider more judiciously the neurophysiology and biomechanics that fall outside the purview of the simplified model that they have chosen. If a more thorough assessment of this nature is not possible, then I would argue that a more measured conclusion of the paper should be 1) that the authors observed modifications to movement kinematics in weightlessness consistent with an under-actuation for the intended motion, 2) that a simplified model of human physiology and biomechanics that incorporates principles of optimal control suggest that the source of this under-actuation might be an underestimation of mass in the computation of an appropriate feedforward motor command, and 3) that other potential neurophysiological or biomechanical effects cannot be excluded due to limitations of the computational model.

    1. Reviewer #1 (Public review):

      Summary:

      The objective of this study was to infer the population dynamics (rates of differentiation, division and loss) and lineage relationships of NK cell subsets during an acute immune response and under homeostatic conditions.

      Strengths:

      A rich dataset and a detailed analysis of a particular class of stochastic models.

      Weaknesses: (relating to initial submission)

      The stochastic models used are quite simple; each population is considered homogeneous with first-order rates of division, death, and differentiation. In Markov process models such as these there is no dependence of cellular behavior on its history of divisions. In recent years models of clonal expansion and diversification, in the settings of T and B cells, have progressed beyond this picture. So I was a little surprised that there was no mention of the literature exploring the role of replicative history in differentiation (e.g. Bresser Nat Imm 2022), nor of the notion of family 'division destinies' (either in division number, or the time spent proliferating, as described by the Cyton and Cyton2 models developed by Hodgkin and collaborators; e.g. Heinzel Nat Imm 2017). The emerging view is that variability in clone (family) size arises may arise predominantly from the signals delivered at activation, which dictate each precursor's subsequent degree of expansion, rather than from the fluctuations deriving from division and death modeled as Poisson processes.

      As you pointed out, the Gerlach and Buchholz Science papers showed evidence for highly skewed distributions of family sizes, and correlations between family size and phenotypic composition. Is it possible that your observed correlations could arise if the propensity for immature CD27+ cells to differentiate into mature CD27- cells increases with division number? The relative frequency of the two populations would then also be impacted by differences in the division rates of each subset - one would need to explore this. But depending on the dependence of the differentiation rate on division number, there may be parameter regimes (and timepoints) at which the more differentiated cells can predominate within large clones even if they divide more slowly than their immature precursors. One might not then be able to rule out the two-state model. I would like to see a discussion or rebuttal of these issues.

      Comments on revised version.

      I am happy with the latest revisions that the authors have made.

    1. Reviewer #1 (Public review):

      Summary:

      Kashiwagi et al. undertook a population analysis of dendritic spine nanostructure applied to the objective grouping of 8 mouse models of neuropsychiatric disorders. They report that spine morphology in cultured hippocampal neurons shows a higher similarity among schizophrenia mouse models (compared with autism spectrum disorder (ASD) mouse models) and identify an effect of Ecrg4 (encoding small secretory peptides) on spine dynamics and shape in these models.

      Strengths:

      The study developed a method for objectively comparing spine properties in primary hippocampal neuron cultures from 8 mouse models of psychiatric disorders at the population level using high-resolution structured illumination microscopy (SIM) imaging. This novel technique identified two distinct groups of mouse models according to the population-level spine properties: those with ASD-related gene mutations and those with schizophrenia-related gene mutations. Functional studies, including gene knockdown and overexpression experiments, identified an effect of Ecrg4 on the spine phenotype of the schizophrenia model mice.

      Weaknesses:

      The main weakness is that the study is wholly in vitro, using cultured hippocampal neurons. The authors present this as an advantage, however, arguing that spine morphology as measured in a reduced culture system can demonstrate direct effects of gene mutations on neuronal phenotypes in the absence of indirect influences from nonneuronal cells or specific environments.

    2. Reviewer #2 (Public review):

      Okabe and colleagues build on a super-resolution-based technique they have previously developed in cultured hippocampal neurons, improving the pipeline and using it to analyze spine nanostructure differences across 8 different mouse lines with mutations in autism or schizophrenia (Sz) risk genes/pathways. It is a worthy goal to try to use multiple models to examine potential convergent (or not) phenotypes, and the authors have made a good selection of models. They identify some key differences between the autism versus the Sz risk gene models, primarily that dendritic spines are smaller in Sz models and (mostly) larger in autism risk gene models. They then focus on three models (2 Sz - 22q11.2 deletion, Setd1a; 1 ASD - Nlgn3) for timelapse imaging of spine dynamics, and together with computational modelling provide a mechanistic rationale for the smaller spines in Sz risk models. Bulk RNA sequencing of all 8 model cultures identifies several differentially expressed genes which they go on to test in cultures, finding that ecgr4 is upregulated in several Sz models and its misexpression recapitulates spine dynamics changes seen in the Sz mutants, while knockdown rescues spine dynamics changes in the Sz mutants. Overall, these have the potential to be very interesting findings and useful for the field. My major concerns from the initial manuscript, especially regarding cherry picking and circularity have been addressed with revised analytical approaches. I have some remaining minor comments.

      (1) The comparison between two wild-type samples versus wild-type-mutant samples is helpful - I think this could be added to the manuscript.

      (2) For results of timelapse imaging - please spell out in the results section the direction of change (lines 270 - 277).

      (3) Using linear mixed effect models for statistical analysis is a significant improvement. While a sample size (n) of mice = 3 is not ideal, I think given the multiple different mouse lines used and intensity of analysis, this is probably the best that can be done, although further validation in larger samples eventually is to be hoped for.

      (4) The revised text is much improved, but I still think the authors should be upfront somewhere in the text that the schizophrenia-associated genes can only confer biased risk for schizophrenia (and that the clinical phenotype can also include autism). As I said before, I think this is the best we can do and I agree with their choices, but it is important not to overstate the link. The differences they see make it clear that these are still relevant distinctions.

    1. Reviewer #1 (Public review):

      Summary:

      Roseby and colleagues report on a body region-specific sensory control of the fly larval righting response, a body contortion performed by fly larvae to correct their posture from an inverted (dorsal side down) position. This is an important topic because of the general need for animals to locomote in the correct orientation and the clever and broadly useful methodologies used in this paper to uncover the sensory triggers for the behavior, including a body region-specific optogenetic approach along different axial positions of the larva, region-specific manipulation of surface contacts with the substrate, and a 'water unlocking' technique to initiate righting behaviors, all strengths of the manuscript. The authors found that multidendritic neurons, particularly the daIV neurons, are necessary for righting behavior. The contribution of daIV neurons had been shown by the authors in a prior paper (Klann et al, 2021), but that study had used constitutive neuronal silencing. Here the authors used acute inactivation to confirm this finding. Additionally, the authors describe an important role for anterior sensory neurons. They move on to test the genetic basis for righting behavior and, consistent with the regional specificity they observe, implicate sensory neuron expression of Hox genes Antennapedia and Abdominal-b in self-righting.

      Strengths:

      Strengths of this paper include the important question addressed and the elegant and innovative combination of methods, which led to clear insights into the sensory biology of self-righting and links between body plan and nervous system function that will be useful for others in the field. The manuscript is very clearly written and couched in interesting biology.

      Limitations:

      There are several important questions for future study that, left unresolved, do not diminish the significance of this manuscript. These include the cellular and developmental basis for Hox gene action, the contributions of dorsal and ventral regions of the animal in righting, and the regional contributions of other sensory cell types in the righting response.

      Comments on revised version.

      The authors have addressed my major concerns.

    2. Reviewer #2 (Public review):

      Summary

      This work explores the relationship between body structure and behavior by studying self-righting in Drosophila larvae, a conserved behavior that restores proper orientation when turned upside-down. The authors first introduce a novel "water unlocking" approach to induce self-righting behavior in a controlled manner. Then, they develop a method for region-specific inhibition of sensory neurons revealing that anterior, but not posterior, sensory neurons are essential for proper self-righting. Deep-learning-based behavioral analysis shows that anterior inhibition prolongs self-righting by shifting head movement patterns, indicating a behavioral switch rather than a mere delay. Additional genetic and molecular experiments demonstrate that specific Hox genes are necessary in sensory neurons, underscoring how developmental patterning genes shape region-specific sensory mechanisms that enable adaptive motor behaviors.

      Strengths

      The work by Roseby et al. is notable for its elegant experimental design, the development of innovative methods that are likely to benefit the fly behavior community, and the strong experimental support for its conclusions. The manuscript is clearly written, well structured, and presents thoughtfully designed experiments that have been further improved in the revised version. This updated manuscript includes a comprehensive set of behavioral experiments using an additional Gal4 line (ppk-Gal4), which yields confirmatory results and strengthens support for the original hypothesis. It also incorporates quantification of Gal4 line strength, improvements to existing figures, the addition of new figures, and overall refinement of the text.

      Weakness:

      A remaining limitation of this manuscript is the lack of a cellular and mechanistic analysis explaining how Hox genes give rise to the observed behavioral phenotypes. The authors note that this question is being addressed in an ongoing follow-up study, which will expand the project to examine the roles of all Hox genes across the sensory system and to characterize their expression patterns within each of its subcomponents, with the aim of providing mechanistic insight. I look forward to seeing this work in a future manuscript.

      Comments on revised version.

      I have no further recommendations for the authors; most of my comments and questions have been satisfactorily addressed.

    1. Reviewer #2 (Public review):

      Summary:

      The authors sought to characterize the somatic mutation landscape and gene expression profiles of Kenyan breast cancer patients. By comparing Whole Exome Sequencing (WES) and RNA-seq data from 23 paired tumor-normal samples against The Cancer Genome Atlas (TCGA) cohorts, the study specifically aimed to highlight the role of the ZNF gene family.

      Strengths:

      The study addresses a critical gap in genomic research by focusing on an underrepresented African population, which is essential for achieving global health equity in oncology.

      Weaknesses:

      The cohort is relatively small for definitive landscape characterization. The study fails to explore the mechanistic link between identified somatic mutations and observed aberrant gene expression.

      Impact and Utility:

      The impact of this work is currently limited. While the data adds to the growing repository of African genomic samples, the lack of novelty and mechanistic insight reduces its utility for the broader scientific community. To be clinically valuable, the study would need to offer more robust, unbiased profiling that could eventually inform population-specific diagnostics or therapies.

      Additional Context:

      Breast cancer in African populations often presents with different clinical trajectories compared to Western cohorts. While any data from these regions is vital, "landscape" studies require high statistical power and unbiased analysis to differentiate true population-specific drivers from noise or small-sample variance. Without a clear regulatory mechanism linking mutations to phenotypes, the findings remain preliminary observations.

    2. Reviewer #3 (Public review):

      Summary:

      This revised study analyzes the somatic mutational profiles and transcriptomic expression of three zinc-finger genes (ZNF217, ZNF703, ZNF750) in 23 Kenyan women with breast cancer, using whole-exome sequencing and RNA-sequencing of paired tumor-normal tissues. A total of 358 somatic mutations were detected, and all three genes were significantly upregulated in tumors compared to normal tissues (ZNF217 showing the most prominent difference). The findings provide preliminary evidence for the idenfication of diagnostic/prognostic biomarkers or therapeutic targets in sub-Saharan African populations.

      Strengths:

      The study's key strengths lie in its focus on an underrepresented Kenyan cohort, addressing a critical gap in sub-Saharan African breast cancer genomic research. It integrates DNA-level mutation analysis with RNA-level expression data, leveraging standardized bioinformatics pipelines and rigorous quality control to deliver detailed insights into mutation types, functional impacts, and amino acid changes.

      Comments on revised version:

      After careful revision by the authors, the manuscript has become more rigorous. The limitations including small sample size and lack of functional validation are properly acknowledged, and conclusions are prudently presented as hypothesis‑generating rather than causal claims. Meanwhile, strengthened multi‑omics analyses, TCGA validation, logical reorganization of results and improved figure presentation further enhance the reliability of this work.

    1. Reviewer #1 (Public review):

      Summary:

      This important study performs a theoretical analysis of the evolutionary dynamics of strains under a classical resource competition model to understand how clonal interference and diversification of resource preferences interact to structure microbial population genetic structure. They find that in large asexual populations evolving in relevant parameter regimes, where evolutionary and ecological time scales overlap, populations are characterized by a small number of ecotypes, which are groups of strains that share a given resource preference, whose dynamics in the long run are dominated by priority effects.

      Strengths:

      The manuscript constitutes a novel and sound contribution to theory in ecology and evolution, under relevant parameter regimes which have been previously overlooked due to the complexities they bring, i.e. when the weak mutation regime breaks down. Here, the authors make a considerable step forward by taking advantage of analytical advances in the population genetics theory of clonal interference in recent years (travel fitness wave moving at a constant average speed v), which they apply to resource competition models typically studied in ecology.

      The main insights in the derivations shown in the supplementary text are clearly summarized in Figure 2 of the main manuscript, where the different phases of the somewhat counterintuitive dynamics of the strategic mutations in the model are quantified.

      Weaknesses:

      Despite its many merits, I believe the manuscript can profit from a few clarifications as I point out below:

      (1) I think the authors should make explicit in the abstract of the paper that they study a stair to heaven fitness landscape and that the rate of beneficial mutations does not slow down.

      (2) Evolution is elegantly incorporated in the resource consumption model by assuming two classes of mutations: strategic mutations and constitutively beneficial mutations. I believe that the biological meaning of these different types should be better explained. Specifically, on pages 3 and 4, the authors state that strategy mutations "alter resource uptake strategy and potentially its overall magnitude as well", whereas the other type is "only tangentially related to resource consumption (e.g. eliminating a pathway that is not necessary in the current environment)." I find this a bit strange since this is a model of resource competition, and I would assume that the latter type of mutations would be neutral. Maybe I am not reading this well, and the meaning of the mutations, as well as their assumed rates, could be clarified with some examples as the authors state that these mutations are routinely observed in microbial evolution experiments.

      (3) The authors discuss the theoretical results obtained in the light of the famous Lenski experiment, where ecotype formation is observed in some populations. However, in the mentioned example, cross-feeding was the mechanism involved. Since in their model, unlike in other models, cross-feeding is not considered, I found this example to be misplaced. In addition, in the Lenski experiment, a single (and essential) resource is present in the environment, so the assumptions of the model do not appear to apply. On the other hand, in Herron and Doebeli's experiments, two resources (substitutable) were present, so a comparison with their experimental results would be more appropriate.

      (4) The paper should also discuss deleterious mutations, which I did not see mentioned anywhere.

    2. Reviewer #2 (Public review):

      Summary:

      In "Ecological diversification in rapidly evolving populations", the authors use a consumer-resource model with competition for 2 different resources to study diversification for cases in which ecology and evolution are separated (weak-mutation limit) and when they overlap. They find the potential for the timing of a mutation (and not just its associated fitness) to confer an advantage against fitter strains (which they call "priority effect"), and the aggregation of dominant trait values that lead to the definition of "ecotypes" that discretize and structure the community.

      Strengths:

      The authors introduce detailed analytical calculations in the limit of overlapping ecology and evolution, which is a case that typically eludes analysis. The work also pays particular attention to the timing of "invasion" by a mutation, whereas most approaches focus on the long-term outcome of evolution (e.g. fixation of a trait value).

      Weaknesses:

      The model makes important assumptions that limit its generality considerably. In particular, the two "evolving traits" defined in the model are very specific and by no means the simplest possible resource competition evolutionary model that the authors claim it to be. The manuscript is not clear enough to be reproducible, and the authors do not discuss in sufficient depth the huge amount of work that is presented in the manuscript. The bibliography omits important work focused on diversification emerging from eco-evolutionary interactions similar to the ones studied in the manuscript.

    1. Reviewer #1 (Public review):

      Summary:

      In this manuscript, Balasubramaniam and colleagues continue this group's efforts to understand mitochondrial-derived compartments (MDCs) that bud off from yeast mitochondria in response to metabolic stress. In a previous genetic screen, they identified Ups lipid transfer proteins and the AAA-protease Yme1 as components that modulate MDC formation. In this study, the authors link these observations by showing that Yme1 modulates levels of Ups1, Ups2, as well as MICOS complex members in the mitochondrial proteome. Using genetic approaches, they then show that Yme1's role on MDCs is dependent on its catalytic activity (via an inactive mutant) and that YME1 shows genetic interactions with UPS1/2 and MIC10/MIC60. The overall model is that Yme1 activity responds to metabolic cues and acts via proteolysis of these two distinct mitochondrial machineries to regulate MDC biogenesis.

      Strengths:

      The strengths of the study are its integration of mitochondrial proteomics with strong genetic approaches, as well as synergy with the authors' previous studies on the role of lipids in MD genesis. The work is overall well carried-out and experiments are thoughtfully discussed.

      Weaknesses:

      The major weaknesses are a lack of mechanistic resolution surrounding the model, e.g., proposed or tested mechanisms by which Yme1 activity is regulated by metabolic cues, or how Ups1/2 activity and the MICOS contribute to MDC generation. The authors acknowledge these as open questions, but addressing them would still enhance the significance of the study.

    2. Reviewer #2 (Public review):

      In this manuscript, the authors report a novel regulation of the outer mitochondrial membrane remodeling domains called mitochondria-derived compartments, MDCs. The team has previously established the main principles behind this recently identified quality control pathway, but the mechanisms that control MDCs formation remain incompletely understood. Using the baker's yeast model, the authors identify the conserved mitochondrial protease Yme1 as a crucial factor that regulates MDC formation. Mechanistically, Yme1's proteolytic function controls the levels of Ups1 and Ups2 lipid transfer proteins and the components of the membrane organizing complex called MICOS, thus providing a plausible model as to how Yme1-dependent proteolysis permits MDC formation through the removal of lipid and MICOS-dependent constraints. Finally, the authors show that this Yme1-mediated activity is also defined by metabolic conditions. In principle, this study is interesting and novel, and holds potential to provide new insights into the regulation of the MDC pathway that emerged as a new fundamental mitochondrial quality control mechanism. However, the following points should be carefully addressed.

      Major points:

      (1) Yme1 has been previously shown to regulate mitochondria-specific autophagy through Atg32 processing. Given the high similarity of the MDC pathway to piecemeal autophagy and the fact that both pathways share some of the core components, the authors should address the involvement of Atg32 in their model. It would also be important to include a brief discussion addressing the differences between piecemeal autophagy and the MDC pathway.

      (2) The Rpt3 (P215L) expression experiment is interesting, but appears to be somewhat superficial due to the unclear mechanism by which the mitochondrial network morphology is restored in these cells. Could this result be replicated in the dnm1∆ mgm1∆ double deletion mutant, which is a well-established model for mitochondrial network restoration?

      (3) Figure 3E. The changes in PE levels appear to be minor. While statistically significant, the observed differences may not be physiologically relevant. More in-depth lipidomic analysis data should be presented to substantiate the authors' argument and better address the questions at hand. Related to that, could PE or PA supplementation stimulate MDC formation?

      (4) The connection between rapamycin treatment and Yme1-regulated MDC formation is unclear and puzzling and needs to be explained better.

      (5) The MICOS complex is clearly involved in the regulation of MDC, but the manuscript misses the mark on providing compelling evidence and a clear explanation as to how MICOS contributes to said regulation.

      Minor points:

      (1) The authors should discuss potential reasons for the dramatically different rates of MDC formation in the S288C and W303 background cells. Does this have anything to do with generally more robust mitochondrial functions in the latter cells?

      (2) Proper statistical analyses should be provided for all the graphs presented.

      (3) The authors should include Yme1 immunoblots to confirm the identity of strains being studied and validate the presence or overexpression of Yme1 and its catalytic mutant in their experiments.

    3. Reviewer #3 (Public review):

      Summary:

      Since describing MDCs over a decade ago, the lab of the corresponding author, Hughes, has been at the forefront of further characterizing these structures. Here, they follow up on recent work (PMID: 38497895), where a screen identified Yme1 as a potential regulator of MDCs. After confirming that Yme1-ko prevents MDCs that are usually induced via various established treatments (Rapamycin, cycloheximide, Concanavalin A), the authors confirmed that the proteolytic activity of Yme1 is required. Next, using proteomics, they identified how loss of Yme1 impacts the mitochondrial proteome with and without Rapamycin treatment to induce MDCs. From this result and based on insight from other published data implicating lipids, the focused initially on the lipid transfer protein Usp2, a known target of Yme1. Here, they showed that loss of Usp2 could partially rescue MDC formation in Yme1-ko cells. To look for other Yme1 targets that might also be involved in MDC formation, next, they investigated the MICOS complex, which was also notable in their proteomics data. They then showed that inhibiting MICOS also partially restored MDC formation in Yme1-ko cells. They then tested the combined effects of Usp2 and MDC inhibition on MDCs, which was limited by the fact that the combination of full MICOS disruption, Usp2-KO, and Yme1-KO was not viable. To circumvent this limitation, they investigated the knockout of individual MICOS subunits in combination with Usp2 and/or Yme1. Finally, they showed that growth conditions also mediate MDC formation in the context of Yme1 overexpression. In rich media, Yme1 overexpression induces MDCs on its own. However, this induction is lost upon amino acid starvation, suggesting that there are still other as-yet-unidentified factors regulating the formation of MDCs.

      Strengths:

      The authors use unbiased approaches and genetic models to begin unraveling a novel regulatory role of Yme1 in the formation of MDCs.

      Weaknesses:

      (1) The authors find both Ups1 and Ups2 in their screens, but only focus on Ups2 in this paper. It would be good to know why they did not also investigate Ups1, and its other protease Atp23, which could potentially act similarly to Yme1, or even rescue the loss of Yme1.

      (2) I'm not convinced that the data support the notion that Usp2 and MICOS have distinct effects on MDCs. In Figure S3C-D, there is no statistical analysis to indicate whether the small differences between the MICOS-ko and the double knockout are significant. If MICOS-ko and Ups2-ko were acting through different mechanisms, one would expect their combination to be additive; this does not appear to be the case, as both single deletions and the double deletion all cause similar levels of MDCs (~30-40%). Rather, this result is what you would expect if they were working through the same mechanism. There also does not appear to be an additive effect in Figure 4F-G, when using the mic60-ko rather than the complete MICOS-ko. In this regard, the authors note in their discussion that 'loss of MICOS may disrupt membrane associations or alter lipid distribution between mitochondrial subcompartments' (lines 390-392). The latter situation seems like it would be the same mechanism as Usp2 and would more accurately explain their findings.

      (3) The manuscript is missing key data confirming the re-expression or overexpression of Yme1 protein (Figure 1 E/G and Figure 5A). It is important to know the relative levels of expression of the re-expressed proteins to each other and to endogenous Yme1.

      (4) Some clarification of the details for metabolically restrictive conditions would be helpful.

      (5) Beyond just the presence/absence of MDCs, does more detailed quantification of their size/shape reveal any subtle differences between conditions?

    1. Reviewer #1 (Public review):

      Summary:

      Combining in vitro refolding, SEC-based assembly assays, peptide-library screening, MALDI-TOF, LC-MS/MS, structural analysis and immunopeptidomics, this manuscript investigates the peptide-binding principles of the promiscuous chicken MHC-I molecule BF2*21:01.

      Strengths:

      Although the peptide motif of BF2*21:01 is highly complex, this manuscript identified several principles, including a preference for 10-mer peptides, co-variation between P2 and Pc-2, effects of P3 and Pc-3, and a strong cellular preference for Leu at Pc. The results are important for avian MHC biology and poultry vaccine epitope prediction.

      Weaknesses:

      The manuscript is sometimes difficult to follow because the authors present a large amount of peptide-library, structural and immunopeptidomics data. without always clearly explaining how these datasets support the proposed simplifying principles.

      Major Issues - Points Requiring Clarification or Additional Support:

      (1)(Line 282-301, 537-545)<br /> The immunopeptidomics conclusions are mainly based on one B21 cell line with one biological replicate and at least two technical replicates. Given the complexity of the BF2*21:01 peptide repertoire, this is a major limitation. The authors should either provide additional biological replicates or clearly state this limitation in the Abstract, Results and Discussion.

      (2) (Lines 290-313)<br /> The B21 cell preparations contain both BF2 and the lowly expressed BF1 molecule. Some peptides, especially 8-mers or peptides with atypical motifs, may derive from BF1*21:01. The authors should clarify how BF2*21:01-bound peptides were distinguished from possible BF1-derived peptides, or interpret the immunopeptidomics motif more cautiously. The authors should also provide or cite evidence confirming the B21 haplotype identity of the cell line and chicken materials used for immunopeptidomics.

      (3) (Lines 217-221, 243-253)<br /> The authors acknowledge that MALDI-TOF cannot reliably distinguish peptide combinations with identical or similar masses, nor determine residue positions in some cases. Therefore, MALDI-TOF results should not be overinterpreted as precise evidence for residue preference. The authors should clearly indicate which conclusions are supported by LC-MS/MS.

      (4) (Lines 297-301, 316-330)<br /> The authors suggest that longer peptides may bulge in the middle or extend out of the groove at the C-terminal end. The rationale for the C-terminal extension is not clearly explained. Why is the C-terminal extension considered rather than the N-terminal extension? If the binding register is uncertain, long peptides should be analyzed separately from canonical-length peptides.

      (5) (Lines 406-439)<br /> In vitro assembly assays show that several hydrophobic residues can be tolerated at Pc, whereas immunopeptidomics shows a strong Leu preference at this position. The authors should clarify whether this Leu preference reflects intrinsic BF2*21:01 binding specificity, TAP-mediated peptide transport, antigen processing, peptide loading, or a cell-line-specific effect. Additional experimental support, such as TAP transport analysis, would strengthen this conclusion.

      (6) (Lines 172-178, 243-279, 442-457)<br /> The structural analysis explains some residue combinations, such as Arg at P2 with Glu at Pc-2 or Trp at Pc. However, the structural interpretation is not fully integrated with the large-scale peptide library and immunopeptidomics results. Representative high- and low-frequency combinations should be discussed structurally.

      (7) The inference of co-variation between P2 and Pc-2, as well as the modulatory effects of P3 and Pc-3, should be better explained. At present, some conclusions appear to be based mainly on residue-frequency patterns, and the logical connection between these observations and the proposed binding principles is not always clear. Statistical analyses, such as mutual information, chi-square tests or permutation tests, and representative structural explanations would strengthen this conclusion.

    2. Reviewer #2 (Public review):

      Summary:

      The study presents an in-depth analysis of the peptide repertoire bound by a promiscuous chicken MHC molecule using mass spectrometry, x-ray crystallography and modelling. While the MHC can bind a very diverse set of peptides, the authors have found some new rules that govern peptide binding to this MHC that could help to build a predictive model to study the repertoire of pathogen-derived peptides.

      Strengths:

      The study uses a range of well performed experiment across multiple techniques and provides an in-depth analysis of the peptide repertoire, including peptide sequences, length, preferred residues, stability and MHC presentation.

      Weaknesses:

      The data overall support the analysis and conclusion well. The only caveat is linked to Figure 4, which does not describe the stability of the peptide-MHC complex, but instead shows refold yield, and the two are not always linked.

    1. Reviewer #1 (Public review):

      Summary:

      The "multiple-demand" (MD) system is a well-known finding of human brain imaging and is thought to play a central role in cognitive control. To directly compare the MD system in humans and monkeys, Mione et al. used functional magnetic resonance imaging to measure whole-brain activation in a multi-step saccadic maze task. In humans, the authors found a distributed pattern of brain activity close match to the canonical MD network and extends to adjacent regions of dorsal attention and other networks. While there was good correspondence between monkey and human data, differences were also notable in the lateral frontal cortex, the dorsal parietal cortex, and the sensorimotor cortex.

      Strengths:

      Though previous data hint at a corresponding network in the macaque, there has been no direct comparison to human data. This study provides a direct cross-species comparison with whole-brain data from fMRI, and the findings suggest an extended and strongly interconnected brain network recruited by increased cognitive challenge.

      Weaknesses:

      In previous human imaging, the MD system is defined by overlapping activation for many kinds of cognitive demands. In the present work, however, the authors used just a single task. Although there is some overlap between the putative monkey MD network and the canonical MD network identified in human imaging, there should be caution in linking current findings to the MD system based on limited task events.

    2. Reviewer #2 (Public review):

      Summary:

      Mione et al. aim to resolve a long-standing question in comparative neuroscience: whether the macaque brain contains a functional analogue to the distributed human multiple-demand (MD) network. To address this, the authors employ a direct cross-species fMRI comparison using a multi-step saccadic maze task in humans and a simplified two-step version in macaques. By contrasting goal-directed navigation against a control condition that requires similar motor responses but no strategic planning, the study isolates the neural signatures of cognitive control across species.

      Strengths:

      The most compelling aspect of this work is its methodological alignment. Previous attempts to compare these systems often relied on comparisons of human BOLD signals and macaque single-unit recordings. By running parallel fMRI protocols, the authors establish a shared measurement basis that allows for a more direct comparison. The resulting activation maps clearly demonstrate conserved network topology across dorsomedial frontal, lateral, and medial parietal, and insula cortices. Combining these results with recent research on functional and structural connectivity further supports the idea that these networks evolved across species and provides a helpful starting point for future comparative studies. The findings will be highly useful for researchers investigating the evolutionary origins of domain-general cognitive control, as well as for neuroimaging methodologists developing cross-species alignment pipelines.

      Weaknesses:

      However, there are several differences in how the two groups were studied that make it harder to compare the results precisely. The human task mixed 2-, 4-, and 6-step trials within the same experimental blocks, whereas macaques performed only 2-step trials. This design difference likely places human participants in a state of sustained proactive cognitive control (Braver, 2012), as they must remain prepared for highly demanding trials at any moment. This elevated baseline arousal may artificially inflate MD network activation during the simpler 2-step trials in humans, making direct magnitude comparisons with the macaque data difficult. Additionally, the general linear model combined correct and error trials into a single regressor. Given that macaques exhibited substantially higher error rates, this approach risks diluting task-specific planning signals with activity related to error monitoring and reward prediction errors. The preprocessing pipeline also applied a 4 mm full-width half-maximum smoothing kernel to macaque data acquired at 1.5 mm resolution. Relative to the smaller size of the macaque brain, this kernel is quite large and likely blurs fine-grained topographical distinctions. This may partly explain why the macaque lateral frontal cortex shows a single dorsal activation patch rather than multiple discrete patches seen in humans. Furthermore, there is concerning inter-individual variability in the macaque data. Normally, a functional network like the MD system is identified by consistent activation across all individuals. In this study, however, the two monkeys show substantially different activation maps and behavioral patterns. This lack of consistency renders the group-level results questionable, as it is unclear whether the group-level map represents a unified biological system or merely an average of disparate individual maps. Finally, the subcortical activations shown in Figure 7 require more precise anatomical localization to confidently distinguish cerebellar nodes from adjacent brainstem structures.

      The authors demonstrate a broad functional correspondence between human and macaque cognitive control networks, moving the field beyond speculative homology. The data suggest that an extended, interconnected network is recruited by cognitive challenge in both species; however, the strength of this claim is limited by the inter-individual variability and methodological constraints noted above. Assertions of precise topological equivalence should therefore be tempered. The absence of ventrolateral prefrontal and strong dorsal parietal activations in the macaque group analysis may reflect genuine biological differences, but could also stem from limited statistical power, excessive smoothing, or task-design asymmetries. While the overall conclusions are plausible, they would be significantly strengthened by a more explicit discussion of these limitations and additional analytical clarifications regarding individual-level consistency.

    1. Reviewer #1 (Public review):

      Summary:

      The manuscript entitled "Essential function reflected in the phylodynamics of a multigene family - the pir genes of malaria parasites" by Jackson and colleagues investigates the global phylogeny of pir genes across 14 Plasmodium species and one Hepatocystis species. The authors also focus on the functional characterization of the conserved ortholog pirC1 and claim that pirC1 is not the founder of the family and that it plays an essential role in blood-stage growth.

      Strengths:

      Overall, the manuscript is well written and interesting, as it combines comparative genomics and evolutionary analysis with functional experiments. The phylogenetic analysis is rigorous and represents a major strength of the manuscript.

      Weaknesses:

      The general conclusions regarding the potential function of this gene family are not fully supported by the data presented. The manuscript moves too quickly from growth phenotype and localization studies to a specific mechanistic model. The discussion argues that PIRC1 may be involved in nutrient acquisition, host sensing, or metabolic support, but the data provided do not directly support these functions, and the manuscript in its present form remains speculative. Although the manuscript includes some experimental results, it lacks direct mechanistic validation of the specific functions of the pir genes, including pirC1. In its current form, the study does not yet establish a definitive role for pirC1 in metabolic processes.

    2. Reviewer #2 (Public review):

      Summary:

      This is an extensive study using phylogenetic comparison across multiple plasmodium species to gain new insights in relation to their evolutionary pathways and the potential function of pir. In addition to establishing a framework to identify related orthologues across species as well as expanding paralogues families within a species, the work also focuses on understanding loss and gain of different PIRs and how this indicates a relative lack of functional constraints and essentiality for most members of the gene family.

      The authors provide evidence that at least pirC has a conserved function and plays an important role in parasite growth in multiple species.

      While this study represents a significant effort and does provide interesting new insights that would help our understanding of this complex gene family in the future, it has a number of limitations.

      Strengths:

      Extensive and thorough phylogenetic analysis that is supported by some biological validation. Provides an indication that the PIR gene family has limited biological constraints and evolved independently across different species, leading to rapid expansion and deletion of orthologous groups. Identified pirC as a functional and important member of the family that is conserved across the species.

      Weaknesses:

      The phylogenetic tree is based on a truncated sequence that focuses on the more conserved parts of the pir sequence. This could potentially lead to missing the key functional drivers of evolution. The biological validation of the role of pirC has some inconsistencies that need to be addressed.

    3. Reviewer #3 (Public review):

      This paper aims to classify, from an evolutionary perspective, the multigene family PIR found in malaria parasites infecting rodents and Old World monkeys, and to link this classification to functional diversification. The authors also hypothesize that PIR members conserved across species play important roles in parasite survival, and seek to clarify their functions.

      To achieve these aims, the authors comprehensively analyze the evolution of PIR genes using genomic and transcriptomic information from many malaria parasite species. They focus on PIRC1, a member conserved across species, and attempt to clarify its function in rodent and simian malaria parasites by examining the phenotypes of parasites in which the corresponding genetic locus has been disrupted. They also attempt to determine its localization using PIRC1 tagged with an epitope sequence. However, although the locus-disrupted parasites appear to show an approximately 50% reduction in growth rate, this effect seems to be overestimated. Another weakness is that the cause of the reduced growth rate has not been clarified. The localization analysis also remains insufficiently conclusive.

      Therefore, I consider that the first half of the paper, consisting of the bioinformatics analyses, achieves the objective of comprehensively summarizing PIR and may become a reference paper for discussing the evolution and function of the PIR gene family. On the other hand, regarding the function of PIRC1, no clear conclusion can be drawn from the results presented, and several additional experiments are necessary.

      My major comments are as follows.

      (1) The claim that the failure of eight disruption attempts indicates that pirC1 is essential is too strong.

      Lines 319-321: The authors argue that a total of eight failed attempts to disrupt the pirC1 locus using two different construct designs suggest that pirC1 is essential in P. berghei. However, the failure of these attempts could also reflect technical issues with the construct design itself, such as the length of the homologous regions used for recombination, which are approximately 650 bp. Therefore, it is an overstatement to conclude that "pirC1 is essential for P. berghei blood-stage growth." Given that parasites with disruption of the corresponding locus could be obtained in both P. chabaudi and P. knowlesi, a more appropriate statement would be that "pirC1 is important for P. berghei blood-stage growth."

      (2) The data on the mCherry-expressing P. berghei line shown in Supplementary Figure 11 are insufficient.

      (a) Panel C: Southern blot analysis<br /> To conclusively identify the lower band in panel C as chromosome 1, additional probes specific to genes located on chromosomes 1 and 2 would be required. In addition, a parental parasite control should also be included. The Southern blot image of the parental parasite should show only a single band at the higher position, with no band at the lower position. Probes specific to chromosomes 1 and 2 would help demonstrate that the lower band corresponds to chromosome 1, rather than chromosome 2.

      To this end, the authors could describe the result as follows:<br /> "In the parental parasite, only a single band corresponding to chromosome 7 was detected, indicating that the smaller chromosome was genetically modified. The size of the lower band detected with the dhfr probe was identical to that of the band detected with the control chromosome 1 probe, but distinct from that detected with the chromosome 2 probe, indicating that chromosome 1 was modified."

      That said, this chromosome-level Southern blot analysis is not sufficient to demonstrate that the target PBANKA_0100500 locus was specifically modified. The authors should provide more direct evidence showing that the PBANKA_0100500 locus, rather than another genomic locus, was modified. For example, Southern blot analysis after restriction enzyme digestion would provide more definitive evidence. Diagnostic PCR may also provide more specific evidence.

      (b) Panel D: Flow cytometry analysis

      To allow a more accurate interpretation of the percentage of mCherry-positive cells, flow cytometry data for the parental parasite line should also be presented.

      (3) There are unclear points in the PCR results shown in Supplementary Figure 12.

      Supplementary Figure 12: In panel B, a PCR product should also be amplified from dPCHAS_0101200 using the P1-P3 primer pair. Why is this band absent? The authors should provide the uncropped electrophoresis image so that the larger band can be seen. In addition, if labels 1 and 2 indicate independent clones, this should be stated in the figure legend.

      (4) The growth rates of P. chabaudi and P. knowlesi parasites with disruption of the PIRC1 gene locus should be quantitatively analyzed.

      The growth rates of P. chabaudi and P. knowlesi are described only qualitatively, but they should be evaluated quantitatively. In Figure 4A, the parasitemia of wild-type P. chabaudi increases from approximately 6.1% on day 6 to approximately 15.6% on day 8, corresponding to a 3.8-fold increase. However, because parasite growth may already be affected by immune-mediated suppression at this stage, this value should be regarded as a minimum estimate. In contrast, the mutant increases from approximately 3.2% on day 8 to approximately 6.8% on day 10, corresponding to a 2.1-fold increase. Based on these values, the daily growth rate of the mutant appears to be reduced to at least approximately 56% of that of the wild type. Similarly, from the growth curve of P. knowlesi in Fig. 5A, the DMSO-treated group appears to increase approximately two-fold per day, whereas the rapamycin-treated group increases only approximately one-fold per day. Thus, P. knowlesi also appears to show an approximately 50% reduction in growth rate. Taken together, both P. chabaudi and P. knowlesi appear to reproducibly show an approximately 50% reduction in growth capacity. A reduction of this magnitude is difficult to describe as a "severe growth defect"; a more appropriate wording would be simply that the parasites "showed a growth defect." In addition, the terms "a severe growth defect" and "essential" appear to be overstated throughout the manuscript, and the wording should be toned down. Finally, I recommend presenting Figure 4A and Figure 5A on a logarithmic scale so that the trend in growth rates can be more intuitively appreciated from the graphs.

      (5) The evidence that disruption of the PIRC1 gene locus in P. knowlesi does not affect erythrocyte invasion is weak.

      The authors describe that "the developmental cycle of the parasites lacking PIRCl is slightly longer than that of parasites that produce PIRCl (line 383-384)," and appear to support this interpretation with data showing that "mutant parasites are significantly smaller than wild-type parasites (line 414)" and that "the DNA content in ML10-arrested parasites lacking PIRCl is lower than that of DMSO-treated parasites (line 417-418)" at 24 hours after invasion. However, a slightly longer developmental cycle alone does not seem sufficient to explain a 50% growth reduction.

      I think the erythrocyte invasion capacity has not been quantitatively evaluated, and therefore, the evidence supporting the conclusion that the phenotype of P. knowlesi parasites with disruption of the PIRC1 gene locus is unrelated to erythrocyte invasion is weak. The authors should assess invasion efficiency using purified merozoites. For P. chabaudi, it should also be possible to apply an in vitro or in vivo erythrocyte invasion assay similar to that used for other rodent malaria parasites, and this should be evaluated as well.

      (6) The authors should examine whether disruption of the PIRC1 gene locus results in a phenotype characterized by a reduced number of merozoites.

      Alternatively, the reduced DNA content in ML10-arrested parasites lacking PIRC1 (lines 416-417) could suggest that the number of merozoites formed per schizont may be reduced. To clarify this point, the authors should assess whether the number of merozoites per schizont is altered in P. knowlesi (and P. chabaudi parasites lacking PIRC1).

      (7) The authors propose the possibility that PIRC1 expressed in merozoites is released after invasion; however, the evidence that PIRC1 localizes to intracellular organelles is weak.

      Line 333: "a peripheral pattern around the parasite" is indicative of parasite plasma membrane, PV, or PVM. ", indicative of a parasitophorous vacuole (PV) or parasitophorous vacuole membrane (PVM) location" should be amended to ", indicative of parasite plasma membrane, a parasitophorous vacuole (PV) or parasitophorous vacuole membrane (PVM) location". In the Figure S14 image, red signals are uniformly detected from the merozoites formed in the schizont stage parasite (not really microorganelle patterns), but not from the PVM surrounding the schizont, suggesting parasite plasma membrane localization, not PVM. I agree that the signal is detected from the compartments extending into the iRBC cytosol, which may be difficult to explain if it is located on the parasite plasma membrane, but how frequently were such images seen?

      Figure 4D. In the images of liver-stage schizonts, AMA1 does not appear to localize to the micronemes in mature merozoites, suggesting this image is an immature schizont. Although PIRC1 appears to be expressed in liver-stage schizonts, it is difficult to clearly determine whether it localizes to intracellular organelles or to the parasite plasma membrane.

      To clarify the above points, the authors should examine whether PIRC1 is detected in intracellular organelles or around the merozoites by analyzing its localization in purified merozoites.

    1. Reviewer #1 (Public review):

      This is an interesting and valuable paper by Gil-Lievana, Arroyo et al. that presents an open-source method (the "Crunchometer") for quantifying biting and chewing behavior in mice using audio detection. The work addresses an important and unmet need in the field: quantitative measures of feeding behavior with solid foods, since most prior approaches have been limited to liquids. The authors make a clear and compelling case for why this problem is important, and I fully agree with their motivation.

      The system is carefully validated against human-scored video data and is shown to be at least as accurate, and in some cases more accurate, than human observers. This is a major strength of the study. I also particularly appreciate the demonstration of the technology in the context of LHA circuitry, which nicely illustrates its utility and importance for mechanistic studies of feeding. I also appreciate the ability to readily time lock neural data to individual crunches. Overall, the manuscript is well executed and represents a useful contribution to the field.

      Comments on revised version.

      The revised manuscript has addressed my minor initial concerns. I appreciate that the sample size was increased for the recording experiments.

    2. Reviewer #2 (Public review):

      Summary:

      The authors set out to develop and validate the Crunchometer, a low-cost, open-source acoustic system designed to overcome the limitations of existing methods for studying feeding behavior in rodents. Their goal was to provide a tool that could precisely capture the microstructure of solid food intake, something often overlooked in favor of liquid-based assays, while being affordable, scalable, and compatible with neural recording techniques. By doing so, they aimed to enable detailed analysis of how physiological states, drugs, and specific neural circuits shape naturalistic feeding behaviors.

      Strengths:

      (1) Introduces a low-cost, open-source acoustic tool for measuring solid food intake, filling a critical gap left by expensive and proprietary systems.

      (2) Makes the method easily adoptable across labs with detailed setup instructions and shared benchmark datasets.

      (3) Provides high temporal precision for detecting bite events compared to human observers.

      (4) Successfully distinguishes feeding microstructure (bites, bouts, IBIs, gnawing vs. consumption) with greater objectivity than manual annotation.

      (5) Demonstrates compatibility with electrophysiology and calcium imaging, enabling fine-scale alignment of neural activity with feeding behavior.

      (6) Effectively discriminates between fed vs. fasted states, validating physiological sensitivity.

      (7) Captures pharmacological effects of semaglutide, although this is really just reduced feeding and associated readouts (bouts, latency, etc.)

      (8) Has potential to distinguish consummatory vs. non-consummatory behaviors (e.g., food spillage, gnawing), however the current SVM model struggles to separate biting from gnawing due to similar acoustic profiles and manual validation is still required.

      (9) Provides potential for closed-loop experiments

      Weaknesses:

      (1) Some neuroscience findings (calcium imaging of GABAergic vs. glutamatergic neurons) are based on small pilot samples (n=2 mice per condition), limiting generalizability.

      (2) Chemogenetic and pharmacological experiments used small cohorts, raising statistical power concerns.

      (3) Correlation with actual food intake is modest and sometimes less accurate than human observers

      (4) Sensitive to hoarding behavior, which can reduce detection accuracy and requires manual correction for misclassifications (e.g., tail movements, non-food noises). However, these limitations are discussed and not ignored.

      Comments on revised version.

      The authors have addressed all my comments and have put forth a creative, accurate approach to assessing food intake in rodents.

    1. Reviewer #2 (Public review):

      [Editors' note: this version has been assessed by the Reviewing Editor without further input from the original reviewers.]

      Summary:

      The premise of the manuscript by Matteucci et al. is interesting and elaborates a mechanism via which TNFa regulates monocyte activation and metabolism to promote murine survival during Plasmodium infection. The authors show that TNF signaling (via an unknown mechanism) induces nitrite synthesis, which (via yet an unknown mechanism), and stabilizes the transcription factor HIF1a. Furthermore, that HIF1a (via an unknown mechanism) increases GLUT1 expression and increases glycolysis in monocytes. The authors demonstrate that this metabolic rewiring towards increased glycolysis in a subset of monocytes is necessary for monocyte activation including cytokine secretion, and parasite control.

      Strengths:

      The authors provide elegant in vivo experiments to characterize metabolic consequences of Plasmodium infection, and isolate cell populations whose metabolic state is regulated downstream of TNFa. Furthermore, the authors tie together several interesting observations to propose an interesting model.

      Weaknesses:

      The authors show that TNFa induces GLUT1 in monocytes, but do not show a direct role for GLUT1 or glucose uptake in monocytes in host resistance to infection.

    1. Reviewer #1 (Public review):

      The authors have presented a revised version of their investigation into the Membrane Associated Periodic Skeleton (MPS) in iPSC derived human motor neurons. As mentioned in the earlier report, the main observations reported in this article-occurrence of patch and gap arrangement of MPS-is very interesting. The real puzzle is whether, and if so how, this structure coarsens over time to produce continuous MPS.

      Following suggestions from reviewers, the authors attempted live cell imaging, but the results were not consistent enough and the authors point out difficulties in obtaining sufficient numbers and possible artefacts of over-expression. This investigation would have been much stronger with live cell imaging data on the dynamics of patch and gap structures.

    2. Reviewer #2 (Public review):

      Summary:

      In this manuscript, Gazal et al., describe the presence of unique gaps and patches of BetaII-spectrin in medial sections of long human motor neuron axons. BII-spectrin, along with Alpha-spectrin forms horizontal linkers between 180nm spaced F-actin rings in axons. These F-actin rings along with the spectrin linkers form membrane periodic structures (MPS) which are critical for maintenance of the integrity, size and function of axons. The primary goal of the authors was to address if long motor axons, particularly those carrying familial mutations associated with the neurodegenerative disorder ALS, show defects in gaps and patches of BetaII-spectrin ultimately leading to degradation of these neurons.

      Strengths:

      The experiments are well designed and the authors have used the right methods and cutting-edge techniques to address the questions in this manuscript. The use of human motor neurons and the use of motor neurons with different familial ALS mutations is a strength. The use of isogenic controls is a positive. The induction of gaps and patches by the kinase inhibitor staurosporine and their rescue by Latrunculin A is novel and well executed. The use of biochemical assays to explore the role of calpains is appropriate and well designed. The use of STED imaging to define the periodicity of MPS in the gaps and patches of spectrin is a strength.

      Weaknesses:

      Primary weakness is the lack of rigorous evaluation to validate the proposed model of spectrin capture from the gaps into adjacent patches by the use of photobleaching and live-imaging. Another point is the lack of investigation into how gaps and patches change in axons carrying the familial ALS mutations as they age, since 2 weeks is not a timepoint when neurodegeneration is expected to start.

      Comment on revised version.

      The authors have given a point-by-point response to all the reviewer's concerns. They have also addressed concerns which I raised adequately. I have no further concerns.

    3. Reviewer #3 (Public review):

      Summary:

      Gazal et al present convincing evidence supporting a new model of MPS formation where a gap-and-patch MPS pattern coalesces laterally to give rise to a lattice covering the entire axon shaft.

      Strengths:

      (1) This is a very interesting study that supports a change in paradigm in the model of MPS lattice formation.

      (2) Knowledge on MPS organization is mainly derived from studies using rat hippocampal neurons. In the current manuscript, Gazal et al use human IPS-derived motor neurons, a highly relevant neuron type to further the current knowledge on MPS biology.

      (3) The quality of the images provided, specifically of those involving super-resolution is of high standards, supporting adequately the conclusions of the authors.

      Weaknesses:

      (1) The main concern raised by the manuscript is the assumption that staudosporine-induced gap and patch formation recapitulates the physiological assembly of gaps and patches of betaII-spectrin.

      (2) One technical challenge that limits a more compelling support of the new model of MPS formation, is that fixed neurons are imaged, which precludes the observation of patch coalescence.

    1. Reviewer #1 (Public review):

      The manuscript by Butler et al. explores a novel physiological role for connexin 32 (Cx32) hemichannels in Schwann cells of peripheral nerves. Building on the authors' prior work on CO<sub>2</sub>-sensitive gating of connexin hemichannels, this study proposes that axonal activity-dependent mitochondrial CO<sub>2</sub> production promotes the opening of Cx32 hemichannels in adjacent Schwann cells, a process regulated by carbonic anhydrase (CA) activity and AQP1. This work reveals a new form of intercellular communication that may contribute to the regulation of conduction velocity.

      The authors aimed to determine whether CO<sub>2</sub> acts as an activity-dependent signal in peripheral nerves through activation of Cx32 hemichannels in myelinating Schwann cells. The study is strengthened by the use of complementary techniques, including in silico approaches, pharmacological manipulation, dye uptake assays, calcium imaging, adenoviral delivery of dominant-negative Cx32 constructs targeted to Schwann cells, and extracellular recordings in isolated sciatic nerves. Together, these methods allow the authors to connect molecular mechanisms with tissue-level function.

      The study has a few technical limitations, and some aspects of the interpretation require caution. Limitations in antibody specificity complicate interpretation of the precise distribution of the signaling pathway components studied here. Dye uptake into the outer myelin layer is consistent with hemichannel opening, but it does not by itself prove that Cx32 directly mediates the observed permeability changes. Similarly, Ca<sup>2+</sup> signals associated with Cx32 activation could reflect direct Ca<sup>2+</sup> permeability through Cx32 or secondary activation of other Ca<sup>2+</sup> entry or release pathways. Finally, hemichannel opening is assessed primarily using FITC uptake, which may not fully capture the complexity of Cx32 gating or distinguish between different conductive states.

      Overall, the authors provide substantial evidence that activity-dependent CO<sub>2</sub> production can influence Schwann cells through a pathway involving CA, AQP1, and Cx32. The results support the broad conclusions of the study, although some direct mechanistic links require further validation. The work is likely to have an important impact because it proposes a novel role for CO<sub>2</sub> as a local signaling molecule in peripheral nerves and may provide new insight into how Schwann cells detect axonal activity and regulate peripheral nerve physiology.

      Comments on revised version.

      The authors have addressed all of my concerns. The manuscript is now much improved and reads very well. Congrats to all the research team.

    1. Reviewer #1 (Public review):

      In the manuscript entitled "Flexible and high-throughput simultaneous profiling of gene expression and chromatin accessibility in single cells," Soltys and colleagues present easySHARE-seq, a method described as an improvement upon SHARE-seq for the simultaneous measurement of RNA transcripts and chromatin accessibility.

      The authors demonstrate the utility of easySHARE-seq by profiling approximately 20,000 nuclei from the murine liver, successfully annotating cell types and linking cis-regulatory elements to target genes. The authors claim that easySHARE-seq supports longer read lengths potentially enabling better variant discovery or allele-specific signal assessment, though they do not provide direct evidence to support these specific claims.

      A key strength of the protocol is enhanced sequencing efficiency, achieved by shortening the Index 1 read from 99 to 17 nucleotides. This reduction does not come at a significant cost to barcode diversity, retaining approximately 3.5 million combinations. Additionally, the approach allows for the sequencing of a sub-library to assess quality prior to final barcoding and sequencing which seems quite clever.

      While the increase in RNA transcript recovery is substantial, it appears to come at a cost: there is a notable decrease in ATAC fragments per cell compared to the original SHARE-seq (and other platforms). Likely as a result, the dimensionality reduction (UMAP) shows good resolution for RNA profiles but relatively poor resolution for accessibility profiles. Furthermore, the presented data suggests potential ambient RNA contamination; specifically, the detection of Albumin in HSCs and B cells is likely an artifact of the protocol rather than a biological signal.

      Overall, the study is well-presented and represents a promising advance. However, there are significant shortcomings that should be addressed, particularly regarding "leaky" transcript recovery and reduced ATAC performance.

    2. Reviewer #2 (Public review):

      Aims:

      The authors sought to optimize SHARE-seq, a multimodal single-cell method, to improve the simultaneous profiling of gene expression and chromatin accessibility. Their goal was to enhance barcode design for better sequencing efficiency and cost savings, while improving overall data quality. They then applied their optimized method, easySHARE-seq, to study liver sinusoidal endothelial cells (LSECs) to demonstrate its utility in examining gene regulation and spatial zonation.

      Strengths:

      The improved barcode design is an advance, increasing the proportion of sequencing reads dedicated to biological information rather than barcode identification. This modification offers practical benefits in terms of sequencing costs and read length, potentially reducing alignment errors. The method also demonstrates improved RNA detection compared to the original SHARE-seq protocol. The biological applications showcase how simultaneous measurement of both modalities enables analyses that would be practically impossible with single-modality approaches, particularly in examining how chromatin states change along developmental or spatial trajectories.

      Weaknesses:

      There is a notable reduction in chromatin accessibility detection compared to the original SHARE-seq method, likely limiting the use of the method in certain situations.

      Overall:

      The authors achieve their aim of creating an optimized protocol with improved barcode design and enhanced RNA detection. The method represents a useful advance for specific experimental contexts where the trade-offs are appropriate.

    1. Reviewer #1 (Public review):

      Summary:

      The manuscript by Yang et al. investigates the relationship between multi-unit activity in the locus coeruleus, putatively noradrenergic locus coeruleus, hippocampus (HP) sharp-wave ripples (SWR) and spindles using multi-site electrophysiology in freely behaving male rats. The study focuses on SWR during quiet wake and non-REM sleep, and their relation to cortical states (identified using EEG recordings in frontal areas) and LC units.

      The manuscript highlights differential modulation of LC units as a function of HP-cortical communication during wake and sleep. They establish that ripples and LC units are inversely correlated to levels of arousal: wake, i.e. higher arousal correlates with higher LC unit activity and lower ripple rates. The authors show that LC neuron activity is strongly inhibited just before SWR detected during wake. During non-REM sleep, they distinguish "isolated" ripples from SWR coupled to spindles and show that inhibition of LC neuron activity is absent before spindle-coupled ripples but not before isolated ripples, suggesting a mechanism where noradrenaline (NA) tone is modulated by HP-cortical coupling. This result has interesting implications for the roles of noradrenaline in the modulation of sleep-dependent memory consolidation, as ripple-spindle coupling is a mechanism favoring consolidation. The authors further show that NA neuronal activity is downregulated before spindles.

      Strengths:

      In continuity with previous work from the laboratory, this work expands our understanding of the activity of neuromodulatory systems in relation to vigilance states and brain oscillations, an area of research that is timely and impactful. The manuscript presents strong results suggesting that NA tone varies differentially depending on coupling of HP SWR with cortical spindles. The authors place their findings back in the context of identified roles of HP ripples and coupling to cortical oscillations for memory formation in a very interesting discussion. The distinction of LC neuron activity between awake, ripple-spindle coupled events and isolated ripples is an exciting result and its relation to arousal and memory opens fascinating lines of research.

      Weaknesses:

      I regretted that the paper fell short of trying to push this line of idea a bit further, for example by contrasting in the same rats the LC unit-HP ripple coupling during exploration of a highly familiar context (as seemingly was the case in their study) versus a novel context, which would increase arousal and trigger memory-related mechanisms. Any kind of manipulation of arousal levels and investigation of the impact on awake vs nonREM sleep LC-HP ripple coordination would considerably strengthen the scope of the study.

      Comments on revised version.

      The authors have added methodological details to the results section after the first round of reviews, improving the manuscript readability. Some points might still be improved, for example, the authors use a delta/gamma ratio to track cortical states for example, but there is no methods section corresponding to this metric. Authors write that higher SI corresponds to a lower arousal state that is associated with "more synchronized cortical population activity, higher ripple rate and reduced LC neurons firing" but there are no references or analysis to support this statement, only examples showing changes in SI over a few minutes.

    2. Reviewer #2 (Public review):

      Summary:

      In this study, authors studied the synchrony between ripple events in Hippocampus, cortical spindles and Locus Coeruleus spiking. The results in this study together with the established literature on the relationship of hippocampal ripples with widespread thalamic and cortical waves, guided authors to propose a role for Locus Coeruleus spiking patterns in memory consolidation. The findings provided here, i.e. correlations between LC spiking activity and Hippocampal ripples, could provide basis for future studies probing the directional flow or the necessity of these correlations in the memory consolidation process. Hence, the paper provides enough scientific advance to highlight the elusive yet important role of Norepinephrine circuitry in the memory processes.

      Strengths:

      Authors were able to demonstrate correlations of Locus Coeruleus spikes with hippocampal ripples as well as with cortical spindles. Specific strength of the paper is in the demonstration that the spindles that activate with the ripples are comparatively different in their correlations with Locus Coeruleus than those which do not.

      Weaknesses:

      The claims regarding the roles of these specific interactions were mostly derived from the literature that these processes individually contribute to the memory process, without any evidence of these specific interactions being necessary for memory processes. There are also issues with the description of methods, validation of shuffling procedures and unclear presentation and the interpretation of the findings, which are described in points that follow. I believe addressing these weaknesses might improve and add to the strength of the findings.

      Comments on revised version.

      The authors addressed all of my major concerns during the revision. As a result, the study now provides convincing evidence as well as improved presentation of results, that makes this manuscript important to the broader field of neuroscience, beyond the specific sub-field.

    3. Reviewer #3 (Public review):

      This manuscript examines how locus coeruleus (LC) activity relates to hippocampal ripple events across behavioral states in freely moving rats. Using multi-site electrophysiological recordings, the authors report that LC activity is suppressed prior to ripple events, with the magnitude of suppression depending on ripple subtype. Suppression is stronger during wakefulness than during NREM sleep and least pronounced for ripples coupled to spindles.

      The study is technically sound and addresses a timely and important question regarding how LC activity interacts with hippocampal and thalamocortical network events across vigilance states. While the findings are interesting, they remain observational in nature. Following revision, the manuscript has substantially improved in both presentation and interpretation of the results, and most concerns have been addressed satisfactorily. I therefore only have a few minor considerations that the authors may wish to explore further in the current study or in future work, as these directions could provide additional mechanistic insight and would likely be of considerable interest to the field.

      The authors demonstrate clearly that tonic LC firing rates preceding ripples differ significantly between wake-associated ripples (highest LC firing), isolated ripples during NREM sleep (lower LC firing), and spindle-coupled ripples (lowest LC firing). They also appropriately note that baseline firing differences will naturally influence the magnitude of LC suppression, which they also observe (highest LC reduction for wake ripples, then isolated ripples and last spindle-coupled ripples). However, this aspect could be explored further, as it may provide additional insight into the regulation of spindle-associated ripple events. Since LC activity appears to decline gradually prior to ripple occurrence (Suppl. Figure 2), it would be interesting to test whether this gradual reduction helps organize the emergence of isolated versus spindle-coupled ripples. For example, isolated ripples may occur during the initial phase of LC decline, whereas spindle-coupled ripples may preferentially emerge when LC activity reaches its lowest levels. Such a relationship could also be consistent with the stronger synchronization observed for spindle-ripple coupling.

      Related to this point, it would also be informative to examine whether isolated spindles occur more randomly in time, whereas spindle-associated ripple events appear more temporally clustered. If a single isolated spindle occurs, the associated LC suppression might be more pronounced. In contrast, when multiple spindle-associated ripple events occur in succession, LC activity may already be reduced following the first event, resulting in smaller additional suppression preceding subsequent events. Exploring this possibility could help clarify how LC dynamics shape the temporal emergence of ripple-subtypes

    1. Reviewer #1 (Public review):

      [Editors' note: this version has been assessed by the Reviewing Editor without further input from the original reviewers. The authors have addressed the comments raised in the previous round of review.]

      This paper aims to improve the accuracy of predictions of the impact of ITN strategies by developing a method to estimate duration of ITN access and use over time on a subnational scale from cross-sectional survey data and the numbers ITNs received annually. The subnational estimates are then input into a mathematical model to predict clinical cases under different ITN distribution strategies.

      Strengths:

      The approach is novel and addresses a useful and timely topic. It makes use of available routine data, and has considered all of the relevant components of ITN distributions.

      The authors have made revisions, particularly to the methods, appendices and title - leaving the paper easier to follow, and with a clear, consistent aim. The assumptions are clearly stated.

    2. Reviewer #2 (Public review):

      Summary:

      The authors design a custom Bayesian model to estimate the probabilities of access, use and use given access of insecticide-treated nets in six African countries, providing sub-national estimates and inferring the average duration of ITN use and access. An individual-based model was employed to simulate malaria epidemics and estimate the effectiveness of different ITN distribution strategies. The study finds that the mean probability of use or access did not reach 80% (a universal coverage formerly targeted by WHO) for any of the regions even for biennial campaigns, demonstrates that switching from triennial to biennial distribution campaigns increases population use by 7.9%, and evaluates the impact of employing more efficient ITNs on P. falciparum prevalence.

      Strengths:

      The authors developed a data-driven model that accounts for data collection imperfections and sources of uncertainty while differentiating between ITN use and access. They developed a methodology to infer the timing of mass campaign from publicly available data instead of assuming fixed dates. The probability of use given access allows determining the regions where ITN distribution is least effective. This work can help better inform future interventions by identifying regions where increasing mass campaign frequency or employing better ITNs are most effective. Finally, in addition to insights on ITN access and use for the six countries analyzed, the paper contributes with a methodological framework that can likely be extended to other countries.

    1. Reviewer #1 (Public review):

      This work addresses a question of practical importance that had never been systematically analysed in the cryo-ET field: when collecting tilt-series data, what is the optimal angular step size between successive tilt images? Due to the upper limit in electron exposure (100 - 150 e⁻/Ų), this question is important, since finer angular sampling improves attainable reconstruction resolution (Crowther criterion) but reduces the signal-to-noise ratio of each individual image, potentially compromising both image quality and the ability to computationally align successive frames. To address this, the authors designed a thorough benchmarking study comparing five tilt increments (1{degree sign}, 2{degree sign}, 3{degree sign}, 5{degree sign}, and 10{degree sign}) while keeping the total dose and tilt range constant. They evaluated the consequences at every stage of the cryo-ET workflow - from raw image quality and tilt-series alignment, through template matching for ribosome detection, to high-resolution subtomogram averaging - with the goal of providing the community with an evidence-based recommendation for data acquisition.

      The manuscript is well written, and the experimental design is carefully thought out. The work provides valuable practical insights into cryo-ET data acquisition by demonstrating that balancing two competing demands - sufficient dose per individual tilt image and fine angular sampling - is essential to achieve high-quality tomographic reconstructions. The identification of a practical optimum at 3{degree sign} tilt increment is the key contribution of the work. It will be interesting to see in the future whether this optimum shifts for smaller molecular targets, and how emerging tilt interpolation strategies such as cryoTIGER may interact with the choice of experimental angular increment.

      The conclusions of this paper are mostly well supported by data, but some aspects of data analysis need to be clarified and/or extended, including:

      (1) Line 109: The authors state that the tilt range was kept at {plus minus}60{degree sign} relative to the lamella plane. Assuming a typical lamella pre-tilt of ~10{degree sign}, the absolute stage tilt would approach its mechanical limit. Two clarifications would be appreciated: (a) What was the average pre-tilt across all lamellae? (b) How many dark tilt images, if any, were excluded during tomogram reconstruction?

      (2) Line 148: "When analysing tomographic volumes, we found that tomograms from data with a smaller increment displayed higher SNR values (see Fig. 2B)." It would be helpful to specify which comparisons are statistically meaningful (e.g. Mann-Whitney U test?). While the difference between 1{degree sign} and 2{degree sign} appears pronounced, the differences between 2{degree sign}, 3{degree sign}, and 5{degree sign} seem minimal. From my point of view, reporting the mean SNR values +/- standard deviations for each condition would already indicate some significance. Furthermore, since SNR is expected to depend on lamella thickness, it should be clarified whether the average lamella thickness is comparable across the five datasets.

      (3) Line 167: "Indeed, the variation in maximum resolution correlates with lamella thickness across all datasets (see Fig. 2F)." The reported R² values of 0.30 (1{degree sign}), 0.38 (2{degree sign}), 0.66 (3{degree sign}), 0.61 (5{degree sign}), and 0.60 (10{degree sign}) reveal a notably weak linear relationship for the finer tilt increments. It is also difficult to assess whether the lamella thickness distributions are comparable across conditions from the current figures - visually, the 1{degree sign} dataset appears to be based on thinner lamellae, while the 10{degree sign} dataset appears to include thicker samples. A histogram of lamella thickness distributions for each condition, provided as supplementary material, would greatly aid interpretation. Given this thickness dependency, reporting mean +/- standard deviation of lamella thickness per condition is highly appreciated.

      (4) Figure 4: It should be specified which tomogram subsets were used for the Rosenthal-Henderson analysis, whether lamella thickness was taken into account in the subset selection, and whether ribosomes too close to the lamella edges were excluded. Finally, linear fits should be displayed across the full x-axis range for all tilt increments to facilitate direct visual comparison.

      (5) General: Were ribosomes located at the lamella edges excluded from the analysis? As demonstrated in the authors' own prior work (Tuijtel et al., Science Advances, 2024), Ga-FIB milling induces structural damage at the lamella surfaces. To exclude the influence on the STA results, particles near the lamella edges should be removed prior to analysis, and the criteria for this exclusion should be stated explicitly.

      The aim of the authors was to provide the cryo-ET community with an evidence-based recommendation for the choice of tilt increment, and they largely succeeded in this goal. The identification of 3{degree sign} as a practical optimum - balancing sufficient dose per tilt image for effective per-particle refinement with fine enough angular sampling for accurate tilt-series alignment - is well supported by the data and consistent across the multiple quality metrics employed. The conclusion that coarser increments (5{degree sign} and 10{degree sign}) compromise tomogram quality, template matching accuracy, and STA resolution is robust and clearly demonstrated. However, the conclusion rests entirely on a single biological system using ribosomes as the sole molecular target, which are exceptionally favourable due to their abundance, size, and electron contrast. Whether the identified optimum holds for smaller, lower-abundance, or lower-contrast targets remains an open question.

      In future, it would be particularly interesting to test whether emerging tilt interpolation strategies, such as cryoTIGER, which is particularly intriguing, can effectively compensate for coarser experimental angular sampling in post-processing. Here, the optimal experimental increment may shift, and the interaction between these two approaches represents a promising direction for future work. More broadly, as cryo-ET datasets grow larger and public repositories expand, the practical tradeoffs between acquisition time, data storage, and structural quality identified here will become increasingly relevant to the field.

    2. Reviewer #2 (Public review):

      The determination of macromolecular structures directly within their native cellular environment is becoming increasingly routine, making standardized data collection strategies essential. In this manuscript, Tuijtel et al. provide a timely and valuable contribution by benchmarking key acquisition parameters and establishing practical guidelines for in situ cryo-electron tomography (cryo-ET). Critically, the authors present a systematic framework for optimizing data collection to achieve the highest attainable resolution.

      Using Dictyostelium cells as a model system, the authors generate multiple datasets at a constant total dose while varying the tilt increment. They demonstrate that tilt-series acquired with finer increments (1-3 degrees) yield superior alignment accuracy and improved template-matching performance, resulting in higher-quality reconstructions than those collected with coarser increments (5 degrees or above). Furthermore, the authors show that for subtomogram averaging, a 3-degree tilt increment outperforms all other conditions tested, particularly after per-particle refinement as implemented in M.

      Overall, the manuscript is clearly written, and the conclusions are well supported by the data presented. I have no major concerns. There are some minor points that the authors should address, including:

      (1) The phrase "electron optical density distribution" (line 31, Introduction) should be revised to "electrostatic potential" or "Coulomb potential distribution," which more accurately reflects what is measured in cryo-EM/ET.

      (2) The authors state that the maximum tolerable electron dose is approximately 100-150 e⁻/Ų (line 34, Introduction). This is an oversimplification, as bacterial specimens, for example, have been shown to tolerate doses of 200 e⁻/Ų or higher (see Breigel et al., PNAS, 2009; https://www.pnas.org/doi/10.1073/pnas.0905181106#T1). The statement should be revised to reflect this variability.

      (3) Lines 56-57: The authors do not cite their own prior work benchmarking tilt-series acquisition strategies on in vitro samples. This earlier study provides important context and should be referenced and briefly discussed.

    1. Reviewer #1 (Public review):

      This study addresses an important clinical challenge by proposing muscle network analysis as a tool to evaluate rehabilitation outcomes. The research direction is relevant and the findings suggest further research.

      The revised manuscript included additional methodological details and a supplementary comparison with conventional NMF.

      Comments on latest version:

      No additional comments.

    2. Reviewer #2 (Public review):

      This study presents an important analysis of how interactions between muscles can serve as biomarkers to quantify therapeutic responses in post-stroke patients. To do so, the authors employ an information-theoretical metric (co-information) to define muscle networks and perform cluster analysis.

      Comments on revised version.

      Thank the authors for the carefully revised article. I have no further comments.

    1. Reviewer #1 (Public review):

      Summary

      In the presented paper, Lu and colleagues focus on how items held in working memory bias someone's attention. In a series of three experiments, they utilized a similar paradigm in which subjects were asked to maintain two colored squares in memory for a short and variable time. After this delay, they either tested one of the memory items or asked subjects to perform a search task.

      In the search task, items could share colors with the memory items, and the authors were interested in how these would capture attention, using reaction time as a proxy. The behavioral data suggest that attention oscillates between the two items. At different maintenance intervals, the authors observed that items in memory captured different amounts of attention (attentional capture effect).

      This attentional bias fluctuates over time at approximately the theta frequency range of the EEG spectrum. This part of the study is a replication of Peters and colleagues (2020).

      Next, the authors used EEG recordings to better understand the neural mechanisms underlying this process. They present results suggesting that this attentional capture effect is positively correlated with the mean amplitude of alpha power. Furthermore, they show that the weighted phase lag index (wPLI) between the alpha and theta bands across different electrodes also fluctuates at the theta frequency.

      Strengths

      The authors focus on an interesting and timely topic: how items in working memory can bias our attention. This line of research could improve our understanding of the neural mechanisms underlying working memory, specifically how we maintain multiple items and how these interact with attentional processes. This approach is intriguing because it can shed light on neuronal mechanisms not only through behavioral measures but also by incorporating brain recordings, which is definitely a strength.

      Subjects performed several blocks of experiments, ranging from 4 to 30, over a few days depending on the experiment. This makes the results - especially those from behavioral experiments 2 and 3, which included the most repetitions - particularly robust.

      Comments on revision:

      The authors have adequately addressed my concerns. No further comments.

    1. Reviewer #1 (Public review):

      Summary:

      Laaker et al. investigates the immunological role of the cribriform plate during neuroinflammation using the EAE model. The authors combine immunohistochemistry, flow cytometry and single-cell RNA sequencing to characterize CD11b+CD11c+ myeloid cells that accumulate at podoplanin (PDPN)-rich meningeal-lymphatic niches surrounding olfactory nerve bundles. They identified distinct populations of migratory dendritic cells (DCs) and macrophages retained at the cribriform plate that exhibit transcriptional signatures consistent with immune tolerance, reduced interferon signaling, and programmed cell death, including Pdcd1 (PD-1) expression. In parallel, CCR2+ monocytes and alternatively activated (M2-like) Arg1+/CHI3L3+ macrophages integrate into this niche, suggesting the establishment of a locally immunosuppressive myeloid network.

      Strengths:

      (1) Overall, the study postulates a novel model in which the cribriform plate functions as a specialized perineural immune interface that reshapes myeloid phenotypes during neuroinflammation.

      (2) Suggests broader relevance for shaping peripheral immunity and therapeutic targeting. If DCs are being "tuned" at this exit site, it could influence what reaches cervical lymph nodes and how peripheral responses are set during CNS autoimmunity; the authors explicitly position this as relevant to CNS autoimmunity and possibly other CNS diseases (while acknowledging the need for human validation).

      (3) Technical sound and highly original work. Convergent multi-method support: the central narrative is backed by immunohistochemistry + flow cytometry + scRNA-seq, rather than a single assay. The headline conclusion (tolerogenic/suppressive skew at the cribriform plate during EAE) is explicitly built from these combined modalities.

      Comments on revised version.

      All my points were adequately addressed by the authors.

    2. Reviewer #2 (Public review):

      Summary:

      In this article, Laaker et al described diverse populations of macrophages and dendritic cells found in and around the cribriform plate in a context of a neuroinflammation caused by an autoimmune disease (EAE). The authors utilize elegant histochemical staining and a nifty approach to sort doublets to interrogate cells that are in contact with one another, presumably in vivo. Notably, they uncover a population of CD11c+CD11b+ cells interacting with M2 macrophages and PDPN+ fibroblasts and lymphatics. These cells are heterogenous but some of these DCs express PD-1 and transcriptional profiling suggests they may have immunosuppressive behavior. Altogether, this article explains well the complexity of cell populations found around the cribriform plate during inflammation and are suggestive of different interactions that trigger these different phenotypes from immune cells.

      Strengths:

      Beautiful images of a unique CNS: peripheral interface that support a novel scRNA approach to understanding how different cell populations engage in functional interactions in vivo.

      Weaknesses:

      It is unclear how the sorted populations reflect in vivo interactions, or a propensity to form aggregates during ex vivo processing. Future work will be needed to address which poplanin expressing cells are most relevant.

    1. Reviewer #1 (Public review):

      Summary:

      Kaku and Flenniken investigate the mechanistic pathways through which specific viral infections alter the flight capabilities of honey bees. Building on their previous discovery that DWV impairs flight while SBV unexpectedly enhances it, the authors hypothesized that these behavioral shifts are driven by interactions with the insect's octopamine (OA) signaling pathway, which is responsible for the "fight-or-flight" neurohormonal stress response and energy mobilization. To test this, the authors experimentally infected adult honey bees with DWV or SBV and pharmacologically manipulated the OA pathway using either octopamine supplementation or epinastine (EP), an OA-receptor antagonist. They then evaluated the bees' flight performance (distance, duration, and speed) on custom flight mills and profiled their gene expression using qPCR and RNA sequencing.

      Strengths:

      A major strength of this study is the high prevalence of preexisting background DWV and SBV infections in the honey bee cohorts, which meant there were no completely "virus-free" control groups. However, the authors successfully mitigated this limitation by rigorously quantifying viral RNA copies for every individual bee via qPCR and utilizing these viral abundances as continuous variables in powerful linear mixed-effect models.

      Weaknesses:

      The primary weakness lies in the methodology used for targeted pharmacological manipulations, as well as the lack of OA quantification across different treatments. Thus, their claims are not sufficiently supported by the current data.

      (1) The authors utilize Epinastine to block octopamine signaling, describing it as a highly specific OA receptor antagonist. However, pharmacological inhibitors often lack absolute specificity. Epinastine might bind to other octopamine receptor subtypes present in honey bee neural and flight muscle tissues, or it could potentially cross-react with tyramine and dopamine receptors. Without further genetic validation (e.g., RNA interference targeting specific receptors), it is difficult to definitively conclude that the altered flight performance is solely due to the blockade of the specific Oβ−2R pathway.

      (2) As a natural neurotransmitter, insects have evolved highly efficient "cleanup" mechanisms. OA is rapidly cleared from the synaptic cleft via reuptake transporters and quickly inactivated by enzymes such as N-acetyltransferase (NAT) or Monoamine Oxidase (MAO). Consequently, an injection of OA produces only a transient "pulse" of activity. It is often a poor "tool" for inducing prolonged physiological effects compared to synthetic formamidines like Amitraz.

      (3) The study relies heavily on transcriptomics and quantitative PCR to measure the mRNA expression of key synthesizing enzymes, namely tyrosine decarboxylase (tdc) and tyramine β-hydroxylase (tβh), to infer the activation or suppression of the octopamine pathway. However, changes in enzyme synthesis at the RNA level are often insufficient to accurately reflect the true physiological levels of biogenic amines. To robustly prove the authors' hypothesis of a "feedback loop that regulates intracellular OA concentrations", direct quantification of actual octopamine and tyramine titers in the bees (e.g., using high-performance liquid chromatography or mass spectrometry) is necessary.

    2. Reviewer #2 (Public review):

      Summary:

      This highly original and well-designed study provides insight into how honeybee picorna-like viruses, Deformed wing virus ( DWV) and Sacbrood virus (SBV), affect flight performance, and reveals the role of the octopamine (OA) pathway in virus-honeybee interactions. The authors used a flight mill to quantify the flight performance of bees with different levels of DWV and SBV. Bees were treated with OA and/or epinastine (EP) - an OA receptor antagonist; the study also quantified virus loads and expression of two key genes involved in OA biosynthesis.

      The results showed that reduced flight performance associated with high DWV levels could be alleviated by OA administration. In contrast, increased levels of SBV had the opposite effect, leading to enhanced flight performance. This suggests distinct physiological responses to DWV and SBV infections. Administration of EP had led to a reduction of flight performance in SBV-infected bees, indicating the involvement of the OA pathway.

      The authors also quantified levels of mRNAs of enzymes involved in OA synthesis, tyrosine decarboxylase (TDC) and tyramine beta-hydroxylase (TbH), and concluded that DWV induced expression of TbH, while SBV upregulated expression of TDC. Furthermore, the study identified upregulated and downregulated genes in response to SBV, DWV and DWV in combination with OA.

      Strengths:

      The study reported opposing effects of infections of related viruses, SBV and DWV, on honeybee flight performance, and identified the central role of the octopamine (OA) signaling pathway in the effect of viruses on honeybee flights.

      These findings were achieved by using a combination of approaches, including experimental measurement of flight distance, virus infections, and introduction of OA and EP. Experimental work with honeybees is technically challenging and requires specialized expertise, which makes the results produced in this study more valuable.

      DWV and SBV are among the most important honeybee pathogens affecting honeybee health and threatening the pollination service. Therefore, an understanding of the mechanisms underlying DWV and SBV pathogenesis has the potential to develop novel approaches to mitigate the negative impact of these viruses.

      Weaknesses:

      No weaknesses were identified by this reviewer.

    1. Reviewer #1 (Public review):

      Summary:

      The authors aim to characterize Huntingtin (HTT) aggregates in various cells and tissues and propose that mutant polyQ HTT (mHTT) assembles at the Golgi apparatus, thereby impairing Golgi organization and function. They further suggest that such Golgi defects might contribute to disease pathology, including neurodegeneration.

      Strengths:

      The study spans a wide range of disciplines, including genetics, cell biology, neuroscience, and systems biology, and employs diverse methodologies such as iPSC, 3D SIM microscopy, omics approaches, organoid culture, electrophysiology, and antisense depletion.

      Weaknesses:

      While the breadth of techniques is impressive, the central premise of the work-the structural and functional relationship between polyQ assemblies and the Golgi apparatus-is not supported by sufficiently rigorous cell biological evidence.

      A major concern is that much of the cell biology data remains descriptive and lacks mechanistic depth. The findings are fragmented and not integrated into a coherent molecular or cellular model. Instead of building a logical progression of experiments, the study presents a collection of observations that appear disconnected and, at times, driven more by technical capability than by hypothesis-driven design.

      Critically, the key claim that polyQ HTT functionally disrupts the Golgi (Golgipathy) is not convincingly demonstrated. Many observations could be more simply explained by the polyQ HTT localization to the Golgi and known Golgi sensitivities to perturbations (e.g., starvation or Brefeldin A treatment), rather than by a specific mechanistic role of polyQ HTT.

      The manuscript also suffers from issues in organization and clarity, including imprecise descriptions and figures that are difficult to interpret.

      Major Concerns:

      (1) Golgi localization

      The localization of polyQ HTT relies entirely on the antibody 3B5H10, which is foundational to the study. However, previous reports using the same antibody have described predominantly cytosolic localization. This discrepancy must be addressed rigorously by independent validation using alternative antibodies or tagged, exogenously expressed polyQ HTT constructs that should be shown to colocalize with 3B5H10 signals.

      Furthermore, the Golgi is identified solely using GM130, a cis-Golgi and ER exit site marker. This raises ambiguity: does polyQ HTT associate with the entire Golgi or only recruit GM130? Could the observed signal correspond to a sub-Golgi compartment?

      If polyQ HTT is indeed Golgi-associated, several key observations become expected rather than novel. For example, in Figure 4I-M, sensitivity to Brefeldin A is unsurprising, as Golgi structure collapses upon such treatment; in Figure 4N-O, co-fragmentation with the Golgi is expected under Golgi-disrupting conditions.

      (2) 3D rendering

      The extensive use of 3D rendering appears unnecessary and, in some cases, misleading. The rendered images do not provide additional insight beyond conventional 2D fluorescence images. Serial 2D fluorescence sections should be more objective in representing the 3D organization. In Figure 2A and Figure 5A, red line features in 3D beige polyQ HTT structures resemble unrelated biological structures, such as vasculature, which is inappropriate.

      There is also an inconsistency in rendering. For example, fine mesh-like structures are shown in some figures (e.g., Figure 2A, Figure 4A), whereas others appear as amorphous aggregates (e.g., Figure 5A, Figure S2B), without explanation.

      (3) Quantification of area and volume

      The manuscript extensively quantifies the area and volume of polyQ assemblies (e.g., Figure 2B, C and Figure 3B, C, E, G, H). These measurements are not reliable. First, the structures appear filamentous and likely below the diffraction limit. Second, fluorescence signals are broadened by the point spread function (PSF), artificially inflating measured dimensions. Last, even with 3D SIM (~100 nm resolution), fine structural details remain unresolved. Thus, these quantitative measurements lack physical meaning and might not be used to support conclusions.

      (4) Interpretation of structural features (Figure 2A)

      Descriptions such as "parallel spindles" and "ring-like assemblies" are not clearly supported by the data. The terminology is ambiguous, and the claimed structures are not discernible. The use of the term "interaction" with the nuclear membrane is also inappropriate. At best, the data suggest colocalization, which itself is not convincingly demonstrated.

      (5) Mitotic fragmentation (Figure 2E)

      The conclusion that polyQ assemblies fragment during mitosis lacks proper controls. It is unclear whether these cells exhibited intact "fabric-like" assemblies during interphase, or the observed structures were already fragmented prior to mitosis.

      (6) Fixation-induced fragmentation (Figure 2F)

      The claim that fixation-induced fragmentation reflects a unique dynamic property of polyQ assemblies is likely an overinterpretation. This phenomenon may simply represent a fixation artifact. Therefore, it cannot be used as evidence for in-cellulo structural dynamics.

      (7) Nuclear localization claims (Figure 5A)

      The assertion that polyQ assemblies "almost completely occupy the nucleus" is not supported. The images are more consistent with perinuclear localization, typical of the Golgi region. There is no clear evidence for nucleoplasmic distribution.

      (8) Drug treatment and data interpretation (Figure 3D-E)

      The x-axis in Figure 3E is non-linear, which is inappropriate unless explicitly justified. Furthermore, the rationale for using Onjisaponin F is unclear. What is its known mechanism? Does it affect Golgi organization? Without this context, observed effects may reflect Golgi perturbation rather than specific effects on polyQ assemblies.

    2. Reviewer #2 (Public review):

      Summary:

      In this study, the authors report the hitherto unobserved types of HTT assemblies observed in human fibroblasts and iPCS-derived neurons in 2D and 3D culture, applying a state-of-the-art confocal microscopy imaging with near 64 nm resolution to decode their structures. They further demonstrate that these assemblies closely contact with various types of Golgi ribbons, stacks, vesicles, and Golgi-derived clathrin-coated vesicles but not mitochondria. They also used single-cell RNAseq to show some interesting findings that supported the suggested defects in Golgi-related function, specifically by downregulation of various cellular processes related to Golgi and vesicle transport functions. They also replicate mHTT nuclear accumulations in striatal neurons, which is considered to be a hallmark of HD pathology, by using long-term neuronal culture. Furthermore, the assemblies showed differential responses to glucose starvation and to autophagy enhancer treatment by onjisaponin F for mutant HTT assemblies, but not for the healthy siblings, in fibroblasts and neurons. Onjisaponin F treatment did not reverse nuclear deposition. They also showed that ASO shortens these polyQ assemblies but does not change neuronal firings that are detected by HD-MEA. Notably, they also used human brain samples to show the existence of polyQ assemblies in fetal and child brain samples. This part is impressive.

      Overall, this work reports a novel polyQ assembly, which was previously reported as a pathogenic factor, has not been reported before for HTT, is related to Golgi activities and vesicular transport, and is dismantled in HD patient cells. The intensive immunostaining and super-resolution scanning are impressive and definitely strengthened by the impact of the findings. The scRNAseq data adds another layer to the observed Golgi impairments and their suggested relationship to Golgi function. The drug testing for polyQ assemblies, especially polyQ assemblies in HD cells, is preliminary. However, the data in this study are enough to support the existence of polyQ assemblies in human cells and their specific relationships with the Golgi apparatus.

      Strengths:

      In this study, the authors used the cells from a large HD family and fetal/child brain samples to decode the structure of endogenous polyQ assemblies. This part is impressive. The intensive staining and super-resolution scanning are amazing. The spatial relationships of polyQ assemblies with the Golgi apparatus and mitochondria are well illustrated.

      Weaknesses:

      Although they used healthy sibling cells as a control, an isogenic control (genetic correction of the mutant gene) is lacking. Based on the Golgipathy of mHTT, they did a drug screening. The drug testing for polyQ assemblies is preliminary. More rigorous validation, such as scRNA seq and proteomic analysis, etc., is necessary to reach a systemic conclusion.

    1. Reviewer #1 (Public review):

      Summary:

      This paper describes an application of the high-resolution cryo-EM 2D template matching technique to sub-50kDa complexes. The paper describes how density for ligands can be reconstructed without having to process cryo-EM data through the conventional single particle analysis pipelines.

      Strengths:

      Improved insights in which particles contribute to the density of ligands that is absent from the templates are valuable.

      Weaknesses:

      Although the convenient visualisation of small molecules bound to protein targets of a known structure would be relevant for the pharmaceutical industry, the evidence described for the claim that this technique "significantly" improves alignment of reconstruction of small complexes is incomplete. In a revised paper, the authors are encouraged to better evaluate the effects of model bias on the reconstructed densities.

      In the revised version, the refinement of atomic occupancies in the 2DTM-generated maps has been insightful: densities only come back at values ranging from 0.55-0.80, whereas residues included in the template remain at 1, suggesting that the 2DTM-reconstruction does suffer from model bias. Their newly added Omega calculations, which are helpful, also suggest that model bias is present in the 2DTM-based reconstructions. These observations therefore contradict the first subsection heading of the Results, which claims "unbiased reconstruction of omitted residues".

      Both the Omega analysis and the refined atomic occupancies provide insights into the "real-space aspect" of the model bias. The question to what extent the model bias affects the map in Fourier space remains unanswered. The authors base some of their claim in the paper on FSC curves in Figures 1b and 3b, but these will suffer from the same model bias. To assess this, I had requested the authors to reconstruct an OMIT map and to assess its resolution using FSCs. The authors have indeed performed a careful reconstruction of an OMIT map, which is currently shown in Figure 5. I liked how they implemented this, as described in detail in the Methods section. However, the measurement of how much model bias is present in this OMIT map by FSC calculations is still pending. This could be done in two ways, and I would encourage the authors to present the results of both in (hopefully a last) revised version of their manuscript. My original suggestion was to calculate a map-to-model FSC for the OMIT map and the full reference. This should be compared with a similar map-to-model FSC on the map where only the ligand was omitted. Alternatively, they can use the cisTEM FSC_uncorr procedure on the OMIT half-reconstructions and compare the resulting curve with the one presented in Figure 1b.

      The reason that I am keen to see these FSCs is because high-resolution model bias is a fundamental danger of the 2DTM approach. It will therefore also be in the interest of the authors to quantify the extent to which it happens. For now, I have kept the above public review and short assessment the same as they were, but I will consider raising the assessment after the suggested experiments (which I hope will be relatively easy to do!) are incorporated.

    2. Reviewer #3 (Public review):

      Summary:

      Due to the low SNR of cryo-EM micrographs necessitated by radiation damage, determining the structure of proteins smaller than 50 kDa is exceedingly challenging, such that only a handful have been solved to date. This work aims to improve the reconstruction of small proteins in single-particle cryo-EM by using high-resolution 2D template matching, an algorithm previously used to locate and align macromolecules in situ, to align and reconstruct small proteins. This approach uses an existing macromolecular structure, either experimentally determined or predicted by AlphaFold, to simulate a noise-free 3D reference and generates whitened projections, crucially including high-spatial-frequency information, to align particles by the orientation with maximal cross-correlation. They demonstrate the success of this approach by generating a 3D reconstruction from an existing dataset of a 41.3 kDa protein kinase that had previously evaded attempts at high-resolution structure determination. To alleviate concerns that this is purely from template bias, they demonstrate clear density at two regions that were not present in the template: 6 residues in an alpha helix and an ATP in the ligand binding pocket. The latter is particularly important for its implications in determining structures of ligand-bound proteins for drug discovery. They also produce a composite omit map from 36 partial-deletion reconstructions spanning the entire protein, demonstrating a reconstruction can be obtained without template bias. Additionally, the authors provide an update to the classic calculation in Henderson 1995 to predict the minimum molecular mass of a protein that can be solved by single-particle cryo-EM.

      Strengths:

      I am in no doubt that this technique can be used to gain valuable insights into the structures of small proteins, and this is an important advancement for the field. It is complementary to single-particle cryo-EM and provides an extra tool for the experimentalist that may work better in certain cases. For cases where only a small region of the structure is of interest, such as in drug screening, this method provides a simple workflow to screen many structures.

      The claim that using high-spatial frequency information is essential for aligning small proteins is a valuable insight. A recent pre-print published at a similar time to this manuscript used high-resolution information in standard ab-initio reconstruction to generate a high-resolution reconstruction from the same dataset, supporting the claims made in the manuscript.

      The theoretical section outlined in the appendix is also theoretically sound. It uses the same logic as Henderson, but applies more up-to-date knowledge, such as incorporating dose-weighting and altering the cross-correlation based noise estimation. This update is valuable for understanding factors preventing us from reaching the theoretical limit.

      Weaknesses:

      The applicability of this technique to more than a single target was not demonstrated. Nor was it compared to more recent strategies for processing SPA data from small molecules, such as Blush regularization or HR-HAIR. Additionally, although the authors have demonstrated convincingly that their method selects a stack of high-quality particles, it is less clear whether it performs better than RELION when using the same stack of particles, particularly in the ATP binding pocket. This places this method as a complementary technique, and whether it outperforms those methods for a wide variety of molecules is yet to be determined. The method presented here also introduces template bias, so only parts of the reconstruction not in the initial template are free of template bias. Producing a full reconstruction through a composite omit map is computationally expensive, meaning that unless this method outperforms modern SPA methods, its major use case will be ligand binding studies instead of 3D reconstructions.

    1. Reviewer #1 (Public review):

      Summary:

      The manuscript by Singh et al. presents an application of MOA-seq to better define transcriptional control underlying the hypoxia response in human endothelial cells. This group's previously described MOA-seq technique allows for precise, identity-agnostic mapping of occupied sites of DNA-binding proteins across the epigenome and over time. Here, they applied MOA-seq to HUVECs under normal oxygen conditions or variable lengths of hypoxia treatment, comparing changes in occupancy over time and associating these changes with corresponding transcriptome alterations. This approach revealed thousands of dynamically occupied sites comprising 10 major kinetic clusters that appear to define distinct subsets and phases of the hypoxia response. Analysis of DNA motifs in these dynamically occupied regions captured the known major roles of HIF1A in the hypoxia response and also implicated new HIF1A-associated regulators. Importantly, they also identified many potential HIF1A-independent candidate TFs that act at HREs, which has been an outstanding question in the field. Additionally, this study identified ~7K additional sites not previously defined as regulatory elements by ENCODE.

      Strengths:

      Overall, this study is well executed and described, providing new biological insights as well as a rich data resource for the field. As MOA-seq was previously developed for use in plants, this work demonstrates the application of this method in mammalian cells and highlights its utility in identifying new potential regulatory sites not captured by DNase-seq or ATAC-seq. The conclusions made by the authors are well supported by the results, with the caveat that extensive use of DNA motif identification and ontology analyses invariably leads to some uncertainty regarding factor identity and gene network properties.

      Weaknesses:

      There are several areas where the clarity of presentation could be improved:

      (1) Given the importance of the methodology, the methods section needs more detail on how the extent of MNase digestion is chosen to achieve optimal results with MOA-seq. This is described to some extent in the description of control library preparation, but not for the experimental samples.

      (2) The abstract describes this approach as "native cistrome profiling" but this is misleading since formaldehyde fixation is used.

      (3) Species- and field-specific jargon and abbreviations need to be clarified on first usage. For example, on page 9: "Downsampling analysis was carried out for two sets of published reference peaks; the CTCF cCRE peak midpoints and for the ERG motif under the ERG ReMap ChIP-seq peaks." The different categories of cCREs were not clearly defined, nor will it be clear what the term ReMap refers to for those outside the field. The sentence after this refers to IDR, which also should be defined.

      (4) Figure 4C: Are these motifs examined under MOA sites specifically or anywhere in the genes in question?

      (5) Figure 5B shows that up-DEGs with diff-MOA footprints tend to show more losses of footprints. Do the authors interpret this as a loss of repressor binding?

    2. Reviewer #2 (Public review):

      Summary:

      Singh et al. apply MOA-seq to map transcription factor occupancy genome-wide in HUVECs across a hypoxia time course. The study provides a well-validated, high-resolution view of cistrome dynamics and identifies both HIF1A-associated and independent regulatory programs.

      Major Comments:

      Methodological validation is strong. MOA-seq's ability to map protein-bound DNA at near-nucleotide resolution without factor-specific antibodies is a genuine advance, and the cross-validation against independent ChIP-seq and ENCODE datasets is convincing. As noted, future work with additional biological replicates could further strengthen confidence in the smaller kinetic clusters.

      Imaging-based validation would strengthen the key biological claims. The kinetic clustering and pathway enrichments are computationally inferred. Orthogonal approaches, for example, live-cell fluorescence imaging of HIF1A nuclear translocation to confirm the proposed temporal binding waves, would provide independent experimental support.

    1. Reviewer #1 (Public review):

      Summary:

      White et al. explore the role of synaptotagmin isoforms in mediating neurotransmitter release from EPN terminals in the LHb. The authors show a relatively high expression of Syt2 and Syt3 in the EPN relative to other Syt isoforms. The authors then perform a series of experiments to show that Syt2 preferentially regulates glutamatergic transmission while Syt3 regulates GABAergic transmission.

      Strengths:

      Interesting, timely topic.

      Weaknesses:

      While interesting, the study is rather preliminary. There are a number of issues the authors need to address.

    2. Reviewer #2 (Public review):

      Summary:

      This is an important study of the molecular mechanisms of GABA vs. glutamate release by coreleasing neurons that project from the EPN to LHB. The conclusion is that separate pools of vesicles release each transmitter and use different molecular machinery to do so. This is in contrast to and in disagreement with functional studies of the same synapse that conclude that the transmitters are copackaged.

      As detailed below, the study has a major flaw. It uses an incorrect Cre line, which is also expressed in a purely glutamatergic population in the EPN that also projects to the LHB. In addition, there is little quantification and validation of important tools and no histological confirmation of the sites of expression of viral-encoded proteins.

      Strengths:

      The strength of the study is in the importance of the question addressed and in the ambition of the tools used.

      Weaknesses:

      (1) The study uses Vglut2-IRES-Cre mouse to gain control over EPN to LHB projections. However, as has been shown by several groups, this line is not exclusive to the EPN co-releasing population. It is also expressed in glutamatergic EPN PV neurons that project solely to the EPN. Therefore, all of the studies here are contaminated with analysis of a purely glutamatergic Vglut2+ projection. This calls into question all the conclusions about the differential localization and function of synaptic proteins.

      (2) It is unclear from the paper, but it seems that some experiments may have been done with no Cre control, which likely led to contamination in neighboring brain regions, some of which project to LHB as well.

      (3) Histology: There is no histology shown for the mice used in the study. This is a crucial point. We need to see that the injection was clean and specific for each mouse used in the study (although, given the use of Vglut2-Cre, it cannot be specific to the coreleasing population). Whole-brain histology is necessary.

      (4) ASO KO: Unfortunately, there is little validation of the ASO KO. The effects shown in Figure S2 show a very small effect, if any. There appear to be no statistics. The functional effects in the main figure are also relatively subtle.

      (5) Other concerns: There are many typos and errors, including in important claims.

    1. Reviewer #1 (Public review):

      Public Review

      This paper presents an fNIRS neuroimaging study with a relatively large sample of preschool children (aged 3-5) that measures both positive and negative empathy within a single task. Children watch emotional events and are asked questions about both their own emotions and the emotions of others, allowing the authors to distinguish between affective and cognitive empathy. The authors propose "foundational" models of affective and cognitive empathy and argue that their findings support the idea that cognitive empathy emerges before affective empathy in early childhood.

      Strengths:

      The paper addresses a valuable question by measuring both positive and negative empathy within a single cognitive task. The use of fNIRS with a relatively large preschool sample is commendable, and the pre-registered design strengthens the contribution. The task itself is innovative, well-suited to this age group, and achieves high compliance, which is essential and notably difficult with young children. Overall, the methods are appropriate, and the empirical work is valuable.

      Weaknesses:

      The main concerns relate to the framing of the paper rather than the empirical work itself.

      The introduction contains several claims that are overstated or inaccurate. The statement that "we know very little about the development of this fundamental social skill during the first years of life" does not reflect the state of the field; empathy in early development has been quite extensively studied (e.g., Davidov et al., Malti et al., Uzefovsky et al., Decety et al., Feldman et al., among others). The view that emotional contagion directly develops into affective empathy is based on early theoretical accounts that have since been challenged by empirical evidence (see Davidov et al., 2025). The claim that cognitive empathy does not require theory of mind is also overstated - it is hard to see how theory of mind, the understanding that others have thoughts, beliefs, and emotions that may differ from our own, would not be required for cognitive empathy. Furthermore, the introduction neglects recent and directly relevant work (e.g., Zach et al., 2025; Uzefovsky et al., 2020; Davidov et al., 2021).

      Most critically, the claim that "no neuroimaging studies have yet investigated brain regions supporting empathy in preschoolers" is inaccurate. Multiple studies have examined brain regions supporting empathy in children within this age range, including work using fNIRS and studies of positive empathy (e.g., Decety et al., 2018; Light et al., 2009; Levy et al., 2019; Bray et al., 2022; Brink et al., 2011). This is also not the first study to measure brain activation in response to positive and negative emotional events in children (e.g., Cheng et al., 2014; Light et al., 2009). These novelty claims need to be corrected.

      The use of "explicit" to describe cognitive empathy and "implicit" or "spontaneous" to describe affective empathy is problematic. Affective empathy can be expressed quite explicitly, through facial expressions, verbal statements, and gestures, and framing it as spontaneous overlooks the motivational dimensions of empathy (e.g., Zaki and colleagues). The authors' use of "foundational affective empathy model" and "foundational cognitive empathy model" as though these are established concepts is not well supported by the current evidence base.

      The conclusions in the discussion go beyond what the data can support. The question of whether cognitive or affective empathy emerges first cannot be adequately addressed with a cross-sectional sample aged 3-5, an age at which affective empathy is likely already well established and cognitive empathy is expected to be developing around the lower end of this range. The cross-sectional design further limits what can be inferred about developmental trajectories during a period of substantial individual variability. Together, these issues make the developmental-precedence conclusions difficult to sustain. The claim that the results demonstrate "the first time that this brain specialisation for stimuli of different emotional valence may be rooted in childhood" is also inaccurate, as there is prior evidence for brain specialisation of emotional valence in early childhood (e.g., Grossmann et al., 2007).

      Appraisal:

      The empirical contribution, the task design, the fNIRS data, and the analyses are sound and have value for the field. However, in its current form, the paper does not achieve what it sets out to do. The novelty claims are undermined by the omission of a substantial body of relevant prior work, and the developmental conclusions are not adequately supported by the cross-sectional design and age range studied. The abstract similarly overstates the support this study provides for the early emergence of cognitive over affective empathy.

      Impact:

      With appropriate revision, this work could make a meaningful contribution. The task is well-designed for studying empathy in young children and could be useful to other researchers in the field. The fNIRS data from a large preschool sample are a valuable resource. However, the contribution needs to be framed accurately, both in terms of what is genuinely novel relative to the existing literature and in terms of what conclusions the data can and cannot support.

    2. Reviewer #2 (Public review):

      Summary:

      Authors examined neural substrates for cognitive empathy (conceptually understanding others' emotions) versus affective empathy (automatically sharing others' emotions) development in 3-5-year-old toddlers, and argued that cognitive empathy emerges earlier than affective empathy, challenging the predominant view that affective empathy develops earlier. The authors developed an empathy test for toddlers while measuring their brain activity with fNIRS (particularly in MPFC, STG, DLPFC, and TPJ) and heart rate. They found different brain region activation in cognitive versus affective empathy tasks, as well as age-related changes in the activation of right MPFC and right TPJ.

      Strengths:

      This work investigated the development of different components of empathy, which is a quite understudied topic. The authors developed an age-appropriate task for toddlers to measure their cognitive empathy and affective empathy, which is likely useful for future research in this field. Their methods are sound, and give a relatively large sample; the results look interesting and relatively solid, except for certain details in the reporting of methods and results.

      Weaknesses:

      (1) My major concern is the roles of brain regions hypothesized and found in this paper (MPFC, STG, DLPFC, and TPJ) - the authors seemed to have omitted a large portion of the literature on this topic. Prior works have found that these brain regions may be involved in more than one process, or involved in processes that are common to both cognitive and affective empathy (see Schurz et al., 2021). In particular, MPFC seems to be indicated more often in cognitive empathy, and STG may be involved in both cognitive empathy and intermediate processes, which is contradictory to what the author claimed and hypothesized. Relatedly, when the authors made statements like "these results highlight that regions underpinning affective and cognitive empathy in preschoolers largely resemble those documented in adults" (without proper citations), I found it unconvincing due to the disagreements in the past adult research about brain regions related to empathy, which were not quite discussed in the current paper. It may be helpful if the authors do a more thorough literature review and provide a more comprehensive view of how their results fit in the existing literature.

      (2) Given the disagreement in the past research about the roles of these brain regions, I feel like the authors' hypotheses may be insufficiently justified, and their claim that cognitive empathy develops earlier than affective empathy is a bit overly strong - would it be possible that these brain regions' different rates/patterns of development are irrelevant to specific components of empathy? Given that behavioral data did not show any age difference, and that each brain region can engage in many functions besides empathy (e.g., generic social and emotional processing), I would be more cautious when interpreting these results.

      (3) It would be helpful if the authors report certain parts of their methods and results in more detail.<br /> a) During the cognitive/affective empathy tasks, it is not explicitly clear which part of the fNIRS data were included in the analysis.<br /> b) When the authors did FDR corrections, they should include the q values and adjusted p values. I was also confused about how the FDR correction was conducted - were analyses performed on all 10 ROIs or only the hypothesized regions? I think if the authors have hypotheses about specific regions, they should test their hypotheses first, and then everything else would be exploratory analyses.<br /> c) Additionally, it is unclear what brain template was used and what procedure was followed to map channels of fNIRS data to the template.

      References:

      Schurz, M., Radua, J., Tholen, M. G., Maliske, L., Margulies, D. S., Mars, R. B., ... & Kanske, P. (2021). Toward a hierarchical model of social cognition: A neuroimaging meta-analysis and integrative review of empathy and theory of mind. Psychological bulletin, 147(3), 293.

    1. Reviewer #1 (Public review):

      The study by He and colleagues aims to investigate the molecular mechanisms driving key cell potency transitions, particularly the naïve-to-primed pluripotency transition. The authors explore the relationship between cell polarity and stemness using stem cell models combined with a comprehensive panel of experiments, including pharmacological inhibition and co-culture/conditioned medium rescue approaches. Overall, the study provides interesting observations and contributes to the understanding of the molecular mechanisms dynamically regulating stem cell differentiation.

      However, several conceptual and interpretational aspects could be strengthened:

      First, the Introduction would benefit from being more focused on what is currently known regarding cell polarity during early embryogenesis and pluripotent stem cell transitions, rather than emphasizing later neurogenesis events. Such reorientation would better match the main topic of the manuscript and improve the conceptual coherence of the study.

      Similarly, Figure 6, where the authors attempt to provide clinical relevance through neural organoid formation experiments, feels somewhat disconnected from the central theme of the naïve-to-primed transition. Although this section is interesting on its own, there is already extensive literature describing polarization and morphogenetic events occurring much earlier during pluripotent state transitions. Therefore, the developmental relevance of the neural differentiation phenotypes could be better contextualized in relation to earlier morphogenetic events associated with pluripotency progression.

      The manuscript contains a substantial amount of experimental work; however, several results would benefit from deeper discussion. For example, in Figure 1, what is the rationale behind ZO1 downregulation being observed specifically in primed PAR knockout cells but not under naïve culture conditions? In addition, in Figure 3, the authors perform co-culture and conditioned medium experiments between wild-type and knockout cells. While the authors focus on the secreted protein fraction that rescues the phenotype, they also mention that other fractions display rescuing activity. Could the authors briefly discuss what additional components may contribute to this rescue effect? For example, could other molecules within these fractions also converge on AKT signaling regulation?

      Importantly, transitions in cell potency are frequently associated with coordinated morphogenetic changes. For example, during mouse embryogenesis, naïve pluripotent inner cell mass cells progressively polarize into a rosette-like structure with apical domain specification before lumen formation and epithelialization during progression toward the primed epiblast state. This developmental context could help strengthen the biological interpretation of the study.

      There are also several claims throughout the manuscript that appear to be overinterpreted or insufficiently quantified. For example, in Figure 1, the authors state that CDH1 expression is uniform; however, this is difficult to appreciate from the images shown, and quantitative analysis would be necessary to support this conclusion.

      Another example appears in Figure 2, where the authors claim that "heatmap analysis revealed that transcriptomic profiles of PAR knockout cells progressively diverged from wild type from day 3 onwards". This conclusion is not fully supported by the presented data for two reasons: (1) transcriptomic divergence is more appropriately assessed through principal component analysis, clustering, or distance-based methods rather than by visual inspection of a heatmap alone; and (2) although some genes displayed in panel E begin to show genotype-associated differences from day 3, the overall transcriptomic structure shown in the PCA and heatmap remains primarily dominated by temporal progression rather than genotype.

      In this context, it remains unclear whether PAR knockout cells truly retain a more naïve pluripotent transcriptomic identity. To support this claim, the authors should compare the knockout transcriptome directly against a naïve pluripotent population. The phenotype observed in the knockout cells may instead represent an incomplete or aberrant primed transition rather than maintenance of naïve pluripotency itself. Intermediate morphogenetic states, such as rosette-like epithelial stages, could also explain the observed phenotype.

      Strengthening this aspect of the study would substantially improve its developmental and in vivo relevance, which currently appears somewhat limited. In particular, it would be interesting to determine whether this mechanism operates during embryogenesis itself. The authors could consider relatively simple but informative experiments, such as perturbing PAR signaling or Furin activity during embryo culture.

      Along the same lines, some statements in the manuscript appear overly speculative. For example, the statement that "these findings may reveal a developmental compensation mechanism during embryogenesis, whereby normal cells rescue defective cells or increase their own proportion" extends well beyond the experimental evidence presented. Such claims invoke concepts related to cell competition, abnormal cell recognition, or developmental quality control mechanisms in vivo, none of which are directly demonstrated in this study. The authors are encouraged either to substantially tone down these statements or move them to the Discussion as speculative possibilities.

      Another important conceptual point concerns the relationship between PAR complex regulation and Lefty signaling. If this mechanism indeed reflects a physiological or homeostatic process operating during embryogenesis, what would be the developmental rationale for the PAR complex regulation of Lefty? Lefty is well known for its role during gastrulation and anterior epiblast patterning. It would therefore be interesting if the authors could further discuss potential links between these developmental contexts.

      Minor points:

      (1) The authors state that PAR knockout cells do not exhibit major differences in self-renewal capacity; however, they simultaneously claim that these cells remain in a more naïve-like state. This interpretation requires clarification, as naïve pluripotent cells are typically associated with increased clonogenicity, enhanced self-renewal, and expression of markers such as alkaline phosphatase and SSEA1 compared to primed cells. The relationship between the observed phenotype and the proposed "naïve-like" state should therefore be discussed more carefully.

      (2) The authors generated several independent knockout clones, but appear to use only one clone for downstream analyses after observing similar morphogenetic phenotypes. Is this sufficient to account for potential clonal heterogeneity? Would the use of pooled clones provide a more robust experimental system?

      (3) The rescue experiments using pathway inhibitors are interesting; however, the interpretation again relies primarily on colony morphology. Readers may question whether these experiments truly represent rescue of the naïve-to-primed transition itself without additional transcriptomic or molecular characterization.

      (4) In Figure 4, the manuscript could be strengthened by integrating transcriptomic analyses from pharmacological treatments with the secreted-factor and co-culture datasets.

      (5) The authors could better clarify the context of Furin downregulation in the knockout cells. Is this a direct consequence of altered transcriptional regulation by the PAR complex, or could it instead represent a secondary consequence of impaired progression through the primed pluripotent transition?

    2. Reviewer #2 (Public review):

      Summary:

      The study demonstrated that Par, but not other polarity genes, Crumbs or Scrib, regulates cell polarity during PSC transition to primed state as well as neural tube formation.

      Strengths:

      The use of KO convinces the role of Par in NPT. Scrib and Crumbs KO data are informative to the field. The conditioned medium experiment is informative. They suggested the potential secreted factors over 50kDa are responsible for maintaining the polarity of NPT in Par KO.

      Weaknesses:

      Most importantly, how Par is important for PSC maintenance and differentiation is not clear. The data provided are dome shape formation, endoderm lineage tendency, and neural tube formation reduction. The manuscript lacks a core message of the physiological importance of Par. Is Par critical of PSC maintenance? Is Par critical for neural system development?

      Secondly, AKT-FURIN-...... axis still lacks supportive data. Various inhibitors were used to rescue the Par KO. But the link between each component in the axis is missing and rather superficial.

    1. Reviewer #1 (Public review):

      The wide-ranging serotonergic projections emerging from the Dorsal Raphe nucleus (DRN) is suggestive of a central role in regulating brain-wide activity and behavioural states. DRN activity has been associated to diverse functions, ranging from mood, motivation and pain regulation to sleep and cognitive flexibility. Its far-reaching connectivity made it challenging to assess the brain-wide effect of its activation, especially during behaviour.

      The present study by Qi et al. addresses these challenges by combining state-of-the-art tracking microscopy with the whole-brain accessibility of the larval zebrafish model. To investigate the effect of DRN activation, the authors leveraged the Tg(tph2:ChrimsonR) line to optogenetically activate tph2-positive neurons in the DRN, while monitoring changes in brain-wide activity, locomotion and auditory-stimuli evoked responses.

      Optogenetic activation had a suppressing effect on locomotion, which the authors distinguished from inducing sleep by the maintenance of posture and its sleep disturbing effect of nighttime stimulations. Further, the authors report a distinct effect of DRN activation on motor-related, but not auditory-related neuronal subspaces, identified by demixed principal component analysis.

      In addition, rather than affecting all motor-correlated neurons similarly, tph2+ DRN-mediated suppression focused on neurons encoding high-amplitude or turning motion.

      In summary, the work of Qi et al. provides solid evidence for a predominant role of the DRN in wake-state motor suppression by aptly combining the vast data-acquisition possibilities of the larval zebrafish model with computational methods to extract relevant information.

      The brain-wide scope of the analysis is a key strength, reducing bias, confirming the involvement of known motor and auditory regions, and providing a valuable dataset for future analyses.

      While the results well support the conclusion of the authors, certain biological and technical aspects demand discussion.

      Comments on revised version.

      The authors successfully addressed my points.

    2. Reviewer #2 (Public review):

      Summary:

      The authors examine the effects of activating the dorsal raphe nucleus serotonergic system using a combination of calcium imaging and optogenetics in freely moving larval zebrafish. Their findings show that optogenetic stimulation induces a state of behavioral quiescence.

      They further investigate whether this state corresponds to sleep or reduced motor activity. Analyses of posture and sleep-related paradigms indicate that serotonergic activation primarily suppresses motor output rather than promoting sleep. Notably, this suppression appears to be bout type-dependent, with stronger effects on neurons associated with larger tail amplitudes and turning angles.

      In addition, auditory stimulation experiments reveal no significant impact of serotonin on sound encoding.

      Strengths:

      The study combines advanced experimental techniques with state-of-the-art analytical methods, enabling precise and compelling insights into the role of serotonergic modulation. The experiments and analyses are well aligned with the questions being addressed, and the results appear robust and reliable.

      Moreover, the implementation of experiments that combine calcium imaging and optogenetics in freely moving animals is technically challenging and appears well justified in the context of the research questions.

      Weaknesses:

      While the authors discuss different quiescent states mediated by serotonin reported in previous studies, more thorough attempt to determine whether the observed state corresponds to any of the previously described forms of quiescence, or represents a subset or variant of them, would strengthen the manuscript. This would help better integrate the findings with the existing literature.

      While addressing these questions may require substantial further work, potentially beyond the scope of the present study, the availability of whole-brain data provides an opportunity to at least explore or discuss these possibilities. In particular, it would be interesting to examine the recruitment of regions not directly stimulated but known to be associated with other neuromodulatory systems or promoting glial activation (e.g., the locus coeruleus).

    1. Reviewer #1 (Public review):

      Summary:

      In this article by Xiao et al. the authors aimed to identify the precise targets by which magnesium isoglycyrrhizinate (MgIG) functions to improve liver injury in response to ethanol treatment. The authors found through a series of in-vivo and molecular approaches that MgIG treatment attenuates alcohol-induced liver injury through a potential SREBP2-IdI1 axis. The revised manuscript adds to a previous set of literature showing MgIG improves liver function across a variety of etiologies, and also provides mechanistic insight into its mechanism of action. All major weaknesses were addressed in the revised submission.

      Strengths:

      (1) The authors use a combination of approaches from both in-vivo mouse models to in-vitro approaches with AML12 hepatocytes to support the notion that MgIG does improve liver function in response to ethanol treatment.

      (2) The authors use both knockdown and overexpression approaches, in-vivo and in-vitro, to support most of the claims provided.

      (3) Identification of HSD11B1 as the protein target of MgIG, as well as confirmation of direct protein-protein interactions between HSD11B1/SREBP2/IDI1 is novel.

      Comments on revision:

      The authors addressed all my concerns. No additional comments.

    2. Reviewer #2 (Public review):

      Summary:

      In this manuscript, the authors investigated magnesium isoglycyrrhizinate (MgIG)'s hepatoprotective actions in chronic-binge alcohol-associated liver disease (ALD) mouse models and ethanol/palmitic acid-challenged AML-12 hepatocytes. They found that MgIG markedly attenuated alcohol-induced liver injury, evidenced by ameliorated histological damage, reduced hepatic steatosis, and normalized liver-to-body weight ratios. RNA sequencing identified isopentenyl diphosphate delta isomerase 1 (IDI1) as a key downstream effector. Hepatocyte-specific genetic manipulations confirmed that MgIG modulates the SREBP2-IDI1 axis. The mechanistic studies suggested that MgIG could directly target HSD11B1 and modulate the HSD11B1-SREBP2-IDI1 axis to attenuate ALD. This manuscript is of interest to the research field of ALD.

      Strengths:

      The authors have performed both in vivo and in vitro studies to demonstrate the action of magnesium isoglycyrrhizinate on hepatocytes and an animal model of alcohol-associated liver disease.

      My first question: All the treatment arms (A-control, MgIG-25 mg/kg, MgIG-50 mg/kg) showed significant body weight loss compared to the untreated controls (Supplemental Figure 1A), but the body weight significantly increased in the treatment arms (A-control and MgIG-50 mg/kg) compared to the untreated controls (Figure 1E). Why?

      My second question: Mice with MgIG (25 mg/kg) showed the lowest body weight, compared to either A-control or MgIG (50 mg/kg) treatment. According to the authors' explanation, the MgIG (25 mg/kg) caused bodyweight loss are attributed to inter-individual variability, differences in metabolic adaptation, or sample size-related variation. Did these differences happen in MgIG (25 mg/kg) only? or in all other groups? The mouse group assignment should be randomized; however, a large variation in bodyweight was seen in MgIG (25 mg/kg) group. It is not convincing for the author to select MgIG (50 mg/kg) group for subsequent animal experiments, because of a large variation in MgIG (25 mg/kg) group, and because that MgIG (50 mg/kg) group demonstrated more consistent and stable improvements across multiple parameters. The author should reanalyze and compare all the raw data between MgIG (50 mg/kg) group and MgIG (25 mg/kg) group, and address the issues being pointed out and justify rationale for the animal group assignment.

      The author's response did not answer my question. If the authors believe it could be experimental constraints associated with the MgIG formulation, then it is questionable for this MgIG formulation used in all other associated experiments. The experiments, at least those the MgIG formulation associated experiments, need to be repeated.

      The author explained the relative expression was normalized to GAPDH (fold change), but they did not answer my question. My question is for Figure 5B. in Figure 5B (left, Hsd11b1-KD), scramble control showed over 100 (unit), however, in Figure 5B (right, Hsd11b1-OE), scramble control showed only 0.5-1 (unit). The data seemed that authors used same scramble control for both KD and OE? If yes, they should provide more details of the KD and OE experiments and explain why this happened. If they used plasmid for OE control, they also need to clarify it. In addition, qPCR is not a good assay to show the success of KD or OE, Western blotting should be done as convincing data to show the success of KD or OE.

      Comments on revised version.

      In this revision, all the issues are addressed.

    1. Reviewer #1 (Public review):

      Summary:

      The authors combine PSMC and habitat modeling to try to connect habitat change during the Last Glacial Period to changes in Ne.

      Strengths:

      Observing how tropical single-island endemic bird species responded to habitat change in the past may help inform conservation interventions for these particularly vulnerable species. The combination of genomics and habitat modeling is a good idea-this sort of interdisciplinary thinking is what is needed to tackle these complex questions. Additionally, the use of PSMC makes it possible to perform this analysis on poorly-studied species with only a single genome available.

      Room for Improvement:

      A paper was cited to support the idea, but why coalescent Ne is a better predictor of extinction risk than current genomic diversity or current Ne isn't explicitly explained in this paper.

      Differing PSMC parameters may also impact results: the differences between passerines and non-passerines was one of their main results. They explain why they chose different mutation rates for the two groups, but they do not provide any analysis to show this difference was not driven by the different mutation rates used for the two groups.

      For five of the species tested, PSMC parameter differences led to different results, but the species shown in table S4 are different from what is listed in the manuscript.

      Ecosystems are highly complex; there may also be other variables influencing past demographic change other than those explored here. Results should be interpreted with caution.

    2. Reviewer #2 (Public review):

      Summary and strengths:

      In this manuscript, Karjee and colleagues used coalescent based effective population size reconstruction (PSMC) from single genomes to understand past population trends in island birds and related this to life history traits and glacial patterns. In this analysis they chose to use a generation time of 2 years for passerines and 1 year for non-passerines. Non-passerine birds include Amazona vittata which only reaches sexual maturity at 3-5 years; Amazona guildingii which reaches sexual maturity at ~5 years; Amblyornis subalaris at 7 years etc. This means that the choice of generation time is very poorly matched to the species biology of many of the focal systems. What this will do is to "squash" the PSMC plot, meaning that population trends will not match with when they actually occurred. As a result, glaciation windows are not correctly placed. It is my opinion that the results are not interpretable in the current form.

      The authors must adjust the generation time to roughly the median period between average age of sexual maturity and age of death. It should represent the time when an individual has had 50% of their offspring. After which all analyses must be repeated.

    1. Reviewer #1 (Public review):

      Summary of goals:

      The authors' stated goal (line 226) was to compare gene expression levels for gut hormones between males and females. As female flies contain more fat than males, they also sought to identify hormones that control this sex difference. Finally, they attempted to place their findings in the broader context of what is already known about established underlying mechanisms.

      Strengths:

      (1) The core research question of this work is interesting. The authors provide a reasonable hypothesis (neuro/entero-peptides may be involved) and well-designed experiments to address it.

      (2) Some of the data are compelling, especially positive results that clearly implicate enteropeptides in sex-biased fat contents.

      Comments on revised version:

      There are small but useful improvements in the revised manuscript. Textual revisions have helped clarify some points, and I particularly appreciate the model (Figure 5). It gives a broader overview of fat storage regulation, even if new insights are limited to a generic statement that this phenomenon is complex (e.g. line 261).

      One crucial sticking point is again the handling of statistics. As the authors now explain, peptide knockdown effects are significant only if the experimental group differs from both parental controls (lines 191-194). By this definition (which is indeed the field standard and I also agree with), Tk knockdown had no significant effect (Figure 3B). The authors partially acknowledge this, initially calling the result a trend (line 198), but in many other places in their manuscript (e.g. lines 258-259, line 333) including in the Abstract (line 30) they (misre)present it as if it were significant. I have a huge problem with this, and it is the reason why I evaluate the strength of the evidence as Incomplete.

      Overall, I do not think it is meaningful for authors to undergo a new (second) revision if they do not carry out experiments to address key points.

    1. Reviewer #1 (Public review):

      [Editors' note: this version has been assessed by the Reviewing Editor without further input from the original reviewers. The authors have addressed the comments raised in the previous round of review.]

      Summary:

      The manuscript by Bohra et al. describes the indirect effects of ligand-dependent gene activation on neighboring non-target genes. The authors utilized single-molecule RNA-FISH (targeting both mature and intronic regions), 4C-seq, and enhancer deletions to demonstrate that the non-enhancer-targeted gene TFF3, located in the same TAD as the target gene TFF1, alters its expression when TFF1 expression declines at the end of the estrogen signaling peak. Since the enhancer does not loop with TFF3, the authors conclude that mechanisms other than estrogen receptor or enhancer-driven induction are responsible for TFF3 expression. Moreover, ERα intensity correlations show that both high and low levels of ERα are unfavorable for TFF1 expression. The ERa level correlations are further supported by overexpression of GFP-ERa. The authors conclude that transcriptional machinery used by TFF1 for its acute activation can negatively impact the TFF3 at peak of signaling but once, the condensate dissolves, TFF3 benefits from it for its low expression.

      Strengths:

      The findings are indeed intriguing. The authors have maintained appropriate experimental controls, and their conclusions are well-supported by the data.

    2. Reviewer #3 (Public review):

      Summary:

      In this manuscript Bohra et al. measure the effects of estrogen responsive gene expression upon induction on nearby target genes using a TAD containing the genes TFF1 and TFF3 as a model. The authors propose that there is a sort competition for transcriptional machinery between TFF1 (estrogen responsive) and TFF3 (not responsive) such that when TFF1 is activated and machinery is recruited, TFF3 is activated after a time delay. The authors attribute this time delay to transcriptional machinery that was being sequestered at TFF1 becomes available to the proximal TFF3 locus. The authors demonstrate that this activation is not dependent on contact with the TFF1 enhancer through deletion, instead they conclude that it is dependent on a phase-separated condensate which can sequester transcriptional machinery. Although the manuscript reports an interesting observation that there is a dose dependence and time delay on the expression of TFF1 relative to TFF3, there is much room for improvement in the analysis and reporting of the data. Most importantly there is no direct test of condensate formation at the locus in the context of this study: i.e. dissolution upon the enhancer deletion, decay in a temporal manner, and dependence of TFF1 expression on condensate formation. Using 1,6' hexanediol to draw conclusion on this matter is not adequate to draw conclusions on the effect of condensates on a specific genes activity given current knowledge on its non-specificity and multitude of indirect effects. Thus, in my opinion the major claim that this effect of a time delayed expression of TFF3 being dependent on condensates in not supported by the current data.

      Strengths:

      The depends of TFF1 expression on a single enhancer and the temporal delay in TFF3 is a very interesting finding.

      The non-linear dependence of TFF1 and TTF3 expression on ER concentration is very interesting with potentially broader implications.

      The combined use of smFISH, enhancer deletion, and 4C to build a coherent model is a good approach.

    1. Reviewer #2 (Public review):

      Summary:

      Zhang and colleagues investigate the molecular mechanisms by which the small brown planthopper (SBPH, Laodelphax striatellus) manipulates host rice carbohydrate metabolism to enhance its own fitness. Using a combination of molecular, pharmacological, and biochemical approaches, they demonstrate that SBPH infestation induces systemic glucose reallocation in rice, as evidenced by the upregulation of glucose levels in aerial tissues and simultaneous reduction in root glucose levels. Notably, host-derived glucose acts as a central signaling molecule, driving two key adaptive traits: enhanced fecundity via the glucose-TOR-JH-Vg signaling cascade, and increased imidacloprid tolerance through synergistic metabolic (GCL-GSH) and regulatory (TOR-JH-GST) pathways targeting GST activity. These findings uncover a sophisticated resource-manipulation strategy in SBPH and identify nutrient-sensing and detoxification pathways as potential targets for pest control.

      Strengths:

      (1) The study addresses a gap in plant-insect coevolution research by identifying glucose as a dual-function signaling molecule that coordinates SBPH reproduction and insecticide tolerance, providing valuable insights into how herbivores exploit host nutritional signals.

      (2) The experimental design is well structured and multifaceted, integrating RNAi, RT-qPCR, Western blotting, pharmacological inhibition, and biochemical assays. The use of appropriate controls (e.g., osmotic controls with mannitol and hydrolase-inhibitor rescue experiments) strengthens the causal interpretation of the results.

      (3) The mechanistic framework is clear and well-supported. The authors delineate two interconnected molecular cascades (glucose-TOR-JH-Vg for fecundity and GCL-GSH/TOR-JH-GST for tolerance) with hierarchical validation (e.g., rescue experiments with JHA), ensuring the reliability of conclusions.

      Weaknesses:

      (1) The study focuses exclusively on SBPH without validating whether the observed phenomena and mechanisms are conserved in closely related planthopper species (e.g., brown planthopper Nilaparvata lugens). This limitation restricts the generalizability of the findings to other economically important rice pests.

      (2) The specific upstream signals that trigger glucose reallocation in rice (e.g., SBPH salivary effectors or oviposition-associated factors) are not identified. Although this represents a complex and independent research direction, the absence of such information limits the depth and completeness of the mechanistic framework and leaves open questions regarding the initiation of host metabolic manipulation.

      (3) Insecticide tolerance assays are limited to imidacloprid. Extending these analyses to one or two additional commonly used insecticides (e.g., thiamethoxam) would help determine whether the glucose-mediated detoxification pathway is specific to imidacloprid or reflects a broader resistance mechanism, thereby strengthening conclusions regarding the generality of the GST activation cascade.

      (4) Given the study's potential implications for pest management, the manuscript would benefit from a brief discussion of possible practical applications, such as manipulating rice glucose metabolism through breeding strategies or developing small-molecule inhibitors targeting the TOR-JH axis. Including such perspectives would enhance the translational relevance of the work by linking mechanistic insights to real-world pest control strategies.

      Comments on revised version.

      The authors have comprehensively and satisfactorily addressed all my comments. The revised manuscript shows significant improvement in quality. I have no further questions or suggestions.

    1. Reviewer #1 (Public review):

      Summary:

      In this study, the authors propose that HSV-1 infection degrades the class I histone deacetylases HDAC1 and HDAC2. The MDM2 E3 ubiquitin ligase from the DNA damage response pathway is responsible for ubiquitinating these HDACs that are subsequently degraded via proteasomes. The authors hypothesize that HDAC degradation will cause hyperacetylation of viral chromatin and enable viral gene transcription.

      Strengths:

      The ubiquitination of HDAC1 & HDAC2 by Mdm2 and the mapping studies are clear.

      Comments on revised version:

      The authors enhanced their manuscript by more supportive data and providing clarification and the necessary corrections. However, a few more issues pertain:

      (1) In Figure 4j at 2 h post-infection we typically see the input virus and not progeny virus production. The input seems to have about 1-log difference that is expected to impact the results.

      (2) Figs 1A, 1E, 2H it seems unclear why ICP4 becomes detectable at 12 h post-infection in HeLa cells? How about other a-genes? How about other cells? ICP4 is typically detectable within 2-3 h post-infection.

      (3) In responses 2-2, Fig 5K: An infection without transfection has not been included. This is important to understand kinetics of infection in transfected cells.

      (4) Why HDAC1 with deleted NES does not accumulate or looks like it is degraded? Why then ICP4 does not accumulate?

    2. Reviewer #2 (Public review):

      Summary:

      The authors discovered that HDAC1/2 are degraded in HSV-1 and PRV infections. They attempted to establish a new mechanism by which HDAC1/2 are translocated to the cytoplasm to be degraded in HSV-1 infection, and the degradation causes changes in histone acetylation to affect the DDR pathway.

      Strengths:

      (1) Interesting findings of HDAC1/2 degradation during HSV-1 and PRV infection, and it may impact more than the virology field.

      (2) Significant work to identify the ubiquitin site in HDAC1/2 and K63 linkage.

      Comments on revised version:

      The authors added experiments to address the previous comments. The added knockdown and overexpression experiments provided sufficient support for the proposed mechanism. The conclusions are now strengthened. However, a few essential controls are still missing.

      (1) Figure 3K: How does the expression level of Flag-HDAC1 variants compare to the endogenous HDAC1 level? The stripe probed by Flag antibody should be reprobed by HDAC1 antibody. Also, how does the K74R mutant affect histone acetylation? Moreover, the numbers between the panels are hard to read and have not been explained.

      (2) Figure 3M and 3L: DNA transfection per se frequently stimulates cell reactions that inhibit HSV-1 replication. Is the HSV-1 only sample transfected by empty vector or untransfected?

      (3) Figure 4G-4J: What is the MDM2 knockdown efficiency?

      (4) Figure 5F and line 400-401: "thereby preventing HDAC1 degradation-markedly impaired HSV-1 replication (Fig. 5F)." However, viral replication is not demonstrated in Figure 5F.

      (5) Figure 5K: also need a control of empty vector. Furthermore, how does the HDAC1 NES expression affect histone acetylation and DDR responses?

      (6) Statements listed below are better moved to discussion after all data being presented. They are quite a stretch when looking at each figure by itself.

      (i) Line 268-270: "Together, these findings indicate that HSV-1 selectively degrades class I HDACs, resulting in widespread histone hyperacetylation that fosters a chromatin state conducive to viral replication". ----may be okay for a statement.

      (ii) Line 291-292: "providing initial evidence that HSV-1 infection promotes DDR activation through downregulation of HDAC1 expression"

      (iii) Line 331-333: "Together, these results indicate that HSV-1 infection promotes K63-linked polyubiquitination of HDAC1/2 at conserved lysine residues, ultimately leading to their proteasomal degradation."

      (iv) Line 334-336 is a repeated sentence.

    1. Reviewer #2 (Public review):

      Summary:

      This paper presents results interpreted to indicate that sequences upstream of stop codons capable of base-pairing with the 3' end of 18S rRNA prolong the dwell time of 80S ribosomes at stop codons in a manner impeded by Rps26 in the 40S subunit exit channel, which leads to the proper completion of termination and ribosome recycling and prevents spurious translation of 3'UTR sequences by one or more unconventional mechanisms.

      Strengths:

      The standard 80S and selective eRF1 80S ribosome profiling data obtained using EZRA-Seq are of high quality, allowing the authors to detect an enrichment for purine-rich sequences upstream of stop codons at sites where termination is relatively slow and ribosomal complexes are paused with eRF1 still engaged in the A site.

      Weaknesses:

      There are many weaknesses in the experimental design and interpretation of results that undermine several of the final conclusions of the study described in the abstract, as described in detail below.

      (1) It's not indicated how far upstream of the stop codon the sequences were searched to find the enriched motifs in Figs. 1C and 2D. If it's further upstream of -15 then the sequence would generally not be found in the exit channel of a terminating ribosome positioned with the stop codon in the A site in the manner expected from their final model of mRNA:18S rRNA pairing. (This would be analogous to the occurrence of the Shine-Dalgarno within 15 nt of the initiation codon for most mRNAs in E. coli.) They could have depicted nucleotide percentages at each nucleotide from -1 to -15 for the high and low pause stop codons to better facilitate consideration of their proposed mechanism of termination pausing involving the 3' end of 18S rRNA.

      (2) lines 234-242: Their reporter data in Fig. 4B suggest that only the presence of GGG triplets at any location in the 9 nt substantially prevents downstream translation. If their interpretation about these G-rich sequences promoting termination by forming G-quadruplexes is correct, then this would have little to do with the purine-rich motifs identified by the profiling experiments (and their proposed function in base-pairing with rRNA), as the purine-rich motifs do not feature GG bases (as shown in Fig. 2D in particular). The authors point out that the MPRA can sample sequence space not represented in living cells. While true, this doesn't change the fact that it failed identify sequences conforming to the purine rich motifs found by the profiling experiments and identified instead sequences capable of forming G-quadruplexes that may well function by a different mechanism than that employed in cells. The authors cannot persist in claiming that the MPRA results confirm the findings of the profiling experiments regarding the purine-rich motif. Also, the claim of enrichment for C-rich sequences in the MPRA results is not compelling as only 3 of the 11 triplets showing the smallest M/P ratios contain more than 1 C and three of them contain no Cs. Also, there was no evidence for depletion of C's upstream of the stop codons with low pause scores from the ribosome profiling data in Fig. 1, so it's inaccurate to claim "mirroring" of results from the ribosome profiling and MPRA data on this point as well.

      (3) lines 256-260: I still contend that the different results shown in Fig. 4E for the C-rich and GA-rich sequences are not compelling as results for only a single sequence of each type are shown, which might not be typical of the entire class. In fact, the GA-rich sequence has two GG's and could form a G-quadruplex, whereas the GA-rich motifs identified by ribosome profiling and eRF1-seq do not exhibit consecutive GGs, such that the single G-rich sequence chosen for analysis might function by G-quadruplex mediated stalling rather than base-pairing with the 3' end of 18S rRNA, as they actually suggested in their rebuttal. Even the second GA-rich sequence analyzed in Fig. S3G has two GGs. Thus, while the results in Fig. 4 provide support for the notion that C-rich sequences preceding the stop codon promote stop codon read-through, it's important to note that no evidence was obtained by ribosome-profiling in Fig. 1 that the increased 3'UTR translation seen for low-pause stop codons is associated with C-rich sequences. It's unclear why they would be unable to observe this in the manner they document for the eRF1-Seq data in Fig. 2D for the three C-rich triplets enriched at stop codons lacking eRF1 peaks.<br /> - lines 278-282: These differences are quite small and could arise from the different sequences of the GFP-HiBit fusion proteins, as observed in Fig. 4C (top two control constructs), precluding mechanistic interpretations.

      (4) Notwithstanding their claim in the rebuttal, I still find no definition of the GA-rich and C-rich mRNAs described in Fig. 5C in the Methods or legends, nor whether the compilation is restricted to -15 from the stop codons. In addition, if expression of the mutant 18S rRNA is sufficient to alter the height of the termination peaks as shown in Fig. 5C and to alter reporter expression in Fig. 5D, I see no reason why they cannot carry out the pause score/motif enrichment of Fig. 1C to determine if they see the expected diminished enrichment for the GA-motif shown there on expressing the mutant 18S vs. the WT 18S control strain. If not, this would undermine their interpretation of the results in Figs. 5C-D as favoring base-pairing between the 3' end of 18S rRNA and sequences upstream of the stop codon.

      (5) I still find a significant shortcoming in their failure to analyze the 18S rRNA 3' end biochemically to show that the expected ~15% with the mutant sequence. Stating simply that they followed a previous protocol is not sufficient to document their success in this notoriously challenging experimental approach.

      (6) lines 382-384: The level of the control protein RACK1 is diminished in testis polysomes, and it's unclear that the ratio of Rps26:RACK1 is actually lower in testis polysomes in the manner claimed.

      (7) lines 414-427: I still contend that the authors should have quantified the ratio of the stop codon peak to the adjacent coding sequences in Figures 7E to establish that Rps26 OE decreased the stop codon peaks selectively on the GA-rich cohort of mRNAs. In addition, they still have not explained why the C-rich reporter behaves like the GA-rich reporter in Fig. 7F in showing reduced HiBiT expression on Rps26 OE when it should be unaffected. As such, the reporter data do not support the conclusion reached from the data in Fig. 7E.

      (8) Notwithstanding their rebuttal I still contend that the failure to measure Rps26 association with 80S ribsoomes or polysomes and show that it is depleted by the shRNA knockdown and increased by Rps26 OE is a significant shortcoming, especially since their interpretation of the OE data depends on the occurrence of 40S subunits lacking Rps26 in unstressed WT cells, which seems improbable based on the prior work on yeast.

      (9) Overall, examining the claims in the revised Abstract, I feel that I am in agreement with the claim "We identify a sequence motif upstream of the stop codon that promotes termination pausing,.." but disagree that the function of this motif was "validated by massively paralleled reporter assays", for the reasons stated above in point 2. Regarding the statement "Unexpectedly, reduced termination pausing increases the likelihood of stop codon slippage, giving rise to proteins with heterogenous C-terminal extensions." , I believe it would be more cautious to say that "reduced pausing is associated with stop codon read-through accompanied by frameshifting" since the MRPA did not provide compelling evidence for causality for the reasons described in point 3 above. Regarding the statement "Mechanistically, we show that sequence-dependent termination pausing arises from post-decoding mRNA scanning by the 3' end of 18S rRNA", I find this statement too strong in view of the shortcomings described above in points 4-5 and think it would be more correct to say that their findings are consistent with (rather than showing) this point, and also think they should add qualifying statements to the manuscript acknowledging the limitations of these experiments. I further contend that there are shortcomings in the experiments leading to the conclusion that the stoichiometry of Rps26... modulates mRNA:rRNA interactions, described above in points 6-9. Finally, in the last sentence, the claims that termination pausing is shaped by ribosome heterogeneity, and cell type-specific translational control is too strong.

    2. Reviewer #3 (Public review):

      Summary:

      This study from Jia et al carried out a variety of analyses of terminating ribosomes, including the development of eRF1-seq to map termination sites, identification of a GA-rich motif that promotes ribosome pausing, characterization of tissue-specific termination dynamics, and elucidation of the regulatory roles of 18S rRNA and RPS26. Overall, the study is thoughtfully designed, and its biological conclusions are well supported by complementary experiments. The tools and datasets generated provide valuable resources for researchers investigating the mechanisms of RNA translation.

      Strengths:

      (1) The study introduces eRF1-seq, a novel approach for mapping translation termination sites, providing a methodological advance for studying ribosome termination.

      (2) Through integrative bioinformatic analyses and complementary MPRA experiments, the authors demonstrate that GA-rich motifs promote ribosome pausing at termination sites and reveal possible regulatory roles of 18S rRNA in this process.

      (3) The study characterizes tissue-specific ribosome termination dynamics, showing that the testis exhibits stronger ribosome pausing at stop codons compared to other tissues. Follow-up experiments suggest that RPS26 may contribute to this tissue specificity.

      Weaknesses:

      The biological significance of ribosome pausing regulation at translation termination sites or of translational readthrough, for example across different tissue types, remains unclear. Nevertheless, this question lies beyond the primary scope of the current study.

      Comments on the latest version:

      The authors addressed my comments by revising the claims in the manuscript.

    1. Reviewer #1 (Public review):

      Summary:

      Plasmodesmata are channels that allow cell-cell communication in plants; based on the functional similarities between facilitated transport within plasmodesmata and into the nucleus, the authors speculate that nuclear pore complex proteins might be involved in plasmodesmata function. In this manuscript, they localize nuclear pore complex proteins to plasmodesmata using proteomics and heterologous overexpression. They also document a possible plasmodesmata transport defect in a mutant affecting one nuclear pore complex protein.

      Strengths:

      The main strength of this manuscript is the interesting and novel hypothesis. This work could open exciting new directions in our understanding of plasmodesmata function and cell-cell communication in plants. They also localized many NUPs (12/35 Arabidopsis NUPs).

      Weaknesses:

      The main weakness of this manuscript is that the data are solid, but could benefit from further controls. The authors appropriately and frequently acknowledge caveats to their data, which include: 1) that the proteomics preparations cannot completely purify plasmodesmata; 2) heterologous expression does not allow them to assess the function of the fluorescently-tagged NUPs; 3) some NUPs may be overexpressed, especially in the heterologous system, which can lead to localization artefacts; 4) ER-localized proteins can appear partially localized to plasmodesmata.

      Comments on revised version.

      In the revised version of the manuscript, the authors have addressed my main concerns from the previous review and they acknowledge the caveats and alternative interpretations to their results in the text. However, although some important controls have been added, the rationale for why different NUPs were used in different control experiments is often unclear, and it is also unclear why specific NUPs (corresponding to different locations in the nuclear pore complex) were selected for each experiment. This includes:

      a) Expression level analysis via proteomics: NUP62 (core FG NUP)<br /> b) Colocalization with known PD protein: HOS1 (outer ring)<br /> c) Colocalization with ER marker: NUP43 (outer ring)<br /> d) Complementation assays: CPR5 (membrane anchor) - only the rationale for this choice is articulated clearly (lines 224-228).

      However, they have not systematically conducted all controls for one NUP, nor explained why they selected specific different NUPs, corresponding to different localizations within the complex, for the control experiments.

      Generally, the manuscript needs careful proofreading. There are a number of typos, misused punctuation, sentence fragments, etc.

      - As one example, see the legend for Figure 5: there are two different definitions of white arrowheads, yet green are not defined; there is a sentence fragment on line 1320 ("And aniline blue."); there is double punctuation on line 1321 "localization.,"; and red arrows are defined as "mCherry-HDEL specific localization., without overly with other markers" yet in several cases, they point to either 1) regions of only mCherry-HDEL in cells not expressing NUP43-mVenus (both red arrows in the second row of images, which are biologically meaningless and potentially misleading) or 2) red arrows pointing to sites where mCherry-HDEL and NUP43-mVenus are colocalized (top two red arrows in the first row of images, which are biologically meaningful yet incorrectly interpreted by the authors). These are just a small example set of the proofreading required.

    2. Reviewer #2 (Public review):

      Summary:

      The authors aim to address whether nuclear pore complex components localize and function at PD in plant cells to mediate cell-to-cell communication.

      Strengths:

      (1) Novelty and Significance:

      The core hypothesis, drawing parallels between PD and NPC transport, is highly original and addresses a critical gap in understanding plant intercellular communication. The idea that phase-separated domains formed by FG-NUPs could act as diffusion barriers at PD offers an alternative and plausible explanation for their complex transport properties, including size exclusion and facilitated translocation. This could fundamentally change how we view PD transport and function.

      (2) Comprehensive Evidence:

      The study employs a rigorous and diverse set of experimental approaches, including a comprehensive bioinformatic analysis of both moss and Arabidopsis NUPs in available PD proteomic datasets, extensive imaging analysis of Nup localization in vivo, and functional transport assays using a loss-of-function nup mutant (cpr5). The transport assay is particularly important to provide functional evidence linking CPR5 to PD-mediated transport. The finding that callose levels were not significantly different in cpr5 mutants under these conditions is helpful and supports a distinct, callose-independent mechanism of transport regulation.

      (3) Objectivity:

      The authors are forthright in discussing the limitations and potential artifacts of their own data, clearly distinguishing between observations and definitive conclusions.

      Weaknesses:

      While the claims are generally justified as hypotheses or consistent observations, the authors themselves extensively detail the caveats, which are worth reiterating for clarity:

      (1) Potential Overexpression Artifacts in Localization:

      Although efforts were made to control expression levels, the authors acknowledge that transient overexpression could still lead to NUP accumulation at PD, either as a physiologically irrelevant accumulation under excess conditions or due to mis-targeting. Note that they provided data showing Nup62 PD localization at a near native level.

      (2) CPR5 Mutant Interpretation:

      While cpr5 mutants exhibited reduced macromolecular transport, the authors state that they cannot exclude that the reduced transport is due to secondary effects in the cpr5 mutants, which show rather severe phenotypic defects. This is an important distinction, as CPR5 has known roles in defense responses and hormone signaling that could indirectly influence PD integrity, independent of callose deposition. The lack of effect on small molecule transport is a good control, but the broader pleiotropic effects of cpr5 mutants remain a consideration.

      (3) Conceptual Distinction between NPC and PD:

      The authors correctly point out that while similarities exist, the physical assembly of NUPs at PD must differ from that at the NPC due to the presence of the desmotubule and smaller cytoplasmic sleeve width at PD. Moreover, nucleocytoplasmic transport depends on kayropherin proteins (importins) that interact with the NPC central channel to complete the transport. Yet the role of karyopherins in this case is not clear. Therefore, the proposed "PD pore complex" may bear some NPC features, but not identical.

    3. Reviewer #3 (Public review):

      Summary:

      This manuscript presents a step towards testing the hypothesis that plasmodesmata have homology to nuclear pores. The similarities between the two structures have long been noted as both structures allow the transport of proteins and nucleic acids and both structures are composed of curved membranes. The manuscript has identified nuclear pore proteins (NUPs) in plasmodesmal protein fractions and uses live imaging in a non-endogenous system and functional assays of a mutant to propose that this might be a bone fide association.

      The conclusions the authors seek to draw are that: NUPs are present in plasmodesmal protein fractions; NUPs localise at plasmodesmata; NUPs might form a pore-gating complex at plasmodesmata, regulating non-specific (2xGFP) and specific (SHR) transport through plasmodesmata.

      The authors then use these conclusions to propose the possibility that phase separation mediates transport through plasmodesmata. If there is phase separation at plasmodesmata or a nuclear pore-like complex, it would revolutionise the community. However, this data is insufficient to act as a cornerstone for such a discovery.

      Strengths:

      The strength of the manuscript lies in the boldness and novelty of the idea.

      Weaknesses:

      The weaknesses lie in the lack of resolution over the specificity of the plasmodesmal association of the NUPs. The authors' own assessments of their data suggest they agree with this - in their abstract alone they point out that the transport defects they observe might be off-target effects and suggest there is a requirement in the future to determine whether the NUPs are bona fide PD components.

      Across the proteomic and live imaging experiments, the authors have tried to make their initial conclusions stronger by comparing the NUP localisation and accumulation with ER proteins. Thus, they have demonstrated that there are some differences in the localisations between the NUPs and an ER-lumen marker, although there are also many similarities. Indeed, for CPR5 they have demonstrated that the protein in ER located and their imaging shows a very clear association with ER beyond the plasmodesmata. Residence in the ER does not prevent the possibility that the protein has a plasmodesmal function, but it does raise questions of specificity of the localisation at the plasmodesmata (and nuclear envelope) when it is evident throughout the ER. The authors acknowledge the possibility that PD accumulation is artefactual, so they are aware of this.

      In my initial review I suggested that super-resolution imaging of an ER marker would help interpret the structures revealed by CPR5 in Figure 6. The authors indicated that because the localisation of NUPs looked different to the ER luminal marker that this wasn't a priority. However, they have shown that CPR5 is an ER-resident protein and so I disagree with this conclusion. I think this experiment would provide valuable information regarding whether there is any specificity in CPR5 accumulation at plasmodesmata.

      Regarding the proteomic identification of NUPs in plasmodesmal fractions, the authors place significant weight on their own metric for PD enrichment, the PD score. As I understand it, this a metric derived from addition of two factors: a two component enrichment score that is the difference between intensity of peptides of a given protein in the PD fraction and cell wall fraction, added to the difference between intensity of peptides of a given protein in the PD fraction and total cell fraction, and a feature score that is a factor that describes representation of protein domains contained in said given protein in the plasmodesmal fraction relative to the representation of that domain in proteins in the whole proteome. The features chosen for analysis are not indicated and the feature factor, as I understand it is a score common to all proteins with a given feature. While each of the factors carries a measure of meaning and information, I do not understand how adding them is mathematically or biologically meaningful.

      Regarding the possibility that there is a pore-gating complex at plasmodesmata. If NUPs are specifically located at plasmodesmata, this is a strong hypothesis. The authors approach this functionally by assaying for protein and dye movement through plasmodesmata in the cpr5 mutants. These experiments suggest that cpr5 mutants have reduced transport through plasmodesmata for both proteins, but not for a smaller dye. In their introduction the authors identify how PD structure can modify transport capacity so there are many technical and biological phenomena that could explain these data. Further, as the authors themselves acknowledge, altered protein movement might also arise from an off-target developmental phenotype. Many proteins have been shown to have no association with plasmodesmata but an indirect effect on their function. This hasn't been investigated and so cannot be ruled out.

    1. Reviewer #1 (Public review):

      Summary:

      Some of the authors proposed in a PNAS paper in 2016 the occurrence of the Entner-Doudoroff (ED) pathway in cyanobacteria and plants, on the basis of several lines of biochemical and genetic evidence. However, more recent results indicated that one of the two specific enzymes of the ED pathway (EDD) is missing in Synechocystis PCC 6803. The authors carried out additional experiments, which demonstrated that EDD is missing, and one of the enzymes (ED aldolase) is a promiscuous enzyme which seems to be involved in proline metabolism and is not actually participating in the ED pathway as initially believed. The results described in this paper are strong evidence that this new interpretation is appropriate, and therefore, it corrects the previous proposal, providing an honest description of the reasons why the authors had reached the wrong conclusion about the existence of the ED pathway in cyanobacteria and plants.

      Strengths:

      Thorough reanalysis of the experimental results obtained in previous studies, which led to the publication of the PNAS paper in 2016.

      New experimental evidence to confirm that enzymes previously considered as participating in the ED actually are not catalyzing the ED biochemical reactions, but are involved in other metabolic pathways. Also, the authors completely discarded the occurrence of the GDH/GK shunt in Synechocystis PCC 6803. Generally speaking, the manuscript is very clearly written, with a precise description of the previous findings, the mistakes which took place in the 2016 paper, and the strategies they have used to address those issues, in order to reach a thoroughly revised vision of the glucose metabolic pathways in Synechocystis PCC 6803. In this regard, the drawings shown in Figures 1 and 7 are very helpful for the reader to follow the story and understand the possible metabolic transformations depending on the working hypothesis.

      Also, I commend the authors for openly describing previous mistakes. In this paper, they reassess past observations in light of more recent findings and to integrate the information in this manuscript. The scientific conclusions are solid and very interesting, and besides, they use the opportunity to offer valuable advice to researchers. This is especially focused on the importance of careful biochemical characterization of enzymes, which should always be carried out when studying proteins which have been identified as a specific enzyme on the basis of sequence homology. In a similar way, they found that an insertional mutant was the cause of the absence of specific metabolites, which had been attributed to particularities of a metabolic pathway in that mutant, when it was actually due to a nucleotide insertion; this could have been easily prevented by confirming the correct generation of the mutant by DNA sequencing.

      Weaknesses:

      The authors propose that EDA might be involved in the PEP-pyruvate-OAA node, or in the proline metabolism, but this requires further experimental work for clarification; what their results indicate clearly is that this enzyme is not actually catalyzing the transformation of KDPG to GAP, which is the second specific enzyme of the ED pathway. But the real physiological function in this cyanobacterium is still unconfirmed.

      Another aspect which could be improved is that the recombinant expression of some genes was carried out in E. coli; even if this is a useful and valid research strategy, in studies like this (where there is a strong focus on the physiological function of enzymes in the original organism, Synechocystis PCC 6803), I think it would have been more appropriate to express the 6803 genes in another cyanobacterium easily amenable for genetic transformation and gene expression, which would produce the protein in a physiological environment more similar to another cyanobacterium (compared to E. coli, which is an heterotrophic bacterium). I am not sure this would change any of the obtained results, but it certainly would confer additional robustness to the enzymatic results.

      Bibliography:

      I think the list of papers used in this manuscript is complete and up to date. However, I do miss recent papers which addressed one aspect that was proposed in the original 2016 PNAS paper: the authors wrote, "We therefore suggest that Prochlorococcus might oxidize glucose via the ED pathway under mixotrophic conditions, as shown for Synechocystis." Recent studies checked this hypothesis and have shown that the ED pathway seems to be also missing in Prochlorococcus and marine Synechococcus, and I think this manuscript is a good place to cite them, since these results are consistent with the findings of this paper.

    2. Reviewer #2 (Public review):

      Summary:

      The study presents novel results on the presence of the Entner-Doudoroff pathway in Synechocystis sp. PCC 6803. In contrast to an earlier study, compelling evidence is given that this strain lacks both an ED pathway and a glucose dehydrogenase/glucokinase bypass but contains a promiscuous aldolase, which also decarboxylates oxaloacetate and cleaves 2-keto-4-hydroxyglutarate (as it occurs in proline degradation). The study concludes with successfully reconciling data from different studies and with lessons learned from the previous misconception.

      Strengths:

      Solid biochemical data are presented to reconcile contradicting data of earlier studies and to serve as a basis for disclosing possible functions of a promiscuous aldolase. Earlier misconceptions and lessons to be learned are well discussed.

      Weaknesses:

      The materials and methods section is rather lengthy, suffering from a lack of conciseness and repetition, and nevertheless misses some specifications.

    1. Reviewer #1 (Public review):

      Summary:

      The authors attempt to use a combination of behavioural and EEG analyses in order to investigate whether expectation of task difficulty influences spatial focus narrowing in the context of a spatially cued task, alongside an expected attention-related amplitude effect. This distinguishes the experiment from previous tasks which looked at this potential spatial narrowing in the context of more non-cued diffuse attention tasks. The authors present 2 major findings.<br /> (1) Behaviourally, they analysed the effects of cue validity and difficulty expectation on response accuracy and found that participants displayed an effect of difficulty expectation in validly cued trials, showing relatively enhanced behaviour to Hard Expectation trials, but no effect of expectation in invalidly cued trials.<br /> (2) Inverted encoding modelling on broadband EEG showed greater pre-target attentional processing in the Hard Expectation blocks. They go on to show that this enhancement comes in the form of greater amplitude of the Channel Tuning Functions (CTFs) approximately 300 to 400ms post-cue, in the absence of any spatial tuning specificity enhancement (as would be evident in a difference in CTF fit width). Together these results provide valuable findings for those investigating the separable effects of expectation and attention on target detection in visual search.

      Strengths:

      (1) This is a very solidly performed experiment and analysis, with different streams of evidence convincingly pointing in the same direction, i.e. a gain effect of Expectation in the absence of a spatial tuning effect.

      (2) EEG is competently analysed and interpreted, and the paper is well written, and simple in its motivation.

      (3) The authors report appropriately on the results in the Discussion, without overreaching.

      Comments on revised version:

      The authors have addressed all of my comments. Very interesting work, thank you!

    2. Reviewer #2 (Public review):

      Summary:

      The authors set out to determine whether people can adjust how narrowly or broadly they focus attention in advance based on expectations about how difficult an upcoming visual task will be. Specifically, they aimed to test whether expecting a more demanding search leads to a narrower focus of attention or instead strengthens attention at the relevant location without changing its spatial extent.

      Strengths:

      The study addresses a timely and interesting question about how expectations influence the preparation of attention before a task begins. The experimental design is well suited to isolating anticipatory effects by manipulating expectations about task difficulty independently of moment-to-moment stimulus information. The manuscript is clearly written, and the methods are described in sufficient detail to support transparency and reproducibility.

      Comments on revised version.

      During the review process the authors addressed my previous concerns. The revisions have improved the clarity of the analyses and the interpretation of the results, and I have no further substantive comments.

    1. Reviewer #1 (Public review):

      Summary:

      This useful study provides incomplete evidence of an association between atovaquone-proguanil use (as well as toxoplasmosis seropositivity) and reduced Alzheimer's dementia risk. The study reinforces findings that VZ vaccine lowers AD risk and suggests that this vaccine may be an effect modifier of A-P's protective effect. Strengths of the study include two extremely large cohorts, including a massive validation cohort in the US. Statistical analyses are sound, and the effect sizes are significant and meaningful. The CI curves are certainly impressive.

      Weaknesses include the inability to control for potentially important confounding variables. In my view, the findings are intriguing but remain correlative / hypothesis generating rather than causative. Significant mechanistic work needs to be done to link interventions which limit the impact of Toxoplasmosis and VZV reactivation on AD.

      Weaknesses:

      Major:

      (1) Most of the individuals in the study received A-P for malaria prophylaxis as it is not first line for Toxo treatment. Many (probably most) of these individuals were likely to be Toxo negative (~15% seropositive in the US), thereby eliminating a potential benefit of the drug in most people in the cohort. Finally, A-P is not a first line treatment for Toxo because of lower efficacy.

      (2) A-P exposure may be a marker of subtle demographic features not captured in the dataset such as wealth allowing for global travel and/or genetic predisposition to AD. This raises my suspicion of correlative rather than casual relationships between A-P exposure and AD reduction. The size of the cohort does not eliminate this issue, but rather narrows confidence intervals around potentially misleading odds ratios which have not been adjusted for the multitude of other variables driving incident AD.

      (3) The relationship between herpes virus reactivation and Toxo reactivation seems speculative.

      (4) A direct effect on A-P on AD lesions independent on infection is not considered as a hypothesis. Given the limitations above and effects on metabolic pathways, it probably should be. The Toxo hypothesis would be more convincing if the authors could demonstrate an enhanced effect of the drug in Toxo positive individuals without no effect in Toxo negative individuals.

      Minor:

      (5) "Clinically meaningful" should be eliminated from the discussion given that this is correlative evidence.

    2. Reviewer #2 (Public review):

      Summary:

      This manuscript examines the association between atovaquone/proguanil use, zoster vaccination, toxoplasmosis serostatus and Alzheimer's Disease, using 2 databases of claims data. The manuscript is well written and concise. The major concerns about the manuscript center around the indications of atovaquone/proguanil use, which would not typically be active against toxoplasmosis at doses given, and the lack of control for potential confounders in the analysis.

      Strengths:

      (1) Use of 2 databases of claims data.

      (2) Unbiased review of medications associated with AD, which identified zoster vaccination associated with decreased risk of AD, replicating findings from other studies.

      Weaknesses:

      (1) Given that atovaquone/proguanil is likely to be given to a healthy population who is able to travel, concern that there are unmeasured confounders driving the association.

      (2) The dose of atovaquone in atovaquone/proguanil is unlikely to be adequate suppression of toxo (much less for treatment/elimination of toxo), raising questions about the mechanism.

      (3) Unmeasured bias in the small number of people who had toxoplasma serology in the TriNetX cohort.

    1. Reviewer #1 (Public review):

      Summary:

      In this manuscript, Torro et al. presented CellDetective, an open-source software designed for a user-friendly execution of single cell segmentation, tracking and analysis of time-lapse microscopy data. The authors demonstrated the applications of the software by measuring NK cell spreading events acquired with reflection interference contrast microscopy (RICM), as well as detecting target cell death events and their interaction with neighboring NK cells in a multichannel widefield microscopy datasets.

      Strengths:

      The segmentation (StarDist, Cellpose) and tracking (bTrack) modules implemented were based on existing and published software packages, while the event detection, classification and analysis modules were added by the authors to enable an end-to-end time-lapse microscopy data processing and analysis pipeline, complete with graphical user interface (GUI) to minimize coding experience required from the user. The latest iteration of CellDetective also incorporates new features that enable multiple cell subsets to be examined and visualized. The documentation that accompanies CellDetective is also well written.

      Weaknesses:

      The current iteration of CellDetective is still limited to 2D 'widefield' analysis, although the authors have provided convincing justification for the current implementation for 2D + time analysis and clarified such limitations of the software in the manuscript. This reviewer maintains that support for 3D + time analysis in future iterations of CellDetective will substantially improve its applicability across broad disciplines, especially with emerging focus on 3D organoid studies.

      Additionally, this reviewer has also encountered a key technical issue with the latest version of CellDetective (v1.5.2, installed on Windows 11 25H2) where the main CellDetective window is displayed in a fixed size that prevented the user from accessing the user interface/buttons that are essential for operating the software. As an example, in the very first demo (https://celldetective.readthedocs.io/en/latest/first-experiment.html), the fixed window size prevented this reviewer from accessing the "Submit" button in Step 2: Segment Cells (which is not visible as the fixed window size only displayed a certain portion of the GUI) of the workflow. This limitation made it near impossible to evaluate the useability and stability of the software. Fixing this issue by making the window size adjustable such that these buttons of the interface can be accessed by the user will be important to ensure the useability of the software.

      This reviewer understands the difficulties and time involved in bug fixing, and hope that the experience could have been much smoother and the software behaves much more stably in order to maximize its useability.

    1. Reviewer #1 (Public review):

      Summary:

      This paper investigates the physical basis of epithelial invagination in the morphogenesis of the ascidian siphon tube. The authors observe changes in actin and myosin distribution during siphon tube morphogenesis using fixed specimens and immunohistochemistry. They discover that there is a biphasic change in the actomyosin localization that correlates with changes in cell shapes. Initially, there is the well-known relocation of actomyosin from the lateral sides to the apical surface of cells that will invaginate, accompanied by a concomitant lengthening of the central cells within the invagination, but not a lot of invagination. Coincident with a second, more rapid, phase of invagination, the authors see a relocalization of actomyosin back to the lateral sides of the cells. This 2nd "bidirectional" relocation of actin appears to be important because optogenetic inhibition of myosin in the lateral domain after the initial invaginations phase resulted in a block of further invagination. Although not noted in the paper, that the second phase of siphon invagination is dependent on actomyosin is interesting and important because it has been shown that during Drosophila mesoderm invagination that a second "folding" phase of invagination is independent of actomyosin contraction (Guo et al. eLife 2022), so there appear to be important differences between the Drosophila mesoderm system and the ascidian siphon tube systems.

      Using the experimental data, the authors create a vertex model of the invagination, and simulations reveal a coupled mechanism of apicobasal tension imbalance and lateral contraction that creates the invagination. The resultant model appears to recapitulate many aspects of the observed cell behaviors, although there are some caveats to consider (described below).

      Strengths:

      The studies and presented results are well done and provide important insights into the physical forces of epithelial invagination, which is important because invaginations are how a large fraction of organs in multicellular organisms are formed.

      Weaknesses:

      (1) This reviewer has concerns about two aspects of the computational model. First, the model in Fig. 5D shows a simulation of a flat epithelial sheet creating an invagination. However, the actual invagination is occurring in a small embryo that has significant curvature, such that nine or so cells occupy a 90-degree arc of the 360-degree circle that defines the embryo's cross-section (e.g., see Fig. 1A). This curvature could have important effects on cell behavior.

      (2) The second concern about the model is that Figure 5 D shows the vertex model developing significant "puckering" (bulging) surrounding the invagination. Such "puckering" is not seen in the in vivo invagination (Fig. 1A, 2A). This issue is not discussed in the text, so it is unclear how big an issue this is for the developed model, but the model does not recapitulate all aspects of the siphon invagination system.

      (3) In Fig. 2A Top View and the schematic in Fig. 2C, the developing invagination is surrounded by a ring of aligned cell edges characteristic of a "purse string" type actomyosin cable that would create pressure on the invaginating cells that has been documented in multiple systems. Notably, the schematic in Fig 2C shows myosin II localizing to aligned "purse string" edges, suggesting the purse string is actively compressing the more central cells. If the purse string consistently appears during siphon invagination, a complete understanding of siphon invagination will require understanding the contributions of the purse string to the invagination process.

      (4) The introduction and discussion put the work in context of work on physical forces in invagination, but there is not much discussion of how the modeling fits into the literature.

      Comment on revised version.

      This is an extensively revised version of a previously submitted manuscript that, as detailed in their 20-page response to the first reviews, satisfactorily addresses the reviewers' comments. In particular, the revised manuscript makes it much clearer how this work fits into and advances the field. The added experiments strengthen the rigor of the manuscript as well. Overall, this paper is ready to go.

    2. Reviewer #2 (Public review):

      Summary:

      The authors propose that bidirectional redistribution of actomyosin drives tissue invagination in Ciona siphon tube formation. They suggest a two-stage model where actomyosin first accumulates apically to drive a slow initial invagination, followed by redistribution to lateral domains to accelerate the invagination process through cell shortening. They have shown that actomyosin activity is important for invagination - modulation of myosin activity through expression of myosin mutants altered the timing and speed of invagination; furthermore, optogenetic inhibition of myosin during the transition of the slow and fast stages disrupted invagination. The authors further developed a vertex model to validate the relationship between contractile force distribution and epithelial invagination.

      Strengths:

      (1) The authors employed various techniques to address the research question, including optogenetics, use of MRLC mutants, and vertex modelling.

      (2) The authors provide quantitative analyses for a substantial portion of their imaging data, including cell and tissue geometry parameters as well as actin and myosin distributions. The sample sizes used in these analyses appear appropriate.

      (3) The authors combined experimental measurements with computer modeling to test the proposed mechanical models, which represents a strength of the study. It provides a framework to explore the mechanical principles underlying the observed morphogenesis.

      Comments on the revision.

      The revised manuscript has been substantially improved. The authors have addressed many of my previous concerns through the addition of new data, analyses, and discussion. The characterization of epithelial folding in the ascidian Ciona provides valuable insight into a comparatively less explored morphogenetic system, and the imaging and quantitative analyses are overall compelling. That said, a few important points remain to be addressed.

      One remaining issue concerns the mechanistic novelty of the actomyosin redistribution described in this study. The authors emphasize that the key novelty lies in the stepwise translocation of actomyosin from the lateral membrane to the apical domain during the initial stage (apical constriction), followed by redistribution from the apical domain back to the lateral domain during the accelerated stage (invagination). I agree that the dynamic redistribution itself is potentially interesting and may represent an underexplored aspect of epithelial morphogenesis. However, as I discussed in my previous review comments, from a mechanics perspective, the role of apical actomyosin in driving apical constriction and of lateral actomyosin in contributing to tissue folding/invagination have already been demonstrated in multiple systems, although to varying extents depending on the model. Therefore, while the current study convincingly documents a distinct spatiotemporal sequence of actomyosin localization in Ciona atrial siphon tube formation, it could be clarified further to what extent this work advances new mechanical principles underlying epithelial folding, as opposed to revealing a variation in the deployment of previously described force-generating modules.

      Importantly, I think the manuscript has the potential to provide deeper conceptual insight if the authors more explicitly consider the significance of the "redistribution" process itself. Redistribution does not only involve the appearance of actomyosin at a new membrane domain; it also necessarily involves its disappearance from the previous domain. The latter aspect has, in my view, been much less explored in the literature. For example: Is the removal of lateral actomyosin during the early phase important for efficient apical constriction? Conversely, is the reduction of apical actomyosin during the later accelerated phase important for proper invagination mechanics? These questions are particularly interesting because they address whether redistribution between domains serves an active mechanical regulatory role, rather than focusing on the role of force-generating actomyosin at a given location.

      I acknowledge that addressing these questions experimentally could be technically challenging. One potentially powerful way to address this would be through the revised computational model. For example, the authors could test whether tissue folding is altered when actomyosin is allowed to accumulate at a new domain without being concomitantly depleted from the original domain. Such analyses could help distinguish whether redistribution itself has functional mechanical importance, rather than merely reflecting sequential recruitment to different cellular regions. In my opinion, incorporating this aspect would substantially strengthen the conceptual and mechanistic novelty of the study.

      My other concern relates to the new optogenetic data presented in Figure 4-figure supplement 2. In the "Dark" samples, active myosin does not appear to be clearly enriched along the membrane, but instead seems relatively diffuse within the cytoplasm. This appears distinct from the images shown in Figure 2, where active myosin exhibits clear membrane enrichment. Could the authors provide top-view images for the samples shown in Figure 4-figure supplement 2? This would help clarify whether active myosin is indeed enriched along the apical membrane at 16 hpf and along the lateral membrane at 17 hpf in the "Dark" condition.

      In addition, the tissue morphology in the "17 hpf Light 1 hr" panel of Figure 4-figure supplement 2 appears noticeably different from that shown in Figure 4. Specifically, the apical side of the tissue in Figure 4 appears substantially more relaxed than in Figure 4-figure supplement 2. Based on the authors' interpretation of the optogenetic experiments, apical active myosin is not strongly affected by the treatment described in Figure 4. If so, one would expect apical constriction to remain largely intact. However, the more relaxed apical domain shown in Figure 4 seems to suggest that apical constriction may in fact be perturbed by the optogenetic manipulation. This apparent discrepancy complicates the interpretation of the experiment and seems somewhat inconsistent with the authors' main conclusion from this figure.

    3. Reviewer #3 (Public review):

      Summary:

      In this revised manuscript by Qiao et al., the authors seek to uncover force and contractility dynamics that drive tissue morphogenesis, using the Ciona atrial siphon primordium as a model. Specifically, the authors perform a detailed examination of epithelial folding dynamics. Generally, the authors' claims were supported by their data, and the conceptual advances may have broader implications for other epithelial morphogenesis processes in other systems.

      Strengths:

      The strengths of this manuscript include the variety of experimental and theoretical methods, including generally rigorous imaging and quantitative analyses of actomyosin dynamics during this epithelial folding process, and the derivation of a mathematical model based on their empirical data, which they perturb in order to gain novel insights into the process of epithelial morphogenesis.

      Weaknesses:

      Concerns raised in the initial submission were addressed in the revised manuscript.

    1. Reviewer #4 (Public review):

      [Editors' note: this version has been assessed by the Reviewing Editor without further input from the original reviewers. The authors have addressed the comments raised in the previous round of review.]

      Summary:

      The authors demonstrate a computational rational design approach for developing RNA aptamers with improved binding to the Receptor Binding Domain (RBD) of the SARS-CoV-2 spike protein. They demonstrate the ability of their approach to improve binding affinity using a previously identified RNA aptamer, RBD-PB6-Ta, which binds to the RBD. They also computationally estimate the binding energies of various RNA aptamers with the RBD and compare against RBD binding energies for a few neutralizing antibodies from the literature. Finally, experimental binding affinities are estimated by electrophoretic mobility shift assays (EMSA) for various RNA aptamers and a single commercially available neutralizing antibody to support the conclusions from computational studies on binding. The authors conclude that their computational framework, CAAMO, can provide reliable structure predictions and effectively support rational design of improved affinity for RNA aptamers towards target proteins. Additionally, they claim that their approach achieved design of high affinity RNA aptamer variants that bind to the RBD as well or better than a commercially available neutralizing antibody.

      Strengths:

      The thorough computational approaches employed in the study provide solid evidence of the value of their approach for computational design of high affinity RNA aptamers. The theoretical analysis using Free Energy Perturbation (FEP) to estimate relative binding energies supports the claimed improvement of affinity for RNA aptamers and provides valuable insight into the binding model for the tested RNA aptamers in comparison to previously studied neutralizing antibodies. The multimodal structure prediction in the early stages of the presented CAAMO framework, combined with the demonstrated outcome of improved affinity using the structural predictions as a starting point for rational design, provide moderate confidence in the structure predictions.

    1. Reviewer #1 (Public review):

      Summary:

      Authors have investigated the role of FMRP in the formation and function of RNA granules in mouse brain/cultured hippocampal neurons. Most of their results indicate that FMRP does not have a role in the formation or function of RNA granules with specific mRNAs but may have some role in distal RNA granules in neurons and their response to synaptic stimulation. This is an important work (though the results are mostly negative) in understanding the composition and function of neuronal RNA granules. the last part of the work in cultured neurons is disjointed from the rest of the manuscript and the results are neither convincing nor provide any mechanistic insight.

      Strengths:

      (1) The study is quite thorough, the methods and analysis used are robust and the conclusion and interpretation are diligent.

      (2) The comparative study of Rat and Mouse RNA granules is very helpful for future studies

      (3) The conclusion that the absence of FMRP does not affect the RNA granule composition and many of its properties in the system the authors have chosen to study is well supported by the results

      (4) The difference in the response to DHPG stimulation concerning RNA granules described here is very interesting and could provide a basis for further studies though it has some serious technical issues (see below)

      Weaknesses:

      (1) The system used for the study (P5 mouse brain or DIV 8-10 cultured neuron) is surprising as the majority of defects in the absence of FMRP are reported in later stages (P30+ brain and DIV 14+ neurons). It is important to test if the conclusions drawn here hold good at different developmental stages.

      (2) The term 'distal granules' is very vague. Since there is no structural or biochemical characterization of these granules it is difficult to understand how they are different from the proximal granules and why FMRP has an effect only on these granules.

      (3) Since the manuscript does not find any effect of FMRP on neuronal RNA granules, it does not provide any new molecular insight with respect to the function of FMRP

      Comments on revised version.

      The authors have answered several questions raised by the reviewers. But for me, the critical issue of using only the brain from P5 animals and relatively early DIV neurons is still not convincingly addressed. FMRP may still play a role in determining the stalled ribosomes on its target mRNAs at a later stage of development, when there is more scope for activity-mediated protein synthesis.

      I agree with the authors that this work helps the molecular understanding of FMRP functions by disproving one of the long-standing hypotheses.

    2. Reviewer #2 (Public review):

      In the present manuscript, Li et al. use biochemical fractionation of "RNA granules" from P5 wildtype and FMR1 knock-out mouse brains to analyze their protein/RNA content, determine a single particle cryo-EM structure of contained ribosomes, and perform ribo-seq analysis of ribosome-protected RNA fragments (RPFs). The authors conclude from these that neither the composition of the ribosome granules, nor the state of their contained ribosomes, nor the mRNA positions with high ribosome occupancy change significantly. Besides minor changes in mRNA occupancy, the one change the authors identified is a decrease in puromycylated punctae in distal neurites of cultured primary neurons of the same mice, and their enhanced resistance to different pharmacological treatments. These results directly build on their earlier work (Anadolu et al., 2023) using analogous preparations of rat brains; the authors now perform a very similar study using WT and FMR1-KO mouse brains. This is an important topic, aiming to identify the molecular underpinnings of the FMRP protein, which is the basis of a major neurological disease. Unfortunately, several limitations of this study prevent it from being more convincing in its present form.

      In order to improve this study, our main suggestions are as follows:

      (1) The authors equate their biochemically purified "RG" fraction with their imaging-based detection of puromycin-positive punctae. They claim essentially no differences in RGs but detect differences in the latter (mostly their abundance and sensitivity to DHPG/HHT/Aniso). In the discussion the authors acknowledge the inconsistency between these two modalities: "An inconsistency in our findings is the loss of distal RPM puncta coupled with an increase in the immunoreactivity for S6 in the RG." and "Thus, it may be that the RG is not simply made up of ribosomes from the large liquid-liquid phase RNA granules."<br /> How can the authors be sure that they are in fact analysing the same entities in both modalities? A more parsimonious explanation of their results would be that, while there might be some overlap, two different entities are analyzed. Much of the main message rests on this equivalence and I believe the authors should show its validity.

      (2) The authors show that increased nuclease digestion (and magnesium concentration) led to a reduction of their RPF sizes down to levels also seen by other researchers. Analyzing these now properly digested RPFs, the authors state that the CDS coverage and periodicity drastically improved, and that spurious enrichments of secretory mRNAs, which made up one of the major fractions in their previous work, are now reduced. In my opinion this would be more appropriately communicated as a correction to their previous work, not as a main Figure in another manuscript.

      (3) The fold changes reported in Figure 7 (ranging between log2(-0.2) and log2(+0.25)) are all extremely small and in my opinion should not be used to derive claims such as "The loss of FMRP significantly affected the abundance and occupancy of FMRP-Clipped mRNAs in WT and FMR1-KO RG (Fig 7A, 7B), but not their enrichment between RG and RCs".

      (4) Fig 8 / S8-1 - The authors show that ~2/3 of their reads stem from PCR duplicates, but that even after removing those, the majority of peaks remains unaltered. At the same time, Fig S8-1 shows the total number of peaks to be 615 compared with 1392 before duplicate removal. Can the authors comment on this discrepancy? In addition, the dataset with properly removed artefacts should be used for their main display item instead of the current Fig 8.

      (5) Fig 9 / S9-1, the density of punctae in both WT and FMR1-KO actually increases after treatment of HHT or Anisomycin (Fig S9-1 B-C). Even if a large fraction would now be "resistant to run-off", there should not be an increase. While this effect is deemed not significant, a much smaller effect in Fig 9C is deemed significant. Can the authors explain this? Given how vastly different the sample sizes are (ranging from 23 neurites in Fig S9-1 to 5,171 neurites in Fig 9), the authors should (randomly) sample to the same size and repeat their statistical analysis again to improve their credibility.

      Comments on revised version.

      We can see that the authors invested substantial effort to improve the manuscript and we believe it is improved.

    3. Reviewer #3 (Public review):

      Summary:

      Li et al describe a set of experiments to probe the role of FMRP in ribosome stalling and RNA granule composition. The authors are able to recapitulate findings from a previous study performed in rats (this one is in mice).

      Strengths:

      (1) The work addresses an important and challenging issue, investigating mechanisms that regulate stalled ribosomes that are part of stress granules, and focusing on the role of FMRP. This is a complicated problem, given the heterogeneity of the granules and the challenges related to their purification. This work is a solid attempt at addressing this issue, which is widely understudied.

      (2) The interpretation of the results could be interesting, if supported by solid data. The idea that FMRP could control the formation and release of stress granules, rather than the elongation by stalled ribosomes is of high importance to the field, offering a fresh perspective into translational regulation by FMRP.

      (3) The authors focused on recapitulating previous findings, published elsewhere (Anadolu et al., 2023) by the same group, but using rat tissue, rather than mouse tissue. Overall, they succeeded in doing so, demonstrating, among other findings, that stalled ribosomes are enriched in consensus mRNA motifs that are linked to FMRP. These interesting findings reinforce the role of FMRP in formation and stabilization of RNA granules. It would be nice to see extensive characterization of the mouse granules as performed in Figure 1 of Anadolu and colleagues, 2023.

      (4) Some of the techniques incorporated aid in creating novel hypotheses, such as the ribopuromycilation assay and the cryo-EM of granule ribosomes.

      Comments on revised version:

      I am satisfied with the authors response to my comments.

    1. Reviewer #1 (Public review):

      Summary:

      The article by Zdraljevic et al. reports the discovery of a third toxin-antidote (TA) element in C. elegans, composed of the genes mll-1 (toxin) and smll-1 (antidote). Unlike previously characterized TA systems in C. elegans, this element induces larval arrest rather than embryonic lethality. The study identifies three distinct haplotypes at the TA locus, including a hyper-divergent version in the standard laboratory strain N2, which retains a functional toxin but lacks a functional antidote. The authors propose that small RNA-mediated silencing mechanisms, dependent on MUT-16 and PRG-1, suppress the toxicity of the divergent toxin allele. This work provides insights into the evolutionary dynamics of TA elements and their regulation through RNA interference (RNAi).

      Overall, there are many things to like about this paper and only a few small quibbles, which will not require more than a little rewriting or relatively minor analyses.

      Strengths of the Paper:

      (1) The discovery of a maternally deposited TA element with delayed toxicity due to delayed mRNA translation of the maternally deposited toxin mRNA is a significant addition to the literature on selfish genetic elements in metazoans.

      (2) Identifying three haplotypes at the TA locus provides a snapshot of potential evolutionary trajectories for these elements, which are often inferred but rarely demonstrated in naturally occurring strains. The genomic analysis of 550 wild isolates contextualizes the findings within natural populations, revealing geographic clustering and evolutionary pressures acting on the TA locus.

      (3) The study employs various techniques, including CRISPR/Cas9 knockouts, FISH, long-read RNA sequencing, and population genomics. The use of inducible systems to confirm toxicity and antidote functionality is particularly robust. This multifaceted approach strengthens the validity of the findings.

      (4) The authors provide compelling evidence that small RNA pathways suppress toxin activity in strains lacking a functional antidote. This highlights an alternative mechanism for neutralizing selfish genetic elements.

      Comments on revised version.

      The authors have addressed all my (relatively minor) comments from the first round of reviews. However, the most substantial comments came from Reviewer 2, mostly focused on the conclusions that "Multiple lines of evidence suggest that the N2 tmrl-1 allele is recognized by piRNAs, leading to MUT-16-dependent 22G siRNA production and post-transcriptional silencing of the transcript." This is beyond my expertise to fully evaluate what is state-of-the-art in terms of acceptable evidence, so I will defer to Reviewer #2 for this.

    2. Reviewer #2 (Public review):

      Summary:

      In the manuscript by Walter-McNeill, Kruglyak and team, the authors provide solid evidence of another toxin-antidote (TA) system in C. elegans. Generally, TA systems involve selfish and linked genetic elements, one encoding a toxin that kills progeny inheriting it, unless an antidote (the second element) is also present. Currently, only two TA systems have been characterized in this species, pointing to the importance of identifying new instances of such systems to understand their transmission dynamics, prevalence, and functions in shaping worm populations.

      The manuscript has been improved in some aspects upon revision. We remain enthusiastic for the overall findings and the identification of a new toxin/anti-toxin system and note that the strengths and weaknesses we detailed previously remain. We reiterate our critique regarding the strength of conclusions that can be made about small RNA pathway regulation based on meta-analysis of other datasets. While we agree that the observations presented are suggestive of small RNA regulation, likely due to piRNA targeting and subsequent 22G-RNA regulation, until these hypotheses are tested experimentally in the future by mutation of the piRNA target sites, testing ago/piRNA pathway and other 22G-RNA pathway mutants for tmrl-1 expression, etc., we think it is important to use precise language in presenting the conclusions. In particular, the abstract states:

      "Multiple lines of evidence suggest that the N2 tmrl-1 allele is recognized by piRNAs, leading to MUT-16-dependent 22G siRNA production and post-transcriptional silencing of the transcript. The N2 haplotype represents the first naturally occurring unlinked toxin-antidote system where the toxin is post-transcriptionally suppressed by endogenous small RNA pathways."

      We therefore recommend moderating this statement to "...is likely to be post-transcriptionally suppressed by endogenous small RNA pathways."

      Previously noted strengths and weaknesses remain relevant to this revision.

      Strengths:

      This novel TA system (mll-1/smll-1) was identified on LGV in wild C. elegans isolates from the Hawaiian Islands, by crossing divergent strains and observing allele frequency distortions by high throughput genome sequencing after 10 generations. These allele frequency distortions were subsequently confirmed in another set of crosses with a separate divergent strain, and crosses of heterozygous males or hermaphrodites resulted in a pattern of L1 lethality in progeny (with a rod arrest phenotype) that suggested the maternal transmission of this TA system from the XZ1516 genetic background. By elegantly combining the use of near-isogenic lines, CRISPR editing to generate knock-outs, and a transgene rescue of the antidote gene, the authors identified the genes encoding the toxin and the antidote, which they refer to as mll-1 and smll-1. Moreover, the specific mll-1 isoform responsible for the production of the toxin was identified and mll-1 transcripts were observed by FISH in early and late embryos, as well as in larvae. Inducible expression of the toxin in various strains resulted in larval arrest and rod phenotypes. The authors then characterized the genetic variation of 550 wild isolates at the toxin/antidote region on LGV and distinguished three clades: 1) one with the conserved TA system, 2) one having lost the toxin and retaining a mostly functional antidote, and 3) one having lost the antidote and retaining a divergent yet coding toxin (this includes the reference strain Bristol N2, in which the homologous toxin gene has acquired mutations and is known as B0250.8). Further, the authors show that this region is under positive selection. These data are compelling and provide very strong evidence of a new TA system in this species.

      Weaknesses:

      The question remained as to how one clade, including N2, could retain the toxin gene but not possess a functional antidote. In the second part of the manuscript, the authors hypothesized that small RNA targeting (RNAi) of the toxin transcript could provide the necessary repression to allow worms to survive without the antidote. Through a meta-analysis of multiple small RNA datasets from the literature, the authors found evidence to support this idea, in which the toxin transcript is targeted by 22G siRNAs whose biogenesis is dependent on the Mutator foci protein, MUT-16. They note that from previous studies, mut-16 null mutants displayed a varied penetrance of larval arrest. In their own hands, mut-16 mutants displayed 15% varied larval arrest and 2% rod phenotypes. In an attempt to link B0250.8 to mut-16/siRNAs, they made a double mutant and examined body length as a proxy for developmental stage. Here, they observed a partial rescue of the mut-16 size defect by B0250.8 mutation. Finally, the authors also highlight data from further meta-analysis which predicts the recognition of B0250.8 by several piRNAs. Also based on existing data from the literature, the authors link loss of Piwi (PRG-1), which binds piRNAs, to a depletion of 22G-RNAs targeting B0250.8 and an upregulation of B0250.8 expression in gonads, suggesting that piRNAs are the primary small RNAs that target B0250.8 for down-regulation. The data in this portion of the manuscript are intriguing, but somewhat incomplete, as they are based on little primary experimentation and a collection of different datasets (which have been acquired by slightly different methods in most cases). This portion of the study would require subsequent experimentation to firmly establish this mechanistic link. For example, to be able to claim that "the N2 toxin allele has acquired mutations that enable piRNA binding to initiate MUT-16-dependent 22G small RNA amplification that targets the transcript for degradation" the identified piRNA sites should be mutated and protein and transcript levels analysed in wild-type and in the strain with mutated piRNA sites. At a minimum, the protein levels in wild-type and mut-16, prg-1, and/or wago-1 mutants should be measured by western blot and/or by live imaging (introducing a GFP or some other tag to the endogenous protein via CRISPR editing) to show that the toxin is not accumulated as a protein in wt, but increases in levels in these mutants. mRNA levels in Fig S5A suggest there is still some expression of the B0250.8 transcript in a wild type situation.

      Comments on revised version.

      We have no further recommendations for the authors, other than those provided above.

    1. Reviewer #3 (Public review):

      Summary:

      This study uses large-scale all-atom molecular dynamics simulations to examine the conformational plasticity of the HIV-1 envelope glycoprotein (Env) in a membrane context, with particular emphasis on how the transmembrane domain (TMD), cytoplasmic tail (CT), protomer cleavage, and membrane environment influence ectodomain orientation and antibody epitope exposure. By comparing Env constructs with and without the CT, explicitly modeling glycosylation, and embedding Env in an asymmetric lipid bilayer, the authors aim to provide an integrated view of how membrane-proximal regions and lipid interactions shape Env antigenicity, including epitopes targeted by MPER-directed antibodies.

      Strengths:

      The authors have made a heroic effort to address the concerns raised in the first two rounds of review, and the revised manuscript is substantively improved. The addition of dynamical cross-correlation maps, expanded citation of prior computational work, clarification of the membrane composition rationale, data deposition to Zenodo, and new contextualization has improved the flow and interpretation of the manuscript throughout. Several scientifically interesting aspects of the work merit highlighting with a brief discussion on how future studies can leverage this data to build upon its impact.

      A key strength of this work remains the scope, scale, and realism of the simulation systems. The authors construct a very large, nearly complete-Env-scale model that includes a glycosylated Env trimer embedded in an asymmetric bilayer, enabling analysis of membrane-protein interactions that are difficult to capture experimentally. The inclusion of specific glycans at reported sites, and the focus on constructs with and without the CT or cleavage, are well motivated by existing biological and structural data.

      The observation that R696 orientation and its interacting partners give rise to asymmetric protomer conformations and distinct TMD tilts is a notable finding. The statement that interactions between R696 and lipid headgroups or CT residues can be strong enough to introduce a kink into the TMD is well-supported by representative snapshots and consistent with prior isolated-TMD simulations. The use of two initialization depths ("high" and "low") to probe R696 leaflet preference is methodologically interesting and the authors' interpretation - that there is a slight bias toward cytoplasmic leaflet interactions, but that these contacts could be highly dynamic over the course of viral entry - is appropriately cautious. It would be valuable to explicitly frame this as a hypothesis with testable predictions that future experimental or enhanced-sampling work could address. Similarly, the equilibration-driven kinking of the TMD core, consistent with prior isolated-TMD studies, represents a useful validation that extends those earlier observations to the intact trimeric context.

      The simulations reveal substantial tilting motions of the ectodomain relative to the membrane, with angles spanning roughly 0-30{degree sign} (and up to ~40{degree sign} in some analyses), while the ectodomain itself remains relatively rigid. This framing, that much of Env's conformational variability arises from rigid-body tilting rather than large internal rearrangements, is an important conceptual contribution. The authors also provide interesting observations regarding asymmetric bilayer deformations, including localized thinning and altered lipid headgroup interactions near the TMD and CT, which suggest a reciprocal coupling between Env and the surrounding membrane.

      The analysis of antibody-relevant epitopes across the prefusion state, including the V1/V2 and V3 loops, the CD4 binding site, and the MPER, is another strength. The study makes effective use of existing experimental knowledge in this context, for example by focusing on specific glycans known to occlude antibody binding, to motivate and interpret the simulations.

      Finally, the revised text provides clear context that situates the study's findings and discrepancies within the broader literature, strengthening the manuscript's clarity and interpretability.

      Future work in the field:

      As the authors appropriately acknowledge within in the text, these microsecond simulations capture only the closed ground state and with limited sampling due to the already computationally intensive nature of these simulations. Their simulation setup provides interesting foundational knowledge of this state and a framework for these additional important questions.

      Additionally, the authors appropriately acknowledge that CT-TMD and CT-ectodomain correlations are difficult to interpret given limited structural confidence in these regions. Future experimental and computational work in the field can extend and build upon the author's framework, particularly as the authors have made their trajectories available for the public. Re-analysis of the authors' deposited MD trajectories-such as probing for exposure of cryptic epitopes and potential allosteric coupling-could serve as valuable extensions of this work, particularly as advancements in computational analysis has reached an inflection point.

      Comments on revised version.

      Bravo! The improved clarity was a delight to read and will increase the impact this study has on the field.

    1. Reviewer #1 (Public review):

      [Editors' note: this version has been assessed by the Reviewing Editor without further input from the original reviewers. The authors have addressed the comments raised in the previous round of review.]

      Summary:

      Mitotic kinesins carry out crucial roles in intracellular motility and mitotic spindle organization. Although many mitotic kinesins have been extensively studied, a few conserved mitotic motors remain poorly explored, including chromosome-associated kinesins. Here, Furusaki et al reconstitute recombinant chromosome-associated kinesin or chromokinesin (Kid) and reveal processive plus-end motility along microtubules. The authors purify multiple versions of Kid, revealing dimeric organization and their processive microtubule plus-ended motility which depends on their conserved motor domains, neck linkers, and coiled-coil regions. The study reveals for the first time that KID can recruit and transport duplex DNA along microtubules using its conserved C-terminal DNA binding domain. The work provides crucial revised thinking about the mechanisms of Chromokinesins mitosis as physical processive motors that mobilize chromosomes towards the microtubule plus ends in early metaphase.

      Strengths:

      The authors reconstitute multiple chromosome-associated kinesin (KID) orthologs from Xenopus and humans with microtubules and determine their oligomerization. The study shows how coiled-coil and neck linker regions of KID are essential for its function as its deletion leads to non-processive motility. Chimeras placing the KID coiled-coil and neck linker on the KIF1A motor domain led to the production of a processive recombinant motor supporting the compatibility of their motility mechanisms. The KID c-terminal tail binds and transports only double-stranded DNA and its deletion or single-stranded DNA leads to defects in this activity.

    2. Reviewer #2 (Public review):

      Summary:

      Previous work in the field highlighted the role of the kinesin-10 motor protein Kid (KIF22) in the polar ejection force during prometaphase. However, the biochemical and biophysical properties of Kid that enabled it to serve in this role were unclear. The authors demonstrate that human and xenopus Kid proteins are processive kinesins that function as homodimeric molecules. The data are solid and support the findings although the text could use some editing to improve clarity.

      Strengths:

      A highlight of the work is the reconstitution of DNA transport in vitro.

      A second highlight is the demonstration that the monomer vs dimer state is dependent on protein concentration.

    1. Reviewer #1 (Public review):

      This is a well-written and fully documented methods paper.

      The authors have established a clear rationale for their new packages, especially for real-time use, and demonstrate significant speed improvements that will likely appeal to many users of tools like DLC, SLEAP, and LightningPose. The inclusion of a graphical user interface will help make the package more accessible to neuroscientists with limited computational expertise. While it may be challenging to get users to switch from their established workflows for video analysis, the speed gains offered by this package make it worth considering. The hardware aspects of the project are well-documented, and the GitHub repository for this part of the setup is also thorough. Overall, this paper provides a clear summary of the tools, their uses, setup, and benefits.

      I have a few minor questions about the collective set of tools.

      First, the GitHub repository for SqueakPoseStudio appears to be missing a testing routine and associated badge, and the package has not been formally released. This means users would need to download the repository to install it, correct? I suggest the authors consider publishing a formal release of the package, making it installable via pip, and including a basic testing routine to clearly display the package's status on the repository page. Adding a DOI from Zenodo would also be helpful. A testing routine is especially useful when updates are made, as many users avoid repositories with failing tests.

      Second, the installation instructions simply state "Create a virtualenv and install:". This may not be sufficient for many researchers, as most neuroscientists are not experienced Python programmers and require clear guidance on the environment specific to this package. The installation instructions should be expanded to provide more detailed guidance and encourage more users. It would also be helpful to verify that the setups work across Windows, Mac, and Linux.

      Third, the package defaults to UMAP for non-linear dimensionality reduction, which has some known issues. Can the package be modified to allow for alternative mapping methods, such as PaCMAP, PyDiffMap, or the more comprehensive topometry package?

      Finally, what specific GPUs have been tested with the package, and are there any limitations based on the age of the video card or the available libraries for the deep learning component of the package?

    2. Reviewer #2 (Public review):

      Summary:

      This work presents three tools: SqueakPose Studio, which is used for pose estimation; SqueakView, which is used for real-time video and sensor data capture and analysis; and MouseHouse, which is a behavioral and sensor suite for mouse experiments. Together, these tools provide a comprehensive behavioral platform for acquiring and analyzing video, sensor, and behavioral data. The work is open source and provided as a resource for the field.

      Strengths:

      (1) Squeakpose Studio was relatively easy to install and use. We were impressed that we were able to install it and test our own videos with minimal struggles. The authors provide installation tutorial videos that were very helpful.

      (2) The GUI environment for SqueakPose Studio was very usable, and the authors should be commended on the time and effort that went into improving the useability of their system. The keypoint and skeleton configuration was flexible, allowing us to define custom body part sets without modifying code directly. The pose estimation accuracy on our own videos was good right out of the box, without requiring fine-tuning or retraining. For a tool being evaluated for the first time, this was all very impressive!

      Weaknesses:

      (1) While we were able to install and test Squeakpose Studio, it was not entirely seamless. The primary installation resource is a tutorial video, and we would recommend supplementing this with a written installation checklist that explicitly lists all required software dependencies (e.g. Python, UV, Visual Studio). The tutorial video was also at times unclear in distinguishing required from optional components. For example, Visual Studio is described as not necessary, yet the tutorial demonstrates the workflow entirely within that environment, so it may be challenging for a user to follow along without that. We recommend that the authors adopt a stricter, step-by-step installation guide that is prescriptive about required software and leaves little room for confusion.

      (2) The paper also describes SqueakView and MouseHouse. Unfortunately, we were unable to evaluate these components as both require the MouseHouse hardware platform. Even without directly using MouseHouse, we noticed some incompleteness here, as we could not locate a bill of materials, component pricing, or assembly guide in the paper or associated GitHub repositories. Given that affordability and accessibility are central claims, a consolidated parts list, approximate costs, and a build guide or video would be necessary for most labs to realistically decide whether they plan to replicate the hardware and evaluate this functionality that the paper describes. In this regard, we felt that MouseHouse and potentially SqueakView were not sufficiently documented for publication.

      (3) The benchmarking comparison to DeepLabCut (DLC) introduced multiple challenges that left us unclear if the head-to-head comparison was appropriate as described. First, the dataset used for benchmarking was small and homogeneous, from the methods they used "10 min open-field tasks of single mice with bilateral photometry cables." As such, the claims about comparisons between SqueakPose Studio and DLC may be too broad, given this single test case. Specifically, this dataset does not test robustness across lighting conditions, coat colors, species, occlusions, different-shaped arenas, etc. Second, the comparison to DLC in Figure 1 does not include any quantitative statistical comparisons, which are needed to evaluate the claims that were made. For instance, the error in Figure 1e looks worse for their system than DLC, although statistical comparisons were not made. Third, there are many settings and optimizations that can be made for both systems. Without more detail, this makes it hard to know if the head-to-head comparison is really fair. Fourth - the metrics are given as very specific numbers from single runs, i.e., an inference time of 71.59 minutes in Figure 1d. This metric would be more meaningful if it reported the mean of multiple runs, with error estimation. Finally, while the code is available, the trained datasets are made available only on "reasonable request". Given the importance of these datasets to evaluating the method and allowing others to benchmark it against other systems, these should be made available on GitHub. Overall, I would recommend toning down the comparison to DLC and focusing on the strengths of Squeakpose Studio on its own merits.

      (4) The paper at times makes general statements that are beyond what is shown. For instance, discussions of use in human applications are aspirational and should be treated much more conservatively in the discussion, or possibly even removed. As it stands, the discussion implies that this system can already do "zero-shot tracking of human posture and movement", enabling "a bridge between preclinical and clinical behavioral analysis". In principle, this may be true, but even for a Discussion section, this goes far beyond the capabilities that the paper actually shows.

      (5) While the comprehensive nature of the system and its 3 parts is impressive, I felt that it also detracted from the main focus of the paper, which was Squeakpose Studio. I might recommend dropping the other two parts, as they also require a much higher bar for a user to evaluate, and only present the Squeakpose Studio in this paper, presenting this as a general resource for pose estimation. This would also allow them more space to more comprehensively benchmark SqueakPose Studio.

    1. Reviewer #1 (Public review):

      [Editors' note: this version has been assessed by the Reviewing Editor without further input from the original reviewers. The authors have addressed the comments raised in the previous round of review.]

      Summary:

      In this manuscript, the role of the insulin receptor and the insulin growth factor receptor was investigated in podocytes. Mice, where both receptors were deleted, developed glomerular dysfunction and developed proteinuria and glomerulrosclerosis over several months. Because of concerns about incomplete KO, the authors generated and studied podocyte cell lines where both receptors were deleted. Loss of both receptors was highly deleterious with greater than 50% cell death. To elucidate the mechanism of cell death, the authors performed global proteomics and found that spliceosome proteins were downregulated. They confirmed this directly by using long-read sequencing. These results suggest a novel role for insulin and IGF1R signaling in RNA splicing in podocytes.

      This is primarily a descriptive study and no technical concerns are raised. The mechanism of how insulin and IGF1 signaling regulates splicing is not directly addressed but implicates potentially the phosphorylation downstream of these receptors. In the revised manuscript, it is shown that the mouse KO is incomplete potentially explaining the slow onset of renal insufficiency. Direct measurement of GFR and serial serum creatinines might also enhance our understanding of progression of disease, proteinuria is a strong sign of renal injury. An attempt to rescue the phenotype by overexpression of SF3B4 would also be useful but may be masked by defects in other spliceosome genes. As insulin and IGF are regulators of metabolism, some assessment of metabolic parameters would be an optional add-on.

      Significance:

      With the GLP1 agonists providing renal protection, there is great interest in understanding the role of insulin and other incretins in kidney cell biology. It is already known that Insulin and IGFR signaling play important roles in other cells of the kidney. So, there is great interest in understanding these pathways in podocytes. The major advance is that these two pathways appear to have a role in RNA metabolism.

    2. Reviewer #2 (Public review):

      Summary:

      In this manuscript, Coward and colleagues report on the role of insulin/IGF axis in podocyte gene transcription. They knocked out both the insulin and IGFR1 mice. Dual KO mice manifested a severe phenotype, with albuminuria, glomerulosclerosis, renal failure and death at 4-24 weeks.

      Long read RNA sequencing was used to assess splicing events. Podocyte transcripts manifesting intron retention were identified. Dual knock-out podocytes manifested more transcripts with intron retention (18%) compared wild-type controls (18%), with an overlap between experiments of ~30%.

      Transcript productivity was also assessed using FLAIR-mark-intron-retention software. Intron retention w seen in 18% of ciDKO podocyte transcripts compared to 14% of wild-type podocyte transcripts (P=0.004), with an overlap between experiments of ~30% (indicating the variability of results with this method). Interestingly, ciDKO podocytes showed downregulation of proteins involved in spliceosome function and RNA processing, as suggested by LC/MS and confirmed by Western blot.

      Pladienolide (a spliceosome inhibitor) was cytotoxic to HeLa cells and to mouse podocytes, but no toxicity was seen in murine glomerular endothelial cells.

      The manuscript is generally clear and well-written. Mouse work was approved in advance. The four figures are generally well-designed, bars/superimposed dot-plots.

      Methods are generally well described.

      Comments on previous version:

      Coward and colleagues have done an excellent job of responding to all the reviewer comments.

    3. Reviewer #4 (Public review):

      This report entitled "The insulin/IGF axis is critically important (for) controlling gene transcription in the podocyte" from Hurcombe et al is based on a mouse double knockdown of the IR and IGF1R and a parallel cultured mouse podocyte model. Insulin/IGF signaling system in mammals evolved as three gene reduplicated peptides (insulin, IGF-1, and IGF-2) and their two receptors IR and IGF1R that cross-react to variable extents with the peptides, are ubiquitously expressed, and signal through parallel pathways. The major downstream effect of insulin is to regulate glucose uptake and metabolism, while that of the IGF pathways is to regulate growth and cell cycling in part through mTORC1. The GH-IGF-1-IGF1R pathway regulates post-natal growth. IGF-2 signaling is thought to play a major role in regulating intrauterine growth and development, although IGF-2 is also present at high levels in post-natal life. Thus, one would anticipate that reducing IR/IGF1R signaling in any cell would slow growth and cell cycling by reducing growth factor and metabolic mTORC1-mediated and other processes including the splicing of RNA for protein synthesis.

    1. Reviewer #1 (Public review):

      [Editors' note: this version has been assessed by the Reviewing Editor without further input from the original reviewers. The authors have addressed the comments raised in the previous round of review.]

      Summary:

      The authors describe the results of a single study designed to investigate the extent to which horizontal orientation energy plays a key role in supporting view-invariant face recognition. The authors collected behavioral data from adult observers who were asked to complete an old/new face matching task by learning broad-spectrum faces (not orientation filtered) during a familiarization phase and subsequently trying to label filtered faces as previously seen or novel at test. This data revealed a clear bias favoring the use of horizontal orientation energy across viewpoint changes in the target images. The authors then compared different ideal observer models (cross-correlations between target and probe stimuli) to examine how this profile might be reflected in the image-level appearance of their filtered images. This revealed that a model looking for the best matching face within a viewpoint differed substantially from human data, exhibiting a vertical orientation bias for extreme profiles. However, a model forced to match targets to probes at different viewing angles exhibited a consistent horizontal bias in much the same manner as human observers.

      Strengths:

      I think the question is an important one: The horizontal orientation bias is a great example of a low-level image property being linked to high-level recognition outcomes and understanding the nature of that connection is important. I found the old/new task to be a straightforward task that was implemented ably and that has the benefit of being simple for participants to carry out and simple to analyze. I particularly appreciated that the authors chose to describe human data via a lower-dimensional model (their Gaussian fits to individual data) for further analysis. This was a nice way to express the nature of the tuning function favoring horizontal orientation bias in a way that makes key parameters explicit. Broadly speaking, I also thought that the model comparison they include between the view-selective and view-tolerant models was a great next step. This analysis has the potential to reveal some good insights into how this bias emerges and ask fine-grained questions about the parameters in their model fits to the behavioral data.

      Weaknesses:

      I'll start with what I think is the biggest difficulty I had with the paper. Much as I liked the model comparison analysis, I also don't quite know what to make of the view-tolerant model. As I understand the authors' description, the key feature of this model is that it does not get to compare target and probe at the same yaw angle, but must instead pick a best match from candidates that are at different yaws. While it is interesting to see that this leads to a very different orientation profile, it also isn't obvious to me why such a comparison would be reflective of what the visual system is probably doing. I can see that the view-specific model is more or less assuming something like an exemplar representation of each face: You have the opportunity to compare a new image to a whole library of viewpoints and presumably it isn't hard to start with some kind of first pass that identifies the best matching view first before trying to identify/match the individual in question. What I don't get about the view-tolerant model is that it seems almost like an anti-exemplar model: You specifically lack the best viewpoint in the library but have to make do with the other options. I sort of understand the reasoning that this enforces tolerance of viewpoint variability, but I'm not clear on whether or not this is a version of face familiarity and recognition that the authors think has an analog in human visual processing.

      I do think that this model is interesting in terms of the differential tuning it exhibits, but don't find it easy to align with any theoretical perspective on face recognition. Specifically, do the authors think there is a stage of face processing in which tolerance as they've operationalized it in the model is extant? What I'm looking for is a concrete description of the circumstances that the authors are saying lead to this kind of model potentially being a meaningful analog of face recognition. For example, is the idea that one may become familiar with a face in some very limited set of viewpoints and then be presented with that face in other views?

      Alternatively, if the authors prefer to say that they simply thought this was a nice exercise in terms of identifying a different model and that it may not be a meaningful proxy for face recognition. I think that's fine, to be clear! I just still don't see anything in the text that convinces me of the ecological validity of this version of view-tolerance.

    2. Reviewer #2 (Public review):

      This study investigates the visual information that is used for the recognition of faces. This is an important question in vision research and is critical for social interactions more generally. The authors ask whether our ability to recognise faces, across different viewpoints, varies as a function of the orientation information available in the image. Consistent with previous findings from this group and others, they find that horizontally filtered faces were recognised better than vertically filtered faces. Next, they probe the mechanism underlying this pattern of data by designing two model observers. The first was optimised for faces at a specific viewpoint (view-selective). The second was generalised across viewpoints (view-tolerant). In contrast to the human data, the view-specific model shows that the information that is useful for identity judgements varies according to viewpoint. For example, frontal face identities are again optimally discriminated with horizontal orientation information, but profiles are optimally discriminated with more vertical orientation information. These findings show human face recognition is biased toward horizontal orientation information, even though this may be suboptimal for the recognition of profile views of the face.

      One issue in the design of this study was the lowering of the signal-to-noise ratio in the view-selective observer. This decision was taken to avoid ceiling effects. However, it is not clear how this affects the similarity with the human observers.

      Another issue is the decision to normalise image energy across orientations and viewpoints. I can see the logic in wanting to control for these effects, but this does reflect natural variation in image properties. So, again, I wonder what the results would look like without this step.

      Despite the bias toward horizontal orientations in human observers, there were some differences in the orientation preference at each viewpoint. For example, frontal faces were biased to horizontal (90 deg) but other viewpoints had biases that were slightly off horizontal (e.g. right profile: 80 deg, left profile: 100 deg). This does seem to show that differences in statistical information at different viewpoints (more horizontal information for frontal and more vertical information for profile) do influence human perception. It would be good to reflect on this nuance in the data.

      Comments on revisions:

      I am happy with the response and changes to the comments in my review. The key findings from this study are: (1) that there is bias toward the use of horizontal information across all viewpoints for face recognition in humans using an old-new recognition task. (2) In contrast, the optimal information for matching faces varies as a function of viewpoint. The view-selective model shows horizontal information is dominant for frontal views and vertical information is dominant for profile views.

      The data from the view-tolerant model is less easy to interpret as it doesn't fit with any theoretically plausible model of face recognition. It might be a useful model for a face matching task in which participants had to match unfamiliar faces across viewpoints. This might be a possible extension of the current work.

      Nonetheless, I still think this is an interesting contribution to the literature.

    1. Reviewer #1 (Public review):

      [Editors' note: this version has been assessed by the Reviewing Editor without further input from the original reviewers. The authors have addressed the comments raised in the previous round of review.]

      Summary:

      The authors address whether theta/beta ratio /TBR) can be used as a clinical biomarker for ADHD.

      Strengths:

      The data were acquired independently from 2 separate datasets, and there are sufficient subjects for adequate statistical power. The authors applied up-to-date EEG data preprocessing, state-of-the-art feature extraction, and statistical analyses, using a multiverse approach. By testing and comparing all meaningful approaches, defined a priori in the previous meta-analysis, the author convincingly demonstrates that TBR cannot be used as a clinical biomarker, and previous positive results can be explained by interactions between different factors (alpha peak frequency, aperiodic component, age).

      Weaknesses:

      There are no apparent issues with data, separate datasets, large sample sizes, and state-of-the-art data analysis.

    2. Reviewer #2 (Public review):

      Summary:

      This manuscript examines whether the theta-beta ratio as derived from EEG data relates to ADHD diagnoses. To do so, it performs a multiverse analysis across a large number of analytical choices, applied to a large EEG dataset, and corroborated in an additional validation set. The results overall show that the TBR is not a reliable indicator of ADHD diagnosis. In discussing the patterns of results across analytical choices, the authors also demonstrate some key points about what appears to be driving the ratio measures, noting that significant results appear to be driven by choices regarding aperiodic-correction and the use of individualized alpha frequencies, suggesting TBR measures can be affected by these features rather than reflecting theta and/or beta activity.

      Strengths:

      This manuscript addresses a clearly posed and important question in the literature, addressing a longstanding discussion on the relationship between TBR and ADHD, and uses a large dataset and an expansive analysis approach to provide a definitive answer. The strengths of the approach allow for a clear answer, providing a notable contribution to the field.

      Weaknesses:

      I find no notable weaknesses in the current manuscript nor any major issues that I think challenge the key findings of this manuscript.

    3. Reviewer #3 (Public review):

      Summary:

      In this manuscript, Strzelczyk, Vetsch, and Langer tackle an incredibly important question in clinical neuroscience: the use of the theta/beta ratio as a biomarker of attention deficit hyperactivity disorder (ADHD). The theta/beta ratio is argued to be so reliable as an ADHD biomarker that, in the United States, the Food and Drug Administration has approved its use as a biomarker for ADHD diagnosis. However, there is mounting evidence that the theta/beta ratio is likely not really measuring the relative power between two oscillations - the theta rhythm and the beta rhythm - but rather reflects differences in a singular, non-oscillatory aperiodic process. In this very convincing study, Strzelczyk and colleagues take a "multiverse" analysis approach to show that aperiodic activity differences between healthy controls and people with ADHD are driving the apparent theta/beta ratio differences. While in a vacuum, where a measure is a measure and if it's related to a diagnosis it's still useful no matter what, this distinction might not seem important, from a neuroscientific perspective this is a critical distinction, because the ratio between two oscillations has fundamentally very different underlying physiological mechanisms than aperiodic differences, and this framing has a major impact on guiding research on the diagnosis and treatment of ADHD.

      Strengths:

      While smaller studies and analyses have already hinted at similar results as shown here, the current study's multiverse analysis approach is comprehensive, convincing, and very well done. The large sample size of 1,499 participants is very impressive, as is the use of an independent validation sample of 381 participants.

      Overall, the technical and statistical aspects are very well done: the multiverse approach, the validation set, the resampling methods, and even the shiny apps. The authors should be applauded for being so thorough and making their data and analyses publicly accessible.

      Weaknesses:

      To be clear, I see no breaking weaknesses in the theoretical foundations, methods, statistical analyses, or interpretations.

    1. Reviewer #1 (Public review):

      Summary:

      GPR52 is an orphan receptor implicated in neuropsychiatric disorders; however, the absence of tools capable of monitoring GPR52 activity in real time has stalled both mechanistic research and ligand discovery. This study addresses this gap by reporting the development of GPR52-1.0, a genetically encoded fluorescent sensor designed to detect activation of GPR52. The sensor was systematically engineered using the established GRAB platform, yielding a construct with micromolar sensitivity and high selectivity in cell culture. The authors largely achieve their stated aims, however the biological relevance of their aims is unclear, as GPR52 is reported to be a constitutively active receptor (PMID: 32076264, PMID: 26384023). GPR52-1.0 is a validated, specific, and sensitive sensor that functions in vitro and ex vivo. The claim that electrically stimulated endogenous GPR52 ligand release occurs in the striatum is supported by the specificity of the GPR52 antagonist block using ex vivo brain slices, however, once again this aim is clouded by evidence that GPR52 is constitutively active. The sensor is presented as a tool for future deorphanization; however, this assumes that the physiological ligand is an agonist, which is unclear based on the evidence that GPR52 is constitutively active. If the authors can explain or adapt their experiments and manuscript in the context of GPR52 constitutive activity, this will be useful work to the community. The impact of this work is likely to be moderate to high within the specialized communities studying orphan GPCRs, neuronal signaling, and neuropsychiatric disease. The GRAB sensor strategy has already generated widely adopted tools for other receptors, and a validated GPR52 sensor would fill a genuine gap. The GRAB technology makes GPR52-1.0 directly applicable to in vivo studies. It is likely that GPR52-1.0 could be replicated for other orphan receptors to facilitate their deorphanization.

      Strengths:

      (1) Systematic and rigorous sensor optimization and characterization by screening ~800 variants with iterative linker and cpEGFP mutation step. The resulting EC50 values are characterized in HEK293T and cultured neurons.

      (2) Testing GPR52-1.0 against a broad panel of neurotransmitters with no detectable off-target activation strengthens confidence in sensor specificity.

      (3) The use of a selective antagonist to confirm specificity, both in cell lines and in brain slices, strengthens the conclusions significantly.

      (4) Electrically stimulated GPR52-1.0 fluorescence changes in ex vivo striatal slices are blocked by a GPR52 antagonist. This is the most biologically significant result in the manuscript, as GPR52-related diseases can involve the striatum.

      Weaknesses:

      (1) The work, both experimentally and in its presentation, is not put into the context of what is known about GPR52 pharmacology and signaling. It is reported by multiple groups that GPR52 has high constitutive activity and does not require a ligand for high levels of signaling (PMID: 32076264, PMID: 26384023). The authors should clarify whether GPR52-1.0 senses constitutive activation and whether baseline fluorescence is stable over the timescale of their experiments. The cell and mouse work needs to be reframed and conducted in the context of the high basal activity of the receptor, or the authors need to explain the differences between their study and other studies.

      (2) The electrical stimulation used in brain slice experiments is non-specific. This could be activating many cell types and neurotransmitter systems simultaneously. The pharmacological block by the GPR52 antagonist is reassuring, but the identity of the molecules driving the signal remains unknown. It could be that GPR52 is constitutively active, and that the electrical stimulation drives higher expression of GPR52 and thus constitutive signaling. This constitutive signaling can then be inhibited by the GPR52 antagonist. In this scenario, there would be no endogenous GPR52 agonist invoked by electrical stimulation.

      (3) The ex vivo brain slice data rely on n=9 slices without reporting the number of animals that the slices come from. Given the importance of this result, more biological replicates and clear reporting of animal numbers would strengthen confidence.

      (4) The manuscript does not benchmark GPR52-1.0 against existing approaches (e.g., HTRF, BRET, or calcium mobilization assays) to contextualize its advantages in a drug-discovery or screening workflow.

      (5) The paper's title references deorphanization, but the authors have made no attempts toward this deorphanization. No candidate ligand molecules are identified or tested.

    2. Reviewer #2 (Public review):

      Summary:

      This study describes the development of GPR52-1.0, a novel genetically encoded fluorescent sensor for the orphan GPCR, GPR52. The authors also utilized this sensor in vivo in brain slices and discovered that striatal neuron excititation may activate GPR52.

      Strengths:

      (1) The design and validation of the sensor are elegant, thorough, and rigorous. The authors conducted a systematic and impressive optimization screen of numerous variants to arrive at the top-performing GPR52-1.0 sensor. The subsequent characterization is thorough, showing excellent membrane trafficking, appropriate pharmacological profiles (EC50, IC50) by the GPR52 chemical agonist/antagonist, rapid kinetics, and high specificity against a panel of common neurotransmitters. The functional characterization was also performed in multiple experimental systems.

      (2) The most exciting result is the observation that electrical stimulation may activate GPR52 in the striatum, an area where GPR52 is natively expressed. The blockade by a specific GPR52 antagonist confirms its specificity and provides the first direct evidence for activity-dependent, native GPR52 ligand in striata. This finding alone is a significant step forward and strongly justifies the sensor's development.

      (3) The manuscript is well-written and logically structured. The figures are clear and effectively illustrate the key data, from the initial screening process to the final ex vivo validation. The authors did not overstate their discoveries.

      Weaknesses:

      (1) The sensor specificity is largely based on a single agonist/antagonist, and it might be desired for future studies to confirm this by additional agonists/antagonists or by point mutagenesis that is known to influence GPR52 activation (for example, the ones reported in (PMID: 40087539).

      (2) The discovery of the existence of activity-dependent, native GPR52 ligand(s) in striata is extremely exciting. This might be further strengthened by inhibiting synaptic transmitter release with TTX, calcium channel blockers, or SNARE complex disruptors, etc.

    1. Reviewer #1 (Public review):

      The manuscript titled," Sleep-Wake Transitions Are Impaired in the AppNL-G-F Mouse Model of Early Onset Alzheimer's Disease", is about a study of sleep/wake phenomena in a knockin mouse strain carrying, "three mutations in the human App gene associated with elevated risk for early onset AD". Traditional, in-depth, characterization of sleep/wake states, EEG parameters and response to sleep loss are employed to provide evidence, "supporting the use of this strain as a model to investigate interventions that mitigate AD burden during early disease stages". The sleep/wake findings of earlier studies (especially, Maezono, et al., 2020, as noted by the authors) were extended by several important, genotype-related observations, including age-related hyperactivity onset that is typically associated with increased arousal, a normal response to loss of sleep and to multiple sleep latency testing, and a stronger AD-like phenotype in females.

      The authors conclude that the AppNL-G-F mice demonstrate many of the human AD prodromal symptoms and suggest that this strain may serve as a model for prodromal AD in humans, confirming the earlier results and conclusions of Maezono, et al. Finally, based on state bout frequency and duration analyses, it is suggested that the AppNL-G-F mice may develop disruptions in mechanism(s) involved in state transition.

      The study appears to have been, technically, rigorously conducted with high quality, in depth traditional assessment of both state and EEG characteristics with the concordant addition of activity and temperature.

      The major strengths of this study derive from observations that the AppNL-G-F mice: 1) are more hyperactive in association with decreased transitions between states; 2) maintain a normal response to sleep deprivation and have normal MSLT results; and 3) display a sex specific, "stronger" insomnia-like effect of the knockin in females.

      The weaknesses stem from the study's impact being limited due to its being largely confirmatory of the Maezono et al. study with advances of import to a potentially, more focused field. Further, the authors conclude that AppNL-G-F mice have disrupted mechanism(s) responsible for state transition, however these were not directly examined. The rationale for this conclusion is stated by the authors as based on the observations that bouts of both W and NREM tend to be longer in duration and decreased in frequency in AppNL-G-F mice. Although altered mechanism(s) of state transition (it is not clear what mechanisms are referenced here) cannot be ruled out, other explanations require careful consideration. It is acknowledged in the discussion that increased arousal in association with hyperactivity would be expected to result in increased duration of W bouts during the active phase. This would also predictably result in greater sleep pressure that is typically associated with more consolidated NREM bouts, consistent with the observations of bout duration and frequency. The results from the MSLT tests and lack of increased EEG slow wave activity are problematic to interpret in the context of increased arousal (evidenced by the hyperactivity) since these phenomena, known to be enhanced in association with increased sleep pressure, may be masked by arousal (or by some other effect of the altered genotype). Perhaps, the effect on consolidation is less sensitive. Thus, understanding the underlying mechanism(s) involved is needed for conclusion(s) about sleep pressure.

      Overall, this study's findings are valuable but with respect to the claims, incomplete.