10,000 Matching Annotations
  1. Dec 2025
    1. Reviewer #1 (Public review):

      Summary:

      Marchand et al. seek to understand how basement membrane (BM) is initially assembled around developing vasculature (and by extension basement membrane assembly generally progresses). To do this, they use an established cell culture system that is amenable to advanced microscopy techniques, including high-resolution fluorescence imaging and atomic force microscopy. This allows them to produce very high-quality imaging data that includes both protein localization and matrix topography in 3D. They show that fibronectin (FN) is remodeled as collagen IV (Col IV) assembles. Lysyl oxidase-like-2 (LOXL2) is needed for this process, and without it, BM does not form correctly, cells cannot adhere to BM, and cells also don't correctly form junctions with other cells.

      Detailed Review:

      The authors provide quantitative measures of BM fibril assembly at the earliest timepoints. They show two phases of growth - initial deposition, elongation, and interconnection of short fibers; the second is a significant thickening. As the BM forms, FN is immediately associated with filaments, but laminin and Col IV are not associated with fibers as detected by AFM. LOXL2 is associated with fibers, similar to FN. At a later time point, Col IV becomes associated with fibers, but laminin never does. Likely FN templates LOXL2, which crosslink Col IV into fibrils over time. Could the authors comment on how this data fits with in vivo data from model organisms? Also, I would like to know if they can uncouple LOXL2 from the FN matrix? Could you express a mutated form of LOXL2 that cannot interact with FN but still is able to crosslink Col IV?)

      Depletion of LOXL2 supports this mechanism. Without it, Col IV and FN are uncoupled and accumulate as large aggregates rather than a complex fibrous network. Further, long-term thickening/growth of the fibronectin network is inhibited, indicating LOXL2 and/or the Col IV network positively reinforces fibronectin assembly. (Does LOXL2 directly act on FN, or is this effect dependent on Col IV? The nature of the molecular interactions between COL IV, LOXL2, and FN will be an important future research area.)

      Next, Marchand et al. ask if failure to produce mature BM (induced by LOXL2 depletion) has consequences for underlying cells. They demonstrate a clear shift in the orientation of actin towards a linear alignment, and similarly, cells change shape from round to very elongated. Cell junctions also shifted to a linear arrangement in LOXL2 depletion. This fits with the known balance between cell-ECM and cell-cell adhesion. The changes in actin network and cell shape/adhesion correlate with a change in B1 integrin localization upon LOXL2 depletion. B1 integrin colocalized with sparse early FN fibers, but was absent from large FN aggregates that occur if LOXL2 is depleted. Similar reorganization of integrin adhesion components (FAK, Vinc, Pax). Clearly, there is feedback between BM assembly and cell junction organization. But I think the authors might emphasize to the reader that this normally reinforces the epithelial fate of these cells. It's less a balance and more like a tipping point. (Related to this section, I could not read Figure 4C graphs unless I enlarged them to 300%.)

      Finally, they culture cells on micro groove plates, with or without LOXL2. The grooved substrate can orient the cells, and they show this is superseded by BM once it assembles. Without LOXL2 cells on micro-grooved substrates become disorganized, similar to their observation on flat surfaces (elongated cells, linear actin, etc.). This demonstrates a switch from external topographical cues to self-generated BM. This is consistent with the idea of reorganizing junctions to produce a stable epithelial tube. I was very interested in their 3D culture. The effect of BM assembly on tube diameter makes sense. But how does BM assembly support complex capillary functions like branching? (Can they force branching with targeted mutations that decouple integrin from the BM?) Is this a question of change to cell fate? (Are other remodeling enzymes activated after initial BM assembly that could support growth and/or branching?)

    2. Reviewer #2 (Public review):

      Summary:

      The manuscript entitled "Adaptation of endothelial cells to microenvironment 1 topographical cues through lysyl oxidase like-2-mediated basement membrane scaffolding" by Marchand et al., aims to determine the impact of LOXL2 on the dynamic formation of vascular basement membranes (BMs).

      Strengths:

      This manuscript includes a nice combination of different methods and presents the results in an appropriate manner.

      Furthermore, the results clearly demonstrate an impact of LOXL2 on collagen IV-fibronectin organization and topography. Finally, the proper arrangement of collagen IV-fibronectin impacts cell alignment.

      Weaknesses:

      An open question for this reviewer is what the real take-home message of the present study is? Can the authors deliver novel insight into BM formation transferable to the in vivo situation? Why do the authors not see a "real" BM? Could it be that in vivo endothelial cells do not build the vascular BM alone? Thus, are additional cell types needed? And what will happen then if LOXL2 expression is altered?

      Major comments:

      (1) Can the authors show that LOXL2 cross-links fibronectin and collagen IV?

      (2) The authors stated that LOXL2 depletion affects cytoskeleton arrangements and cell shape. Could it be that this is simply a secondary effect mediated primarily through the altered cross-linking of fibronectin and collagen IV?

      (3) Can the authors perform cell adhesion studies on CDMs derived from wild-type versus LOXL2-deficient cells?

      (4) Line 226-230: Can the authors provide the proliferation data of wildtype and LOXL2-depleted cells supporting their Src and Akt activity findings?

      (5) Line 298-299: The authors made a statement about laminin. Can the authors think of a co-culture of wild-type versus LOXL2-depleted endothelial cells in combination with pericytes or fibroblasts, as these cells contribute to the efficient assembly of a functional vascular basement membrane (10.1182/blood-2009-05-222364). Here, you can determine the impact of altered fibronectin-collagen IV cross-linking on laminin network formation. This will affect their conclusion in lines 299-304, as these facts are solely based on endothelial cells.

      (6) Suggestion: can the authors supplement recombinant LOXL2 protein in its active version to the LOXL2-depleted endothelial cells to rescue the observed changes? And further compare LOXL2 enzymatic function with LOXL2 protein harbouring Zn instead of Cu, making it enzymatic inactive. Here, the authors might be able to strengthen their hypothesis that LOXL2 might bridge fibronectin and collagen IV or link both proteins.

      (7) There are grammatical errors in the manuscript that the authors should work on.

    3. Reviewer #3 (Public review):

      This important study shows that basement membrane (BM) generation is a key event mediating cell 3D organization in response to microenvironmental cues. Such a mechanism participates in the endothelial cell capacity to organize into a capillary vessel segment through the shift of interactions with the interstitial ECM to interactions with vascular BM. This is particularly important for the developing, sprouting vasculature. The authors conclusively show, using TIRF and atomic force microscopy substantiated by 3D sprouting assays, that the lysyl oxidase Loxl2 plays a key role herein. With respect to translation into clinical practice, the dysregulation of adherens junctions and barrier properties associated with Loxl2 dysfunction mediated defects in BM supports its involvement in the progression of long-term microvascular diseases.

      An outstanding question not answered in the current MS is how Loxl2 integrates into the Dll4-Notch mediated control of tip-stalk-phalanx cell differentiation in the developing (embryonic) vasculature. The authors focused a lot on Loxl2 loss of function; however, in a (patho)physiological context, Loxl2 gain of function would be relevant. Loxl2 is a hypoxia target and Loxl2 accumulates in the ECM upon hypoxic stress (as occurs during ischemic CVD, stroke/heart infarct). It would be interesting to know how Loxl2 gain-of-function impacts BM assembly, endothelial behavior, mechanosensing, and vessel angiogenic remodeling.

    1. Reviewer #1 (Public review):

      Summary:

      The authors utilize genetic code expansion to tag TDP-43 and G3BP1, and evaluate this protein tagging system (ANAP) compared to antibodies, and evaluate protein trafficking and stress granule formation in response to stress with sodium arsenite treatment. They find similar staining to antibodies in HeLa cells, mouse embryonic stem cells, and primary mouse cortical neurons. This is a useful study that demonstrates the utility of ANAP tagging to evaluate ALS proteins.

      Strengths:

      Rescue of cell survival by ANAP-tagged TDP-43 is compelling

      Weaknesses:

      While the ANAP-tagged proteins had similar distributions to antibody staining, there were some discrepancies that may be more explained by the technique than by novel findings, as the authors suggested. The inclusion of additional controls to evaluate this would be helpful.

    2. Reviewer #2 (Public review):

      Summary:

      In this manuscript, Chen and colleagues describe a novel means of labeling two RNA-binding proteins, G3BP1 and TDP-43, using genetic code expansion. Overexpressed constructs that incorporate the intrinsically-fluorescent non-canonical amino acid Anap redistribute to cytoplasmic granules upon application of external stressors such as sodium arsenite. Similar labeling and redistribution of overexpressed G3BP1 and TDP-43 were observed in cultures of mouse primary neurons.

      Strengths:

      Genetic code expansion and non-canonical amino acid labeling have quite a few advantages over traditional fusion proteins for tracking protein redistribution in living cells. The authors show that they are able to label exogenous G3BP1 and TDP-43 with the non-canonical amino acid Anap and follow labeled proteins in living cells with and without stress.

      Weaknesses:

      The authors do not convincingly leverage the advantages of genetic code expansion in the current study. There is no specific question posed by the authors that can be or is answered using this approach, and several of the experiments lack critical controls. This is also not the first example of TDP-43 labeling by genetic code expansion (see PMID: 38290242). As a result, the study as a whole adds little to our understanding of protein trafficking and behavior under stress.

    1. Reviewer #2 (Public review):

      This paper proposes two changes to classic RSA, a popular method to probe neural representation in neuroimaging experiments: computing RSA at row/column level of RDM, and using linear mixed modeling to compute second level statistics, using the individual row/columns to estimate a random effect of stimulus. The benefit of the new method is demonstrated using simulations and a re-analysis of a prior fMRI dataset on object perception and memory encoding.

      The author's claim that tRSA is a promising approach to perform more complete modeling of cogneuro data, and to conceptualize representation at the single trial/event level (cf Discussion section on P42), is appealing.

      In their revised manuscript, the authors have addressed some previous concerns, now referencing more literature aiming to improve RSA and its associated statistical inferences, and providing more guidance on methodological considerations in the Discussion. However, I wish the authors had more extensively edited the Introduction to better contextualize the work and clarify the specific settings in which they see the method as being beneficial over classic RSA. For example, some of the limitations of cRSA mentioned on page 6, e.g. related to presenting the same stimuli to multiple subjects, seem to be quite specific to settings where the researcher expects differential responses across subjects to fundamentally alter the interpretation, rather than something that will just average out by repeatedly offering the same stimulus, or combining data across subjects. It's not clear to me how the switch from 'matrix-level' to 'row-level' analysis in tRSA necessarily addresses this problem. I would be very helpful if the authors would more explicitly outline what problem the row-level aspect of tRSA is solving; what problem statistical inference via LMM is solving; and walk the reader through a very specific use case (perhaps a toy version of the real-data experiment which is now at the end of the paper). Explaining the utility of tRSA for experimental settings in which assessing representational strength for a single-events is crucial would clarify the contribution of this new method better.

      A few weaknesses mentioned in my previous review were not adequately addressed. To demonstrate the utility of the method on real neural recordings, only a single dataset is used with a quite complicated experimental design; it's not clear if there is any benefit of using tRSA on a simpler real dataset. Moreover, the cells of an RDM/RSM reflect pairwise comparisons between response patterns. Because the response patterns are repeatedly compared, the cells of this matrix are not independent of one another. While the authors show examples that failure to meet independence assumptions do not affect results in their specific dataset, it does not get acknowledged as a problem at a more fundamental level. Finally, while the paper now states that 'simulations and example tRSA code' are publicly available, the link points to the lab's general github page containing many lab repositories, in which I could not identify a specific repository related to this paper. This is disappointing given that the main goal of this manuscript is to provide a new method that they encourage others to use; a clear pointer to available code is only a minimal requirement to achieve that goal. A dedicated repository, including documentation, READMEs and tutorials/demo's to run simulations, compare methods, etc. would greatly enhance the paper's contribution.

    1. Reviewer #2 (Public review):

      Summary:

      Egawa et al describe the developmental timeline of the assembly of nodes of Ranvier in the chick brainstem auditory circuit. In this unique system, the spacing between nodes varies significantly in different regions of the same axon from early stages, which the authors suggest is critical for accurate sound localization. Egawa et al set out to determine which factors regulate this differential node spacing. They do this by using immunohistological analyses to test the correlation of node spacing with morphological properties of the axons, and properties of oligodendrocytes, glial cells that wrap axons with the myelin sheaths that flank the nodes of Ranvier. They find that axonal structure does not vary significantly, but that oligodendrocyte density and morphology varies in the different regions traversed by these axons, which suggests this is a key determinant of the region-specific differences in node density and myelin sheath length. They also find that differential oligodendrocyte density is partly determined by secreted neuronal signals, as (presumed) blockage of vesicle fusion with tetanus toxin reduced oligodendrocyte density in the region where it is normally higher. Based on these findings, the authors propose that oligodendrocyte morphology, myelin sheath length, and consequently nodal distribution are primarily determined by intrinsic oligodendrocyte properties rather than neuronal factors such as activity.

      Significance:

      In our view the study tackles a fundamental question likely to be of interest to a specialized audience of cellular neuroscientists. This descriptive study is suggestive that in the studied system, oligodendrocyte density determines the spacing between nodes of Ranvier, but further manipulations of oligodendrocyte density per se are needed to test this convincingly.

    2. Reviewer #3 (Public review):

      Summary:

      The authors have investigated the myelination pattern along the axons of chick avian cochlear nucleus. It has already been shown that there are regional differences in the internodal length of axons in the nucleus magnocellularis. In the tract region across the midline, internodes are longer than in the nucleus laminaris region. Here the authors suggest that the difference in internodal length is attributed to heterogeneity of oligodendrocytes. In the tract region oligodendrocytes would contribute longer myelin internodes, while oligodendrocytes in the nucleus laminaris region would synthesize shorter myelin internodes. Not only length of myelin internodes differs, but also along the same axon unmyelinated areas between two internodes may vary. This is an interesting contribution since all these differences contribute to differential conduction velocity regulating ipsilateral and contralateral innervation of coincidence detector neurons. However, the demonstration falls rather short of being convincing.

      Significance:

      The authors suggest that the difference in internodal length is attributed to heterogeneity of oligodendrocytes. In the tract region oligodendrocytes would contribute longer myelin internodes, while oligodendrocytes in the nucleus laminaris region would synthesize shorter myelin internodes. Not only length of myelin internodes differs, but also along the same axon unmyelinated areas between two internodes may vary. This is an interesting contribution since all these differences contribute to differential conduction velocity regulating ipsilateral and contralateral innervation of coincidence detector neurons.

      Editors' note: The authors have written an effective rebuttal to the previous round of reviews.

    1. Reviewer #1 (Public review):

      Summary:

      The authors attempted to clarify the impact of N protein mutations on ribonucleoprotein (RNP) assembly and stability using analytical ultracentrifugation (AUC) and mass photometry (MP). These complementary approaches provide a more comprehensive understanding of the underlying processes. Both SV-AUC and MP results consistently showed enhanced RNP assembly and stability due to N protein mutations.<br /> The overall research design appears well planned, and the experiments were carefully executed.

      Strengths:

      SV-AUC, performed at higher concentrations (3 µM), captured the hydrodynamic properties of bulk assembled complexes, while MP provided crucial information on dissociation rates and complex lifetimes at nanomolar concentrations. Together, the methods offered detailed insights into association states and dissociation kinetics across a broad concentration range. This represents a thorough application of solution physicochemistry.

      Weaknesses:

      Unlike AUC, MP observes only a part of solution. In MP, bound molecules are accumulated on the glass surface (not dissociated) thus concentration in solution should change as time develops. How does such concentration change impact the result shown here?

      Comments on revisions:

      The response from the authors is appropriate and reasonable.

    2. Reviewer #2 (Public review):

      Summary:

      In this manuscript, the authors apply a variety of biophysical and computational techniques to characterize the effects of mutations in the SARS-CoV-2 N protein on the formation of ribonucleoprotein particles (RNPs). They find convergent evolution in multiple repeated independent mutations strengthening binding interfaces, compensating for other mutations that reduce RNP stability but which enhance viral replication.

      Strengths:

      The authors assay the effects of a variety of mutations found in SARS-CoV-2 variants of concern using a variety of approaches, including biophysical characterization of assembly properties of RNPs, combined with computational prediction of the effects of mutations on molecular structures and interactions. The findings of the paper contribute to our increasing understanding of the principles driving viral self-assembly, and increases the foundation for potential future design of therapeutics such as assembly inhibitors.

      Weaknesses:

      For the most part, the paper is well-written, the data presented support the claims made, and the arguments made easy to follow. However, I believe that parts of the presentation could be substantially improved. I found portions of the text to be overly long and verbose and likely could be substantially edited; the use of acronyms and initialisms is pervasive, making parts of the exposition laborious to follow; and portions of the figures are too small and difficult to read/understand.

      Comments on revisions:

      The authors have adequately addressed all of my concerns.

    3. Reviewer #3 (Public review):

      Summary:

      This manuscript investigates how mutations in the SARS-CoV-2 nucleocapsid protein (N) alter ribonucleoprotein (RNP) assembly, stability, and viral fitness. The authors focus on mutations such as P13L, G214C, G215C combining biophysical assays (SV-AUC, mass photometry, CD spectroscopy, EM), VLP formation, and reverse genetics. They propose that SARS-CoV-2 exploits "fuzzy complex" principles, where distributed weak interfaces in disordered regions allow both stability and plasticity, with measurable consequences for viral replication.

      Strengths:

      * The paper demonstrates a comprehensive integration of structural biophysics, peptide/protein assays, VLP systems, and reverse genetics.

      * Identification of both de novo (P13L) and stabilizing (G214C/G215C) interfaces provides a mechanistic insight into RNP formation.

      * Strong application of the "fuzzy complex" framework to viral assembly, showing how weak/disordered interactions support evolvability, is a significant conceptual advance in viral capsid assembly.

      * Overall, the study provides a mechanistic context for mutations that have arisen in major SARS-CoV-2 variants (Omicron, Delta, Lambda) and a mechanistic basis for how mutations influence phenotype via altered biomolecular interactions.

      Weaknesses:

      The weaknesses are shared via detailed comments to follow.

      Comments on revisions:

      The authors have addressed the criticisms of the original manuscript satisfactorily.

    1. Reviewer #1 (Public review):

      Summary:

      Cai et al have investigated the role of msiCAT-tailed mitochondrial proteins that frequently exist in glioblastoma stem cells. Overexpression of msiCAT-tailed mitochondrial ATP synthase F1 subunit alpha (ATP5) protein increases the mitochondrial membrane potential and blocks mitochondrial permeability transition pore formation/opening. These changes in mitochondrial properties provide resistance to staurosporine (STS)-induced apoptosis in GBM cells. Therefore, msiCAT-tailing can promote cell survival and migration, while genetic and pharmacological inhibition of msiCAT-tailing can prevent the overgrowth of GBM cells.

      Strengths:

      The CATailing concept has not been explored in cancer settings. Therefore, the present provides new insights for widening the therapeutic avenue.

    2. Reviewer #2 (Public Review):

      This work explores the connection between glioblastoma, mito-RQC, and msiCAT-tailing. They build upon previous work concluding that ATP5alpha is CAT-tailed and explore how CAT-tailing may affect cell physiology and sensitivity to chemotherapy. The authors conclude that when ATP5alpha is CAT-tailed, it either incorporates into the proton pump or aggregates and that these events dysregulate MPTP opening and mitochondrial membrane potential and that this regulates drug sensitivity. This work includes several intriguing and novel observations connecting cell physiology, RQC, and drug sensitivity. This is also the first time this reviewer has seen an investigation of how a CAT tail may specifically affect the function of a protein.

      Comment from the Reviewing Editor:

      The revisions made the work more valuable and convincing. The authors adequately made point-by-point response to the reviewers comments by providing new data. Image acquisition and data analysis were further clarified. NEMF knockdown experiments and additional control data for ATP5α featuring a poly-glycine-serine (GS) tail support their conclusion.

    1. Reviewer #1 (Public review):

      This study uses a new 'hidden multivariate pattern method' to parse in time and space the neural events intervening between stimulus and response in an immediately-reported perceptual decision, and use the resultant neural event timing information to show quite convincingly that Pieron's and Fechner's laws can apply in concert at distinct processing levels.

      They designed a clever contrast comparison paradigm in which the contrast difference is kept constant while widely manipulating mean contrast, so that sensory encoding of the overall stimulus would be boosted with increasing mean contrast, whereas decision difficulty and hence duration would increase. With this, they found that the time intervening between early sensory-evoked components, up to an 'N200'-type component associated with launching the decision process, varies inversely with contrast according to Pieron's law. Meanwhile, the time intervals running up to neural events peaking near the time of response, consistent with decision termination, increases with contrast, fitting Fechner's law. Further, a diffusion model whose drift rates are scaled by Fechner's law, fit to RT, predicts the observed proportion of correct responses very well.

      In the process of review and revision it was highlighted that presumably the full sequence of neural events intervening between stimulus and response is massively task dependent, but;

      (1) The method is intended to capture all key components that specifically covary with RT, as opposed to each and every component in general, and

      (2) The main conclusions of the study mentioned above do not change whether the method is set up to track three neural events, or five, as was done in the final analysis.

      The propensity for topographic parsing algorithms to potentially lump-together distinct processes that partially co-evolve was acknowledged, but a key clarification in review was that even though the method entails a specification of neural event duration - which was changed from 50 to 25 ms - the success of the method is not strongly contingent on the actual underlying neural events in question having that very duration - indeed, the components extracted using that short template duration can be observed to evolve over a longer time frame associated with the Fechner diffusion process.

      Notably, standard average event-related potential analysis was able to show expected amplitude effects - where sensory signals increased with contrast but decision signals decreased - but assessment of the by-trial distribution of their timings was grealy aided by the HMP method.

      One of the stages of processing implicated in the parsing analysis was linked to attention orientation, and the authors speculate on whether this might reflect a spatially-selective deployment of attention or a resource allocation, but sensibly refrain from speculating too far since the focus here was on the sensory and decision process durations and their respective adherence to Pieron and Fechner's laws.

    2. Reviewer #2 (Public review):

      Summary:

      The authors decomposed response times into component processes and manipulated the duration of these processes in opposing directions by varying contrast, and overall by manipulating speed-accuracy tradeoffs. They identify different processes and their durations by identifying neural states in time and validate their functional significance by showing that their properties vary selectively as expected with predicted effects of the contrast manipulation. They identify 4 processes: stimulus encoding, attention orienting, decision and motor execution. These map onto 5 classical event related potentials. The decision-making component matched the CPP and its properties varied with contrast and predicted decision-accuracy.

      Strengths:

      The design of the experiment is remarkable and offers crucial insights. The analyses techniques are beyond-state-of-the art and the analyses are well motivated and offer clear insights.

      Weaknesses:

      The number of identified events depends on the parameter setting of the analysis. While the authors discuss weaknesses of the approach this needs to be made explicit as well. It is also unclear to what extent topographies map onto processes since e.g., different combinations of sources can lead to the same scalp topography.

    3. Reviewer #3 (Public review):

      Summary:

      In this manuscript the authors examine the processing stages involved in perceptual decision-making using a new approach to analysing EEG data, combined with a critical stimulus manipulation. This new EEG analysis method enables single-trial estimates of the timing and amplitude of transient changes in EEG time-series recurrent across trials in a behavioural task. The authors find evidence for five events between stimulus onset and the response in a two-spatial-interval visual discrimination task. By analysing the timing and amplitude of these events in relation to behaviour and the stimulus manipulation, the authors interpret these events as related to separable processing stages for stimulus encoding (first two events), attention orientation (second event), motor planning (fourth event) and decision (deliberation, final event). This is largely consistent with previous findings from both event-related potentials (across trials) and single-trial estimates using decoding techniques and neural network approaches. However, by taking a data-driven approach (as opposed to theory-driven decoding analyses) a more nuanced picture emerges: there are several stimulus encoding steps which may contribute differently to behaviour, and decision processes extend beyond the planning of the motor response.

      Strengths:

      This work is not only important for the conceptual advance, but also in promoting this new analysis technique, which will likely prove useful in future research. For the broader picture, this work is an excellent example of the utility of neural measures for mental chronometry.

      Weaknesses:

      Though beyond the scope of this manuscript, these results should be considered within the broader decision-making literature, where task or domain-specific processes may not generalise (for example, in value-based decision-making).

    1. Reviewer #1 (Public review):

      Summary:

      The authors report the results of a tDCS brain stimulation study (verum vs sham stimulation of left DLPFC; between-subjects) in 46 participants, using an intense stimulation protocol over 2 weeks, combined with an experience-sampling approach, plus follow-up measures after 6 months.

      Strengths:

      The authors are studying a relevant and interesting research question using an intriguing design, following participants quite intensely over time and even at a follow-up time point. The use of an experience-sampling approach is another strength of the work.

      Weaknesses:

      There are quite a few weaknesses, some related to the actual study and some more strongly related to the reporting about the study in the manuscript. The concerns are listed roughly in the order in which they appear in the manuscript.

      (1) In the introduction, the authors present procrastination nearly as if it were the most relevant and problematic issue there is in psychology. Surely, procrastination is a relevant and study-worthy topic, but that is also true if it is presented in more modest (and appropriate) terms. The manuscript mentions that procrastination is a main cause of psychopathology and bodily disease. These claims could possibly be described as 'sensationalized'. Also, the studies to support these claims seem to report associations, not causal mechanisms, as is implied in the manuscript.

      (2) It is laudable that the study was pre-registered; however, the cited OSF repository cannot be accessed and therefore, the OSF materials cannot be used to (a) check the preregistration or to (b) fill in the gaps and uncertainties about the exact analyses the authors conducted (this is important because the description of the analyses is insufficiently detailed and it is often unclear how they analyzed the data).

      (3) Related to the previous point: I find it impossible to check the analyses with respect to their appropriateness because too little detail and/or explanation is given. Therefore, I find it impossible to evaluate whether the conclusions are valid and warranted.

      (4) Why is a medium effect size chosen for the a priori power analysis? Is it reasonable to assume a medium effect size? This should be discussed/motivated. Related: 18 participants for a medium effect size in a between-subjects design strikes me as implausibly low; even for a within-subjects design, it would appear low (but perhaps I am just not fully understanding the details of the power analysis).

      (5) It remains somewhat ambiguous whether the sham group had the same number of stimulation sessions as the verum stimulation group; please clarify: Did both groups come in the same number of times into the lab? I.e., were all procedures identical except whether the stimulation was verum or sham?

      (6) The TDM analysis and hyperbolic discounting approach were unclear to me; this needs to be described in more detail, otherwise it cannot be evaluated.

      (7) Coming back to the point about the statistical analyses not being described in enough detail: One important example of this is the inclusion of random slopes in their mixed-effects model which is unclear. This is highly relevant as omission of random slopes has been repeatedly shown that it can lead to extremely inflated Type 1 errors (e.g., inflating Type 1 errors by a factor of then, e.g., a significant p value of .05 might be obtained when the true p value is .5). Thus, if indeed random slopes have been omitted, then it is possible that significant effects are significant only due to inflated Type 1 error. Without more information about the models, this cannot be ruled out.

      (8) Related to the previous point: The authors report, for example, on the first results page, line 420, an F-test as F(1, 269). This means the test has 269 residual degrees of freedom despite a sample size of about 50 participants. This likely suggests that relevant random slopes for this test were omitted, meaning that this statistical test likely suffers from inflated Type 1 error, and the reported p-value < .001 might be severely inflated. If that is the case, each observation was treated as independent instead of accounting for the nestedness of data within participants. The authors should check this carefully for this and all other statistical tests using mixed-effects models.

      (9) Many of the statistical procedures seem quite complex and hard to follow. If the results are indeed so robust as they are presented to be, would it make sense to use simpler analysis approaches (perhaps in addition to the complex ones) that are easier for the average reader to understand and comprehend?

      (10) As was noted by an earlier reviewer, the paper reports nearly exclusively about the role of the left DLPFC, while there is also work that demonstrates the role of the right DLPFC in self-control. A more balanced presentation of the relevant scientific literature would be desirable.

      (11) Active stimulation reduced procrastination, reduced task aversiveness, and increased the outcome value. If I am not mistaken, the authors claim based on these results that the brain stimulation effect operates via self-control, but - unless I missed it - the authors do not have any direct evidence (such as measures or specific task measures) that actually capture self-control. Thus, that self-control is involved seems speculation, but there is no empirical evidence for this; or am I mistaken about this? If that is indeed correct, I think it needs to be made explicit that it is an untested assumption (which might be very plausible, but it is still in the current study not empirically tested) that self-control plays any role in the reported results.

      (12) Figures 3F and 3H show that procrastination rates in the active modulation group go to 0 in all participants by sessions 6 and 7. This seems surprising and, to be honest, rather unlikely that there is absolutely no individual variation in this group anymore. In any case, this is quite extraordinary and should be explicitly discussed, if this is indeed correct: What might be the reasons that this is such an extreme pattern? Just a random fluctuation? Are the results robust if these extreme cells are ignored? The authors remove other cells in their design due to unusual patterns, so perhaps the same should be done here, at least as a robustness check.

      (13) The supplemental materials, unfortunately, do not give more information, which would be needed to understand the analyses the authors actually conducted. I had hoped I would find the missing information there, but it's not there.

      In sum, the reported/cited/discussed literature gives the impression of being incomplete/selectively reported; the analyses are not reported sufficiently transparently/fully to evaluate whether they are appropriate and thus whether the results are trustworthy or not. At least some of the patterns in the results seem highly unlikely (0 procrastination in the verum group in the last 2 observation periods), and the sample size seems very small for a between-subjects design.

    2. Reviewer #2 (Public review):

      Summary:

      Chen and colleagues conducted a cross-sectional longitudinal study, administering high-definition transcranial direct stimulation targeting the left DLPFC to examine the effect of HD-tDCS on real-world procrastination behavior. They find that seven sessions of active neuromodulation to the left DLPFC elicited greater modulation of procrastination measures (e.g., task-execution willingness, procrastination rates, task aversiveness, outcome value) relative to sham. They report that tDCS effects on task-execution willingness and procrastination are mediated by task outcome value and claim that this neuromodulatory intervention reduces procrastination rates quantified by their task. Although the study addresses an interesting question regarding the role of DLPFC on procrastination, concerns about the validity of the procrastination moderate enthusiasm for the study and limit the interpretability of the mechanism underlying the reported findings.

      Strengths:

      (1) This is a well-designed protocol with rigorous administration of high-definition transcranial direct current stimulation across multiple sessions. The approach is solid and aims to address an important question regarding the putative role of DLPFC in modulating chronic procrastination behavior.

      (2) The quantification of task aversiveness through AUC metrics is a clever approach to account for the temporal dynamics of task aversiveness, which is notoriously difficult to quantify.

      Weaknesses:

      (1) The lack of specificity surrounding the "real-world measures" of procrastination is problematic and undermines the strength of the evidence surrounding the DLPFC effects on procrastination behavior. It would be helpful to detail what "real-world tasks" individuals reported, which would inform the efficacy of the intervention on procrastination performance across the diversity of tasks. It is also unclear when and how tasks were reported using the ESM procedure. Providing greater detail of these measures overall would enhance the paper's impact.

      (2) Additionally, it is unclear whether the reported effects could be due to differential reporting of tasks (e.g., it could be that participants learned across sessions to report more achievable or less aversive task goals, rather than stimulation of DLPFC reducing procrastination per se). It would be helpful to demonstrate whether these self-reported tasks are consistent across sessions and similar in difficulty within each participant, which would strengthen the claims regarding the intervention.

      (3) It would be helpful to show evidence that the procrastination measures are valid and consistent, and detail how each of these measures was quantified and differed across sessions and by intervention. For instance, while the AUC metric is an innovative way to quantify the temporal dynamics of task-aversiveness, it was unclear how the timepoints were collected relative to the task deadline. It would be helpful to include greater detail on how these self-reported tasks and deadlines were determined and collected, which would clarify how these procrastination measures were quantified and varied across time.

      (4) There are strong claims about the multi-session neuromodulation alleviating chronic procrastination, which should be moderated, given the concerns regarding how procrastination was quantified. It would also be helpful to clarify whether DLPFC stimulation modulates subjective measures of procrastination, or alternatively, whether these effects could be driven by improved working memory or attention to the reported tasks. In general, more work is needed to clarify whether the targeted mechanisms are specific to procrastination and/or to rule out alternative explanations.

    3. Reviewer #3 (Public review):

      This manuscript explores whether high-definition transcranial direct current stimulation (HD-tDCS) of the left DLPFC can reduce real-world procrastination, as predicted by the Temporal Decision Model (TDM). The research question is interesting, and the topic - neuromodulation of self-regulatory behavior - is timely.

      However, the study also suffers from a limited sample size, and sometimes it was difficult to follow the statistics.

      The preregistration and ecological design (ESM) are commendable, but I was not able the find the preregistration, as reported in the paper.

      Overall, the paper requires substantial clarification and tightening.

    1. Reviewer #1 (Public review):

      Summary:

      This paper is a comprehensive review of perturbation studies and the state-dependence of the brain's response to perturbation at the circuit, mesoscale, and macroscale levels.

      Strengths:

      The strengths of the paper are the thorough description of many perturbation studies at different levels of organization, and the integration of both experimental and modeling studies. The review clearly communicates the need to consider (1) brain or local-population state, and (2) multiple levels of organization, in order to understand perturbation responses. Another major strength is the ability for the reader to reproduce figures using the EBRAINS platform.

      Weaknesses:

      Two major points of improvement should be resolved with the review, in order to make it useful for a broad audience.

      The first is that the review does not include a significant integration across scales, and as a result, reads like three separate (though comprehensive) reviews. Currently, the only integration across the scales is in the brief conclusion paragraph. I would recommend adding an additional section, in which the overarching picture is discussed. (i.e. a unifying view of state dependence, and what is learned by considering across scales). This need not be too long, but it should be longer than a single conclusion paragraph.

      The second major weakness is that there is a lack of clarity on many points throughout, which is needed for the reader to fully understand the results described.

    2. Reviewer #2 (Public review):

      Summary:

      In this review article, the authors discuss the whole-brain activity changes induced by brain stimulation. They review the literature on how these activity changes depend on the cognitive state of the brain and divide the results by the scale of the change being induced, from microscale changes across small groups of neurons, up to macroscale changes across the entire brain. Finally, they describe attempts to model these changes using computational models.

      Strengths:

      The review provides an overview of the results within this subfield of neuroscience, and the authors are able to discuss a lot of prior results. The framing of the changes in neuronal activity in terms of computational changes is also a helpful approach.

      Weaknesses:

      However, the authors are not able to contextualize these results within a single framework, i.e. explaining from first principles how different aspects of stimulus-induced changes interact to generate functional changes in the brain, and how different changes - at distinct spatiotemporal scales - combine to form larger effects. This is a significant weakness in generating a review of the literature, since the authors do not provide a cohesive conceptual framework on which to frame the results. Similarly, the authors do not explain how their different computational models fit together, and how one can get a singular computational understanding of the distinct mechanisms of brain activity changes due to stimulation under different brain states, by combining the results derived from each separate model.

      Major Comments:

      (1) The authors have written this review as if it were intended for an audience who is already familiar with the topics. For example, they introduce concepts like complexity, spiral vs planar waves, without much explanation.

      (2) Regarding complexity, the authors present a quantification termed PCI. However, in the associated box, they state that PCI could be implemented in a number of different ways, using analogous metrics (which are, nonetheless, not identical). Yet the authors simply claim that all these metrics are sufficiently similar to be grouped together as "PCI". The authors do not provide much intuition about this, and they also don't present any other potential quantifications. This makes any interpretation of their results strongly dependent on your understanding of the concept of PCI. It would be helpful to present some other, analogous metric to demonstrate that the results that the authors are focusing on are not somehow tied to the specific computational structure of the PCI metric.

      (3) The authors divide the review into sections organized by the spatial extent of the effects that they are exploring (e.g. from microscale to macroscale). However, they don't bring together these insights into a cohesive structure - for example, by providing potential explanations of the macroscale effects by using the microscale changes.

      (4) The authors completely ignore any aspect of cell-type specificity in their review, despite the known importance of specific cell types at the microcircuit scale. This makes it difficult to map their results onto the true biological system.

      (5) The authors introduce several different computational models, such as the Hopf model, the AdEx model, and the MPR model. However, they do not provide the reader with a conceptual understanding of the structure of each of these models (except through potentially more complex terminology, e.g. the Hopf model is a "phenomenological Stuart-Landau nonlinear oscillator"). Additionally, though they present the results of each simulation, they don't provide the reader with intuition about how these models compare against each other, and how best to interpret results derived from each model.

      (6) In several cases, the authors make statements that they appear to believe to be completely straightforward (and require no justification), but that do not appear so to the reader. For example, they mention: "In wakefulness and REM sleep, ..., the membrane potential is depolarized and close to the spike threshold, which explains why neurons respond more reliably and with less response variability compared with slow-wave sleep". However, this statement is not obvious to the reader and requires explanation (for example, in a system that is close to balance, bringing cells closer to the firing threshold can result in increased response jitter).

    1. Reviewer #1 (Public review):

      Summary:

      In the paper, the authors review literature on synchronous activity, its relationship to brain state, and the multi-scale mechanisms underlying it.

      Strengths:

      The overall strength of the paper is the wide range of information reviewed, and the diversity of perspectives/approaches it brings together.

      Weaknesses:

      However, this strength is also the source of its major weaknesses - namely, that the overall structure lacks clarity, and there are inconsistencies throughout. Overall, in the opinion of this reviewer, the manuscript reads as disorganized and incomplete. Major and minor points are delineated below.

      Major points:

      (1) Most of the text in many figures was too small to read.

      (2) Terminology is inconsistent throughout the manuscript. What is the difference between slow oscillations and delta waves? Sometimes the term slow waves is used instead. For sleep state, sometimes the term SWS is used, sometimes non-REM. Similarly, "spindle activity" is not defined, but simply stated as if the reader knows. This brings up two issues: (a) the manuscript should be clearer and more consistent about its terminology, and (b) it's unclear who is the intended readership of the review - is it a pedagogical review for people outside the field of sleep and slow oscillations, or is it meant to be a consensus statement for readers who are already in the field in which a pressing concern has been addressed? It seems part way between these two, and as a result, is ineffective at either goal.

      (3) I suggest the authors look again at the overall structure and flow of the review... many sections feel redundant, and it's unclear how they fit together into a single review.

      (4) There are many speculative statements in the review that are not justified or explained sufficiently for the reader. For example: "While highly regular slow waves in vivo suggest a single mechanism of generation, namely local cortical circuits, irregular cycles are compatible with a larger role of subcortical nuclei, ..."; "The involvement of different cortical areas and subcortical nuclei can form the basis of these different roles in memory.". For these statements, I assume the relationship between slow wave statistics, subcortical nuclei, and memory either has been written about before, and then should be cited and summarized, or is a novel claim of the authors, which then should be explained and defended rather than stated. There are other similar examples, and I suggest the authors go through the manuscript and make sure that it's clear what is a novel claim of the authors vs a cited claim, and make sure that both are sufficiently justified for the reader.

      (5) An especially notable example can be found in the section on the role of the thalamus, where the authors state that they "hold that slow oscillations are fundamentally cortical". However, this section is far too short, and very little evidence is provided to back up this claim. Please review the ways in which the thalamus modulates, and, e.g., ways in which up-down is similar/different without the thalamus.

    2. Reviewer #2 (Public review):

      Summary:

      In this review article, the authors discuss the correlated dynamical states associated with distinct cognitive states, including those associated with anesthesia and sleep. They present evidence that these states are primarily cortically generated, and demonstrate the properties of these dynamical states at different levels, from the microscale dynamics in individual neurons to the macroscale dynamics across the brain.

      Strengths:

      Multiple groups have been adding to this field over the past decades, and therefore, a review of this literature is very helpful. This review collates a large amount of the literature within this field into a single document, which should make it a valuable resource within this area of neuroscience.

      Weaknesses:

      Unfortunately, this review does not seem to be a balanced viewpoint of the field in question. Although there are a lot of authors in the review, it feels as if they are from a common school of thought. The authors provide only a single perspective on these dynamical states, focusing on the perspective of wave-like electrical dynamics across the cortex. Their perspective is embedded in methods such as EEG and LFP recordings. This makes the work hard to interpret outside of the field in which the authors reside. Indeed, the review seems intended for a more specialized audience.

      In addition, the article reads more like a catalog of prior studies as opposed to a true synthesis across the large volume of data in this field that highlights links across multiple sources. Hence, it does not seem to provide a novel way of understanding the dynamics involved in cognitive state transitions.

      We have included more details on these general comments below:

      Major Comments:

      (1) The authors have written this review as if it were intended for an audience who is already familiar with these topics. They do not define many of the terms that they introduce within the review, including concepts like complexity, metastability, and oscillations that are fundamental to the concepts that the authors are introducing. Though these may seem like first principles concepts to the authors, they often introduce assumptions that may be unfamiliar to the general reader. For example, are slow wave oscillations periodic? A naïve reader may assume that oscillations - characterized by their frequency - should be somewhat periodic, but that is often not the case. For a journal with a general biological science readership, it would be particularly helpful for each of these terms to be formally defined and characterized.

      (2) It would be helpful for the authors to reframe their work in different perspectives and to incorporate all the literature on the dynamics of cortical brain states, and not simply the work that is most familiar to them. As one example, the authors do not discuss cell-type-specific changes in brain state during anesthesia and in altered states of consciousness (including dissociative states and hallucinatory states). There is recent work in this vein (Suzuki and Larkum, 2020; Vesuna et al, 2020; Bharioke, Munz et al, 2023), and yet the authors do not discuss these papers.

      (3) Given the authors' clear, extensive knowledge of their field, it would also be extremely helpful for the authors to reframe fundamental concepts in terms of neuronal population activity, trajectory analyses, etc. This would enable a more general audience to better understand their work.

      (4) The authors have one section focused on thalamic contributions to cortical wave-like activity. This is a cursory treatment of a subject that is quite controversial in the field. It would be helpful if the authors could provide a more balanced consideration of all the evidence regarding potential thalamocortical interactions and their role in wave-like activity.

      (5) The authors present many computational models and describe the results of simulations with these different models. However, this doesn't provide the reader with intuition about what each model adds or removes from the true biological picture. It would be helpful for the authors to provide some intuition about the assumptions and constraints that underlie each model.

      (6) The authors state that "The main mechanism [of slow oscillatory dynamics] consists of a combination of two ingredients: the recurrent connectivity, which maintains the excitability in the network, and adaptation, an activity-dependent fatigue variable that provides inhibitory feedback". They make this statement as a fact, yet they don't provide much justification for it. Additionally, it's not clear that any other possible combination of ingredients would be able to produce slow oscillatory dynamics.

      (7) The authors often define one concept in terms of other equally complex concepts. For example: "EIA (excitatory-inhibitory with adaptation) cortical circuits then display the typical slow-fast dynamics of relaxation oscillators". The reader would need an explanation of slow-fast dynamics and relaxation oscillators to understand this line, neither of which is provided in the text.

      (8) When discussing sleep, the authors do not discuss REM sleep, focusing on slow-wave non-REM sleep. It would be helpful if the authors could at least frame the full sleep cycle and discuss why they are focusing on one part of it.

      (9) The authors introduce the concept of sleep spindles without any explanation.

    1. Reviewer #1 (Public review):

      Summary:

      This report demonstrates that the gene expression output of the Wnt pathway, when controlled precisely by a synthetic light-based input, depends substantially on the frequency of stimulation. The particular frequency-dependent trend that is observed - anti-resonance, a suppression of target gene expression at intermediate frequencies given a constant duty cycle - is a novel aspect that has not been clearly shown before for this or other signaling pathways. The paper provides both clear experimental evidence of the phenomenon with engineered cellular systems and a model-based analysis of how the pairing of rate constants in pathway activation/deactivation could result in such a trend.

      Strengths:

      This report couples in vitro experimental data with an abstracted mathematical model. Both of these approaches appear to be technically sound and to provide consistent and strong support for the main conclusion. The experimental data are particularly clear, and the demonstration that Brachyury expression is subject to anti-resonance in ESCs is particularly compelling. The modeling approach is reasonably scaled for the system at the level of detail that is needed in this case, and the hidden variable analysis provides some insight into how the anti-resonance works.

      In this revised manuscript, the authors have addressed issues in presentation and in discussing the broader relevance of their study to other pathways. Other limitations of the paper, including the fact that the anti-resonance phenomenon has not yet been demonstrated using physiological Wnt ligands and that the model has not been validated using experimental manipulations to establish that the mechanisms of the cell system and the model are the same, were deemed out of the scope of this initial demonstration by both the reviewers and authors. These questions will provide an interesting basis for further studies.

    2. Reviewer #2 (Public review):

      Summary:

      By combining optogenetics with theoretical modelling the authors identify an anti-resonance behavior in the WnT signaling pathway. This behavior is manifested as a minimal response at a certain stimulation frequency. Using an abstracted hidden variable model, the authors explain their findings by a competition of timescales. Furthermore, they experimentally show that this anti-resonance influences the cell fate decision involved in human gastrulation.

      Strengths:

      - This interdisciplinary study combines precise optogenetic manipulation with advanced modelling.<br /> - The results are directly tested in two different systems: HEK293T cells and H9 human embryonic stem cells.<br /> - The model is implemented based on previous literature and has two levels of detail: i) a detailed biochemical model and ii) an abstract model with a hidden parameter

      Weaknesses:

      - While the experiments provide both single-cell data and population data, the model only considers population data.<br /> - Although the model captures the experimental data for TopFlash very well, the beta-Cat curves (Fig 2B) are only described qualitatively. This discrepancy is not discussed.

      Overall Assessment:

      The authors convincingly identified an anti-resonance behavior in a signaling pathway that is involved in cell fate decisions. The focus on a dynamic signal and the identification of such a behavior is important. I believe that the model approach of abstracting a complicated pathway with a hidden variable is an important tool to obtain an intuitive understanding of complicated dependencies in biology. Such a combination of precise ontogenetical manipulation with effective models will provide a new perspective on causal dependencies in signaling pathways and should not be limited only to the system that the authors study.

      Comments on revisions:

      I don't have any more comments for the authors and would like to congratulate them for the nice piece of work!

    1. Reviewer #1 (Public review):

      Summary:

      This study presents a valuable contribution of NO signaling in zebrafish retinal regeneration in larval animals. The data on NO signaling are solid. There are multiple limitations to the study, but these are largely acknowledged by the authors in the revised text.

      Strengths:

      New data on NO signaling is valuable to the field but may be limited to larval "regeneration".

      Weaknesses:

      A weakness of the approach is testing cone ablation and regeneration in early larval animals. A near identical study was already done by Hoang et al 2020 in the adult zebrafish, a more relevant biological timepoint.

    2. Reviewer #2 (Public review):

      Summary:

      In this manuscript Ye at al. examine the sequence of events that occur in the damaged zebrafish Muller glia (MG) in states between quiescence and the onset of proliferation. Using an inducible metronidazole (MTZ) and nitroreductase system to ablate red/green cones in larval zebrafish, they identify a novel transitional MG state that is characterized by the expression of cxcl18b. Using trajectory analysis from single-cell RNA-seq datasets, they find that cxcl18b is expressed before MG expression PCNA and become proliferative. They find that cxcl18b expression peaks in MG at approximately 24 hours post injury (hpi) and rapidly declines as MG proliferate following injury. In a most interesting finding, the authors find a link between nos2b-dependent nitric oxide signaling and cxcl18b-mediated proliferation. Mutagenesis of nos2b decreases MG proliferation. The mechanism linking NO signaling to proliferation was suggested to function via notch signaling as pharmacological inhibition of nitric oxide signaling resulted in elevated Notch activity, thus preventing MG proliferation. The authors suggest a model whereby cxcl18b induces autocrine NO signaling in MG to reduce activity of Notch3, thereby promoting MG proliferation.

      Strengths:

      The authors utilize a number of sophisticated transgenic approaches and generate novel lines that will have value to the field. The identification of a novel cxcl18b transition state is exciting and the putative link between NO signaling and Notch activity would provide new insight into the drivers of Muller glia proliferation.

      Weaknesses:

      While the overall model is appealing and may serve as a foundation for future studies, some information gaps remain and certain conclusions rely on correlational data. The cellular expression of nos2b remains unclear as the single-cell RNA-seq data cannot provide expression data that matches RT-PCR results. The temporal sequence of events are based on transgene expression in the Tg(cxcl18b:GFP) lines, where persistence of the GFP fluorescence may not reflect endogenous cxcl18b. The identity of putative cxcl18b receptors on MG to support an autocrine signaling pathway remains unclear. Nevertheless, this is an interesting study that should open new avenues of exploration.

    1. Reviewer #1 (Public review):

      Summary:

      Lai and Doe address the integration of spatial information with temporal patterning and genes that specify cell fate. They identify the Forkhead transcription factor Fd4 as a lineage-restricted cell fate regulator that bridges transient spatial transcription factors to terminal selector genes in the developing Drosophila ventral nerve cord. The experimental evidence convincingly demonstrates that Fd4 is both necessary for late-born NB7-1 neurons, but also sufficient to transform other neural stem cell lineages toward the NB7-1 identity. This work addresses an important question that will be of interest to developmental neurobiologists: How can cell identities defined by initial transient developmental cues be maintained in the progeny cells, even if the molecular mechanism remains to be investigated? In addition, the study proposes a broader concept of lineage identity genes that could be utilized in other lineages and regions in the Drosophila nervous system and in other species.

      Strengths:

      While the spatial factors patterning the neuroepithelium to define the neuroblast lineages in the Drosophila ventral nerve cord are known, these factors are sometimes absent or not required during neurogenesis. In the current work, Lai and Doe identified Fd4 in the NB7-1 lineage that bridges this gap and explains how NB7-1 neurons are specified after Engrailed (En) and Vnd cease their expression. They show that Fd4 is transiently co-expressed with En and Vnd and is present in all nascent NB7-1 progenies. They further demonstrate that Fd4 is required for later-born NB7-1 progenies and sufficient for the induction of NB7-1 markers (Eve and Dbx) while repressing markers of other lineages when force-expressed in neural progenitors, e.g., in the NB5-6 lineage and in the NB7-3 lineage. They also demonstrate that, when Fd4 is ectopically expressed in NB7-3 and NB5-6 lineages, this leads to the ectopic generation of dorsal muscle-innervating neurons. The inclusion of functional validation using axon projections demonstrates that the transformed neurons acquire appropriate NB7-1 characteristics beyond just molecular markers. Quantitative analyses are thorough and well-presented for all experiments.

      Weaknesses:

      (1) While Fd4 is required and sufficient for several later-born NB7-1 progeny features, a comparison between early-born (Hb/Eve) and later-born (Run/Eve) appears missing for pan-progenitor gain of Fd4 (with sca-Gal4; Figure 4) and for the NB7-3 lineage (Figure 6). Having a quantification for both could make it clearer whether Fd4 preferentially induces later-born neurons or is sufficient for NB7-1 features without temporal restriction.

      (2) Fd4 and Fd5 are shown to be partially redundant, as Fd4 loss of function alone does not alter the number of Eve+ and Dbx+ neurons. This information is critical and should be included in Figure 3.

      (3) Several observations suggest that lineage identity maintenance involves both Fd4-dependent and Fd4-independent mechanisms. In particular, the fact that fd4-Gal4 reporter remains active in fd4/fd5 mutants even after Vnd and En disappear indicates that Fd4's own expression, a key feature of NB7-1 identity, is maintained independently of Fd4 protein. This raises questions about what proportion of lineage identity features require Fd4 versus other maintenance mechanisms, which deserves discussion.

      (4) Similarly, while gain of Fd4 induces NB7-1 lineage markers and dorsal muscle innervation in NB5-6 and NB7-3 lineages, drivers for the two lineages remain active despite the loss of molecular markers, indicating some regulatory elements retain activity consistent with their original lineage identity. It is therefore important to understand the degree of functional conversion in the gain-of-function experiments. Sparse labeling of Fd4 overexpressing NB5-6 and NB7-3 progenies, as was done in Seroka and Doe (2019), would be an option.

      (5) The less-penetrant induction of Dbx+ neurons in NB5-6 with Fd4-overexpression is interesting. It might be worth the authors discussing whether it is an Fd4 feature or an NB5-6 feature by examining Dbx+ neuron number in NB7-3 with Fd4-overexpression.

      (6) It is logical to hypothesize that spatial factors specify early-born neurons directly, so only late-born neurons require Fd4, but it was not tested. The model would be strengthened by examining whether Fd4-Gal4-driven Vnd rescues the generation of later-born neurons in fd4/fd5 mutants.

      (7) It is mentioned that Fd5 is not sufficient for the NB7-1 lineage identity. The observation is intriguing in how similar regulators serve distinct roles, but the data are not shown. The analysis in Figure 4 should be performed for Fd5 as supplemental information.

    2. Reviewer #2 (Public review):

      Summary:

      Via a detailed expression analysis, they find that Fd4 is selectively expressed in embryonic NB7-1 and newly born neurons within this lineage. They also undertake a comprehensive genetic analysis to provide evidence that fd4 is necessary and sufficient for the identity of NB7-1 progeny.

      Strengths:

      The analysis is both careful and rigorous, and the findings are of interest to developmental neurobiologists interested in molecular mechanisms underlying the generation of neuronal diversity. Great care was taken to make the figures clear and accessible. This work takes great advantage of years of painstaking descriptive work that has mapped embryonic neuroblast lineages in Drosophila.

      Weaknesses:

      The argument that Fd4 is necessary for NB7-1 lineage identity is based on a Fd4/Fd5 double mutant. Loss of fd4 alone did not alter the number of NB7-1-derived Eve+ or Dbx+ neurons. The authors clearly demonstrate redundancy between fd4 and fd5, and the fact that the LOF analysis is based on a double mutant should be better woven through the text. The authors generated an Fd5 mutant. I assume that Fd5 single mutants do not display NB7-1 lineage defects, but this is not stated. The focus on Fd4 over Fd5 is based on its highly specific expression profile and the dramatic misexpression phenotypes. But the LOF analysis demonstrates redundancy, and the conclusions in the abstract and through the results should reflect the existence of Fd5 in the conclusions of this manuscript.

      It is notable that Fd4 overexpression can rewire motor circuits. This analysis adds another dimension to the changes in transcription factor expression and, importantly, demonstrates functional consequences. Could the authors test whether U4 and U5 motor axon targeting changes in the fd4/fd5 double mutant? To strengthen claims regarding the importance of fd4/fd5 for lineage identity, it would help to address terminal features of U motorneuron identity in the LOF condition.

    3. Reviewer #3 (Public review):

      The goal of the work is to establish the linkage between the spatial transcription factors (STFs) that function transiently to establish the identities of the individual NBs and the terminal selector genes (typically homeodomain genes) that appear in the newborn post-mitotic neurons. How is the identity of the NB maintained and carried forward after the spatial genes have faded away? Focusing on a single neuroblast (NB 7-1), the authors present evidence that the fork-head transcription factor, fd4, provides a bridge linking the transient spatial cues that initially specified neuroblast identity with the terminal selector genes that establish and maintain the identity of the stem cell's progeny.

      The study is systematic, concise, and takes full advantage of 40+ years of work on the molecular players that establish neuronal identities in the Drosophila CNS. In the embryonic VNC, fd4 is expressed only in the NB 7-1 and its lineage. They show that Fd4 appears in the NB while the latter is still expressing the Spatial Transcription Factors and continues after the expression of the latter fades out. Fd4 is maintained through the early life of the neuronal progeny but then declines as the neurons turn on their terminal selector genes. Hence, fd4 expression is compatible with it being a bridging factor between the two sets of genes.

      Experimental support for the "bridging" role of Fd4 comes from a set of loss-of-function and gain-of-function manipulations. The loss of function of Fd4, and the partially redundant gene Fd5, from lineage 7-1 does not affect the size of the lineage, but terminal markers of late-born neuronal phenotypes, like Eve and Dbx, are reduced or missing. By contrast, ectopic expression of fd4, but not fd5, results in ectopic expression of the terminal markers eve and Dbx throughout diverse VNC lineages.

      A detailed test of fd4's expression was then carried out using lineages 7-3 and 5-6, two well-characterized lineages in Drosophila. Lineage 7-3 is much smaller than 7-1 and continues to be so when subjected to fd4 misexpression. However, under the influence of ectopic Fd4 expression, the lineage 7-3 neurons lost their expected serotonin and corazonin expression and showed Eve expression as well as motoneuron phenotypes that partially mimic the U motoneurons of lineage 7-1.

      Ectopic expression of Fd4 also produced changes in the 5-6 lineage. Expression of apterous, a feature of lineage 5-6, was suppressed, and expression of the 7-1 marker, Eve, was evident. Dbx expression was also evident in the transformed 5-6 lineages, but extremely restricted as compared to a normal 7-1 lineage. Considering the partial redundancy of fd4 and fd5, it would have been interesting to express both genes in the 5-6 lineage. The anatomical changes that are exhibited by motoneurons in response to Fd4 expression confirm that these cells do, indeed, show a shift in their cellular identity.

    1. Reviewer #1 (Public review):

      The study introduces an open-source, cost-effective method for automating the quantification of male social behaviors in Drosophila melanogaster. It combines machine-learning based behavioral classifiers developed using JAABA (Janelia Automatic Animal Behavior Annotator) with inexpensive hardware constructed from off-the-shelf components. This approach addresses the limitations of existing methods, which often require expensive hardware and specialized setups. The authors demonstrate that their new "DANCE" classifiers accurately identify aggression (lunges) and courtship behaviors (wing extension, following, circling, attempted copulation, and copulation), closely matching manually annotated ground-truth data. Furthermore, DANCE classifiers outperform existing rule-based methods in accuracy. Finally, the study shows that DANCE classifiers perform as well when used with low-cost experimental hardware as with standard experimental setups across multiple paradigms, including RNAi knockdown of the neuropeptide Dsk and optogenetic silencing of dopaminergic neurons.

      The authors make creative use of existing resources and technology to develop an inexpensive, flexible, and robust experimental tool for the quantitative analysis of Drosophila behavior. A key strength of this work is the thorough benchmarking of both the behavioral classifiers and the experimental hardware against existing methods. In particular, the direct comparison of their low-cost experimental system with established systems across different experimental paradigms is compelling. A weakness of the study is that the use of JAABA-based classifiers to analyze aggression and courtship is not novel (Tao et al., J. Neurosci., 2024; Sten et al., Cell, 2023; Chiu et al., Cell, 2021; Isshi et al., eLife, 2020; Duistermars et al., Neuron, 2018). However, the demonstration the JAABA classifiers they developed work as well without expensive experimental hardware opens the door to more low-cost systems for quantitative behavior analysis.

      In summary, this work provides a practical and accessible approach to quantifying Drosophila behavior, reducing the economic barriers to the study of the neural and molecular mechanisms underlying social behavior.

    2. Reviewer #2 (Public review):

      Summary:

      This manuscript addresses the development of a low-cost behavioural setup and standardised open-source high performing classifiers for aggression and courtship behaviour. It does so by using readily available laboratory equipment and previously developed software packages. By comparing the performance of the setup and the classifiers to previously developed ones, this study shows the classifier's overperformance and the reliability of the low-cost setup in recapitulating previously described effects of different manipulations on aggression and courtship.

      Strengths:

      The newly developed classifiers for lunges, wing extension, attempted copulation, copulation, following, circling, perform better than previously available developed ones. The behavioural setup developed is low cost and reliably allows analysis of both aggression and courtship behaviour, validated through social experience manipulation (social isolation), gene knock (Dsk in Dilp2 neurons) and neuronal inactivation (dopaminergic neurons) know to affect courtship and aggression.

      Weaknesses:

      This framework only encompasses analysis of lunges, while aggression encompasses multiple behaviours. Even though DANCE can serve as a template allowing future development of additional classifiers, the current study compares performance to CADABRA which analyses further aggression behaviours, making the comparisons incomplete.

    3. Reviewer #3 (Public review):

      The study by Yadav et al. describes a new setup to quantify a number of aggression and mating behaviors in Drosophila melanogaster. The investigation of these behaviors requires the analysis of large number of videos to identify each kind of behavior displayed by a fly. Several approaches to automatize this process have been published before, but each of them has their limitations. The authors set out to develop a new setup that includes a very low-cost, easy to acquire hardware and open-source machine-learning classifiers to identify and quantify the behavior.

      Strengths:

      (1) The study demonstrates that their cheap, simple, and easy to obtain hardware works just as well as custom-made, specialized hardware for analyzing aggression and mating behavior. This enables the setup to be used in a wide range of settings, from research with limited resources to classroom teaching.

      (2) The authors used previously published software to train new classifiers for detecting a range of behaviors related to aggression and mating and make them freely available. The classifiers are very positively benchmarked against a manually acquired ground-truth as well as existing algorithms.

      (3) The study demonstrates the applicability of the setup (hardware and classifiers) to common methods in the field by confirming a number of expected phenotypes with their setup.

      Taken together, this work can greatly facilitate research of aggression and mating in Drosophila. The combination of low-cost, off-the-shelf hardware and open-source, robust software enables researchers with very little funding or technical expertise to contribute to the scientific process, and also allows large-scale experiments, for example, in classroom teaching with many students, or for systematic screenings.

    1. Reviewer #1 (Public review):

      Summary:

      Biomolecular condensates are essential part of cellular homeostatic regulation. In this manuscript, authors develop a theoretical framework for phase separation of membrane bound proteins. They show the effect of non-dilute surface binding and phase separation on tight junction protein organization.

      Strengths:

      It is an important study considering the phase separation of membrane bound molecules are taking the center stage of signaling, spanning from immune signaling to cell-cell adhesion. A theoretical framework will help biologists to quantitatively interpret their findings.

      Weaknesses:

      Understandably, authors used one system to test their theory (ZO-1). However, to establish a theoretical framework, this is sufficient.

      Comments on revisions:

      I do not recommend new experiments. The manuscript is clear and establishes a new step in understanding the physical chemistry of biomolecular condensates.

    1. Reviewer #1 (Public review):

      Summary:

      This useful study provides incomplete evidence of an association between atovaquone-proguanil use (as well as toxoplasmosis seropositivity) and reduced Alzheimer's dementia risk. The study reinforces findings that VZ vaccine lowers AD risk and suggests that this vaccine may be an effect modifier of A-P's protective effect. Strengths of the study include two extremely large cohorts, including a massive validation cohort in the US. Statistical analyses are sound, and the effect sizes are significant and meaningful. The CI curves are certainly impressive.

      Weaknesses include the inability to control for potentially important confounding variables. In my view, the findings are intriguing but remain correlative / hypothesis generating rather than causative. Significant mechanistic work needs to be done to link interventions which limit the impact of Toxoplasmosis and VZV reactivation on AD.

      Weaknesses:

      Major:

      (1) Most of the individuals in the study received A-P for malaria prophylaxis as it is not first line for Toxo treatment. Many (probably most) of these individuals were likely to be Toxo negative (~15% seropositive in the US), thereby eliminating a potential benefit of the drug in most people in the cohort. Finally, A-P is not a first line treatment for Toxo because of lower efficacy.

      (2) A-P exposure may be a marker of subtle demographic features not captured in the dataset such as wealth allowing for global travel and/or genetic predisposition to AD. This raises my suspicion of correlative rather than casual relationships between A-P exposure and AD reduction. The size of the cohort does not eliminate this issue, but rather narrows confidence intervals around potentially misleading odds ratios which have not been adjusted for the multitude of other variables driving incident AD.

      (3) The relationship between herpes virus reactivation and Toxo reactivation seems speculative.

      (4) A direct effect on A-P on AD lesions independent on infection is not considered as a hypothesis. Given the limitations above and effects on metabolic pathways, it probably should be. The Toxo hypothesis would be more convincing if the authors could demonstrate an enhanced effect of the drug in Toxo positive individuals without no effect in Toxo negative individuals.

      Minor:

      (5) "Clinically meaningful" should be eliminated from the discussion given that this is correlative evidence.

    2. Reviewer #2 (Public review):

      Summary:

      This manuscript examines the association between atovaquone/proguanil use, zoster vaccination, toxoplasmosis serostatus and Alzheimer's Disease, using 2 databases of claims data. The manuscript is well written and concise. The major concerns about the manuscript center around the indications of atovaquone/proguanil use, which would not typically be active against toxoplasmosis at doses given, and the lack of control for potential confounders in the analysis.

      Strengths:

      (1) Use of 2 databases of claims data.

      (2) Unbiased review of medications associated with AD, which identified zoster vaccination associated with decreased risk of AD, replicating findings from other studies.

      Weaknesses:

      (1) Given that atovaquone/proguanil is likely to be given to a healthy population who is able to travel, concern that there are unmeasured confounders driving the association.

      (2) The dose of atovaquone in atovaquone/proguanil is unlikely to be adequate suppression of toxo (much less for treatment/elimination of toxo), raising questions about the mechanism.

      (3) Unmeasured bias in the small number of people who had toxoplasma serology in the TriNetX cohort.

    1. Reviewer #1 (Public review):

      Disclaimer: While I am familiar with the CFS method and the CFS literature, I am not familiar with primate research or two-photon calcium imaging. Additionally, I may be biased regarding unconscious processing under CFS, as I have extensively investigated this area but have found no compelling evidence in favor of unconscious processing under CFS.

      This manuscript reports the results of a nonhuman-primate study (N=2 behaving macaque monkeys) investigating V1 responses under continuous flash suppression (CFS). The results show that CFS substantially suppressed V1 orientation responses, albeit slightly differently in the two monkeys. The authors conclude that CFS-suppressed orientation information "may not suffice for high-level visual and cognitive processing" (abstract).

      The manuscript is clearly written and well-organized. The conclusions are supported by the data and analyses presented (but see disclaimer). However, I believe that the manuscript would benefit from a more detailed discussion of the different results observed for monkeys A and B (i.e., inter-individual differences), and how exactly the observed results are related to findings of higher-order cognitive processing under CFS, on the one hand, and the "dorsal-ventral CFS hypothesis", on the other hand.

      Major Comments:

      (1) Some references are imprecise. For example, l.53: "Nevertheless, two fMRI studies reported that V1 activity is either unaffected or only weakly affected (Watanabe et al., 2011; Yuval-Greenberg & Heeger, 2013)". "To the best of my understanding, the second study reaches a conclusion that is entirely opposite to that of the first, specifically that for low-contrast, invisible stimuli, stimulus-evoked fMRI BOLD activity in the early visual cortex (V1-V3) is statistically indistinguishable from activity observed during stimulus-absent (mask-only) trials. Therefore, high-level unconscious processing under CFS should not be possible if Yuval-Greenberg & Heeger are correct. The two studies contradict each other; they do not imply the same thing.

      (2) Line 354: "The flashing masker was a circular white noise pattern with a diameter of 1.89{degree sign}{degree sign}, a contrast of 0.5, and a flickering rate of 10 Hz. The white noise consisted of randomly generated black and white blocks (0.07 × 0.07 each)." Why did the authors choose a white noise stimulus as the CFS mask? It has previously been shown that the depth of suppression engendered by CFS depends jointly on the spatiotemporal composition of the CFS and the stimulus it is competing with (Yang & Blake, 2012). For example, Hesselmann et al. (2016) compared Mondrian versus random dot masks using the probe detection technique (see Supplementary Figure S4 in the reference below) and found only a poor masking performance of the random dot masks.

      Yang, E., & Blake, R. (2012). Deconstructing continuous flash suppression. Journal of Vision, 12(3), 8. https://doi.org/10.1167/12.3.8

      Hesselmann, G., Darcy, N., Ludwig, K., & Sterzer, P. (2016). Priming in a shape task but not in a category task under continuous flash suppression. Journal of Vision, 16, 1-17.

      (3) Related to my previous point: I guess we do not know whether the monkeys saw the CF-suppressed grating stimuli or not? Therefore, could it be that the differences between monkey A and B are due to a different individual visibility of the suppressed stimuli? Interocular suppression has been shown to be extremely variable between participants (see reference below). This inter-individual variability may, in fact, be one of the reasons why the CFS literature is so heterogeneous in terms of unconscious cognitive processing: due to the variability in interocular suppression, a significant amount of data is often excluded prior to analysis, leading to statistical inconsistencies. Moreover, the authors' main conclusion (lines 305-307) builds on the assumption that the stimuli were rendered invisible, but isn't this speculation without a measure of awareness?

      Yamashiro, H., Yamamoto, H., Mano, H., Umeda, M., Higuchi, T., & Saiki, J. (2014). Activity in early visual areas predicts interindividual differences in binocular rivalry dynamics. Journal of Neurophysiology, 111(6), 1190-1202. https://doi.org/10.1152/jn.00509.2013

      (4) The authors refer to the "tool priming" CFS studies by Almeida et al. (l.33, l.280, and elsewhere) and Sakuraba et al. (l.284). A thorough critique of this line of research can be found here:

      Hesselmann, G., Darcy, N., Rothkirch, M., & Sterzer, P. (2018). Investigating Masked Priming Along the "Vision-for-Perception" and "Vision-for-Action" Dimensions of Unconscious Processing. Journal of Experimental Psychology. General. https://doi.org/10.1037/xge0000420

      This line of research ("dorsal-ventral CFS hypothesis") has inspired a significant body of behavioral and fMRI/EEG studies (see reference for a review below). The manuscript would benefit from a brief paragraph in the discussion section that addresses how the observed results contribute to this area of research.

      Ludwig, K., & Hesselmann, G. (2015). Weighing the evidence for a dorsal processing bias under continuous flash suppression. Consciousness and Cognition, 35, 251-259. https://doi.org/10.1016/j.concog.2014.12.010

    2. Reviewer #2 (Public review):

      Summary:

      The goal of this study was to investigate the degree to which low-level stimulus features (i.e., grating orientation) are processed in V1 when stimuli are not consciously perceived under conditions of continuous flash suppression (CFS). The authors measured the activity of a population of V1 neurons at single neuron resolution in awake fixating monkeys while they viewed dichoptic stimuli that consisted of an oriented grating presented to one eye and a noise stimulus to the other eye. Under such conditions, the mask stimulus can prevent conscious perception of the grating stimulus. By measuring the activity of neurons (with Ca2+ imaging) that preferred one or the other eye, the authors tested the degree of orientation processing that occurs during CFS.

      Strengths:

      The greatest strength of this study is the spatial resolution of the measurement and the ability to quantify stimulus representations during CSF in populations of neurons, preferring the eye stimulated by either the grating or the mask. There have been a number of prominent fMRI studies of CFS, but all of them have had the limitation of pooling responses across neurons preferring either eye, effectively measuring the summed response across ocular dominance columns. The ability to isolate separate populations offers an exciting opportunity to study the precise neural mechanisms that give rise to CFS, and potentially provide insights into nonconscious stimulus processing.

      Weaknesses:

      While this is an impressive experimental setup, the major weakness of this study is that the experiments don't advance any theoretical account of why CFS occurs or what CFS implies for conscious visual perception. There are two broad camps of thinking with regard to CFS. On the one hand, Watanabe et al. (2011) reported that V1 activity remained intact during CFS, implying that CFS interrupts stimulus processing downstream of V1. On the other hand, Yuval-Greenberg and Heeger (2013) showed that V1 activity is, in fact, reduced during CFS. By using a parametric experimental design, they measured the impact of the mask on the stimulus response as a function of contrast and concluded that the mask reduces the gain of neural responses to the grating stimulus. They presented a theoretical model in which the mask effectively reduced the SNR of the grating, making it invisible in the same way that reducing contrast makes a stimulus invisible.

      An important discussion point of Yuval-Greenberg and Heeger is that null results (such as those presented by Watanabe et al.) are difficult to interpret, as the lack of an effect may be simply due to insufficient data. I am afraid that this critique also applies to the present study. Here, the authors report that CFS effectively 'abolishes' tuning for stimuli in neurons preferring the eye with the grating stimulus. The authors would have been in a much stronger position to make this claim if they had varied the contrast of the stimulus to show that the loss of tuning was not simply due to masking. So, while this is an incredibly impressive set of measurements that in many ways raises the bar for in vivo Ca2+ imaging in behaving macaques, there isn't anything in the results that constitutes a real theoretical advance.

    3. Reviewer #3 (Public review):

      Summary:

      In this study, Tang, Yu & colleagues investigate the impact of continuous flash suppression (CFS) on the responses of V1 neurons using 2-photon calcium imaging. The report that CFS substantially suppressed V1 orientation responses. This suppression happens in a graded fashion depending on the binocular preference of the neuron: neurons preferring the eye that was presented with the marker stimuli were most suppressed, while the neurons preferring the eye to which the grating stimuli were presented were least suppressed. The binocular neuron exhibited an intermediate level of suppression.

      Strengths:

      The imaging techniques are cutting-edge, and the imaging results are convincing and consistent across animals.

      Weaknesses:

      I am not totally convinced by the conclusions that the authors draw based on their machine learning models.

    1. Reviewer #1 (Public review):

      Summary:

      In this study, participants completed two different tasks. A perceptual choice task in which they compared the sizes of pairs of items and a value-different task in which they identified the higher value option among pairs of items with the two tasks involving the same stimuli. Based on previous fMRI research, the authors sought to determine whether the superior frontal sulcus (SFS) is involved in both perceptual and value-based decisions or just one or the other. Initial fMRI analyses were devised to isolate brain regions that were activated for both types of choices and also regions that were unique to each. Transcranial magnetic stimulation was applied to the SFS in between fMRI sessions and it was found to lead to a significant decrease in accuracy and RT on the perceptual choice task but only a decrease in RT on the value-different task. Hierarchical drift diffusion modelling of the data indicated that the TMS had led to a lowering of decision boundaries in the perceptual task and a lowering of non-decision times on the value-based task. Additional analyses show that SFS covaries with model derived estimates of cumulative evidence, that this relationship is weakened by TMS.

      The paper has many strengths including the rigorous multi-pronged approach of causal manipulation, fMRI and computational modelling which offers a fresh perspective on the neural drivers of decision making. Some additional strengths include the careful paradigm design which ensured that the two types of tasks were matched for their perceptual content while orthogonalizing trial-to-trial variations in choice difficulty. The paper also lays out a number of specific hypotheses at the outset regarding the behavioural outcomes that are tied to decision model parameters and well justified.

    2. Reviewer #2 (Public review):

      Summary:

      The authors set out to test whether a TMS-induced reduction in excitability of the left Superior Frontal Sulcus influenced evidence integration in perceptual and value-based decisions. They directly compared behaviour-including fits to a computational decision process model---and fMRI pre and post TMS in one of each type of decision-making task. Their goal was to test domain-specific theories of the prefrontal cortex by examining whether the proposed role of the SFS in evidence integration was selective for perceptual but not value-based evidence.

      Strengths:

      The paper presents multiple credible sources of evidence for the role of the left SFS in perceptual decision making, finding similar mechanisms to prior literature and a nuanced discussion of where they diverge from prior findings. The value-based and perceptual decision making tasks were carefully matched in terms of stimulus display and motor response, making their comparison credible.

    1. Reviewer #1 (Public review):

      Summary:

      Age-related synaptic dysfunction can have detrimental effects on cognitive and locomotor function. Additionally, aging makes the nervous system vulnerable to late-onset neurodegenerative diseases. This manuscript by Marques et al. seeks to profile the cell surface proteomes of glia to uncover signaling pathways that are implicated in age-related neurodegeneration. They compared the glial cell-surface proteomes in the central brain of young (day 5) and old (day 50) flies, and identified the most up- and down-regulated proteins during the aging process. 48 genes were selected for analysis in a lifespan screen, and interestingly, most sex-specific phenotypes. Among these, adult-specific pan-glial DIP-β overexpression (OE) significantly increased the lifespan of both males and females and improved their motor control ability. To investigate the effect of DIP-β in the aging brain, Marques et al. performed snRNA-seq on 50-day-old Drosophila brains with or without DIP-β OE in glia. Cortex and ensheathing glia showed the most differentially expressed genes. Computational analysis revealed that glial DIP-β OE increased cell-cell communication, particularly with neurons and fat cells.

      Strengths:

      (1) State-of-the-art methodology to reveal the cell surface proteomes of glia in young and old flies.

      (2) Rigorous analyses to identify differentially expressed proteins.

      (3) Examination of up- and down-regulated candidates and identification of glial-expressed mediators that impact fly lifespan.

      (4) Intriguing sex-specific glial genes that regulate life span.

      (5) Follow-up RNA-seq analysis to examine cellular transcriptomes upon overexpression of an identified candidate (DIP-β).

      (6) A compelling dataset for the community that should generate extensive interest and spawn many projects.

      Weaknesses:

      (1) DIP-β OE using flySAM:

      a) These flies showed a larger increase in lifespan compared to using UAS-DIP-β (Figure 2 C, D). Do the authors think that flySAM is a more efficient way of OE than UAS? Also, the UAS construct would be specific to one DIP-β isoform, while flySAM would likely express all isoforms. Could this also contribute to the phenotypes observed?

      b) The Glial-GS>DIP-β flySAM flies without RU-486 have significantly shorter lifespans (Figure 2C) than their UAS-DIP-β counterparts. flySAM is lethal when expressed under the control of tubulin-GAL4 (Jia et al. 2018), likely due tothe toxicity of such high levels of overexpression. Is it possible that a larger increase in lifespan is due to the already reduced viability of these flies?

      c) Statistics: It is stated in the Methods that "statistical methods used are described in the figure legend of each relevant panel." However, there is no description of the statistics or sample sizes used in Figure 2.

      (2) Figure 3: The authors use a glial GeneSwitch (GS) to knock down and overexpress candidate genes. In Figure 3A, they look at glial-GS>UAS-GFP with and without RU. Without RU, there is no GFP expression, as expected. With RU, there is GFP expression. It is expected that all cell body GFP signal should colocalize with a glial nuclear marker (Repo). However, there is some signal that does not appear to be glia. Also, many glia do not express GFP, suggesting the glial GS driver does not label all glia. This could impact which glia are being targeted in several experiments.

      (3) It is interesting that sex-specific lifespan effects were observed in the candidate screen.

      a) The authors should provide a discussion about these sex-specific differences and their thoughts about why these were observed.

      b) The authors should also provide information regarding the sex of the flies used in the glial cell surface proteome study.

      c) Also, beyond the scope of this study, examining sex-specific glial proteomes could reveal additional insights into age-related pathways affecting males and females differentially.

      (4) The behavioral assay used in this study (climbing) tests locomotion driven by motor neurons. The proteomic analysis was performed with the central adult brain, which does not include the nerve cord, where motor neurons reside. While likely beyond the scope of this study, it would be informative to test other behaviors, including learning, circadian rhythms, etc.

      (5) It is surprising that overexpressing a CAM in glia has such a broad impact on the transcriptomes of so many different cell types. Could this be due to DIP-β OE maintaining the brain in a "younger" state and indirectly influencing the transcriptomes? Instead of DIP-β OE in glia directly influencing cell-cell interactions? Can the authors comment on this?

    2. Reviewer #2 (Public review):

      This manuscript presents an ambitious and technically innovative study that combines in situ cell-surface proteomics, functional genetic screening, and single-nucleus RNA sequencing to uncover glial factors that influence aging in Drosophila. The authors identify DIP-β as a glial protein whose overexpression extends lifespan and report intriguing sex-specific differences in lifespan outcomes. Overall, the study is conceptually compelling and offers a valuable dataset that will be of considerable interest to researchers studying glia-neuron communication, aging biology, and proteomic profiling in vivo.

      The in-situ proteomic labeling approach represents a notable methodological advance. If validated more extensively, it has the potential to become a widely used resource for probing glial aging mechanisms. The use of an inducible glial GeneSwitch driver is another strength, enabling the authors to carefully separate aging-relevant effects from developmental confounds. These technical choices meaningfully elevate the rigor of the study and support its central conclusions. The discovery of new candidate genes from the proteomics pipeline, including DIP-β, is intriguing and opens new avenues for understanding glial contributions to organismal lifespan. The observation of sex-specific lifespan effects is particularly interesting and warrants further exploration; the study sets the stage for future work in this direction.

      At the same time, several areas would benefit from clarification or additional analysis to fully support the manuscript's claims:

      (1) The manuscript frequently refers to "improved" or "increased" cell-cell communication following DIP-β overexpression, but the meaning of this term remains somewhat vague. Because the current analysis relies largely on transcriptomic predictions, it would be helpful to define precisely what metric is being used, e.g., increased numbers of predicted ligand-receptor interactions, enrichment of specific signaling pathways, or altered expression of communication-related components. Strengthening the mechanistic link between DIP-β, cell-cell communication, and lifespan extension, potentially through targeted validation of specific glial interactions, would substantially reinforce the interpretation.

      (2) The lifespan screen is central to the paper, and clearer visualization and contextualization of these results would significantly improve the manuscript's impact. For example, Figure 3D is challenging to interpret in its current form. More explicit presentation of which manipulations extend lifespan in each sex, along with effect sizes and significance values, would provide clarity. Including positive controls for lifespan extension would also help contextualize the magnitude of the observed effects. The reported effects of DIP-β, while promising, are modest relative to baseline effects of RU feeding, and a discussion of this would help appropriately calibrate the conclusions.

      (3) Several figures would benefit from improved labeling or more detailed legends. For instance, the meaning of "N" and "C" in Figure 1D is unclear; Figure 3A should clarify that Repo is a glial marker; and Figure 5C appears to have truncated labels. Reordering certain panels (e.g., moving control data in Figure 4A-B) may also improve narrative flow. These refinements would greatly aid reader comprehension.

      (4) A few claims would be strengthened by more specific references or acknowledgment of alternative interpretations. Examples include the phenoxy-radical labeling radius, the impact of H₂O₂ exposure, and the specificity of neutravidin. Additionally, downregulation of synapse-related GO terms may reflect age-related transcriptional changes rather than impaired glia-neuron communication per se, and this possibility should be recognized. The term "unbiased" to describe the screen may also be reconsidered, given the preselection of candidate genes.

      (5) Clarifying the rationale for focusing on central brain glia over optic-lobe glia would be useful.

    1. Reviewer #3 (Public review):

      Summary:

      Razlan and colleagues provide a detailed anatomical characterization of lamina I projection neurons in the mouse spinal cord that are densely innervated by primary afferents activated by cooling of the skin. The authors, building on their previous anatomical work, validate a Trpm8-Flp mouse line, show synaptic contacts between Trpm8⁺ boutons and projection neurons at the ultrastructural level, and demonstrate at the physiological level that these neurons specifically respond to cooling stimuli. Next, by taking advantage of their previous transcriptomic analysis of ALS neurons, they identify calbindin as a marker for cold-activated lamina I projection neurons and map their ascending projections to the rostral lateral parabrachial area, caudal periaqueductal gray, and ventral posterolateral thalamus, well-known thermosensory and thermoregulatory centers. Altogether, these findings provide strong anatomical and functional evidence for a direct line of transmission from Trpm8⁺ sensory afferents through Calb1⁺ lamina I neurons to key supraspinal centers controlling perception of cold and thermoregulatory responses.

      Strengths:

      The combination of mouse genetics, electron microscopy, ex vivo physiology, and viral tracing provides convincing evidence for a direct cold pathway. The work validates the Trpm8-Flp line by extensive anatomical and molecular characterization. Integration with previous transcriptomic and anatomical data neatly links the cold-selective lamina I neurons to a molecularly defined cluster of ALS neurons, strengthening the bridge between molecular identity, anatomy, and physiological function.

      Weaknesses:

      While anatomical evidence for direct synaptic connectivity between Trpm8+ afferents and lamina I projection neurons is compelling, a physiological demonstration of strict monosynaptic transmission is not shown. The conclusion that these inputs are exclusively monosynaptic should be toned down. Similarly, the statement that "Lamina I ALS neurons that are surrounded by Trpm8 afferents are cold-selective" should also be toned down as only a few neurons have been tested and it cannot be excluded that other neurons with similar characteristics may be polymodal.

    1. Reviewer #2 (Public review):

      Summary:

      The current article adapts standard rhythmic measures to describe the temporal organisation of whale song units.

      Strengths:

      The detailed description of the internal temporal structure of whale songs is something that has thus far been lacking.

      Weaknesses:

      Conceptual and terminological bases of the paper are problematical and hamper comparison with other taxa, including humans. According to signal theory, codas are indexical rather than symbolic. They signal an individual's group identity. Borrowing from humans and linguistics, coda inter-group variation represents a case of accents -- phonologically different varieties of the same call -- not dialects, confirming they are an index. Moreover, symbolism is not a feature detectable or confirmed through rhythmic analyses or temporal characterisation. This raises serious doubt whether alleged "dialects," "symbolism" and similarity between whales and humans is factual. The comparative scope and relevance of this paper for the broader field is limited and evolutionary claims are potentially misleading and perilous.

    1. Reviewer #2 (Public review):

      Summary:

      In the manuscript entitled "Ω-Loop mutations control dynamics 2 of the active site by modulating the 3 hydrogen-bonding network in PDC-3 4 β-lactamase", Chen and coworkers provide a computational investigation of the dynamics of the enzyme Pseudomonas-derived chephalosporinase 3 (PDC3) and some mutants associated with increased antibiotic resistance. After an initial analysis of the enzyme dynamics provided by RMSD/RMSF, the author conclude that the mutations alter the local dynamics within the omega loop and the R2 loop. The authors show that the network of hydrogen bonds in disrupted in the mutants. Constant pH calculations showed that the mutations also change the pKa of the catalytic lysine 67 and pocket volume calculations showed that the mutations expand the catalytic pocket. Finally, time-independent componente analysis (tiCA) showed different profiles for the mutant enzyme as compared to the wild type.

      Strengths:

      The scope of the manuscript is definitely relevant. Antibiotic resistance is an important problem and, in particular, Pseudomonas aeruginosa resistance is associated with an increasing number of deaths. The choice of the computational methods is also something to highlight here. Although I am not familiar with Adaptive Bandit Molecular Dynamics (ABMD), the description provided in the manuscript that this simulation strategy is well suited for the problem under evaluation.

      Weaknesses:

      In the revised version, the authors addressed my concerns regarding their use of the MSM, and in my view, their conclusions are now much more robust and well-supported by the data. While it would be very interesting to see a quantitative correlation between the effects of the mutations observed in the MD data and relevant experimental findings, I understand that this may be beyond the scope of the manuscript.

    2. Reviewer #3 (Public review):

      Summary:

      This manuscript aims to explore how mutations in the PDC-3 3 β-lactamase alter its ability to bind and catalyse reactions of antibiotic compounds. The topic is interesting and the study uses MD simulations and to provide hypotheses about how the size of the binding site is altered by mutations that change the conformation and flexibility of two loops that line the binding pocket. Some greater consideration of the uncertainties and how the method choice affect the ability to compare equilibrium properties would strengthen the quantitative conclusions. While many results appear significant by eye, quantifying this and ensuring convergence would strengthen the conclusions.

      Strengths:

      The significance of the problem is clearly described the relationship to prior literature is discussed extensively.

      Comments on revised version:

      I am concerned that the authors state in the response to reviews that it is not possible to get error bars on values due to the use of the AB-MD protocol that guides the simulations to unexplored basins. Yet the authors want to compare these values between the WT and mutants. This relates to RMSD, RMSF, % H-bond and volume calculations. I don't accept that you cannot calculate an uncertainty on a time averaged property calculated across the entire simulation. In these cases you can either run repeat simulations to get multiple values on which to do statistical analysis, or you can break the simulation into blocks and check both convergence and calculate uncertainties.

      I note that the authors do provide error bars on the volumes, but the statistics given for these need closer scrutiny (I cant test this without the raw data). For example the authors have p<0.0001 for the following pair of volumes 1072 {plus minus} 158 and 1115 {plus minus} 242, or for SASA p<0.0001 is given for 2 identical numbers 155+/- 3.

      I also remain concerned about comparisons between simulations run with the AB-MD scheme. While each simulation is an equilibrium simulation run without biasing forces, new simulations are seeded to expand the conformational sampling of the system. This means that by definition the ensemble of simulations does not represent and equilibrium ensemble. For example, the frequency at which conformations are sampled would not be the same as in a single much longer equilibrium simulation. While you may be able to see trends in the differences between conditions run in this way, I still don't understand how you can compare quantitative information without some method of reweighing the ensemble. It is not clear that such a rewieghting exists for this methods, in which case I advise some more caution in the wording of the comparisons made from this data.

      At this stage I don't feel the revision has directly addressed the main comments I raised in the earlier review, although there is a stronger response to the comments of Reviewer #2.

    1. Reviewer #1 (Public review):

      Summary:

      In this descriptive study, Tateishi et al. report a Tn-seq based analysis of genetic requirements for growth and fitness in 8 clinical strains of Mycobacterium intracellulare Mi), and compare the findings with a type strain ATCC13950. The study finds a core set of 131 genes that are essential in all nine strains, and therefore are reasonably argued as potential drug targets. Multiple other genes required for fitness in clinical isolates have been found to be important for hypoxic growth in the type strain.

      Strengths:

      The study has generated a large volume of Tn-seq datasets of multiple clinical strains of Mi from multiple growth conditions, including from mouse lungs. The dataset can serve as an important resource for future studies on Mi, which despite being clinically significant, remains a relatively understudied species of mycobacteria.

      Weaknesses:

      The primary claim of the study that the clinical strains are better adapted for hypoxic growth is yet to be comprehensively investigated. However, this reviewer thinks such an investigation would require a complex experimental design and perhaps form an independent study.

      Comments on revisions:

      The revised paper has satisfactorily addressed my previous concerns, and I have no further issues with this paper.

    1. Reviewer #1 (Public review):

      Summary:

      This manuscript addresses the important problem of the uncoupling of oxidative phosphorylation due to hypoxia-ischemia injury in the neonatal brain and provides insight into the neuroprotective mechanisms of hypothermia treatment.

      Strengths:

      The authors used a combination of in vivo imaging of awake P10 mice and experiments on isolated mitochondria to assess various key parameters of brain metabolism during hypoxia-ischemia with and without hypothermia treatment. This unique approach resulted in a comprehensive data set that provides solid evidence to support the derived conclusions.

      Weaknesses:

      Several potential weaknesses were identified in the original submission, which the authors subsequently addressed in the revised manuscript. Here is the brief list of the questions:

      (1) Is it possible that the observed relatively low baseline OEF and trends of increased OEF and CBF over several hours after the imaging start were partially due to slow recovery from anesthesia?

      (2) What was the pain management, and is there a possibility that some of the observations were influenced by the pain-reducing drugs or their absence?

      (3) Were P10 mice significantly stressed during imaging in the awake state because they didn't have head-restraint habituation training?

      (4) Considering high metabolism and blood flow in the cortex, it could be potentially challenging to predict cortical temperature based on the skull temperature, particularly in the deeper part of the cortex.

      (5) The map of estimated CMRO2 looks quite heterogeneous across the brain surface. Could this be partially resulting from the measurement artefact?

      (6) It would be beneficial to provide more detailed justification for using P10 mice in the experiments.

    2. Reviewer #3 (Public review):

      Sun et al. present a comprehensive study using a novel photoacoustic microscopy setup and mitochondrial analysis to investigate the impact of hypoxia-ischemia (HI) on brain metabolism and the protective role of therapeutic hypothermia. The authors elegantly demonstrate three connected findings: (1) HI initially suppresses brain metabolism, (2) subsequently triggers a metabolic surge linked to oxidative phosphorylation uncoupling and brain damage, and (3) therapeutic hypothermia mitigates HI-induced damage by blocking this surge and reducing mitochondrial stress.

      The study's design and execution are great, with a clear presentation of results and methods. Data is nicely presented, and methodological details are thorough.

      However, a minor concern is the extensive use of abbreviations, which can hinder readability. As all the abbreviations are introduced in the text, their overuse may render the text hard to read to non-specialist audiences. Additionally, sharing the custom Matlab and other software scripts online, particularly those used for blood vessel segmentation, would be a valuable resource for the scientific community. In addition, while the study focuses on the short-term effects of HI, exploring the long-term consequences and definitively elucidating HI's impact on mitochondria would further strengthen the manuscript's impact.

      Despite these minor points, this manuscript is very interesting.

      Comments on revisions:

      All addressed.

    1. Reviewer #2 (Public Review):

      There is increasing evidence that viruses manipulate vectors and hosts to facilitate transmission. For arthropods, saliva plays an essential role for successful feeding on a host and consequently for arthropod-borne viruses that are transmitted during arthropod feeding on new hosts. This is so because saliva constitutes the interaction interface between arthropod and host and contains many enzymes and effectors that allow feeding on a compatible host by neutralizing host defenses. Therefore, it is not surprising that viruses change saliva composition or use saliva proteins to provoke altered vector-host interactions that are favorable for virus transmission. However, detailed mechanistic analyses are scarce. Here, Zhao and coworkers study transmission of rice stripe virus (RSV) by the planthopper Laodelphax striatellus. RSV infects plants as well as the vector, accumulates in salivary glands and is injected together with saliva into a new host during vector feeding.

      The authors present evidence that a saliva-contained enzyme - carbonic anhydrase (CA) - might facilitate virus infection of rice by interfering with callose deposition, a plant defense response. In vitro pull-down experiments, yeast two hybrid assay and binding affinity assays show convincingly interaction between CA and a plant thaumatin-like protein (TLP) that degrades callose. Similar experiments show that CA and TLP interact with the RSV nuclear capsid protein NT to form a complex. Formation of the CA-TLP complex increases TLP activity by roughly 30% and integration of NT increases TLP activity further. This correlates with lower callose content in RSV-infected plants and higher virus titer. Further, silencing CA in vectors decreases virus titers in infected plants. Interestingly, aphid CA was found to play a role in plant infection with two non-persistent non-circulative viruses, turnip mosaic virus and cucumber mosaic virus (Guo et al. 2023 doi.org/10.1073/pnas.2222040120), but the proposed mode of action is entirely different.

      Editors' note: this version was assessed by the editors, without further input from the reviewers.

    1. Reviewer #1 (Public review):

      The medicinal leech preparation is an amenable system in which to understand how the underlying cellular networks for locomotion function. A previously identified non-spiking neuron (NS) was studied and found to alter the mean firing frequency of a crawl-related motoneuron (DE-3), which fires during the contraction phase of crawling. The data are solid. Identifying upstream neurons responsible for crawl motor patterning is essential for understanding how rhythmic behavior is controlled.

    2. Reviewer #2 (Public review):

      This study by Radice et al., takes advantage of the very well-established leach preparation to investigate questions related to motor control, more precisely the question of how the activity of motoneurons taking part in leach crawling behavior are finely tuned.

      The paper is overall well written. The findings are clearly presented, and the data seems solid overall.

    1. Reviewer #1 (Public review):

      Summary:

      This study advances the lab's growing body of evidence exploring higher-order learning and its neural mechanisms. They recently found that NMDA receptor activity in the perirhinal cortex was necessary for integrating stimulus-stimulus associations with stimulus-shock associations (mediated learning) to produce preconditioned fear, but it was not necessary for forming stimulus-shock associations. On the other hand, basolateral amygdala NMDA receptor activity is required for forming stimulus-shock memories. Based on these facts, the authors assessed: 1. why the perirhinal cortex is necessary for mediated learning but not direct fear learning and 2. the determinants of perirhinal cortex versus basolateral amygdala necessity for forming direct versus indirect fear memories. The authors used standard sensory preconditioning and variants designed to manipulate the novelty and temporal relationship between stimuli and shock and, therefore, the attentional state under which associative information might be processed. Under experimental conditions where information would presumably be processed primarily in the periphery of attention (temporal distance between stimulus/shock or stimulus pre-exposure), perirhinal cortex NMDA receptor activation was required for learning indirect associations. On the other hand, when information would likely be processed in focal attention (novel stimulus contiguous with shock), basolateral amygdala NMDA activity was required for learning direct associations. Together, the findings indicate that the perirhinal cortex and basolateral amygdala subserve peripheral and focal attention, respectively. The authors provide support for their conclusions using careful, hypothesis-driven experimental design, rigorous methods, and integrating their findings with the relevant literature on learning theory, information processing, and neurobiology. Therefore, this work will be highly interesting to several fields.

      Strengths:

      (1) The experiments were carefully constructed and designed to test hypotheses that were rooted in the lab's previous work, in addition to established learning theory and information processing background literature.

      (2) There are clear predictions and alternative outcomes. The provided table does an excellent job of condensing and enhancing the readability of a large amount of data.

      (3) In a broad sense, attention states are a component of nearly every behavioral experiment. Therefore, identifying their engagement by dissociable brain areas and under different learning conditions is an important area of research.

      (4) The authors clearly note where they replicated their own findings, report full statistical measures, effect sizes, and confidence intervals, indicating the level of scientific rigor.

      (5) The findings raise questions for future experiments that will further test the authors' hypotheses; this is well discussed.

    2. Reviewer #2 (Public review):

      This paper continues the authors' research on the roles of the basolateral amygdala (BLA) and the perirhinal cortex (PRh) in sensory preconditioning (SPC) and second order conditioning (SOC). In this manuscript, the authors explore how prior exposure to stimuli may influence which regions are necessary for conditioning to the second-order cue (S2). The authors perform a series of experiments which first confirm prior results shown by the author - that NMDA receptors in the PRh are necessary in SPC during conditioning of the first-order cue (S1) with shock to allow for freezing to S2 at test; and that NMDA receptors in the BLA are necessary for S1 conditioning during the S1-shock pairings. The authors then set out to test the hypothesis that the PRh encodes associations in a peripheral state of attention whereas the BLA encodes associations in a focal state of attention, similar to the A1 and A2 states in Wagner's theory of SOP. To do this, they show that BLA is necessary for conditioning to S2 when the S2 is first exposed during a serial compound procedure - S2-S1-shock. To determine whether pre-exposure of S2 will shift S2 to a peripheral focal state, the authors run a design in which S2-S1 presentations are given prior to the serial compound phase. The authors show that this restores NMDA receptor activity within the PRh as necessary for fear response to S2 at test. They then test whether the presence of S1 during the serial compound conditioning allows the PRh to support the fear responses to S2 by introducing a delay conditioning paradigm in which S1 is no longer present. The authors find that PRh is no longer required and suggest that this is due to S2 remaining in the primary focal state.

      Strengths:

      As with their earlier work, the authors have performed a rigorous series of experiments to better understand the roles of the BLA and PRh in the learning of first- and second-order stimuli. The experiments are well-designed and clearly presented, and the results show definitive differences in functionality between the PRh and BLA. The first experiment confirms earlier findings from the lab (and others), and the authors then build on their previous work to more deeply reveal how these regions differ in how they encode associations between stimuli. The authors have done a commendable job on pursuing these questions.

      Table 1 is an excellent way to highlight the results and provide the reader with a quick look-up table of the findings.

    3. Reviewer #3 (Public review):

      Summary:

      This manuscript presents a series of experiments that further investigate the roles of the BLA and PRH in sensory preconditioning, with a particular focus on understanding their differential involvement in the association of S1 and S2 with shock.

      Strengths:

      The motivation for the study is clearly articulated, and the experimental designs are thoughtfully constructed. I especially appreciate the inclusion of Table 1, which makes the designs easy to follow. The results are clearly presented, and the statistical analyses are rigorous.

      During the revision, the authors have adequately addressed my minor suggestions from the original version.

    1. Reviewer #1 (Public review):

      I have to preface my evaluation with a disclosure that I lack the mathematical expertise to fully assess what seems to be the authors' main theoretical contribution. I am providing this assessment to the best of my ability, but I cannot substitute for a reviewer with more advanced mathematical/physical training.

      Summary:

      This paper describes a new theoretical framework for measuring parsimony preferences in human judgments. The authors derive four metrics that they associate with parsimony (dimensionality, boundary, volume, and robustness) and measure whether human adults are sensitive to these metrics. In two tasks, adults had to choose one of two flower beds which a statistical sample was generated from, with or without explicit instruction to choose the flower bed perceptually closest to the sample. The authors conduct extensive statistical analyses showing that humans are sensitive to most of the derived quantities, even when the instructions encouraged participants to choose only based on perceptual distance. The authors complement their study with a computational neural network model that learns to make judgments about the same stimuli with feedback. They show that the computational model is sensitive to the tasks communicated by feedback and only uses the parsimony-associated metrics when feedback trains it to do so.

      Strengths:

      (1) The paper derives and applies new mathematical quantities associated with parsimony. The mathematical rigor is very impressive and is much more extensive than in most other work in the field, where studies often adopt only one metric (such as the number of causes or parameters). These formal metrics can be very useful for the field.

      (2) The studies are preregistered, and the statistical analyses are strong.

      (3) The computational model complements the behavioral findings, showing that the derived quantities are not simply equivalent to maximum-likelihood inference in the task.

      (4) The speculations in the discussion section (e.g., the idea that human sensitivity is driven by the computational demands each metric requires) are intriguing and could usefully guide future work.

      Weaknesses:

      (1) The paper is very hard to understand. Many of the key details of the derived metrics are in the appendix, with very little accessible explanation in the main text. The figures helped me understand the metrics somewhat, although I am still not sure how some of them (such as boundary or robustness as measured here) are linked to parsimony. I understand that this is addressed by the derivations in the appendix, but as a computational cognitive scientist, I would have benefited from more accessible explanations. Important aspects of the human studies are also missing from the main text, such as the sample size for Experiment 2.

      (2) It is not fully clear whether the sensitivity of human participants to some of the quantities convincingly reported here actually means that participants preferred shapes according to the corresponding aspect of parsimony. The title and framing suggest that parsimony "guides" human decision-making, which may lead readers to conclude that humans prefer more parsimonious shapes. I am not sure the sensitivity findings alone support this framing, but it might just be my misunderstanding of the analyses.

      (3) The stimulus set included only four combinations of shapes, each designed to diagnostically target one of the theoretical quantities. It is unclear whether the results are robust or specific to these particular 4 stimuli.

      (4) The study is framed as measuring "decision-making," but the task resembles statistical inference (e.g., which shape generated the data) or perceptual judgment. This is a minor point since "decision-making" is not well defined in the literature, yet the current framing in the title gave me the initial impression that humans would be making preference choices and learning about them over time with feedback.

    2. Reviewer #2 (Public review):

      This manuscript presents a sophisticated investigation into the computational mechanisms underlying human decision-making, and it presents evidence for a preference for simpler explanations (Occam's razor). The authors dissect the simplicity bias into four different components, and they design experiments to target each of them by presenting choices whose underlying models differ only in one of these components. In the learning tasks, participants must infer a "law" (a logical rule) from observed data in a way that operationalizes the process of scientific reasoning in a controlled laboratory setting. The tasks are complex enough to be engaging but simple enough to allow for precise computational modeling.

      As a further novel feature, authors derive a further term in the expansion of the log-evidence, which arises from boundary terms. This is combined with a choice model, which is the one that is tested in experiments. Experiments are run, but with humans and with artificial intelligence agents, showing that humans have an enhanced preference for simplicity as compared to artificial neural networks.

      Overall, the work is well written, interesting, and timely, bridging concepts in statistical inference and human decision making. Although technical details are rather elaborate, my understanding is that they represent the state of the art.

      I have only one main comment that I think deserves more comments. Computing the complexity penalty of models may be hard. It is unlikely that humans can perform such a calculation on the fly. As authors discuss in the final section, while the dimensionality term may be easier to compute, others (e.g., the volume term, which requires an integral) may be considerably harder to compute (it is true that they should be computed once and for all for each task, but still...). I wonder whether the sensitivity of human decision making with reference to the different terms is so different, and in particular whether it aligns with computational simplicity, or with the possibility of approximating each term by simple heuristics. Indeed, the sensitivity to the volume term is significantly and systematically lower than that of other terms. I wonder whether this relation could be made more quantitative using neural networks, using as a proxy of computational hardness the number of samples needed to reach a given error level in learning each of these terms.

    3. Reviewer #3 (Public review):

      Summary:

      This is a very interesting paper that documents how humans use a variety of factors that penalize model complexity and integrate over a possible set of parameters within each model. By comparison, trained neural networks also use these biases, but only on tasks where model selection was part of the reward structure. In the situation where training emphasizes maximum-likelihood decisions, only neural networks, but not humans, were able to adapt their decision-making. Humans continue to use model integration simplicity biases.

      Strengths:

      This study used a pre-registered plan for analyzing human data, which exceeds the standards compared to other current studies.

      The results are technically correct.

      Weaknesses:

      The presentation of the results could be improved.

    1. Reviewer #1 (Public review):

      Summary:

      The study by Klotzsche et al. examines whether emotional facial expressions can be decoded from EEG while participants view 3D faces in immersive VR and whether stereoscopic depth cues affect these neural representations. Participants viewed computer-generated faces (three identities, four emotions) rendered either stereoscopically or monoscopically, while performing an emotion recognition task. Time-resolved multivariate decoding revealed above-chance decodability of facial expressions from EEG. Importantly, decoding accuracy did not differ between monoscopic and stereoscopic viewing. This indicates that the neural representation of expressions is robust against stereoscopic disparity for the relevant features. However, a separate classifier could distinguish the depth condition (mono vs. stereo) from EEG, i.e., the pattern of neuronal activity differs between conditions, but not in ways relevant for the decoding of emotions. It had an early peak and a temporal profile similar to identity decoding, suggesting that early, task-irrelevant visual differences are captured neurally. Cross-decoding further demonstrated that expression decoders trained in one depth condition could generalize to the other, supporting the idea of representational invariance. Eye-tracking analyses showed that expressions and identities could be decoded from gaze patterns, but not the depth condition, and EEG- and gaze-based decoding performances were not correlated across participants. Overall, this work shows that EEG decoding in VR is feasible and sensitive, and suggests that stereoscopic cues are represented in the brain but do not influence the neural processing of facial expressions. This study addresses a relevant question with state-of-the-art experimental and data analysis techniques.

      Strengths:

      (1) It combines EEG, virtual reality stereoscoptic and monoscopic presentation of visual stimuli, and advanced data analysis methods to address a timely question.

      (2) The figures are of very high quality.

      (3) The reference list is appropriate and up to date.

      Weaknesses:

      (1) The introduction-results-discussion-methods order makes it hard to follow the Results without repeatedly consulting the Methods. Please introduce minimal, critical methodological context at the start of each Results subsection; reserve technical details for Methods/Supplement.

      (2) Many Results subsections begin with a crisp question and present rich analyses, but end without a short synthesis. Please add 1-2 sentences that explicitly answer the opening question and state what the analyses demonstrate.

      (3) The Results compellingly show that (a) expressions are decodable from EEG and (b) mono vs stereo trials are decodable from EEG; yet expression decoding is comparable across mono and stereo. It would help if you articulate why depth is neurally distinguishable while leaving expression representations unchanged. Maybe improve the discussion of the results of source localization and give a more detailed connection to what we already know about the processing of disparity.

    2. Reviewer #2 (Public review):

      Summary:

      The authors' main aim was to determine the extent to which the emotional expression of face images could be inferred from electrophysiological data under the viewing conditions imposed by immersive virtual reality displays. Further, given that stereoscopic depth cues can be easily manipulated in such displays, the authors wished to investigate whether successful emotion decoding was affected by the presence or absence of these depth cues, and also if the presence/absence of depth cues was itself a property of the viewing experience that could be decoded from neural data.

      Overall, the authors use fairly standard approaches to decoding neural data to demonstrate that above-chance results (slightly above the 0.5 chance threshold for their measure of choice) are in general achievable for emotion decoding, decoding the identity of faces from neural data, and decoding the presence/absence of depth cues in an immersive virtual reality display. They further examine the contribution of specific components of the response to visual stimuli with similar outcomes.

      Strengths:

      The main contribution of the manuscript is methodological. Rather than shedding particular light on the neural mechanisms supporting depth processing or face perception, what is on offer is primarily a straightforward examination of an applied question. With regard to the goal of answering that applied question, I think the paper succeeds. The overall experimental design is not novel, but in this case, that is a good thing. The authors have used relatively unadorned tasks and previous approaches to applying decoding tools to EEG data to see what they can get out of the neural data collected under these viewing conditions. While I would say that there is not a great deal that is especially surprising about these results, the authors do meet the goal they set for themselves.

      Weaknesses:

      Some of the key weaknesses I see are points that the authors raise themselves in their discussion, particularly with regard to the generalizability of their results. In particular, the 3D faces they have employed here perhaps exhibit a somewhat limited repertoire of emotional expression and do not necessarily cover a representative gamut of emotional face appearances, such as one would encounter in naturalistic settings. Then again, part of the goal of the paper was to examine the decodability of emotional expression in a specific, non-natural viewing environment - a viewing environment in which one could reasonably expect to encounter artificial faces like these. Still, the limitations of the stimuli potentially limit the scope of the conclusions one should draw from the data. I also think that there is a great deal of room for low-level image properties to drive the decoding results for faces, which could have been addressed in a number of ways (matching power spectra, for example, or using an inverted-image control condition). The absence of such control comparisons means that it is difficult to know if this is really a result that reflects face processing or much lower-level image differences that are diagnostic of emotion or identity in this subset of images. Again, to some extent, this is potentially acceptable - if one is mostly interested in whether this result is achievable at all (by hook or by crook), then it is not so important how the goal is met. Then again, one would perhaps like to know if what has been measured here is more a reflection of spatial vision vs. face processing mechanisms.

    3. Reviewer #3 (Public review):

      Summary:

      This study investigates two main questions:

      (1) whether brain activity recorded during immersive virtual reality can differentiate facial expressions and stereoscopic depth, and

      (2) whether depth cues modulate facial information processing.

      The results show that both expression and depth information can be decoded from multivariate EEG recorded in a head-mounted VR setup. However, the results show that the decoding performance of facial expressions does not benefit from depth information.

      Strengths:

      The study is technically strong and well executed. EEG data are of high quality despite the challenges of recording inside a head-mounted VR system. The work effectively combines stereoscopic stimulus presentation, eye-tracking to monitor gaze behavior, and time-resolved multivariate decoding techniques. Together, these elements provide an exemplary demonstration of how to collect and analyze high-quality EEG data in immersive VR environments.

      Weaknesses:

      The major limitation concerns the theoretical question about how stereoscopic depth modulates facial expression processing. While previous work has suggested that stereoscopic depth cues can shape natural face perception and emphasize the importance of binocular information in recognizing facial expressions (lines 95-97), the present study reports a null effect of depth. However, the stimulus configuration they used likely constrained the ability to detect any depth-related effects. All facial stimuli were static, frontal, and presented at a fixed distance. This design leads to near-ceiling behavioral performance and no behavioral effect of depth on expression recognition. It makes the null modulation of depth on expression processing unsurprising and limits the theoretical reach of the study. Adding more subtle or naturalistic features (such as various viewing angles and dynamic expressions) to the stimulus set if the authors aim to advance a strong theoretical claim about the role of binocular disparity. Or reframing the work as a technical validation of EEG decoding in this context.

      Another issue relates to the claim that eye movements cannot explain the EEG decoding results. It is a real challenge to remove eye-movement-related artifacts and confounds, as the VR setup tends to encourage viewers to explore the environment freely. However, nearly half of the eye-tracking datasets were lost (usable in only 17 of 33 participants), which substantially weakens the evidence for EEG-gaze dissociation. Moreover, it would be almost impossible to decode facial information from only two-dimensional gaze direction, given that with 60 EEG channels, the decoding accuracy was modest (AUC ≈ 0.60). These two factors together limited the strength of the reported null correlation between neural and eye-data decoding.

      The decoding analysis appears to use all 60 EEG channels as input features. I wonder why the authors did not examine using more spatially specific channel subsets. Facial expression and depth cues are known to preferentially engage occipito-temporal regions (e.g., N170-related sites), yet the current approach treats all sensors equally. Including all the channels may add noise and irrelevant signals to facial information decoding. Besides, using a subset of spatial-specific channels would align more directly with the subsequent source reconstruction.

    1. Reviewer #1 (Public review):

      Summary:

      In the present manuscript, de Bos and Kutay investigate the functional implications of persistent microtubule-ER contacts as cells go through mitosis. To do so, they resorted to investigating phosphorylation mutants of the ER-Microtubule crosslinker Climp63. They found that phosphodeficient Climp63 mutants induce a severe SAC-dependent mitotic delay after normal chromosome alignment, with an impressive mitotic index of approximately 75%. Strikingly, this was often associated with massive nuclear fragmentation into up to 30 micronuclei that are able to recruit both core and non-core nuclear envelope components. One particular residue (S17) that is phosphorylated by Cdk1 seems to account for most, if not all, these phenotypes. Furthermore, the authors use the impact on mitosis as an indirect way to map the microtubule binding domain of Climp63, which has remained controversial, and found that it is mostly restricted to the N-terminal 28 residues of Climp63. Of note, despite the strong impact on mitosis, persistent microtubule-ER contacts did not affect the distribution of other organelles during mitosis, such as mitochondria or lysosomes.

      Strengths:

      Overall, this work provides important mechanistic insight into the functional implications of ER-microtubule network remodelling during mitosis and should be of great interest to a vast readership of cell biologists.

      Weaknesses:

      Some of the key findings appear somewhat preliminary and would be worth exploring further to substantiate some of the claims and clarify the respective impact on mitosis and nuclear envelope reassembly on the resulting micronuclei.

      The following suggestions would significantly clarify some key points:

      (1) The striking increase in mitotic index in cells expressing the Climp63 phosphodefective mutant, together with their live cell imaging data indicating extensive mitotic delays that can be relieved by SAC inhibition, suggests that SAC silencing is significantly delayed or even impossible to achieve. The fact that most chromosomes align in 12 min, irrespective of the expression of the Climp63 phosphodefective mutant, suggests that initial microtubule-kinetochore interactions are not compromised, but maybe cannot be stably maintained. Alternatively, the stripping of SAC proteins from kinetochores by dynein along attached microtubules might be compromised, despite normal microtubule-kinetochore attachments. The authors allude to both these possibilities, but unfortunately, they never really test them. This could easily be done by immunofluorescence with a Mad1 or c-Mad2 antibody to inspect which fraction of kinetochores (co-stained with a constitutive kinetochore marker, such as CENP-A or CENP-C) are positive for these SAC proteins. If just a small fraction, then the stability of some attachments is likely the cause. If most/all kinetochores retain Mad1/c-Mad2, then it is probably an issue of silencing the SAC.

      (2) The authors use the increase in mitotic index (H3 S10 phosphorylation levels) as a readout for the MT binding efficiency of Climp63 and respective mutants. Although suggestive, this is fairly indirect and requires additional confirmation. For example, the authors could perform basic immunofluorescence in fixed cells to inspect co-localization of Climp63 (and its mutants) with microtubules.

      (3) The authors refer in the discussion that the striking nuclear fragmentation seen upon mitotic exit of cells expressing Climp63 phosphodefective mutant has not been reported before, and yet it is strikingly similar to what has been previously observed in cells treated with taxol (they cite Samwer et al. 2017, but they might elect to cite also Mitchison et al., Open Biol, 2017 and most relevantly Jordan et al., Cancer Res, 1996). This striking similarity and given the extensive mitotic delay observed in the Climp63 phosphodefective mutant, it is tempting to speculate that these cells are undergoing mitotic slippage (i.e., cells exit mitosis without ever satisfying the SAC) because they are unable to silence/satisfy the SAC. Indeed, the scattered micronuclei morphology has also been observed in cells undergoing mitotic slippage (e.g., Brito and Rieder, Curr Biol., 2006). The experiment suggested in point #1 should also shed light on this problem. The authors might want to consider discussing this possible explanation to interpret the observed phenotypes.

      (4) One of the most significant implications of the findings reported in this paper is that microtubule proximity does not seem to impact the assembly of either core or non-core nuclear envelope proteins on micronuclei (that possibly form due to mitotic slippage, rather than normal anaphase). These results challenge some models explaining nuclear envelope defects in micronuclei derived from lagging chromosomes due to the proximity of microtubules, and, as the authors point out at the very end, other reasons might underlie these defects. Along this line, the authors might elect to cite Afonso et al. Science, 2014, and Orr et al., Cell Reports, 2022, who provide evidence that a spindle midzone-based Aurora B gradient, rather than microtubules per se, underlie the nuclear envelope defects commonly seen in micronuclei derived from lagging chromosomes during anaphase.

    2. Reviewer #2 (Public review):

      Mitotic phosphorylation of the ER-microtubule linker CLIMP63 was discovered decades ago and was shown to release CLIMP63 from microtubules. Here, the authors describe for the first time the significance of CLIMP63 phosphorylation for mitotic division in cells. Expression of non-phosphorylatable CLIMP63 led to a massive re-localization of ER into the area of the mitotic spindle. This was not unexpected, as another ER-microtubule linker, STIM1, is phosphorylated during mitosis to release it from microtubules, and unphosphorylatable STIM1 also leads to an invasion of the ER into the spindle. The authors map CLIMP63's microtubule-binding domain and define S17 as the critical residue that needs to be phosphorylated for release from microtubules and as a target of Cdk1, albeit with an indirect assay that is based on the ability of overexpressed mutants to disrupt mitosis. The authors further demonstrate that aberrant, microtubule-tethered membranes in the spindle disrupt spindle function. This is in line with the group's prior findings that chromosome-tethered membranes lead to severe chromosome segregation defects. Cells overexpressing phospho-deficient CLIMP63 arrested in prometaphase with an active checkpoint. When these cells were forced to exit mitosis, a large number of micronuclei formed. Interestingly, these micronuclei had different compositions and properties from previously described ones, suggesting that there are diverse paths for a cell to become multinucleated. Lastly, the authors asked whether mitochondria and lysosomes depend on ER for their distribution in mitotic cells. However, the position of these other organelles was unchanged in cells in which ER was re-localized due to the overexpression of phospho-deficient CLIMP63. This is an interesting observation in the context of how the interior organisation of mitotic cells is achieved.

      Suggestions:

      (1) The authors should confirm the mapping of the microtubule-binding domain by more direct assays, such as microtubule co-pelleting or proximity ligation assays.

      (2) The authors should clarify why they performed phenotypic studies and live microscopy experiments (Figures 4 and 5) using the CLIMP63(3A) mutant, despite knowing that the relevant phosphorylation site was S17. Were the phenotypes different for S17A versus the triple mutant?

    1. Reviewer #1 (Public review):

      Summary:

      The goal of this paper was to determine whether the T cell receptor (TCR) repertoire differs between a male and a female human. To address this, this group sequenced TCRs from double-positive and single-positive thymocytes in male and female humans of various ages. Such an analysis on sorted thymocyte subsets has not been performed in the past. The only comparable dataset is a pediatric thymocyte dataset where total thymocytes were sorted.

      They report on participant ages and sexes, but not on ethnicity, race, nor provide information about HLA typing of individuals. Though the experiments themselves are heroic, they do represent a relatively small sampling of diverse humans. They observed no differences in TCRbeta or TCRalpha usage, combinational diversity, or differences in the length of the CDR3 region, or amino acid usage in the CD3aa region between males or females. Though they observed some TCRbeta CD3aa sequence motifs that differed between males and females, these findings could not be replicated using an external dataset and therefore were not generalizable to the human population.

      They also compared TCRbeta sequences against those identified in the past using computational approaches to recognize cancer-, bacterial-, viral-, or autoimmune-antigens. They found very little overlap of their sequences with these annotated sequences (depending on the individual, ranging from 0.82-3.58% of sequences). Within the sequences that were in overlap, they found that certain sequences against autoimmune or bacterial antigens were significantly over-represented in female versus male CD8 SP cells. Since no other comparable dataset is available, they could not conclude whether this is a finding that is generalizable to the human population.

      Strengths:

      This is a novel dataset. Overall, the methodologies appear to be sound. There was an attempt to replicate their findings in cases where an appropriate dataset was available. I agree that there are no gross differences in TCR diversity between males and females.

      Weaknesses:

      Overall, the sample size is small given that it is an outbred population. The cleaner experiment would have been to study the impact of sex in a number of inbred MHC I/II identical mouse strains or in humans with HLA-identical backgrounds.

      It is unclear whether there was consensus between the three databases they used regarding the antigens recognized by the TCR sequences. Given the very low overlap between the TCR sequences identified in these databases and their dataset, and the lack of replication, they should tone down their excitement about the CD8 T cell sequences recognizing autoimmune and bacterial antigens being over-represented in females.

      The dataset could be valuable to the community.

    2. Reviewer #2 (Public review):

      Summary:

      This study addresses the hypothesis that the strikingly higher prevalence of autoimmune diseases in women could be the result of biased thymic generation or selection of TCR repertoires. The biological question is important, and the hypothesis is valuable. Although the topic is conceptually interesting and the dataset is rich, the study has a number of major issues that require substantial improvement. In several instances, the authors conclude that there are no sex-associated differences for specific parameters, yet inspection of the data suggests visible trends that are not properly quantified. The authors should either apply more appropriate statistical approaches to test these trends or provide stronger evidence that the observed differences are not significant. In other analyses, the authors report the differences between sexes based on a pulled analysis of TCR sequences from all the donors, which could result in differences driven by one or two single donors (e.g., having particular HLA variants) rather than reflect sex-related differences.

      Strengths:

      The key strength of this work is the newly generated dataset of TCR repertoires from sorted thymocyte subsets (DP and SP populations). This approach enables the authors to distinguish between biases in TCR generation (DP) and thymic selection (SP). Bulk TCR sequencing allows deeper repertoire coverage than single-cell approaches, which is valuable here, although the absence of TRA-TRB pairing and HLA context limits the interpretability of antigen specificity analyses. Importantly, this dataset represents a valuable community resource and should be openly deposited rather than being "available upon request."

      Weaknesses:

      Major:

      (1) The authors state that there is "no clear separation in PCA for both TRA and TRB across all subsets." However, Figure 2 shows a visible separation for DP thymocytes (especially TRA, and to a lesser degree TRB) and also for TRA of Tregs. This apparent structure should be acknowledged and discussed rather than dismissed.

      (2) Supplementary Figures 2-5 involve many comparisons, yet no correction for multiple testing appears to be applied. After appropriate correction, all the reported differences would likely lose significance. These analyses must be re-evaluated with proper multiple-testing correction, and apparent differences should be tested for reproducibility in an external dataset (for example, the pediatric thymus and peripheral blood repertoires later used for motif validation).

      (3) Supplementary Figure 6 suggests that women consistently show higher Rényi entropies across all subsets. Although individual p-values are borderline, the consistent direction of change is notable. The authors should apply an integrated statistical test across subsets (for example, a mixed-effects model) to determine whether there is an overall significant trend toward higher diversity in females.

      (4) Figures 4B and S8 clearly indicate enrichment of hydrophobic residues in female CDR3s for both TRA and TRB (excluding alanine, which is not strongly hydrophobic). Because CDR3 hydrophobicity has been linked to increased cross-reactivity and self-reactivity (see, e.g., Stadinski et al., Nat Immunol 2016), this observation is biologically meaningful and consistent with higher autoimmune susceptibility in females.

      (5) The majority of "hundreds of sex-specific motifs" are probably donor-specific motifs confounded by HLA restriction. This interpretation is supported by the failure to validate motifs in external datasets (pediatric thymus, peripheral blood). The authors should restrict analysis to public motifs (shared across multiple donors) and report the number of donors contributing to each motif.

      (6) When comparing TCRs to VDJdb or other databases, it is critical to consider HLA restriction. Only database matches corresponding to epitopes that can be presented by the donor's HLA should be counted. The authors must either perform HLA typing or explicitly discuss this limitation and how it affects their conclusions.

      (7) Although the age distributions of male and female donors are similar, the key question is whether HLA alleles are similarly distributed. If women in the cohort happen to carry autoimmune-associated alleles more often, this alone could explain observed repertoire differences. HLA typing and HLA comparison between sexes are therefore essential.

      (8) In some analyses (e.g., Figures 8C-D) data are shown per donor, while others (e.g., Fig. 8A-B) pool all sequences. This inconsistency is concerning. The apparent enrichment of autoimmune or bacterial specificities in females could be driven by one or two donors with particular HLAs. All analyses should display donor-level values, not pooled data.

      (9) The reported enrichment of matches to certain specificities relative to the database composition is conceptually problematic. Because the reference database has an arbitrary distribution of epitopes, enrichment relative to it lacks biological meaning. HLA distribution in the studied patients and HLA restrictions of antigens in the database could be completely different, which could alone explain enrichment and depletions for particular specificities. Moreover, differences in Pgen distributions across epitopes can produce apparent enrichment artifacts. Exact matches typically correspond to high-Pgen "public" sequences; thus, the enrichment analysis may simply reflect variation in Pgen of specific TCRs (i.e., fraction of high-Pgen TCRs) across epitopes rather than true selection. Consequently, statements such as "We observed a significant enrichment of unique TRB CDR3aa sequences specific to self-antigens" should be removed.

      (10) The overrepresentation of self-specific TCRs in females is the manuscript's most interesting finding, yet it is not described in detail. The authors should list the corresponding self-antigens, indicate which autoimmune diseases they relate to, and show per-donor distributions of these matches.

      (11) The concept of polyspecificity is controversial. The authors should clearly explain how polyspecific TCRs were defined in this study and highlight that the experimental evidence supporting true polyspecificity is very limited (e.g., just a single TCR from Figure 5 from Quiniou et al.).

      Minor:

      (1) Clarify why the Pgen model was used only for DP and CD8 subsets and not for others.

      (2) The Methods section should define what a "high sequence reliability score" is and describe precisely how the "harmonized" database was constructed.

      (3) The statement "we generated 20,000 permuted mixed-sex groups" is unclear. It is not evident how this permutation corrects for individual variation or sex bias. A more appropriate approach would be to train the Pgen model separately for each individual's nonproductive sequences (if the number of sequences is large enough).

    1. Reviewer #1 (Public review):

      Summary:

      This is a careful, well-powered treatment of age effects in resting-state MEG. Rather than extracting (say) complex connectivity measures, the authors look at the 'simplest possible thing': changes in the overall power spectrum across age.

      Strengths:

      They find significant age-related changes at different frequency bands: broadly, attenuation at low-frequency (alpha) and increased beta. These patterns are identified in a large dataset (CamCAN) and then verified in other public data.

      Weaknesses:

      Some secondary interpretations (what is "unique" to age vs global anatomy) may go beyond what the statistics strictly warrant in the current form, but these can be tightened with (I think, fairly quick) additions already foreshadowed by the authors' own analyses.

      Aims:

      The authors set out to replace piecemeal, band-by-band ageing claims with t-maps, and Cohen's f2 over sensors×frequency ("GLM-Spectrum").

      On CamCAN, six spatio-spectral peaks survive relatively strict statistical controls. The larger effects are in low-frequency and upper-alpha/beta ranges (f2 approx 0.2-0.3), while lower-alpha and gamma reach significance but with small practical impact (f2 < 0.075). A nice finding is that the same qualitative profile appears in three additional independent datasets.

      Two analyses are especially interesting. First, the authors show a difference between absolute and relative spectral magnitude (basically, within-subject normalization). Relative scaling sharpens the spectral specificity of the spatial maps, while absolute magnitude is dominated by a broad spatial mode that correlates positively across frequencies, likely reflecting head-position/field-spread factors. The replication of the main age profile is robust to preprocessing decisions (e.g., SSS movement compensation choices) - the bigger determinant of the effect is whether they apply sensor normalization (relative vs absolute).

      Second, lots of brain-related things might be related to age, and the authors spend some time trying to back out confounds/covariates. This section is handled transparently (in general, I found the writing style very clear throughout) - they examine single covariates (sex, BP, GGMV, etc.) and compare simple vs partial age effects. For example, aging is correlated with reductions in global grey-matter volume (GGMV), but it would be nice to find a measure that is independent of this: controlling for GGMV (via a linear model) reduces age-related effect sizes heterogeneously across space/frequency but does not eliminate them, a nuance the authors treat carefully.

      This is a nice paper, and I have only a few concrete suggestions:

      (1) High-gamma:

      There can be a lot of EMG / eye movement contamination (I know these were RS eyes closed data, but still..) above 30-40 Hz, and these effects are the weakest anyway. Could you add an analysis (e.g., ICA/label-based muscle component removal) and show the gamma band's sensitivity to that step? Or just note this point more clearly?

      (2) GGMV confound control:

      Controlling for GGMV reduces, but does not eliminate, age effects. I have a few questions about this: a) Could we see the residuals as a function of age? I wonder if there are non-linear effects or something else that the regression is not accounting for. Also, b) GGMV and age are highly colinear - is this an issue? Can regression really split them apart robustly? I think by some cunning orthogonalisation, you can compute the effect of age independent of GGVM. I don't think this is the same as the effect 'adjusted' for GGMV (which is what is shown here if I'm reading it correctly). Finally, of course, GGMV might actually be the thing you want to look at (because it might more accurately reflect clinical issues) - so strong correlations are not really a problem: I think really the focus might even be on using MEG to predict GGMV and controlling for age.

    2. Reviewer #2 (Public review):

      This paper describes the application of the "GLM-Spectrum" mass univariate approach to examine the effects of age on M/EEG power spectra. Its strengths include promotion of the unbiased approach, suitable for future meta/mega-analyses, and the provision of effect sizes for powering future studies. These are useful contributions to the literature. What is perhaps lacking is a discussion of the limitations of this approach, in comparison to other methods.

      An analogy is the mass univariate approach to spatial localisation of effects in fMRI/PET images. This approach is unbiased by prior assumptions about the organisation of the brain, but potentially also less sensitive, by ignoring that prior knowledge. For example, a voxelwise univariate approach is less sensitive to detecting effects in functionally homogeneous brain regions, where SNR can be increased by averaging over voxels. In the context of power spectra, the authors' approach deliberately ignores knowledge about the dominant frequency bands/oscillations in human power spectra. This is in contrast to approaches like FOOOF and IRASA, which explicitly parametrise frequency components. I am not saying these methods are better; I just think that the authors should acknowledge that these approaches have advantages over their mass univariate approach (in sensitivity and interpretation; see below). I guess it is a type of bias-sensitivity trade-off: the authors want to avoid bias, but they should acknowledge the corresponding loss of sensitivity, as well as loss of interpretation compared to model-based approaches (i.e, models that parameterise frequency; I don't mean the statistical models for each frequency separately).

      An example of the interpretational loss can be seen in the authors' observation of opposite-signed effects of age around the alpha peak. While the authors acknowledge that this pattern can arise from a reduction in alpha frequency with age, this is an indirect inference, and a direct (and likely much more sensitive) approach would be to parametrise and estimate the peak alpha frequency directly for each participant, as done with FOOOF for example (possibly with group priors, as in Medrano et al, 2025, EJN). The authors emphasise the nonlinear effects of age in Figure 2A, but their approach cannot test this directly (e.g., in terms of plotting effects of age on frequency, magnitude, and width for each participant), so for me, this figure illustrates a weakness of their approach, not a strength.

      Then I think the section "Two dissociable and opposite effects in the alpha range" in the Discussion section is confusing, because if there is a single reduction in alpha peak frequency and magnitude with age, then there is only one "effect", not "two dissociable" ones. If the authors do want to claim that there are two dissociable age effects within the alpha range, then they need to do a statistical test, e.g., that the topographies of low and high alpha are significantly different. This then reveals another limitation of the mass univariate approach - that space (channel) is not parametrised either - so one cannot test for significant channel x effect interactions within this framework, as necessary to really claim a dissociation (e.g., in underlying neural generators).

      While the authors show that normalisation of each person's power spectra by the sum across frequencies helps improve some statistics, they might want to say more about disadvantages of this approach, e.g., loss of sensitivity to any effects (eg of age) that are broadly distributed across majority of frequencies, loss of real SI units (absolute effect sizes) (as well as problems if normalisation were used for techniques like FOOOF, where the 1/f exponent would be affected).

      The authors should give more information on how artifactual ICs were defined. This may be important for cardiac artefacts, since Schmidt et al (2004, eLife) have pointed out how "standard" ICA thresholds can fail to remove all cardiac effects. This is very important for the effects of age, given that age affects cardiac dynamics (even though the focus of Schmidt et al is the 1/f exponent, could residual cardiac effects cause artifactual age effects in current results, even above ~1Hz?).

      The authors should clarify the precise maxfilter arguments, and explain what "reference" was used for the "trans" option - e.g., did the authors consider transforming the data to match a sphere at the centre of the helmet, which might not only remove some of the global power differences due to different head positions, but also be best for generalisation of the effect sizes they report to future studies (assuming the centre of the helmet is the most likely location on average)? And on that matter, did head positions actually differ by age at all?

    1. Reviewer #1 (Public review):

      Summary:

      The manuscript investigates how exogenous attention modulates spatial frequency sensitivity within the foveola. Using high-precision eye-tracking and gaze-contingent stimulus control, the authors show that exogenous attention selectively improves contrast sensitivity for low- to mid-range spatial frequencies (4-8 cycles/degree), but not for higher frequencies (12-20 CPD). In contrast, improvements in asymptotic performance at the highest contrast levels occur across all spatial frequencies. These results suggest that, even within the foveola, exogenous attention operates through a mechanism similar to that observed in peripheral vision, preferentially enhancing lower spatial frequencies.

      Strengths:

      The study shows strong methodological rigor. Eye position was carefully controlled, and the stimulus generation and calibration were highly precise. The authors also situate their work well within the existing literature, providing a clear rationale for examining the fine-grained effects of exogenous attention within the foveola. The combination of high spatial precision, gaze-contingent presentation, and detailed modeling makes this a valuable technical contribution.

      Weaknesses:

      The manipulation of attention raises some interpretive concerns. Clarifying this issue, together with additional detail about statistics, participant profiles, other methodological elements, and further discussion in relation to oculomotor control in general, could broaden the impact of the findings.

    2. Reviewer #2 (Public review):

      Summary:

      This study aims to test whether foveal and non-foveal vision share the same mechanisms for endogenous attention. Specifically, they aim to test whether they can replicate at the foveola previous results regarding the effects of exogenous attention for different spatial frequencies.

      Strengths:

      Monitoring the exact place where the gaze is located at this scale requires very precise eye-tracking methods and accurate and stable calibration. This study uses state-of-the-art methods to achieve this goal. The study builds on many other studies that show similarities between foveal vision and non-foveal vision, adding more data supporting this parallel.

      Weaknesses:

      The study lacks a discussion of the strength of the effect and how it relates to previous studies done away from the fovea. It would be valuable to know if not just the range of frequencies, but the size of the effect is also comparable.

    3. Reviewer #3 (Public review):

      Summary:

      This paper explores how spatial attention affects foveal information processing across different spatial frequencies. The results indicate that exogenously directed attention enhances contrast sensitivity for low- to mid-range spatial frequencies (4-8 CPD), with no significant benefits for higher spatial frequencies (12-20 CPD). However, asymptotic performance increased as a result of spatial attention independently of spatial frequency.

      Strengths:

      The strengths of this article lie in its methodological approach, which combines a psychophysical experiment with precise control over the information presented in the foveola.

      Weaknesses:

      The authors acknowledge that they used the standard approach of analyzing observer-averaged data, but recognize that this method has limitations: it ignores the uncertainty associated with parameter estimates and the relationships between different parameters of the psychometric model. This may affect the interpretation of attentional effects. In the future, mixed-effects models at the trial level could overcome these limitations.

    1. Reviewer #1 (Public review):

      Summary:

      The study from Wu and Turrigiano investigates how disruption of taste coding in a mouse model of autism spectrum disorders (ASDs) affects aversive learning in the context of a conditioned taste aversion (CTA) paradigm. The experiments combine 2-photon calcium imaging of neurons in the gustatory portion of the anterior insular cortex (i.e., gustatory cortex) with behavioral training and testing. The authors rely on Shank3 knockout mice as a model for ASDs. The authors found that Shank3 mice learn CTA more slowly and extinguish the memory more rapidly than control subjects. Calcium imaging identified impairments in taste-evoked activity associated with memory encoding and extinction. During memory encoding, the authors found less suppressed neuronal activity and increased correlated variability in Shank3 mice compared to controls. During extinction, they observed a faster loss of taste selectivity and degradation of taste discriminability in mutants compared to controls.

      Strengths:

      This is a well-written manuscript that presents interesting findings. The results on the learning and extinction deficits in Shank3 mice are of particular interest. Analyses of neural activity are well conducted and provide important information on the type of impaired cortical activity that may correlate with behavioral deficits.

      Weaknesses:

      (1) The experiments rely on three groups: CS-only WT, CTA WT, and CTA KO. Can the authors provide a rationale for not having a CS-only KO group?

      (2) The authors design an effective behavioral paradigm comparing consumption of water and saccharin and tracking extinction (Figure 3). This paradigm shows differences in licking across distinct behavioral conditions. For instance, during T1, licking to water strongly differs from licking to saccharin for both WT and KO. During T2, licking to water strongly differs from licking to saccharin only for WT (much less for KO), and licking to saccharin in WT differs from that in KO. These differences in taste sampling across conditions could contribute to some of the effects on neural activity and discriminability reported in Figures 5 and 6. That is sucrose and water trials may be highly discriminable because in one case the mouse licks and in the other it does not (or licks much less). The author may want to address this issue.

      (3) Are there any omission trials following CTA? If so, they should be quantified and reported. How are the omission trials treated with regard to the analyses?

      (4) The authors describe the extinction paradigm as "alternative choice". In decision-making, alternative choice paradigms typically require 2 lateral spouts to report decisions following the sampling from a central spout. To avoid confusion, the authors may want to define their paradigm as alternative sampling.

      (5) Figure 4 reports that CTA increases the proportion of neurons that consistently respond to saccharin and water across days. While the saccharin result could be an effect of aversive learning, it is less clear why the phenomenon would generalize to water as well. Can the authors provide an explanation?

      (6) The recordings are performed in the part of the anterior insular cortex that is typically defined as "gustatory cortex" (GC). Given the functional heterogeneity of the anterior insular cortex (AIC) and given that the authors do not sample all of the anteroposterior extent of AIC, I would suggest being more explicit about their positioning in GC. Also, some citations (e.g., Gogolla et al, 2014) refer to the posterior insular cortex, which is considered more inherently multimodal than GC. GC multimodality is typically associative in nature, as only a few neurons respond to sound and light in naïve animals.

      (7) It would be useful to add summary figures showing the extent of viral spread as well as GRIN lens placement.

      (8) I encourage the authors to add Ns every time percentages are reported. How many neurons have been recorded in each condition? Can the authors provide the average number of neurons recorded per session and per animal?

      (9) It looks like some animals learned more than others (Figure 1E or Figure 3C). Is it possible to compare neural activity across animals that showed different degrees of learning?

    2. Reviewer #2 (Public review):

      Wu and Turrigiano investigated how cortical taste coding during conditioned taste aversion (CTA) learning is affected in Shank3 knockout (KO) mice, a model of monogenic ASD. Using longitudinal two-photon calcium imaging of AIC neurons, the authors show that Shank3 KO mice exhibit reduced suppression of activity in a subset of neurons and a higher correlated variability in neural activity. This is accompanied by slower learning and faster extinction of aversive taste memories. These results suggest that Shank3 loss compromises the flexibility and stability of cortical representations underlying adaptive behaviour.

      Major strengths:

      (1) Conceptual significance: The study connects a molecular ASD risk gene (Shank3) to flexible sensory encoding, bridging genetics, systems neuroscience, and behaviour.

      (2) Technical rigour: Longitudinal calcium imaging with cell-registration across learning and extinction sessions is technically demanding and well-executed.

      (3) Behavioural paradigm: The use of both acquisition and extinction paradigms provides a more nuanced picture of learning dynamics.

      (4) Analyses: Correlated variability, discriminability indices, and population decoding analyses are robust and appropriate for addressing behavioural and network-level coding changes.

      Major weaknesses:

      (1) Causality: The paper infers that increased correlated variability causes learning deficits, but no causal tests (e.g., optogenetic modulation of inhibition or interneuron rescue) are presented to confirm this.

      (2) Behavioural scope: The study focuses exclusively on taste aversion; generalisation to other flexible learning paradigms (e.g., reversal or probabilistic tasks) is not addressed.

      (3) Mechanistic insights: While providing interesting findings of altered sensory perception and extinction of learning-related signals in AIC, it offered nearly no mechanistic insights. This makes the interpretation, especially on how generalisable these findings are, difficult. Also, different reported findings are "potentially" connected, but the exact relation between increased correlated variability and faster loss of taste selectivity cannot be assessed.

    3. Reviewer #3 (Public review):

      In this study, Wu & Turrigiano investigate an ethologically relevant form of associative learning (conditioned taste aversion - CTA) and its extinction in the Shank3 KO mouse model of ASD. They also examine the underlying circuits in the anterior insular cortex (AIC) simultaneously, using two-photon calcium imaging through a GRIN lens. They report that Shank3 KO mice learn CTA slower and suggest that this is mediated by a reduction in tastant-stimulus activity suppression of AIC neurons and a reduced signal-to-noise ratio due to increased noise correlations in AIC neurons. Interestingly, once Shank3 KO mice acquire CTA, they extinguish the aversive memory more rapidly than wild-type mice. This accelerated extinction is accompanied by a faster loss of neuronal and population-level taste selectivity and coding in the AIC compared to WT mice.

      This is an important study that uses in vivo methods to assess circuit dysfunction in a mouse model of ASD, related to sensory perception valence (in this case, taste). The study is well executed, the data are of high quality, and the analytical procedures are detailed. Furthermore, the behavioural paradigm is well thought out, particularly the approach for assessing extinction through repeated retrieval sessions (T1-T5), which effectively tests discrimination between saccharin and water rather than relying solely on lick counts or total consumption as a measure of extinction. Finally, the statistical tests used are appropriate and justified.

      There is, however, a missing link between the behavioural findings and the underlying mechanisms. More specifically:

      (1) The authors don't make a causal link between the behaviour and AIC neurophysiology, both the percentage of suppressed cells and the coactivity measurements. For the % of suppressed cells, it seems that both WT and KO cells are suppressed in the transition between CST1 and CST2 (Figure 1L), yet only the WT mice exhibit CTA (at least by CST2). For the taste-elicited coactivity measure, it seems that there is an increase in coactivity from CST1 to CST2 in WT (Figure 2C - blue, although not statistically tested?), but persistently higher coactivity in KO. Is this change of coactivity in WT important for the expression of CTA? Plotting behavioral performance (from Figure 1G) against coactivity (from Figure 2C) for each animal would be informative.

      (2) Shank3 KO cells already show an increase in baseline coactivity (Figure 2- figure supplement 1), and the authors never examine CS-only responses in the KO group, therefore making it difficult to determine whether elevated coactivity and noise correlations reflect a generalized AIC abnormality in Shank3 KOs (perhaps through impaired PV-mediated inhibition in insular cortex - Gogolla et al, 2014) that is not directly responsible/related to CTA?

      (3) How do the authors interpret the large range of lick ratios (Figure 1G) for WT (almost bi-modal distribution)? Is there a within-subject correlation with any of the neurophysiological measurements to suggest a relationship between AIC neurophysiology and behavioural expression of CTA?

      (4) Indeed, CTA appears to be successfully achieved for Shank3 KO mice delayed by 1 day, as the level of saccharin aversion during the first retrieval session (T1) is comparable between Shank3 KO and WTs. In this context, not extending the first part of the paradigm to include CST3 seems to be a missed opportunity. Doing so would have allowed for within-cell and within-subject comparison of taste-elicited pairwise correlation across the learning and to investigate the neural mechanism of delayed extinction in KOs more effectively.

      (5) How to interpret Figure 5F: Absolute discriminability is lower for T5 for CTA WT and CTA KO compared to CS-only? Why would AIC neurons have less information on taste identity by the end of extinction than during the unconditioned (CS-only) condition? And if that is the case, how is decoding accuracy in Figure 6C higher in T5 for CTA WT vs CS-only?

    1. Reviewer #1 (Public review):

      Summary:

      In the ecological interactions between wild plants and specialized herbivorous insects, structural innovation-based diversification of secondary metabolites often occurs. In this study, Agrawal et al. utilized two milkweed species (Asclepias curassavica and Asclepias incarnata) and the specialist Monarch butterfly (Danaus plexippus) as a model system to investigate the effects of two N,S-cardenolides - formed through structural diversification and innovation in A. curassavica-on the growth, feeding, and chemical sequestration of D. plexippus, compared to other conventional cardenolides. Additionally, the study examined how cardenolide diversification resulting from the formation of N,S-cardenolides influences the growth and sequestration of D. plexippus. On this basis, the research elucidates the ecophysiological impact of toxin diversity in wild plants on the detoxification and transport mechanisms of highly adapted herbivores.

      Strengths:

      The study is characterized by the use of milkweed plants and the specialist Monarch butterfly, which represent a well-established model in chemical ecology research. On one hand, these two organisms have undergone extensive co-evolutionary interactions; on the other hand, the butterfly has developed a remarkable capacity for toxin sequestration. The authors, building upon their substantial prior research in this field and earlier observations of structural evolutionary innovation in cardenolides in A. curassavica, proposed two novel ecological hypotheses. While experimentally validating these hypotheses, they introduced the intriguing concept of a "non-additive diversity effect" of trace plant secondary metabolites when mixed, contrasting with traditional synergistic perspectives, in their impact on herbivores.

      Weaknesses:

      The manuscript has two main weaknesses. First, as a study reliant on the control of compound concentrations, the authors did not provide sufficient or persuasive justification for their selection of the natural proportions (and concentrations) of cardenolides. The ratios of these compounds likely vary significantly across different environmental conditions, developmental stages, pre- and post-herbivory, and different plant tissues. The ecological relevance of the "natural proportions" emphasized by the authors remains questionable. Furthermore, the same compound may even exert different effects on herbivorous insects at different concentrations. The authors should address this issue in detail within the Introduction, Methods, or Discussion sections.

      Second, the study was conducted using leaf discs in an in vitro setting, which may not accurately reflect the responses of Monarch butterflies on living plants. This limitation undermines the foundation for the novel ecological theory proposed by the authors. If the observed phenomena could be validated using specifically engineered plant lines-such as those created through gene editing, knockdown, or overexpression of key enzymes involved in the synthesis of specific N,S-cardenolides - the findings would be substantially more compelling.

    2. Reviewer #2 (Public review):

      This study examined the effects of several cardenolides, including N,S-ring containing variants, on sequestration and performance metrics in monarch larvae. The authors confirm that some cardenolides, which are toxic to non-adapted herbivores, are sequestered by monarchs and enhance performance. Interestingly, N,S-ring-containing cardenolides did not have the same effects and were poorly sequestered, with minimal recovery in frass, suggesting an alternate detoxification or metabolic strategy. These N,S-containing compounds are also known to be less potent defences against non-adapted herbivores. The authors further report that mixtures of cardenolides reduce herbivore performance and sequestration compared to single compounds, highlighting the important role of phytochemical diversity in shaping plant-herbivore interactions.

      Overall, this study is clearly written, well-conducted and has the potential to make a valuable contribution to the field. However, I have one major concern regarding the interpretations of the mixture results. From what I understand of the methods, all tested mixtures contain all five compounds. As such, it is not possible to determine whether reduced performance and sequestration result from the complete mixture or from the presence of a single compound, such as voruscharin for performance and uscharin for sequestration. For instance, if all compounds except voruscharin (or uscharin) were combined, would the same pattern emerge? I suspect not, since the effects of the individual N,S-containing compounds alone are generally similar to those of the full mixture (Figure S3). By taking the average of all single compounds, the individual effects of the N,S-containing ones are being inflated by the non-N,S-containing ones (in the main text, Figure 4). In the mix, of course, they are not being 'diluted', as they are always present. This interpretation is further supported by the fact that in the equimolar mix, the relative proportion of voruscharin decreases (from 50% in the 'real mix'), and the target measurements of performance and sequestration tend to increase in the equimolar mix compared to the real mix.

      Despite this issue, the discussion of mixtures in the context of plant defence against both adapted and non-adapted herbivores is fascinating and convincing. The rationale that mixtures may serve as a chemical tool-kit that targets different sets of herbivores is compelling. The non-N,S cardenolides are effective against non-adapted herbivores and the N,S-containing cardenolides are effective against adapted herbivores. However, the current experiments focus exclusively on an adapted species. It would be especially interesting to test whether such mixtures reduce overall herbivory when both adapted and non-adapted species are present.

      It remains possible that mixtures, even in the absence of voruscharin or uscharin, genuinely reduce sequestration or performance; however, this would need to be tested directly to address the abovementioned concern.

    1. Reviewer #1 (Public review):

      Bajohr and colleagues propose a transcription factor-driven approach to generating bonafide oligodendrocyte lineage cells (OLCs) from primary mouse astrocytes. Ectopic expression of Olig2, Sox10, or Nkx6.2 in isolated astrocytes produced a range of OLC-like cell states, with Sox10 emerging from lineage tracing and single cell RNA sequencing experiments as the most successful transcription factor in driving direct lineage reprogramming. The authors strengthened their claims with an unbiased, deep learning perturbation model to predict genetic drivers of the astrocyte cluster to OLC cluster transition observed in their scRNA seq dataset. Here, Sox10 surfaced in the top ten correlated genes, and the top transcription factor, mediating this fate shift. Altogether, this paper presents an interesting approach to generate OLCs, a cell type historically difficult to procure, from primary mouse astrocytes to study this lineage in development and disease and perhaps repopulate it in dysmyelinating conditions. While this certainly addresses a technical gap in the field, authors defined iOLCs as ones with lineage-specific gene expression and morphological characteristics, lacking any functional analysis to assess the reprogrammed cells' capacity to myelinate. This comment and other critiques are discussed below.

      While Sox10 and Mbp expression in iOLCs, as confirmed by IHC, is a promising result suggesting that ectopic Sox10 instructs transduced cells to develop into cells of myelinating potential, functional confirmation is essential. As mentioned in the discussion, the absence of a substrate for myelination may have also contributed to the low DLR efficiency. Co-culturing Sox10 iOLCs with primary neurons and examining the cells' potential to engage and enwrap axons would greatly strengthen the authors' claim that this could be an effective therapeutic approach to myelin regeneration in vivo, or even a technical approach to studying myelin dynamics in vitro.

      In Figure 1B, it appears that Mbp expression in tdTomato+ cells decreases in Sox10 transduced iOLs during the observed time period. Can the authors elaborate on this result, given that MBP expression is crucial for myelination and should, if anything, increase with time?

      The authors acknowledge that there is a conversion of tdTomato- zsGreen+ cells with an astrocyte-like morphology to OLC cells expressing Mbp following Sox10 induction (Supplementary figure 5C,D). While they note the diversity of the astrocyte lineage in the discussion, further analysis should be applied to this subset of cells to confirm the subset of astrocyte or progenitor-like cell type that gives rise to their cell endpoint of interest (Sox10-driven Mbp+ iOLs).

      Finally, ectopic expression of Olig2 and Sox10 in primary astrocytes resulted in very different OLC subtypes, as evidenced by OLC marker expression seen in IHC and the subclustering of these cell types in scRNA seq. Although this diversity in OLC type and generation efficiency follows with previous reports showing that these two transcription factors vary in effect, might the authors further discuss this discrepancy given that the two transcription factors regulate one another (as mentioned in the introduction) and should theoretically give rise to more similar cells? Perhaps due to the lower specificity of Olig2 in marking a pure OLC population relative to Sox10?

    1. Reviewer #1 (Public review):

      This study explores the connectivity patterns that could lead to fast and slow undulating swim patterns in larval zebrafish using a simplified theoretical framework. The authors show that a pattern of connectivity based only on inhibition is sufficient to produce realistic patterns with a single frequency. Two such networks couple with inhibition but with distinct time constants can produce a range of frequencies. Adding excitatory connections further increases the range of obtainable frequencies, albeit at the expense of sudden transitions in mid-frequency range.

      Strengths:

      (1) This is an eloquent approach to answering the question of how spinal locomotor circuits generate coordinated activity using a theoretical approach based on moving bump models of brain activity.

      (2) The models make specific predictions on patterns of connectivity while discounting the role of connectivity strength or neuronal intrinsic properties in shaping the pattern.

      (3) The models also propose that there is an important association between cell-type-specific intersegmental patterns and the recruitment of speed-selective subpopulations of interneurons.

      (4) Having a hierarchy of models creates a compelling argument for explaining rhythmicity at the network level. Each model builds on the last and reveals a new perspective on how network dynamics can control rhythmicity. I liked that each model can be used to probe questions in the next/previous model.

      Comments on revisions:

      I am very happy to see the simplified biophysical model supporting the original findings. The authors have done an excellent job addressing my comments.

      Just a small note, please change C. Elegans to C. elegans.

    2. Reviewer #2 (Public review):

      Summary:

      The authors aimed to show that connectivity patterns within spinal circuits composed of specific excitatory and inhibitory connectivity and with varying degrees of modularity could achieve tail beats at various frequencies as well as proper left-right coordination and rostrocaudal propagation speeds.

      Strengths:

      The model is simple and the connectivity patterns explored are well supported by the literature

      The conclusions are intuitive and support many experimental studies on zebrafish spinal circuits for swimming. The simulations provide strong support for the sufficiency of connectivity patterns to produce and control many hallmark features of swimming in zebrafish

      Weaknesses:

      The authors have addressed my previous concerns well. I have no further concerns.

    3. Reviewer #3 (Public review):

      Summary:

      Central pattern generator (CPG) circuits underly rhythmic motor behaviors. Till date, it is thought that these CPG networks are rather local and multiple CPG circuits are serially connected to allow locomotion across the entire body. Distributed CPG networks that incorporate long-range connections have not been proposed although such connectivity has been experimentally shown for several different spinal populations. In this manuscript, the authors use this existing literature on long-range spinal interneuron connectivity to build a new computational model that reproduces basic features of locomotion like left-right alternation, rostrocaudal propagation and independent control of frequency and amplitude. Interestingly, the authors show that a model solely based on inhibitory neurons can recapitulate these basic locomotor features. Excitatory sources were then added that increased the dynamic range of frequencies generated. Finally, the authors were also able to reproduce experimentally observed consequences of cell-type-specific ablations showing that local and long range, cell-type-specific connectivity could be sufficient for generating locomotion.

      Strengths:

      This work is novel, providing an interesting alternative of distributed CPGs to the local networks traditionally predicted. It shows cell type-specific network connectivity is as important if not more than intrinsic cell properties for rhythmogenesis and that inhibition plays a crucial role in shaping locomotor features. Given the importance of local CPGs in understanding motor control, this alternative concept will be of broad interest to the larger motor control field including invertebrate and vertebrate species.

      Weaknesses:

      The main weaknesses were addressed in the revision.

    1. Reviewer #1 (Public review):

      The authors aim to predict ecological suitability for transmission of highly pathogenic avian influenza (HPAI) using ecological niche models. This class of models identify correlations between the locations of species or disease detections and the environment. These correlations are then used to predict habitat suitability (in this work, ecological suitability for disease transmission) in locations where surveillance of the species or disease has not been conducted. The authors fit separate models for HPAI detections in wild birds and farmed birds, for two strains of HPAI (H5N1 and H5Nx) and for two time periods, pre- and post-2020. The authors also validate models fitted to disease occurrence data from pre-2020 using post-2020 occurrence data.

    2. Reviewer #2 (Public review):

      Summary:

      The geographic range of highly pathogenic avian influenza cases changed substantially around the period 2020, and there is much interest in understanding why. Since 2020 the pathogen irrupted in the Americas and the distribution in Asia changed dramatically. This study aimed to determine which spatial factors (environmental, agronomic and socio-economic) explain the change in numbers and locations of cases reported since 2020 (2020--2023). That's a causal question which they address by applying correlative environmental niche modelling (ENM) approach to the avian influenza case data before (2015--2020) and after 2020 (2020--2023) and separately for confirmed cases in wild and domestic birds. To address their questions they compare the outputs of the respective models, and those of the first global model of the HPAI niche published by Dhingra et al 2016.

      ENM is a correlative approach useful for extrapolating understandings based on sparse geographically referenced observational data over un- or under-sampled areas with similar environmental characteristics in the form of a continuous map. In this case, because the selected covariates about land cover, use, population and environment are broadly available over the entire world, modelled associations between the response and those covariates can be projected (predicted) back to space in the form of a continuous map of the HPAI niche for the entire world.

      Strengths:

      The authors are clear about expected bias in the detection of cases, such geographic variation in surveillance effort (testing of symptomatic or dead wildlife, testing domestic flocks) and in general more detections near areas of higher human population density (because if a tree falls in a forest and there is no-one there, etc), and take steps to ameliorate those. The authors use boosted regression trees to implement the ENM, which typically feature among the best performing models for this application (also known as habitat suitability models). They ran replicate sets of the analysis for each of their model targets (wild/domestic x pathogen variant), which can help produce stable predictions. Their code and data is provided, though I did not verify that the work was reproducible.

      The paper can be read as a partial update to the first global model of H5Nx transmission by Dhingra and others published in 2016 and explicitly follows many methodological elements. Because they use the same covariate sets as used by Dhingra et al 2016 (including the comparisons of the performance of the sets in spatial cross-validation) and for both time periods of interest in the current work, comparison of model outputs is possible. The authors further facilitate those comparisons with clear graphics and supplementary analyses and presentation. The models can also be explored interactively at a weblink provided in text, though it would be good to see the model training data there too.

      The authors' comparison of ENM model outputs generated from the distinct HPAI case datasets is interesting and worthwhile, though for me, only as a response to differently framed research questions.

      Weaknesses:

      This well-presented and technically well-executed paper has one major weakness to my mind. I don't believe that ENM models were an appropriate tool to address their stated goal, which was to identify the factors that "explain" changing HPAI epidemiology.

      Comments on the revised version from the editors:

      We are extremely grateful to the authors for presenting a thoughtful and respectful point by point rebuttal to the prior reviewers' comments. After reading these comments carefully, we conclude that there is a straightforward strongly held disagreement between the authors and the reviewers as to the validity of the methods (Ecological Niche Modeling) for this particular dataset. Please note that the two reviewers have substantial expertise in the area of Ecologic Niche Modeling. We elected not to reach out to the reviewers for a third set of comments as we do not think their overall opinions will change, and wish to be respectful of their time.

      To allow readers a balanced assessment of the paper, we intend to publish your rebuttal comments in full. It is our hope that interested readers can weigh both sides of this respectful and interesting debate in order to reach their own conclusions about the strength of evidence presented in your manuscript.

    1. Reviewer #1 (Public review):

      Summary:

      Davis and co-authors used many mouse models to investigate mechanisms that regulate the contractility of mouse popliteal collecting vessels, primarily chronotropy. Many of the mechanisms studied were previously shown to regulate pressure-induced constriction in small arteries. The authors use prior literature from the vasculature as a framework to test similar concepts in lymphatic vessels. The mouse models used provide evidence for and against the involvement of multiple proteins in regulating chronotropy and other contractile properties in lymphatic vessels. They propose that mechano-activation of GNAQ/GNA11-coupled GPCRs generates IP3, which induces Ca2+ release through IP3R1 and drives depolarization through the activation of ANO1 Cl- channels. Major concerns include the author's major conclusion that GNAQ/GNA11-coupled GPCRs contribute to chronotropy. This conclusion is not supported by the data presented.

      Strengths:

      One major strength of the study lies in the vast number of mouse knockout models that were used to test the importance of ion channels and G protein signaling pathways in the regulation of lymphatic vessel contractility. In this regard, the study is a valiant effort. The authors achieved several objectives to find that ANO1 and IP3R1 regulate chronotropy, and many other potential proteins do not regulate chronotropy. This study will have a major impact on the field if additional support for G proteins is provided.

      Weaknesses:

      Major conclusions concerning the involvement of G proteins are drawn from the global Gna11 knockout mouse models. This conclusion is weak. Global Gna11 knockout mice are highly likely to have a multifactorial phenotype that could create significant differences in the data. Control experiments need to be performed on vessels from the global knockout mice if these major conclusions are to be made. Similarly, pharmacological tools or alternative approaches to manipulate G proteins should be used to support the data from these mouse models to draw these major conclusions.

      The Gnaq smKO mice are the most specific G protein model studied here. However, there is no phenotype. Do not discuss trends in the data. If the data are not significant, conclude so. If more experiments are required to reach significance, provide more data in the manuscript.

      The conclusions repeatedly refer to a signaling pathway wherein the upstream component is GPCRs, which activate G proteins. While this may be the case, no GPCRs were identified here, and the involvement of G proteins is questionable, as the authors outline in lines 693-695 and noted above. The conclusions should be tempered, including in the abstract, unless additional experiments are performed to support the involvement of G proteins. Perhaps then the authors may be able to infer that GPCRs are involved.

      Line 318. The point regarding the choice to use popliteal vessels versus IALVs will be unclear to the uninitiated, particularly as the authors previously used IALVs. Including additional justification in the text and/or data from IALVs in Figure 1, which compares IALVs to popliteal vessels, would better explain the logic.

      The conclusions drawn for TRPC6 and TRPC3 are less convincing. Germline global knockout mice, which are known to undergo compensation, were used, and high data variability is apparent. Using TRPC3 and TRPC6 blockers in the mouse models studied in Figure 4 would strengthen the arguments made regarding these proteins.

      Did you perform power analysis to ensure that experimental numbers were sufficient to conclude that no statistical difference exists between datasets? If not, this needs to be done. For example, data shown in Figure 5C for tone and 6C for frequency and tone appear to be significantly different, but are concluded not to be so.

      At the end of each result section, a concluding statement is made regarding the effects on pressure-induced chronotrophy. In many cases, there are additional effects of manipulating protein expression on other contractile properties. One example is for TRPC3 and TRPC6 (lines 414-416), but others are TRPV4, TRPV3, ENaC, Kir, Cav3.1/3.2, etc. Some interpretation is in the Discussion, but the concluding statements at the end of each result section should be expanded to summarize what the authors think the other significant differences in the data represent.

      Kv7.4 channels. You state you have data (not shown) with linopiridine and XE991. Why not show those results here to support the experiments with the Kcnq4 smKO mice? Otherwise, I suggest you remove the statement from the unpublished data.

      Figure 13A. Kcnj2 is modestly expressed in LECs, but very little is present in LMCs. This likely underlies the effect of barium. If you remove the endothelium, does the effect of barium disappear? While this is not the major focus of the study, the effects of barium are dramatic, and it should be made clear whether this is due to inhibition of Kir channels in smooth muscle or endothelial cells.

      Figure 18C tone. Several values for losartan look different but are not labelled as such. Please clarify and discuss if different.

      The manuscript should include raw data traces in figures that show the major pathways that you conclude regulate chronotropy.

    2. Reviewer #2 (Public review):

      Summary:

      In this study, Davis et al. embarked on the quest for the molecular elements responsible for the regulation of lymphatic phasic contractile activity in response to variation of transmural pressure, a mechanism (termed pressure-induced lymphatic chronotropy by the authors) critical for drainage of interstitial fluid from the tissue and transport of lymph back to the blood circulation. Their aim was to investigate the mechanism(s) involved in the pressure-induced regulation of lymphatic pumping, and test whether activation of cation channels, shown in other systems to play mechanosensitive roles are directly at play, and/or whether mechano-activation of GNAQ/GNA11-coupled GPCRs is necessary to generate second messengers to activate those channels, as it has been suggested for the regulation of myogenic tone in arteries. To achieve their goal, the authors used their well-described, highly reliable protocols of mouse lymphatic vessel isolation, pressure myography, and data acquisition to obtain frequency-pressure relationships and other contractile function parameters from transgenic mice where specific channels or molecular elements of interest have been ablated. They combined these data with scRNAseq analysis of these gene targets to determine their respective role and levels of expression in lymphatic muscle cells. Their conclusion is that none of the exhaustive list of tested ion channels was critical, except ANO1 Cl channels, part of the contractile pacemaker mechanism, but that transmural pressure activates GNAQ/GNA11-coupled GPCRs, which generate IP3 to induce SR Ca2+ release through IP3R1 and activate ANO1-mediated depolarization.

      Strengths:

      The manuscript's strengths reside primarily in very robust, clean, and unequivocal pressure myography data and analysis. The research team is mastering these techniques they developed more than a decade ago and have implemented in mouse lymphatics to study their contractile properties, with consistent and convincing outcomes. They also provide data from an impressive list of transgenic mice in order to determine the role of the targeted gene in pressure-induced lymphatic chronotropy, relying on pharmacological small molecule inhibitors only when necessary. Finally, the use of scRNAseq analysis they gathered from previously published datasets brings novelty with respect to the expression of the genes of interest in all populations of cells comprising the lymphatic vessels, but more critically, to validate or contrast the potential impact of genetic alteration of the given gene on the ability of lymphatic muscles to respond to a change in pressure.

      Weaknesses:

      The main weakness may reside in the fact that while the authors provide a convincing demonstration that GNAQ/GNA11 are involved in the regulation of the F-P relationship, they give little evidence of the involvement of "upstream" receptors. Indeed, inhibition of AT1R, shown to be involved in myogenic regulation of arteries (a phenomenon the authors rightfully compare to pressure-induced lymphatic chronotropy), didn't lead toa similar effect (decrease in F-P) in lymphatic vessels. Arguably, other GPCRs might be involved in lymphatic vessels, but as such information is not provided in the manuscript, the author's conclusions should be dampened. More in-depth discussion would be required. In fact, it can be argued that the discussion is very restricted with respect to the amount of data and information the manuscript provides.

      Overall, the authors convincingly achieved their aim by performing an impressive number of technically challenging experiments, leading to solid datasets. While these support their main conclusions, a more elaborate discussion might be required to refine them.

      This study is likely to have an important impact on the field as it provides some answers to the lingering question of how lymphatic vessels regulate their contractile activity to variation in transmural pressure and certainly proposes an experimental means to further explore and address that question.

    3. Reviewer #3 (Public review):

      In this manuscript, Davis and colleagues aimed to identify the molecular sensors and signaling cascade that enable collecting lymphatic vessels to increase their spontaneous contraction frequency in response to intraluminal pressure (pressure-induced chronotropy). They tested whether the process is similar to blood vessel myogenic constriction by relying on cation channels (TRPC6, TRPM4, PKD2, PIEZO1, etc.) or instead require the activation of G-protein-coupled receptors (presumably mechanosensitive GNAQ/GNA11-coupled receptors), using ex vivo pressure myography of mouse popliteal lymphatics, smooth muscle-specific conditional knockouts, quantitative PCR validation, and single-cell RNA sequencing for target prioritization. The authors convincingly demonstrate that pressure-induced chronotropy does not require the cation channels implicated in arterial myogenic tone but is blunted by deletion of GNAQ/GNA11 or IP3 receptor 1, supporting a model of GPCR > IP3 > Ca2+ release > Cl⁻ channel activation > depolarization. The core conclusion is robust. The work redefines lymphatic pacemaking as G-protein-coupled receptor-dependent mechanotransduction, distinct from arterial mechanisms, and provides a genetically validated toolkit that is useful for studying lymphatic function and dysfunction.

      Strengths:

      (1) The data are of high quality and highly sensitive functional readouts

      (2) The systematic genetic targeting is a major strength that overcomes pharmacological artifacts

      (3) Careful quantitative analyses of frequency-pressure slopes

      Weaknesses:

      (1) The use of inguinal-axillary vessels for single-cell RNA sequencing rather than the popliteal segment studied functionally.

      (2) No direct testing of the specific G-protein-coupled receptor involved.

    1. Reviewer #1 (Public review):

      Summary:

      The work of Bechara Rahme and colleagues provides an explanation as to how bacterially infected flies eventually die. While widespread tissue and multiorgan damage are to be expected in the latest stages of a systemic infection, the mechanisms leading to the host's death remain unresolved. To this end, this work illustrates the role of PrtA, a metalloproteinase found within Outer Membrane Vesicles (OMVs) secreted by Serratia marcescens, in inducing neuronal apoptosis and paralysis before death. Another interesting aspect of the work is the compromise of blood blood-brain barrier (BBB) by OMVs. BBB is different between mammals and flies; however, it merits scientific attention.

      Strengths:

      The strength of evidence lies in a wealth of experiments involving disparate innate immune mechanisms that either contribute (Imd, PPO1/2, Nox, Duox, SOD2) or oppose (hemocytes and Hayan protease) host defense. Moreover, the role of neuronal JNK and apoptic signaling is shown to contribute to host death.

      Genetics is supported by experiments using chemical treatments (Vitamin C and mito-TEMPO) as host-protecting antioxidants, and the biochemical purification and quantification of OMVs and the PrtA protease.

      Weaknesses:

      However, the reliance on non-isogenised flies to provide quantitative data is unsafe, and at this point, the strength of the evidenceis apparently incomplete. The mutant flies used for the genes Key, Myd88, Hayan, and Nos are doubtfully comparable to the control fly strains used in terms of the general genetic background. The latter is of utmost importance in assessing quantitative traits.

      The general background difference between control and test flies is also an issue when using tissue-specific expression via GAL4/UAS, because the UAS lines used are only apparently but not truly isogenic to the w flies used as controls.

    2. Reviewer #2 (Public review):

      Summary:

      In this manuscript, the authors investigate the mechanisms underlying the virulence of OMVs using a Drosophila model. They reveal a complex interplay between host defenses and OMV pathogenicity. Although the study enhances our understanding of Drosophila innate immunity, additional evidence is needed to strengthen the conclusions.

      Strengths:

      (1) In Figure 1, Toll pathway mutants infected with OMVs displayed three distinct phenotypic outcomes: mildly enhanced resistance to OMV infection, a response similar to that of the control, or increased susceptibility. Therefore, in addition to Imd and Kenny mutants from the Imd pathway, further mutants, such as Relish and PGRP-LC, should be examined to assess whether the Imd pathway is involved in host defense against OMVs.

      (2) Plasmatocytes clear particles via phagocytosis or endocytosis. However, flies lacking all hemocytes showed increased resistance to OMV challenge, raising the question of whether hemocytes actually aid the pathogen. To explore this hypothesis, the uptake of fluorescently tagged OMVs should be examined.

      (3) Hayan cleaves PPO into active PO. However, Hayan and PPO mutants exhibit opposite phenotypes upon OMV injection, raising the question of whether OMV-induced pathogenesis is linked to melanization.

      (4) Puckered mRNA levels were used as a read-out for JNK pathway activity. A transient induction of the JNK pathway was observed in head and thorax tissues. It would be beneficial if the authors could directly examine JNK activation in neuronal cells using immunostaining for pJNK.

      (5) In Figure 4B, the kayak was knocked down using the pan-neuronal driver elav-Gal4. To confirm the specificity and validity of this observation, the experiment should be repeated using another neural-specific driver.

      Weaknesses:

      It is unclear how many Serratia marcescens cells a 69 nL injection of 0.1 ng/nL OMVs corresponds to.

    3. Reviewer #3 (Public review):

      Summary:

      The authors investigate deficiencies in various immune responses, and also the prtA toxin's role in OMV toxicity. Some key interpretations are that the Imd pathway contributes to preventing OMV toxicity, but not Toll, and that Hayan and Eater somehow mediate OMV or PrtA toxicity. This descriptive effort is a solid set of experiments, although some experimental results may require further validation.

      Strengths:

      The breadth of experiments tests multiple immune parameters, providing a systematic effort that ensures a number of potentially relevant interactions can be recovered. Certain findings, such as the PrtA toxicity to flies, appear solid, and some interesting findings regarding Hayan and eater will be of interest to the fly immunity field.

      Weaknesses:

      It appears almost all results rely on the use of a single mutant representing the deletion of the gene. It's not clear if the mutations are always in the same genetic background, but this can be clarified. There are a couple of results that are confusing and may be internally contradicting, and should be additionally validated and clarified.

    1. Reviewer #1 (Public review):

      The investigators elegantly utilized a single-cell co-assay of RNA and ATAC seq to unveil the heterogeneous gene regulatory networks in Ewing sarcoma. The authors should be commended on their ability to identify multiple unique modules of gene regulation of Ewing sarcoma utilizing complex computational methods between numerous Ewing sarcoma cell lines. Additionally, they complemented their single-cell findings with xenografts as well as primary Ewing sarcoma patient tumors - validating the intratumoral heterogeneous gene regulatory networks of Ewing sarcoma. More importantly, they have revealed that exogenous TGF-β may modify these distinct epigenetic and transcriptional signatures within Ewing sarcoma tumors. Overall, the manuscript highlights an important discovery of the heterogenous gene regulatory programming of Ewing sarcoma and further highlights the role that TGFB plays within the tumor microenvironment of Ewing sarcoma. There are some areas of ambiguity that require clarification to increase the impact of the manuscript.

    2. Reviewer #2 (Public review):

      Summary:

      This work by Waltner et. al. provides a comprehensive single-cell multiomics analysis of plasticity in gene regulatory networks present in Ewing sarcoma using single-cell RNA-sequencing (scRNA-seq) and single-cell assay for transposase accessible chromatin with sequencing (scATAC-seq). They find that Ewing sarcoma cell line models have distinct patterns of chromatin accessibility compared to non-Ewing sarcoma models, and that there is significant variability across Ewing sarcoma cell lines, and sometimes within a single cell line. These differences across models are linked to 3 distinct gene regulatory modules, 2 of which are present across the range of model systems studied here. The first modules present across models are activated when the fusion is expressed and include genes enriched for the known EWSR1::FLI1 response element, GGAA microsatellites, along with other neural crest transcription factors. The other module primarily consists of genes repressed by EWSR1::FLI1, which are activated in EWSR1::FLI1-low states. Interestingly, EWSR1::FLI1-low cells have already been tied to more migratory and metastatic phenotypes, and the data here suggest these cells are more responsive to external signals from TGF-β, and this may be mediated through FOSL2-mediated gene regulation. While there are some minor additional validation studies that can be performed to strengthen a few individual analyses, this is a technically rigorous study, with a variety of different analytical techniques used to address similar questions, and this approach elevates confidence in the answers provided. This is further strengthened by the diverse set of model systems used, including patient-derived cell lines, cell line xenograft models, patient-derived xenografts, mining available single-cell data from patient samples, and validation of the gene modules identified in a larger set of patient microarray samples. In whole, this study provides a valuable resource for understanding heterogeneity, plasticity, and gene expression networks in Ewing sarcoma. This may be useful for future studies of metastatic disease and may also provide a framework for similar questions in other fusion-driven sarcomas.

      Strengths:

      There are a few core strengths in this study. First is the number and diversity of Ewing sarcoma models studied, spanning commonly used cell lines, patient-derived xenografts, and patient samples. The second is the large array of rigorous and orthogonal approaches used to uncover the identity and function of various gene modules. This includes an array of informatics techniques, as well as specific modulation of cell line models in culture. A third is confirmation that different gene expression programs are present in the same tumor using spatial transcriptomic analysis. Lastly, the authors have made all of their data and code accessible, enabling continued use of this dataset as a resource for others.

      Weaknesses:

      As highlighted by the authors, this study is somewhat limited by the small number of single-cell data from patient samples that are publicly available. Much of the analysis comes from cell lines. Additionally, they focus only on one type of signal that may modulate cell plasticity, and there are likely to be many others. Lastly, there are a few weak spots in the data. Some of this likely arises from the underlying complexity of the data, the generally sparse nature of scATAC data, and the biological heterogeneity present in the cell lines studied. The most pronounced weakness was in the analysis of transcription factors that dictate gene expression in the distinct modules, as well as the response to TGF-β. While some specific transcription factors showed module-specific expression consistent with the computational prediction in Figure 2, others did not likely due to additional factors not tested here. Likewise, the same transcription factors did not always show consistent enrichment in the gene modules that responded to TGF-β treatment when analyzed across cell lines. On the whole, these are relatively minor weaknesses and do not diminish the value of this study.

    1. Reviewer #1 (Public review):

      Summary:

      The authors investigate the effects of aging on auditory system performance in understanding temporal fine structure (TFS), using both behavioral assessments and physiological recordings from the auditory periphery, specifically at the level of the auditory nerve. This dual approach aims to enhance understanding of the mechanisms underlying observed behavioral outcomes. The results indicate that aged animals exhibit deficits in behavioral tasks for distinguishing between harmonic and inharmonic sounds, which is a standard test for TFS coding. However, neural responses at the auditory nerve level do not show significant differences when compared to those in young, normal-hearing animals. The authors suggest that these behavioral deficits in aged animals are likely attributable to dysfunctions in the central auditory system, potentially as a consequence of aging.To further investigate this hypothesis, the study includes an animal group with selective synaptic loss between inner hair cells and auditory nerve fibers, a condition known as cochlear synaptopathy (CS). CS is a pathology associated with aging and is thought to be an early indicator of hearing impairment. Interestingly, animals with selective CS showed physiological and behavioral TFS coding similar to that of the young normal-hearing group, contrasting with the aged group's deficits. Despite histological evidence of significant synaptic loss in the CS group, the study concludes that CS does not appear to affect TFS coding, either behaviorally or physiologically.

      Strengths:

      This study addresses a critical health concern, enhancing our understanding of mechanisms underlying age-related difficulties in speech intelligibility, even when audiometric thresholds are within normal limits. A major strength of this work is the comprehensive approach, integrating behavioral assessments, auditory nerve (AN) physiology, and histology within the same animal subjects. This approach enhances understanding of the mechanisms underlying the behavioral outcomes and provides confidence in the actual occurrence of synapse loss and its effects.The study carefully manages controlled conditions by including five distinct groups: young normal-hearing animals, aged animals, animals with CS induced through low and high doses, and a sham surgery group. This careful setup strengthens the study's reliability and allows for meaningful comparisons across conditions. Overall, the manuscript is well-structured, with clear and accessible writing that facilitates comprehension of complex concepts.

      Weakness:

      The stimulus and task employed in this study are very helpful for behavioral research, and using the same stimulus setup for physiology is advantageous for mechanistic comparisons. However, I have some concerns about the limitations in auditory nerve (AN) physiology. Due to practical constraints, it is not feasible to record from a large enough population of fibers that covers a full range of best frequencies (BFs) and spontaneous rates (SRs) within each animal. This raises questions about how representative the physiological data are for understanding the mechanism in behavioral data. I am curious about the authors' interpretation of how this stimulus setup might influence results compared to methods used by Kale and Heinz (2010), who adjusted harmonic frequencies based on the characteristic frequency (CF) of recorded units. While, the harmonic frequencies in this study are fixed across all CFs, meaning that many AN fibers may not be tuned closely to the stimulus frequencies. If units are not responsive to the stimulus further clarification on detecting mistuning and phase locking to TFS effects within this setup would be valuable. Given the limited number of units per condition-sometimes as few as three for certain conditions-I wonder if CF-dependent variability might impact the results of the AN data in this study and discussing this factor can help with better understanding the results. While the use of the same stimuli for both behavioral and physiological recordings is understandable, a discussion on how this choice affects interpretation would be beneficial. In addition a 60 dB stimulus could saturate high spontaneous rate (HSR) AN fibers, influencing neural coding and phase-locking to TFS. Potentially separating SR groups, could help address these issues and improve interpretive clarity.

      A deeper discussion on the role of fiber spontaneous rate could also enhance the study. How might considering SR groups affect AN results related to TFS coding? While some statistical measures are included in the supplement, a more detailed discussion in the main text could help in interpretation.

      Although Figure S2 indicates no change in median SR, the high-dose treatment group lacks LSR fibers, suggesting a different distribution based on SR for different animal groups, as seen in similar studies on other species. A histogram of these results would be informative, as LSR fiber loss with CS-whether induced by ouabain in gerbils or noise in other animals-is well documented (e.g., Furman et al., 2013).

      Although ouabain effects on gerbils have been explored in previous studies, since these data is already seems to be recorded for the animal in this study, a brief description of changes in auditory brainstem response (ABR) thresholds, wave 1 amplitudes, and tuning curves for animals with cochlear synaptopathy (CS) in this study would be beneficial. This would confirm that ouabain selectively affects synapses without impacting outer hair cells (OHCs). For aged animals, since ABR measurements were taken, comparing hearing differences between normal and aged groups could provide insights into the pathologies besides CS in aged animals. Additionally, examining subject variability in treatment effects on hearing and how this correlates with behavior and physiology would yield valuable insights. If limited space maybe a brief clarification or inclusion in supplementary could be good enough.

      Another suggestion is to discuss the potential role of MOC efferent system and effect of anesthesia in reducing efferent effects in AN recordings. This is particularly relevant for aged animals, as CS might affect LSR fibers, potentially disrupting the medial olivocochlear (MOC) efferent pathway. Anesthesia could lessen MOC activity in both young and aged animals, potentially masking efferent effects that might be present in behavioral tasks. Young gerbils with functional efferent systems might perform better behaviorally, while aged gerbils with impaired MOC function due to CS might lack this advantage. A brief discussion on this aspect could potentially enhance mechanistic insights.

      Lastly, although synapse counts did not differ between the low-dose treatment and NH I sham groups, separating these groups rather than combining them with the sham might reveal differences in behavior or AN results, particularly regarding the significance of differences between aged/treatment groups and the young normal-hearing group.

    2. Reviewer #2 (Public review):

      Summary:

      Using a gerbil model, the authors tested the hypothesis that loss of synapses between sensory hair cells and auditory nerve fibers (which may occur due to noise exposure or aging) affects behavioral discrimination of the rapid temporal fluctuations of sounds. In contrast to previous suggestions in the literature, their results do not support this hypothesis; young animals treated with a compound that reduces the number of synapses did not show impaired discrimination compared to controls. Additionally, their results from older animals showing impaired discrimination suggest that age-related changes aside from synaptopathy are responsible for the age-related decline in discrimination.

      Strengths:

      (1) The rationale and hypothesis are well-motivated and clearly presented.

      (2) The study was well conducted with strong methodology for the most part, and good experimental control. The combination of physiological and behavioral techniques is powerful and informative. Reducing synapse counts fairly directly using ouabain is a cleaner design than using noise exposure or age (as in other studies), since these latter modifiers have additional effects on auditory function.

      (3) The study may have a considerable impact on the field. The findings could have important implications for our understanding of cochlear synaptopathy, one of the most highly researched and potentially impactful developments in hearing science in the past fifteen years.

      Weaknesses:

      (1) I have concerns that the gerbils may not have been performing the behavioral task using temporal fine structure information.

      Human studies using the same task employed a filter center frequency that was (at least) 11 times the fundamental frequency (Marmel et al., 2015; Moore and Sek, 2009). Moore and Sek wrote: "the default (recommended) value of the centre frequency is 11F0." Here, the center frequency was only 4 or 8 times the fundamental frequency (4F0 or 8F0). Hence, relative to harmonic frequency, the harmonic spacing was considerably greater in the present study. However, gerbil auditory filters are thought to be broader than those in human. In the revised version of the manuscript, the authors provide modelling results suggesting that the excitation patterns were discriminable for the 4F0 conditions, but may not have been for the 8F0 conditions. These results provide some reassurance that the 8F0 discriminations were dependent on temporal cues, but the description of the model lacks detail. Also, the authors state that "thus, for these two conditions with harmonic number N of 8 the gerbils cannot rely on differences in the excitation patterns but must solve the task by comparing the temporal fine structure." This is too strong. Pulsed tone intensity difference limens (the reference used for establishing whether or not the excitation pattern cues were usable) may not be directly comparable to profile-analysis-like conditions, and it has been argued that frequency discrimination may be more sensitive to excitation pattern cues than predicted from a simple comparison to intensity difference limens (Micheyl et al. 2013, https://doi.org/10.1371/journal.pcbi.1003336).

      I'm also somewhat concerned that the masking noise used in the present study was too low in level to mask cochlear distortion products. Based on their excitation pattern modelling, the authors state (without citation) that "since the level of excitation produced by the pink noise is less than 30 dB below that produced by the complex tones, distortion products will be masked." The basis for this claim is not clear. In human, distortion products may be only ~20 dB below the levels of the primaries (referenced to an external sound masker / canceller, which is appropriate, assuming that the modelling reported in the present paper did not include middle-ear effects; see Norman-Haignere and McDermott, 2016, doi: 10.1016/j.neuroimage.2016.01.050). Oxenham et al. (2009, doi: 10.1121/1.3089220) provide further cautionary evidence on the potential use of distortion product cues when the background noise level is too low (in their case the relative level of the noise in the compromised condition was only a little below that used in the present study). The masking level used in the present study may have been sufficient, but it would be useful to have some further reassurance on this point.

      (2) The synapse reductions in the high ouabain and old groups were relatively small (mean of 19 synapses per hair cell compared to 23 in the young untreated group). In contrast, in some mouse models of the effects of noise exposure or age, a 50% reduction in synapses is observed, and in the human temporal bone study of Wu et al. (2021, https://doi.org/10.1523/JNEUROSCI.3238-20.2021) the age-related reduction in auditory nerve fibres was ~50% or greater for the highest age group across cochlear location. It could be simply that the synapse loss in the present study was too small to produce significant behavioral effects. Hence, although the authors provide evidence that in the gerbil model the age-related behavioral effects are not due to synaptopathy, this may not translate to other species (including human).

      (3) The study was not pre-registered, and there was no a priori power calculation, so there is less confidence in replicability than could have been the case. Only three old animals were used in the behavioral study, which raises concerns about the reliability of comparisons involving this group. Statistical analyses on very small samples can be unreliable due to problems of power, generalisability, and susceptibility to outliers.

    3. Reviewer #3 (Public review):

      This study is a part of the ongoing series of rigorous work from this group exploring neural coding deficits in the auditory nerve, and dissociating the effects of cochlear synaptopathy from other age-related deficits. They have previously shown no evidence of phase-locking deficits in the remaining auditory nerve fibers in quiet-aged gerbils. Here, they study the effects of aging on the perception and neural coding of temporal fine structure cues in the same Mongolian gerbil model.

      They measure TFS coding in the auditory nerve using the TFS1 task which uses a combination of harmonic and tone-shifted inharmonic tones which differ primarily in their TFS cues (and not the envelope). They then follow this up with a behavioral paradigm using the TFS1 task in these gerbils. They test young normal hearing gerbils, aged gerbils, and young gerbils with cochlear synaptopathy induced using the neurotoxin ouabain to mimic synapse losses seen with age.

      In the behavioral paradigm, they find that aging is associated with decreased performance compared to the young gerbils, whereas young gerbils with similar levels of synapse loss do not show these deficits. When looking at the auditory nerve responses, they find no differences in neural coding of TFS cues across any of the groups. However, aged gerbils show an increase in the representation of periodicity envelope cues (around f0) compared to young gerbils or those with induced synapse loss. The authors hence conclude that synapse loss by itself doesn't seem to be important for distinguishing TFS cues, and rather the behavioral deficits with age are likely having to do with the misrepresented envelope cues instead.

      The manuscript is well written, and the data presented are robust. Some of the points below will need to be considered while interpreting the results of the study, in its current form. These considerations are addressable if deemed necessary, with some additional analysis in future versions of the manuscript.

      Spontaneous rates - Figure S2 shows no differences in median spontaneous rates across groups. But taking the median glosses over some of the nuances there. Ouabain (in the Bourien study) famously affects low spont rates first, and at a higher degree than median or high spont rates. It seems to be the case (qualitatively) in figure S2 as well, with almost no units in the low spont region in the ouabain group, compared to the other groups. Looking at distributions within each spont rate category and comparing differences across the groups might reveal some of the underlying causes for these changes. Given that overall, the study reports that low-SR fibers had a higher ENV/TFS log-z-ratio, the distribution of these fibers across groups may reveal specific effects of TFS coding by group.

      [Update: The revised manuscript has addressed these issues]

      Threshold shifts - It is unclear from the current version if the older gerbils have changes in hearing thresholds, and whether those changes may be affecting behavioral thresholds. The behavioral stimuli appear to have been presented at a fixed sound level for both young and aged gerbils, similar to the single unit recordings. Hence, age-related differences in behavior may have been due to changes in relative sensation level. Approaches such as using hearing thresholds as covariates in the analysis will help explore if older gerbils still show behavioral deficits.

      [Update: The issue of threshold shifts with aging gerbils is still unresolved in my opinion. From the revised manuscript, it appears that aged gerbils have a 36dB shift in thresholds. While the revised manuscript provides convincing evidence that these threshold shifts do not affect the auditory nerve tuning properties, the behavioral paradigm was still presented at the same sound level for young and aged animals. But a potential 36 dB change in sensation level may affect behavioral results. The authors may consider adding thresholds as covariates in analyses or present any evidence that behavioral thresholds are plateaued along that 30dB range].

      Task learning in aged gerbils - It is unclear if the aged gerbils really learn the task well in two of the three TFS1 test conditions. The d' of 1 which is usually used as the criterion for learning was not reached in even the easiest condition for aged gerbils in all but one condition for the aged gerbils (Fig. 5H) and in that condition, there doesn't seem to be any age-related deficits in behavioral performance (Fig. 6B). Hence dissociating the inability to learn the task from the inability to perceive TFS 1 cues in those animals becomes challenging.

      [Update: The revised manuscript sufficiently addresses these issues, with the caveat of hearing threshold changes affecting behavioral thresholds mentioned above].

      Increased representation of periodicity envelope in the AN - the mechanisms for increased representation of periodicity envelope cues is unclear. The authors point to some potential central mechanisms but given that these are recordings from the auditory nerve what central mechanisms these may be is unclear. If the authors are suggesting some form of efferent modulation only at the f0 frequency, no evidence for this is presented. It appears more likely that the enhancement may be due to outer hair cell dysfunction (widened tuning, distorted tonotopy). Given this increased envelope coding, the potential change in sensation level for the behavior (from the comment above), and no change in neural coding of TFS cues across any of the groups, a simpler interpretation may be -TFS coding is not affected in remaining auditory nerve fibers after age-related or ouabain induced synapse loss, but behavioral performance is affected by altered outer hair cell dysfunction with age.

      [Update: The revised manuscript has addressed these issues]

      Emerging evidence seems to suggest that cochlear synaptopathy and/or TFS encoding abilities might be reflected in listening effort rather than behavioral performance. Measuring some proxy of listening effort in these gerbils (like reaction time) to see if that has changed with synapse loss, especially in the young animals with induced synaptopathy, would make an interesting addition to explore perceptual deficits of TFS coding with synapse loss.

      [Update: The revised manuscript has addressed these issues]

    1. Reviewer #1 (Public review):

      Summary:

      Grasper et al. present a combined analysis of the role of temporal mutagenesis in cancer, which includes both theoretical investigation and empirical analysis of point mutations in TCGA cancer patient cohorts. They find that temporal elevated mutation rates contribute to cancer fitness by allowing fast adaptation when the fitness drops (due to previous deleterious mutations). This may be relevant in the case of tumor suppressor genes (TSG), which follow the 2-hit hypothesis (i.e., biallelic 2 mutations are necessary to deactivate TS), and in cases where temporal mutagenesis occurs (e.g. high APOBEC, ROS). They provide evidence that this scenario is likely to occur in patients, in some cancer types. This is an interesting and potentially important result that merits the attention of the target audience. Nonetheless, I have some questions (detailed below) regarding the design of the study, the tools and parametrization of the theoretical analysis and the empirical analysis - that I think if addressed would make the paper more solid and the conclusion more substantiated.

      Strengths:

      Combined theoretical investigation with empirical analysis of cancer patients

      Weaknesses:

      Parametrization and systematic investigation of theoretical tools and their relevant to tumor evolution

      Comments on revisions:

      The authors have adequately addressed my suggestions. I think some of the details provided in some of the replies to my comments (specifically with regard to my points 1, 4, 6ii; minor point 6) could be integrated into relevant text in the introduction , discussion and methods, to help the readers follow better the model and its interpretation - but this is up to the authors to decide what to emphasize.

    2. Reviewer #2 (Public review):

      This work presents theoretical results concerning the effect of punctuated mutation on multistep adaptation along with empirical analysis of multistep adaptation in cancer. The empirical results are claimed to demonstrate the acceleration of multistep adaptation predicted theoretically. However, there is an important disconnect between the theoretical results and the empirical observations, such that it is not clear that punctuated mutation can produce the phenomena observed empirically. Furthermore, there are other plausible explanations for the empirical observations.

      The theoretical work emphasizes the positive effect of punctuated mutation on the rate of crossing a "fitness valley", i.e., multistep adaptation where the first mutation is deleterious. The empirical work, however, focuses on inactivation of both alleles of a tumor suppressor gene (TSG), for which the first mutation--inactivation of one gene copy--is expected to be neutral or slightly advantageous, not maladaptive as suggested by the authors. Pairs of genes with putative synergystic effects were also analyzed, but there is no indication that these generally involve fitness valleys either.

      This disconnect is most glaring in Figure 4, in which the simulations are supposed to confirm that punctuated mutation can produce the empirical phenomena reported for TSG inactivation. If this is the case, it should be possible to produce such results in simulations in which inactivation of just one allele is neutral. Instead, simulations assuming a substantial fitness penalty (0.05) for the first mutation are presented. Contrary to what is claimed in the text (line 212), this is not a "biologically realistic" parameter value for TSG inactivation. The insensitivity of results to the size the fitness penalty is irrelevant: a substantial fitness penalty is qualitatively different from no penalty at all.

      The paper does report a small (15%) effect of punctuation on the rate of multistep adaptation in the absence of a fitness valley. This effect is much smaller than the fourfold increase in the presence of a fitness valley. The results presented--a single stochastic run for each condition--are insufficient to establish that there is any effect at all: if we assume that the number of pairs of fixations (about 150-180 in each simulation) is Poisson distributed, the 15% difference is not statistically significant.

      Assuming that this effect is genuine, it is likely due to a mutation rate that is unrealisitcally high (considering that "rescue" requires inactivation of a particular gene). Theoretical considerations suggest that punctuated mutation has little or no effect in the absence of a fitness valley in the limit of low mutation rate:

      (A1) The authors' theoretical results for a Galton-Watson process (SI2) imply that there is no effect without a fitness valley in that limit. This is so because there is no effect in the "supercritical" regime. Cancer cells must be supercritical (otherwise there would be no net growth), and a neutral or advantangeous mutant would remain in the supercritical regime.

      (A2) Fig. S2D indicates, as far as I can tell from the colors, that punctuation makes little or no difference to the rate of adaptation in the absence of a fitness valley, i.e., for vertical axis values of 1 or more. I am not sure why the authors (line 129) point to this figure as evidence that punctuation speeds two-step adaptation when the first mutation is not maladaptive; the figure appears to say that it does not. The fraction of events due to "stochastic tunneling" of course increases with punctuation, but that does not change the fact that adaptation is no faster.

      (A3) The authors' verbal argument to the contrary (line 124ff) is flawed. Despite the fact that even a mildly advantageous mutant is likely to go extinct, its expected frequency only increases with time, and that of a neutral allele remains constant over time. Thus, the average number of opportunities for a second mutation does not decrease with time since the first mutation, as it does when the first muation is deleterious.

      (A4) I ran some simulations for a Wright-Fisher population, and they seem to confirm the lack of an effect in the low mutation rate limit.

      Thus, it is unclear whether punctuated mutation can explain the reported phenomena or should be expected to have major effects on the rate or nature of cancer cell adaptation.

      I would also note that routes to inactivation of both copies of a TSG that are not accelerated by punctuation will dilute any effects of punctuation. An example is a single somatic mutation followed by loss of heterozygosity. Such mechanisms are not included in the theoretical analysis nor assessed empirically. If, for example, 90% of double inactivations were the result of such mechanisms with a constant mutation rate, a factor of two effect of punctuated mutagenesis would increase the overall rate by only 10%. Consideration of the rate of apparent inactivation of just one TSG copy and of deletion of both copies would shed some light on the importance of this consideration.

      Several factors besides the effects of punctuated mutation might explain or contribute to the empirical observations. Though these are now mentioned in the paper, I will list them here for clarity:

      (B1) High APOBEC3 activity can select for inactivation of TSGs (references in Butler and Banday 2023, PMID 36978147). This could explain the empirical correlations.

      (B2) Without punctuation, the rate of multistep adaptation is expected to rise more than linearly with mutation rate. Thus, if APOBEC signatures are correlated with a high mutation rate due to the action of APOBEC, this alone could explain the correlation with TSG inactivation.

      (B3) The nature of mutations caused by APOBEC might explain the results. Notably, one of the two APOBEC mutation signatures, SBS13, is particularly likely to produce nonsense mutations. The authors count both nonsense and missense mutations, but nonsense mutations are more likely to inactivate the gene, and hence to be selected.

    1. Reviewer #1 (Public review):

      Summary:

      In this study, Jeong and Choi examine neural correlates of behavior during a naturalistic foraging task in which rats must dynamically balance resource acquisition (foraging) with the risk of threat. Rats first learn to forage for sucrose reward from a spout, and when a threat is introduced (an attack-like movement from a "LobsterBot"), they adjust their behavior to continue foraging while balancing exposure to the threat, adopting anticipatory withdraw behaviors to avoid encounter with the LobsterBot. Using electrode recordings targeting the medial prefrontal cortex (mPFC), they identify heterogenous encoding of task variables across prelimbic and infralimbic cortex neurons, including correlates of distance to the reward/threat zone and correlates of both anticipatory and reactionary avoidance behavior. Based on analysis of population responses, they show that prefrontal cortex switches between different regimes of population activity to process spatial information or behavioral responses to threat in a context-dependent manner. Characterization of the heterogenous coding scheme by which frontal cortex represents information in different goal states is an important contribution to our understanding of brain mechanisms underlying flexible behavior in ecological settings.

      Strengths:

      As many behavioral neuroscience studies employ highly controlled task designs, relatively less is generally known about how the brain organizes navigation and behavioral selection in naturalistic settings, where environment states and goals are more fluid. Here, the authors take advantage of a natural challenge faced by many animals - how to forage for resources in an unpredictable environment - to investigate neural correlates of behavior when goal states are dynamic. They investigate how prefrontal cortex (mPFC) activity is structured to support different functional "modes" (here, between a navigational mode and a threat-sensitive foraging mode) for flexible behavior. Overall, an important strength and real value of this study is the design of the behavioral experiment, which is trial-structured, permitting strong statistical methods for neural data analysis, yet still rich enough for unconstrained, natural behavior structured by the animal's volitional goals. The experiment is also phased to measure behavioral changes as animals first encounter a threat, and then learn to adapt their foraging strategy to its presence. Characterization of this adaptation process is itself quite interesting and sets a foundation for further study of threat learning and risk management in the foraging context. Finally, the characterization of single-neuron and population dynamics in mPFC in this naturalistic setting with fluid goal states is an important contribution to the field. Previous studies have identified neural correlates of spatial and behavioral variables in frontal cortex, but how these representations are structured, or how they are dynamically adjusted when animals shift their goals, has been less clear. The authors synthesize their main conclusions into a conceptual model for how mPFC could encode task variables in a context-dependent manner, and provide a useful framework for thinking about circuit-level mechanisms that may support mode switching.

      Weaknesses:

      The task design in this study is intentionally stimulus-rich and places minimal constraint on the animal to preserve naturalistic behavior, and this introduces some confounds that place some limits on the interpretability of neural responses. For example, some variables which are the target of neural correlation analysis, such as spatial/proximity coding and coding of threat and threat-related behaviors, are naturally entwined. In their revisions, the authors have included extensive analyses and control conditions to disambiguate these confounds. Within the limits of their task design, this provides compelling evidence that mPFC neurons encode threat, decision, and spatial information in a context-dependent manner. Future experiment designs, which intentionally separate task contexts (e.g. navigation vs. foraging), could serve to further clarify the structure of coding across contexts and/or goal states.

      While the study provides an important advance in our understanding of mPFC coding structure under naturalistic conditions, the study still lacks functional manipulations to establish any form of causality. This limitation is acknowledged in the text, and the report is careful not to over interpret suggestions of causal contribution, instead setting a foundation for future investigations.

    2. Reviewer #2 (Public review):

      Summary:

      Jeong & Choi (2023) use a semi-naturalistic paradigm to tackle the question of how the activity of neurons in the mPFC might continuously encode different functions. They offer two possibilities: either there are separate dedicated populations encoding each function, or cells alter their activity dependent on the current goal of the animal. In a threat-avoidance task rats procurred sucrose in an area of a chamber where, after remaining there for some amount of time, a 'Lobsterbot' robot attacked. In order to initiate the next trial rats had to move through the arena to another area before returning to the robot encounter zone. Therefore the task has two key components: threat avoidance and navigating through space. Recordings in the IL and PL of the mPFC revealed encoding that depended on what stage of the task the animal was currently engaged in. When animals were navigating, neuronal ensembles in these regions encoded distance from the threat. However, whilst animals were directly engaged with the threat and simultaneously consuming reward, it was possible to decode from a subset of the population whether animals would evade the threat. Therefore the authors claim that neurons in the mPFC switched between two functional modes: representing allocentric spatial information, and representing egocentric information pertaining to the reward and threat. Finally, the authors propose a conceptual model based on these data whereby this switching of population encoding is driven by either bottom-up sensory information or top-down arbitration.

      Strengths:

      Whilst these multiple functions of activity in the mPFC have generally been observed in tasks dedicated to the study of a singular function, less work has been done in contexts where animals continuously switch between different modes of behaviour in a more natural way. Being able to assess whether previous findings of mPFC function apply in natural contexts is very valuable to the field, even outside of those interested in the mPFC directly. This also speaks to the novelty of the work; although mixed selectivity encoding of threat assessment and action selection has been demonstrated in some contexts (e.g. Grunfeld & Likhtik, 2018) understanding the way in which encoding changes on-the-fly in a self-paced task is valuable both for verifying whether current understanding holds true and for extending our models of functional coding in the mPFC.

      The authors are also generally thoughtful in their analyses and use a variety of approaches to probe the information encoded in the recorded activity. In particular, they use relatively close analysis of behaviour as well as manipulating the task itself by removing the threat to verify their own results. The use of such a rich task also allows them to draw comparisons, e.g. in different zones of the arena or different types of responses to threat, that a more reduced task would not otherwise allow. Additional in-depth analyses in the updated version of the manuscript, particularly the feature importance analysis, as well as complimentary null findings (a lack of cohesive place cell encoding, and no difference in location coding dependent on direction of trajectory) further support the authors' conclusion that populations of cells in the mPFC are switching their functional coding based on task context rather than behaviour per se. Finally, the authors' updated model schematic proposes an intriguing and testable implementation of how this encoding switch may be manifested by looking at differentiable inputs to these populations.

      Weaknesses:

      The main existing weakness of this study is that its findings are correlational (as the authors highlight in the discussion). Future work might aim to verify and expand the authors' findings - for example, whether the elevated response of Type 2 neurons directly contributes to the decision-making process or just represents fear/anxiety motivation/threat level - through direct physiological manipulation. However, I appreciate the challenges of interpreting data even in the presence of such manipulations and some of the additional analyses of behaviour, for example the stability of animals' inter-lick intervals in the E-zone, go some way towards ruling out alternative behavioural explanations. Yet the most ideal version of this analysis is to use a pose estimation method such as DeepLabCut to more fully measure behavioural changes. This, in combination with direct physiological manipulation, would allow the authors to fully validate that the switching of encoding by this population of neurons in the mPFC has the functional attributes as claimed here.

    3. Reviewer #3 (Public review):

      Summary:

      This study investigates how various behavioral features are represented in the medial prefrontal cortex (mPFC) of rats engaged in a naturalistic foraging task. The authors recorded electrophysiological responses of individual neurons as animals transitioned between navigation, reward consumption, avoidance, and escape behaviors. Employing a range of computational and statistical methods, including artificial neural networks, dimensionality reduction, hierarchical clustering, and Bayesian classifiers, the authors sought to predict from neural activity distinct task variables (such as distance from the reward zone and the success or failure of avoidance behavior). The findings suggest that mPFC neurons alternate between at least two distinct functional modes, namely spatial encoding and threat evaluation, contingent on the specific location.

      Strengths:

      This study attempt to address an important question: understanding the role of mPFC across multiple dynamic behaviors. The authors highlight the diverse roles attributed to mPFC in previous literature and seek to explain this apparent heterogeneity. They designed an ethologically relevant foraging task that facilitated the examination of complex dynamic behavior, collecting comprehensive behavioral and neural data. The analyses conducted are both sound and rigorous.

      Weaknesses:

      Because the study still lacks experimental manipulation, the findings remain correlational. The authors have appropriately tempered their claims regarding the functional role of the mPFC in the task. The nature of the switch between functional modes encoding distinct task variables (i.e., distance to reward, and threat-avoidance behavior type) is not established. Moreover, the evidence presented to dissociate movement from these task variables is not fully convincing, particularly without single-session video analysis of movement. Specifically, while the new analyses in Figure 7 are informative, they may not fully account for all potential confounding variables arising from changes in context or behavior.

      Comments on revisions:

      The authors have addressed my previous recommendations.

    1. Reviewer #1 (Public review):

      In this manuscript, Chen et al. investigate the role of the membrane estrogen receptor GPR30 in spinal mechanisms of neuropathic pain. Using a wide variety of techniques, they first provide convincing evidence that GPR30 expression is restricted to neurons within the spinal cord, and that GPR30 neurons are well-positioned to receive descending input from the primary sensory cortex (S1). In addition, the authors put their findings in the context the previous knowledge in the field, presenting evidence demonstrating that GRP30 is expressed in the majority of CCK-expressing spinal neurons. Overall, this manuscript furthers our understanding of neural circuity that underlies neuropathic pain and will be of broad interest to neuroscientists, especially those interested in somatosensation. Nevertheless, the manuscript would be strengthened by additional analyses and clarification of data that is currently presented.

      Strengths:

      The authors present convincing evidence for expression of GPR30 in the spinal cord that is specific to spinal neurons. Similarly, complementary approaches including pharmacological inhibition and knockdown of GPR30 are used to demonstrate a role for the receptor in driving nerve injury-induced pain in rodent models.

      Weaknesses:

      Although steps were taken to put their data into the broader context of what is already known about the spinal circuitry of pain, more considerations and analyses would help the authors better achieve their goal. For instance, to determine whether GPR30 is expressed in excitatory or inhibitory neurons, more selective markers for these subtypes should be used over CamK2. Moreover, quantitative analysis of the extent of overlap between GPR30+ and CCK+ spinal neurons is needed to understand the potential heterogeneity of the GPR30 spinal neuron population, and to interpret experiments characterizing descending SI inputs onto GPR30 and CCK spinal neurons. Filling these gaps in knowledge would make their findings more solid.

      Revised Manuscript Update:

      In their revised manuscript, Chen et al. have added additional data that establishes GPR30 spinal neurons as a population of excitatory neurons, half of which express CCK. These data help to position GPR30 neurons in the existing framework of spinal neuron populations that contribute to neuropathic pain, strengthening the author's findings.

      I have no new recommendations to the author's following this round of revisions.

    2. Reviewer #3 (Public review):

      Summary:

      The authors convincingly demonstrate that a population of CCK+ spinal neurons in the deep dorsal horn express the G protein coupled estrogen receptor GPR30 to modulate pain sensitivity in the chronic constriction injury (CCI) model of neuropathic pain in mice. Using complementary pharmacological and genetic knockdown experiments they convincingly show that GPR30 inhibition or knockdown reverses mechanical, tactile and thermal hypersensitivity, conditioned place aversion, and c-fos staining in the spinal dorsal horn after CCI. They propose that GPR30 mediates an increase in postsynaptic AMPA receptors after CCI using slice electrophysiology which may underlie the increased behavioral sensitivity. They then use anterograde tracing approaches to show that CCK and GPR30 positive neurons in the deep dorsal horn may receive direct connections from primary somatosensory cortex. Chemogenetic activation of these dorsal horn neurons proposed to be connected to S1 increased nociceptive sensitivity in a GPR30 dependent manner. Overall, the data are very convincing and the experiments are well conducted and adequately controlled. The potential role of direct connections from S1 for descending modulation of pain and the endogenous mechanism(s) activating GPR30 will be interesting to test in future studies.

      Strengths:

      The experiments are very well executed and adequately controlled throughout the manuscript. The data are nicely presented and supportive of a role for GPR30 signaling in the spinal dorsal horn influencing nociceptive sensitivity following CCI. The authors also did an excellent job of using complementary approaches to rigorously test their hypothesis.

      Weaknesses:

      While the viral tracing demonstrates a potential connection between S1 and CCK+ or GPR30+ spinal neurons, no direct evidence is provided for S1 in facilitating any activity of these neurons in the dorsal horn.

      Comments on the latest version:

      The authors have done a good job addressing previous critiques and have appropriately revised the manuscript and conclusions.

    1. Reviewer #1 (Public review):

      In the Late Triassic (around 230 Ma ago), southern Wales and adjacent parts of England were a karst landscape. The caves and crevices accumulated remains of small vertebrates. These fossil-rich fissure fills are being exposed in limestone quarrying. In 2022 (reference 13 of the article), a partial articulated skeleton and numerous isolated bones from one fissure fill were named Cryptovaranoides microlanius and described as the oldest known squamate - the oldest known animal, by some 50 Ma, that is more closely related to snakes and some extant lizards than to other extant lizards. This would have considerable consequences for our understanding of the evolution of squamates and their closest relatives, especially for its speed and absolute timing, and was supported in the same paper by phylogenetic analyses based on different datasets.

      In 2023, the present authors published a rebuttal (ref. 18) to the 2022 paper, challenging anatomical interpretations and the irreproducible referral of some of the isolated bones to Cryptovaranoides. Modifying the datasets accordingly, they found Cryptovaranoides outside Squamata and presented evidence that it is far outside. In 2024 (ref. 19), the original authors defended most of their original interpretation and presented some new data, some of it from newly referred isolated bones. The present article discusses anatomical features and the referral of isolated bones in more detail, documents some clear misinterpretations, argues against the widespread but not justifiable practice of referring isolated bones to the same species as long as there is merely no known evidence to the contrary, further argues against comparing newly recognized fossils to lists of diagnostic characters from the literature as opposed to performing phylogenetic analyses and interpreting the results, and finds Cryptovaranoides outside Squamata again.

      Although a few of the character discussions can probably still be improved, I see no sign that the discussion is going in circles or otherwise becoming unproductive. I can even imagine that the present contribution will end it.

    2. Reviewer #2 (Public review):

      Congratulations on this revised manuscript on the phylogenetic affinities of Cryptovaranoides, and thank you for your modifications to this manuscript following review.

      This manuscript offers a careful review of the features used to hypothesize the placement of Cryptovaranoides within crown Squamata and instead suggests that this taxon represents an earlier-diverging reptile. This work therefore reconciles morphological and molecular data regarding lizard origins, which is an important contribution to the field of vertebrate paleontology.

      The authors have improved their manuscript following reviewer comments and now provide more thorough comparisons with other early reptiles and archosauromorphs, an improvement over early versions of this paper. Changes to these comparative descriptions provide important rationale concerning the absence of superficially squamate-like features in Cryptovaranoides.

      The evolutionary relationships of Cryptovaranoides among reptiles will certainly be a matter of debate until detailed anatomical descriptions of this taxon and other putative lepidosauromorphs are published. However, it can now be said with confidence that the presence of any crown squamate in the Permian or Triassic is unlikely and should be met with skepticism, the same sort of skepticism provided in this manuscript.

    3. Reviewer #3 (Public review):

      Summary:

      The study provides an interesting contribution to our understanding of Cryptovaranoides relationships, which is a matter of intensive debate among researchers. The authors have modified the manuscript according to most of my suggestions. My main concerns are about the wording of some statements but the authors have the right to put it as they want in the end. Overall the discussion and data are well prepared. I would recommend to publish the manuscript after very minor revisions.

      Strengths:

      Detailed analysis of the discussed characters. Illustrations of some comparative materials.

      Weaknesses:

      Abstract: "Our team challenged this identification and instead suggested †Cryptovaranoides had unclear affinities to living reptiles"

      Unfortunately I have to disagree again. "unclear affinities to living reptiles" can mean anything including a crown lizard. First, the 2023 paper clearly rejected the squamate hypothesis and presented some evidence that potentially places Cryptovaranoides among Archosauromorpha. In this context "unclear where it would belong within the latter" does not really matter. Second, we are not discussing here if Cryptovaranoides is a squamate or a stem-squamate. We have many more options on the table, so "unclear affinities" is too imprecise. Please change it to "could be an archosauromorph or an indeterminate neodiapsid" in the abstract to show the scale of conflicting evidence.

    1. Reviewer #1 (Public review):

      Summary and Strengths:

      The very well-written manuscript by Lövestam et al. from the Scheres/Goedert groups entitled "Twelve phosphomimetic mutations induce the assembly of recombinant full-length human tau into paired helical filaments" demonstrates the in vitro production of the so-called paired helical filament Alzheimer's disease (AD) polymorph fold of tau amyloids through the introduction of 12 point mutations that attempt to mimic the disease-associated hyper-phosphorylation of tau. The presented work is very important because it enables disease-related scientific work, including seeded amyloid replication in cells, to be performed in vitro using recombinant-expressed tau protein.

      Comments on revised version:

      The manuscript is significantly improved, as also indicated by Reviewer 2, with the 100% formation of the PHF and the additional experiments to elucidate on the potential mechanism by the PTMs. This is a great work.

    2. Reviewer #2 (Public review):

      Summary:

      This manuscript addresses an important impediment in the field of Alzheimer's disease (AD) and tauapathy research by showing that 12 specific phosphomimetic mutations in full-length tau allow the protein to aggregate into fibrils with the AD fold and the fold of chronic traumatic encephalopathy fibrils in vitro. The paper presents comprehensive structural and cell based seeding data indicating the improvement of their approach over previous in vitro attempts on non-full-length tau constructs. The main weaknesses of this work results from the fact that only up to 70% of the tau fibrils form the desired fibril polymorphs. In addition, some of the figures are of low quality and confusing.

      Strengths:

      This study provides significant progress towards a very important and timely topic in the amyloid community, namely the in vitro production of tau fibrils found in patients.

      The 12 specific phosphomimetic mutations presented in this work will have an immediate impact in the field since they can be easily reproduced.

      Multiple high-resolution structures support the success of the phosphomimetic mutation approach.

      Additional data show the seeding efficiency of the resulting fibrils, their reduced tendency to bundle, and their ability to be labeled without affecting core structure or seeding capability.

      Comments on revised version:

      Generally, I am satisfied with the revisions. Specifically, the new results showing 100% formation of PHF is a significant improvement.

    1. Reviewer #1 (Public review):

      Summary:

      Activation of thermogenesis by cold exposure and dietary protein restriction are two lifestyle changes that impact health in humans and lead to weight loss in model organisms, here the mouse. How these affect liver and adipose tissues has not been thoroughly investigated side by side. In mice, the authors show that the responses to methionine restriction and cold exposure are tissue-specific while the effects on beige adipose are somewhat similar.

      Strengths:

      The strength of the work is the comparative approach, using transcriptomics and bioinformatic analyses to investigate the tissue-specific impact. The work was performed in mouse models and is state-of-the-art. This represents an important resource for researchers in the field of protein restriction and thermogenesis.

      Weaknesses:

      The findings are descriptive and the conclusions remain associative. The work is limited to mouse physiology and the human implications have not been investigated yet.

    2. Reviewer #2 (Public review):

      Summary:

      This study provides a library of RNA sequencing analysis from brown fat, liver and white fat of mice treated with two stressors - cold challenge and methionine restriction - alone and in combination (interaction between diet and temperature). They characterize the physiologic response of the mice to the stressors, including effects on weight, food intake and metabolism. This paper provides evidence that while both stressors increase energy expenditure, there are complex tissue-specific responses in gene expression, with additive, synergistic and antagonistic responses seen in different tissues.

      Strengths:

      The study design and implementation is solid and well-controlled. Their writing is clear and concise. The authors do an admirable job of distilling the complex transcriptome data into digestible information for presentation in the paper. Most importantly, they do not over reach in their interpretation of their genomic data, keeping their conclusions appropriately tied to the data presented. The discussion is well thought out addresses some interesting points raised by their results.

      Weaknesses:

      The major weakness of the paper is the almost complete reliance on RNA sequencing data, but it is presented as a transcriptomic resource.

    3. Reviewer #3 (Public review):

      Summary:

      Ruppert et al. present a well-designed 2×2 factorial study directly comparing methionine restriction (MetR) and cold exposure (CE) across liver, iBAT, iWAT, and eWAT, integrating physiology with tissue-resolved RNA-seq. This approach allows a rigorous assessment of where dietary and environmental stimuli act additively, synergistically, or antagonistically. Physiologically, MetR progressively increases energy expenditure (EE) at 22{degree sign}C and lowers RER, indicating a lipid utilization bias. By contrast, a 24-hour 4 {degree sign}C challenge elevates EE across all groups and eliminates MetR-Ctrl differences. Notably, changes in food intake and activity do not explain the MetR effect at room temperature.

      Strengths:

      The data convincingly support the central claim: MetR enhances EE and shifts fuel preference to lipids at thermoneutrality, while CE drives robust EE increases regardless of diet and attenuates MetR-driven differences. Transcriptomic analysis reveals tissue-specific responses, with additive signatures in iWAT and CE-dominant effects in iBAT. The inclusion of explicit diet×temperature interaction modeling and GSEA provides a valuable transcriptomic resource for the field.

      Comments on revisions:

      The authors have addressed any concerns I had.

    1. Reviewer #1 (Public review):

      Summary:

      This manuscript describes a study examining the relationship between microsaccades and covert attention. This question has been widely investigated, with numerous studies showing that during sustained fixation, when subjects covertly attend to a peripheral stimulus, microsaccades tend to be biased toward the attended location. Here, the authors ask whether this microsaccade bias reflects a shift of covert attention or the maintenance of covert attention. They conclude that the bias is primarily driven by attention shifts, a finding that also helps reconcile the seemingly conflicting results of prior research, where the bias was questioned in paradigms that largely involved attention maintenance rather than shifting.

      Strengths:

      The paradigm and conclusions appear sound and supported by the results. A large sample size was used.

      Weaknesses:

      Weaknesses are mostly related to how the authors enforced fixation in the task, and clarifications are needed regarding some methodological details. A more direct comparison of the effect in the two experimental conditions is missing.

    2. Reviewer #2 (Public review):

      Summary:

      This study aims to test the hypothesis that microsaccades are linked to the shifting of spatial attention, rather than the maintenance of attention at the cued location. In two experiments, participants were required to judge an orientation change at either a validly cued location (80% of the time) or an invalidly cued location (20% of the time). This change was presented at varying intervals (ranging from 500 to 3,200 ms) after cue onset. Accuracy and reaction times both showed attentional benefits at the valid versus invalid location across the different cue-target intervals. In contrast, microsaccade biases were time-dependent. The authors report a directional bias primarily observed around 400 ms after the cue, with later intervals (particularly in Experiment 2) exhibiting no biases in microsaccade direction towards the cued location. The authors argue that this finding supports their initial hypothesis that microsaccade biases reflect shifts in attention, but that maintaining attention at the cued location after an attention shift is not correlated with microsaccade direction.

      Strengths:

      The results are straightforward given the chosen experimental design. The manuscript is clearly written, and the presentation of the study and its visualisations are both of a high standard.

      Weaknesses:

      The major weakness of this paper is its incremental contribution to a widely studied phenomenon. The link between attention and microsaccades has been the subject of extensive research over the past two decades. This study merely provides a limited overview of the key insights gained from these papers and discussions. In fact, it attempts to summarise previous work by stating that many experiments found a link, while others did not, and provides only a relatively small number of references. To make a significant contribution, I believe the authors should evaluate the field more thoroughly, rather than merely scratching the surface.

      The authors then present a potential solution to the conflicting past findings, arguing that attention should be considered a dynamic process that can be broken down into an attention shift and a sustained attention phase. Although the authors present this as a novel concept, I cannot think of anyone in the field who considers spatial attention to be a static entity. Nevertheless, I was curious to see how the authors would attempt to determine the precise timing of the attention shift and manipulate the different stages individually. However, the authors only varied the interval between the onset of the attention cue and the test stimulus, failing to further pinpoint their dynamic attention concept.

      The current version of the experiment, therefore, takes a correlational approach, similar to initial studies by Engbert and Kliegl (2003) and Hafed and Clark (2002). Meanwhile, we have learned a great deal about the link between microsaccades and attention. Below, I will list just a few of these findings to demonstrate how much we already know. It is important to note that, while the present study cites some of these papers, it does not provide a clear overview of how the current study goes beyond previous research.

      (1) Yuval-Greenberg and colleagues (2014) presented stimuli contingent on online-detected microsaccades. A postcue indicated the target for a visual task, and the target could be congruent or incongruent with the microsaccade direction. The authors showed higher visual accuracy in congruent trials. The authors cited that paper, but it is still important to emphasize how this study already tried to go beyond purely correlational links on a single trial level.

      (2) The Desimone lab (Lower et al., 2018) showed that firing rates in monkey V4 and IT were increased when a microsaccade was generated in the direction of the attended target.

      (3) However, attention can modulate responses in the superior colliculus even in the absence of microsaccades (Yu et al., 2022)

      (4) Similarly, Poletti, Rucci & Carrasco (2017) observed attentional modulations in the absence of microsaccades, or comparable attention effects irrespective of whether a microsaccade occurred or not (Roberts & Carrasco, 2019).

      Thus, in light of these insights, I believe the current study only adds incrementally to our understanding of the link between microsaccades and spatial attention.

      In general, it is important to have an independent measure of the dynamics of an attention shift. I think a shift of 200-600 ms is quite long, and defining this interval is rather arbitrary. Why consider such a long delay as the shift? Rather than taking a data-driven approach to defining an interval for an attention shift, it would be more convincing to derive an interval of interest based on past research or an independent measure.

      The present analyses report microsaccade statistics across all trials, but do not directly link single-trial microsaccades to accuracy. Similarly, reaction times and accuracy were analyzed only with respect to valid vs. invalid trials. Here, it would be important to link the findings between microsaccades and performance on a single-trial level. For instance, can the authors report reaction times and accuracy also separately for trials with vs. without microsaccades, and for trials with congruent vs. incongruent microsaccades?

      The study would benefit greatly from including a neutral condition to substantiate claims of attentional benefits and costs. It is highly probable that invalid trials would also demonstrate costs in terms of reaction times and accuracy. It would be interesting to observe whether directional biases in microsaccades are also evident when compared to a neutral condition.

    1. Reviewer #1 (Public review):

      Summary:

      The authors report intracranial EEG findings from 12 epilepsy patients performing an associative recognition memory task under the influence of scopolamine. They show that scopolamine administered before encoding disrupts hippocampal theta phenomena and reduces memory performance, and that scopolamine administered after encoding but before retrieval impairs hippocampal theta phenomena (theta power, theta phase reset) and neural reinstatement but does not impair memory performance. This is an important study with exciting, novel results and translational implications. The manuscript is well-written, the analyses are thorough and comprehensive, and the results seem robust.

      Strengths:

      (1) Very rare experimental design (intracranial neural recordings in humans coupled with pharmacological intervention).

      (2) Extensive analysis of different theta phenomena.

      (3) Well-established task with different conditions for familiarity versus recollection.

      (4) Clear presentation of findings and excellent figures.

      (5) Translational implications for diseases with cholinergic dysfunction (e.g., AD).

      (6) Findings challenge existing memory models, and the discussion presents interesting novel ideas.

      Weaknesses:

      (1) One of the most important results is the lack of memory impairment when scopolamine is administered after encoding but before retrieval (scopolamine block 2). The effect goes in the same direction as for scopolamine during encoding (p = 0.15). Could it be that this null effect is simply due to reduced statistical power (12 subjects with only one block per subject, while there are two blocks per subject for the condition with scopolamine during encoding), which may become significant with more patients? Is there actually an interaction effect indicating that memory impairment is significantly stronger when scopolamine is applied before encoding (Figure 1d)? Similar questions apply to familiarity versus recollection (lines 78-80). This is a very critical point that could alter major conclusions from this study, so more discussion/analysis of these aspects is needed. If there are no interaction effects, then the statements in lines 84-86 (and elsewhere) should be toned down.

      (2) Further, could it simply be that scopolamine hadn't reached its major impact during retrieval after administration in block 2? Figure 2e speaks in favor of this possibility. I believe this is a critical limitation of the experimental design that should be discussed.

      (3) It is not totally clear to me why slow theta was excluded from the reinstatement analysis. For example, despite an overall reduction in theta power, relative patterns may have been retained between encoding and recall. What are the results when using 1-128 Hz as input frequencies?

      (4) In what way are the results affected by epileptic artifacts occurring during the task (in particular, IEDs)?

    2. Reviewer #2 (Public review):

      Summary:

      In this study, performed in human patients, the authors aimed at dissecting out the role of cholinergic modulation in different types of memory (recollection-based vs familiarity and novelty-based) and during different memory phases (encoding and retrieval). Moreover, their goal was to obtain the electrophysiological signature of cholinergic modulation on network activity of the hippocampus and the entorhinal cortex.

      Strengths:

      The authors combined cognitive tasks and intracranial EEG recordings in neurosurgical epilepsy patients. The study confirms previous evidence regarding the deleterious effects of scopolamine, a muscarinic acetylcholine receptor antagonist, on memory performance when administered prior to the encoding phase of the task. During both encoding and retrieval phases, scopolamine disrupts the power of theta oscillations in terms of amplitude and phase synchronization. These results raise the question of the role of theta oscillations during retrieval and the meaning of scopolamine's effect on retrieval-associated theta rhythm without cognitive changes. The authors clearly discussed this issue in the discussion session.<br /> A major point is the finding that the scopolamine-mediated effect is selective for recollection-based memory and not for familiarity- and novelty-based memory.

      The methodology used is powerful, and the data underwent a detailed and rigorous analysis.

      Weaknesses:

      A limited cohort of patients; the age of the patients is not specified in the table.

    1. Joint Public Review:

      Summary

      Non-alcoholic fatty liver disease (NAFLD) is a widespread metabolic disease associated with obesity. Endoplasmic reticulum and calcium dysregulation are hallmarks of NAFLD. Here, the authors explore whether the secreted liver protein transthyretin (TTR), which has been previously shown to modulate calcium signaling in the context of insulin resistance, could also impact NAFLD. The study is motivated by a small cohort of NASH patients who show elevated TTR levels. The authors then overexpress TTR in two mouse obesogenic models, which leads to elevated liver lipid deposition. In contrast, liver-specific TTR knockdown improves some liver lipid levels, reduces inflammation markers, and improves glucose tolerance, overall improving the NAFLD markers. These phenotypic findings are overall convincing and largely consistent in two different diet models.

      Because of TTR's connection to calcium regulation, the authors then assess whether the knockdown affects ER stress and impacts SERCA2 expression. However, the direct mechanistic evidence supporting the central claim that TTR physically interacts with and inhibits the SERCA2 calcium pump is preliminary and requires further validation. Whether the broader effects on lipid accumulation, inflammation markers, and glucose tolerance are mechanistically connected remains to be determined.

      Strengths

      The premise of the study is built on prior work from the authors identifying a link between increased transthyretin secretion and the development of insulin resistance, a related obesity condition. The in vivo studies are comprehensive, using human NASH samples, two distinct diet-induced mouse models (HFD and GAN), and in vitro hepatocyte models. The phenotypic data showing that TTR knockdown alleviates steatosis, inflammation, and insulin resistance are robust and convincing across these systems.

      Weaknesses

      The mechanistic studies in Figures 6-9 are incomplete. There are several issues encompassing experimental design, rigor, and interpretation that, if properly addressed, would make the study much stronger.

      (1) Exogenous TTR that is endocytosed by cells is unlikely to ever find itself inside the lumen of the ER. Conversely, endogenous TTR that is produced in cells and that has not yet been secreted is almost certain to have an ER lumenal localization (as in Figures 7B and 9A, and where an apparent colocalization with SERCA is likely to be incidental). In a model where TTR, acting as a hepatokine, has inhibitory effects on SERCA, these would almost certainly be realized from the cytosolic side of the ER membrane-a region inaccessible to lumenal endogenous TTR. It is possible that the overexpression and knockdown of endogenous TTR have the effects seen due to its secretion and uptake (that is, cell-non-autonomous effects), but this possibility was not directly tested through Transwell or similar assays. Given the identity of TTR as a secretory pathway client protein, the only localization data for TTR that are unexpected are those suggesting an ER localization of exogenously added TTR (Figure 7A), but this localization seems to involve only a minor population of TTR, is hindered by a technical issue with cell permeabilization (see below), and lacks orthogonal approaches to convincingly demonstrate meaningful localization of exogenous TTR at the ER membrane.

      (2) The experimental logic in Figure 8 is problematic. The authors use Thapsigargin (Tg), a potent and specific SERCA inhibitor, to probe SERCA function. However, since both Tg and TTR are proposed to inhibit SERCA2, the design lacks a critical control to demonstrate that TTR's effects are indeed mediated through SERCA2. SERCA2 activity should, in principle, be fully and irreversibly inhibited by Tg treatment, especially using such a high concentration (5 µM). If TTR's effect on calcium flux is exclusively through SERCA2, then SERCA2 impairment by TTR should have no additional effect in the presence of Tg, as Tg would already be maximally inhibiting the pump. The current data (Figures 8G-H) showing an effect of TTR-KD even with Tg present is difficult to interpret and may suggest off-target or compensatory mechanisms.

      (3) The coIP data in Figure 9 need to be better controlled, including by overexpression of FLAG- and MYC-tagged irrelevant proteins, ideally also localized to the ER. The coIP of overexpressed TTR with endogenous SERCA in Figure 9D, in addition to requiring a more rigorous control, is itself of relatively low quality, with the appearance of a possible gel/blotting artifact.

      (4) The ER stress markers in Figure 6 are not convincing. Molecular weight markers and positive controls (for example, livers from animals injected with tunicamycin) are missing. In addition, the species of ATF6 that is purportedly being detected (cleaved or full-length) is not indicated, and this protein is also notoriously difficult to detect with convincing specificity in mouse tissues. As well, CHOP protein is usually not detectable in control normal diet mouse livers, raising questions of whether the band identified as CHOP is, in fact, CHOP. These issues, along with the observation that ER stress-regulated RNAs are not altered (Figure S5), raise the question of whether ER stress is involved at all. Likewise, the quantification of SERCA2 levels from Figure 6 requires more rigor. For all blots, it isn't clear that analyzing only 3 or 4 of the animals provides adequate and unbiased power to detect differences; in addition, in Figure 6C, at least the SERCA2 exposure (assuming SERCA2 is being specifically detected; see above) is well beyond the linear range of quantification.

      In addition, the following important issues were raised:

      (5) n=4 for overexpression might not provide adequate statistical power.

      (6) The error for human NASH samples and controls in Figure 1A is surprisingly small. Larger gene expression data sets from NASH cohorts exist and should be used to test the finding in a larger population.

      (7) For experiments involving two independent variables (e.g., diet and TTR manipulation, as in Figures 2, 3, 4, 5), a Two-way ANOVA must be used instead of One-way ANOVA or t-tests. Also, the ND-TTR-KD group is missing - these data are an essential control to show the specificity of the knockdown and its effects in a non-diseased state.

      (8) Figure 7A: The co-localization signal between TTR-Alexa488 and the ER marker is not strong or convincing, which could be due to the inappropriate immunofluorescence protocol used, of permeabilization prior to fixation. The standard and recommended order is fixation first (to preserve cellular architecture), followed by permeabilization.

    1. Reviewer #1 (Public review):

      In this paper, Stanojcic and colleagues attempt to map sites of DNA replication initiation in the genome of the African trypanosome, Trypanosoma brucei. Their approach to this mapping is to isolate 'short-nascent strands' (SNSs), a strategy adopted previously in other eukaryotes (including in the related parasite Leishmania major), which involves isolation of DNA molecules whose termini contain replication-priming RNA. By mapping the isolated and sequenced SNSs to the genome (SNS-seq), the authors suggest that they have identified origins, which they localise to intergenic (strictly, inter-CDS) regions within polycistronic transcription units and suggest display very extensive overlap with previously mapped R-loops in the same loci. Finally, having defined locations of SNS-seq mapping, they suggest they have identified G4 and nucleosome features of origins, again using previously generated data. Though there is merit in applying a new approach to understand DNA replication initiation in T. brucei, where previous work has used MFA-seq and ChIP of a subunit of the Origin Replication Complex (ORC), there are two significant deficiencies in the study that must be addressed to ensure rigour and accuracy.

      (1) The suggestion that the SNS-seq data is mapping DNA replication origins that are present in inter-CDS regions of the polycistronic transcription units of T. brucei is novel and does not agree with existing data on the localisation of ORC1/CDC6, and it is very unclear if it agrees with previous mapping of DNA replication by MFA-seq due to the way the authors have presented this correlation. For these reasons, the findings essentially rely on a single experimental approach, which must be further tested to ensure SNS-seq is truly detecting origins. Indeed, in this regard, the very extensive overlap of SNS-seq signal with RNA-DNA hybrids should be tested further to rule out the possibility that the approach is mapping these structures and not origins.

      (2) The authors' presentation of their SNS-seq data is too limited and therefore potentially provides a misleading view of DNA replication in the genome of T. brucei. The work is presented through a narrow focus on SNS-seq signal in the inter-CDS regions within polycistronic transcription units, which constitute only part of the genome, ignoring both the transcription start and stop sites at the ends of the units and the large subtelomeres, which are mainly transcriptionally silent. The authors must present a fuller and more balanced view of SNS-seq mapping across the whole genome to ensure full understanding and clarity.

    2. Reviewer #2 (Public review):

      Summary:

      Stanojcic et al. investigate the origins of DNA replication in the unicellular parasite Trypanosoma brucei. They perform two experiments, stranded SNS-seq and DNA molecular combing. Further, they integrate various publicly available datasets, such as G4-seq and DRIP-seq, into their extensive analysis. Using this data, they elucidate the structure of the origins of replication. In particular, they find various properties located at or around origins, such as polynucleotide stretches, G-quadruplex structures, regions of low and high nucleosome occupancy, R-loops, and that origins are mostly present in intergenic regions. Combining their population-level SNS-seq and their single-molecule DNA molecular combing data, they elucidate the total number of origins as well as the number of origins active in a single cell.

      Strengths:

      (1) A very strong part of this manuscript is that the authors integrate several other datasets and investigate a large number of properties around origins of replication. Data analysis clearly shows the enrichment of various properties at the origins, and the manuscript concludes with a very well-presented model that clearly explains the authors' understanding and interpretation of the data.

      (2) The DNA combing experiment is an excellent orthogonal approach to the SNS-seq data. The authors used the different properties of the two experiments (one giving location information, one giving single-molecule information) well to extract information and contrast the experiments.

      (3) The discussion is exemplary, as the authors openly discuss the strengths and weaknesses of the approaches used. Further, the discussion serves its purpose of putting the results in both an evolutionary and a trypanosome-focused context.

      Weaknesses:

      I have major concerns about the origin of replication sites determined from the SNS-seq data. As a caveat, I want to state that, before reading this manuscript, SNS-seq was unknown to me; hence, some of my concerns might be misplaced.

      (1) I do not understand why SNS-seq would create peaks. Replication should originate in one locus, then move outward in both directions until the replication fork moving outward from another origin is encountered. Hence, in an asynchronous population average measurement, I would expect SNS data to be broad regions of + and -, which, taken together, cover the whole genome. Why are there so many regions not covered at all by reads, and why are there such narrow peaks?

      (2) I am concerned that up to 96% percent of all peaks are filtered away. If there is so much noise in the data, how can one be sure that the peaks that remain are real? Specifically, if the authors placed the same number of peaks as was measured randomly in intergenic regions, would 4% of these peaks pass the filtering process by chance?

      (3) There are 3 previous studies that map origins of replication in T. brucei. Devlin et al. 2016, Tiengwe et al. 2012, and Krasiļņikova et al. 2025 (https://doi.org/10.1038/s41467-025-56087-3), all with a different technique: MFA-seq. All three previous studies mostly agree on the locations and number of origins. The authors compared their results to the first two, but not the last study; they found that their results are vastly different from the previous studies (see Supplementary Figure 8A). In their discussion, the authors defend this discrepancy mostly by stating that the discrepancy between these methods has been observed in other organisms. I believe that, given the situation that the other studies precede this manuscript, it is the authors' duty to investigate the differences more than by merely pointing to other organisms. A conclusion should be reached on why the results are different, e.g., by orthogonally validating origins absent in the previous studies.

      (4) Some patterns that were identified to be associated with origins of replication, such as G-quadruplexes and nucleosomes phasing, are known to be biases of SNS-seq (see Foulk et al. Characterizing and controlling intrinsic biases of lambda exonuclease in nascent strand sequencing reveals phasing between nucleosomes and G-quadruplex motifs around a subset of human replication origins. Genome Res. 2015;25(5):725-735. doi:10.1101/gr.183848.114).

      Are the claims well substantiated?:

      My opinion on whether the authors' results support their conclusions depends on whether my concerns about the sites determined from the SNS-seq data can be dismissed. In the case that these concerns can be dismissed, I do think that the claims are compelling.

      Impact:

      If the origins of replication prove to be distributed as claimed, this study has the potential to be important for two fields. Firstly, in research focused on T. brucei as a disease agent, where essential processes that function differently than in mammals are excellent drug targets. Secondly, this study would impact basic research analyzing DNA replication over the evolutionary tree, where T. brucei can be used as an early-divergent eukaryotic model organism.

    1. Reviewer #1 (Public review):

      Summary:

      The novel advance by Wang et al is in the demonstration that, relative to a standard extinction procedure, the retrieval-extinction procedure more effectively suppresses responses to a conditioned threat stimulus when testing occurs just minutes after extinction. The authors provide solid evidence to show that this "short-term" suppression of responding involves engagement of the dorsolateral prefrontal cortex.

      Strengths:

      Overall, the study is well-designed and the results are valuable. There are, however, a few issues in the way that it is introduced and discussed. It would have been useful if the authors could have more explicitly related the results to a theory - it would help the reader understand why the results should have come out the way that they did. More specific comments are presented below.

      Please note: The authors appear to have responded to my original review twice. It is not clear that they observed the public review that I edited after the first round of revisions. As part of these edits, I removed the entire section titled Clarifications, Elaborations and Edits

      Theory and Interpretation of Results

      (1) It is difficult to appreciate why the first trial of extinction in a standard protocol does NOT produce the retrieval-extinction effect. This applies to the present study as well as others that have purported to show a retrieval-extinction effect. The importance of this point comes through at several places in the paper. E.g., the two groups in study 1 experienced a different interval between the first and second CS extinction trials; and the results varied with this interval: a longer interval (10 min) ultimately resulted in less reinstatement of fear than a shorter interval. Even if the different pattern of results in these two groups was shown/known to imply two different processes, there is nothing in the present study that addresses what those processes might be. That is, while the authors talk about mechanisms of memory updating, there is little in the present study that permits any clear statement about mechanisms of memory. The references to a "short-term memory update" process do not help the reader to understand what is happening in the protocol.

      In reply to this point, the authors cite evidence to suggest that "an isolated presentation of the CS+ seems to be important in preventing the return of fear expression." They then note the following: "It has also been suggested that only when the old memory and new experience (through extinction) can be inferred to have been generated from the same underlying latent cause, the old memory can be successfully modified (Gershman et al., 2017). On the other hand, if the new experiences are believed to be generated by a different latent cause, then the old memory is less likely to be subject to modification. Therefore, the way the 1st and 2nd CS are temporally organized (retrieval-extinction or standard extinction) might affect how the latent cause is inferred and lead to different levels of fear expression from a theoretical perspective." This merely begs the question: why might an isolated presentation of the CS+ result in the subsequent extinction experiences being allocated to the same memory state as the initial conditioning experiences?<br /> This is not addressed in the paper. The study was not designed to address this question; and that the question did not need to be addressed for the set of results to be interesting. However, understanding how and why the retrieval-extinction protocol produces the effects that it does in the long-term test of fear expression would greatly inform our understanding of how and why the retrieval-extinction protocol has the effects that it does in the short-term tests of fear expression. To be clear; the results of the present study are very interesting - there is no denying that. I am not asking the authors to change anything in response to this point. It simply stands as a comment on the work that has been done in this paper and the area of research more generally.

      (2) The discussion of memory suppression is potentially interesting but raises many questions. That is, memory suppression is invoked to explain a particular pattern of results but I, as the reader, have no sense of why a fear memory would be better suppressed shortly after the retrieval-extinction protocol compared to the standard extinction protocol; and why this suppression is NOT specific to the cue that had been subjected to the retrieval-extinction protocol. I accept that the present study was not intended to examine aspects of memory suppression, and that it is a hypothesis proposed to explain the results collected in this study. I am not asking the authors to change anything in response to this point. Again, it simply stands as a comment on the work that has been done in this paper.

      (3) The authors have inserted the following text in the revised manuscript: "It should be noted that while our long-term amnesia results were consistent with the fear memory reconsolidation literatures, there were also studies that failed to observe fear prevention (Chalkia, Schroyens, et al., 2020; Chalkia, Van Oudenhove, et al., 2020; Schroyens et al., 2023). Although the memory reconsolidation framework provides a viable explanation for the long-term amnesia, more evidence is required to validate the presence of reconsolidation, especially at the neurobiological level (Elsey et al., 2018). While it is beyond the scope of the current study to discuss the discrepancies between these studies, one possibility to reconcile these results concerns the procedure for the retrieval-extinction training. It has been shown that the eligibility for old memory to be updated is contingent on whether the old memory and new observations can be inferred to have been generated by the same latent cause (Gershman et al., 2017; Gershman and Niv, 2012). For example, prevention of the return of fear memory can be achieved through gradual extinction paradigm, which is thought to reduce the size of prediction errors to inhibit the formation of new latent causes (Gershman, Jones, et al., 2013). Therefore, the effectiveness of the retrieval-extinction paradigm might depend on the reliability of such paradigm in inferring the same underlying latent cause." ***It is perfectly fine to state that "the effectiveness of the retrieval-extinction paradigm might depend on the reliability of such paradigm in inferring the same underlying latent cause..." This is not uninteresting; but it also isn't saying much. Ideally, the authors would have included some statement about factors that are likely to determine whether one is or isn't likely to see a retrieval-extinction effect, grounded in terms of the latent state theories that have been invoked here. Presumably, the retrieval-extinction protocol has variable effects because of procedural differences that affect whether subjects infer the same underlying latent cause when shifted into extinction. Surely, the clinical implications of any findings are seriously curtailed unless one understands when a protocol is likely to produce an effect; and why the effect occurs at all? This question is rhetorical. I am not asking the authors to change anything in response to this point. Again, it stands as a comment on the work that has been done in this paper; and remains a comment after insertion of the new text, which is acknowledged and appreciated.

      (4) The authors find different patterns of responses to CS1 and CS2 when they were tested 30 min after extinction versus 24 h after extinction. On this basis, they infer distinct memory update mechanisms. However, I still can't quite see why the different patterns of responses at these two time points after extinction need to be taken to infer different memory update mechanisms. That is, the different patterns of responses at the two time points could be indicative of the same "memory update mechanism" in the sense that the retrieval-extinction procedure induces a short-term memory suppression that serves as the basis for the longer-term memory suppression (i.e., the reconsolidation effect). My pushback on this point is based on the notion of what constitutes a memory update mechanism; and is motivated by what I take to be a rather loose use of language/terminology in the reconsolidation literature and this paper specifically (for examples, see the title of the paper and line 2 of the abstract).

      To be clear: I accept the authors' reply that "The focus of the current manuscript is to demonstrate that the retrieval-extinction paradigm can also facilitate a short-term fear memory deficit measured by SCR". However, I disagree with the claim that any short-term fear memory deficit must be indicative of "update mechanisms other than reconsolidation", which appears on Line 27 in the abstract and very much indicates the spirit of the paper. To make the point: the present study has examined the effectiveness of a retrieval-extinction procedure in suppressing fear responses 30 min, 6 hours and 24 hours after extinction. There are differences across the time points in terms of the level of suppression, its cue specificity, and its sensitivity to manipulation of activity in the dlPFC. This is perfectly interesting when not loaded with additional baggage re separable mechanisms of memory updating at the short and long time points: there is simply no evidence in this study or anywhere else that the short-term deficit in suppression of fear responses has anything whatsoever to do with memory updating. It can be exactly what is implied by the description: a short-term deficit in the suppression of fear responses. Again, this stands as a comment on the work that has been done; and remains a comment for the revised paper.

      (5) It is not clear why thought control ability ought to relate to any aspect of the suppression that was evident in the 30 min tests - that is, I accept the correlation between thought control ability and performance in the 30 min tests but would have liked to know why this was looked at in the first place and what, if anything, it means. The issue at hand is that, as best as I can tell, there is no theory to which the result from the short- and long-term tests can be related. The attempts to fill this gap with reference to phenomena like retrieval-induced forgetting are appreciated but raise more questions than answers. This is especially clear in the discussion, where it is acknowledged/stated: "Inspired by the similarities between our results and suppression-induced declarative memory amnesia (Gagnepain et al., 2017), we speculate that the retrieval-extinction procedure might facilitate a spontaneous memory suppression process and thus yield a short-term amnesia effect. Accordingly, the activated fear memory induced by the retrieval cue would be subjected to an automatic fear memory suppression through the extinction training (Anderson and Floresco, 2022)." There is nothing in the subsequent discussion to say why this should have been the case other than the similarity between results obtained in the present study and those in the literature on retrieval induced forgetting, where the nature of the testing is quite different. Again, this is simply a comment on the work that has been done - no change is required for the revised paper.

    2. Reviewer #2 (Public review):

      Summary

      The study investigated whether memory retrieval followed soon by extinction training results in a short-term memory deficit when tested - with a reinstatement test that results in recovery from extinction - soon after extinction training. Experiment 1 documents this phenomenon using a between-subjects design. Experiment 2 used a within-subject control and sees that the effect is also observed in a control condition. In addition, it also revealed that if testing is conducted 6 hours after extinction, there is not effect of retrieval prior to extinction as there is recovery from extinction independently of retrieval prior to extinction. A third Group also revealed that retrieval followed by extinction attenuates reinstatement when the test is conducted 24 hours later, consistent with previous literature. Finally, Experiment 3 used continuous theta-burst stimulation of the dorsolateral prefrontal cortex and assessed whether inhibition of that region (vs a control region) reversed the short-term effect revealed in Experiments 1 and 2. The results of control groups in Experiment 3 replicated the previous findings (short-term effect), and the experimental group revealed that these can be reversed by inhibition of the dorsolateral prefrontal cortex.

      Strengths

      The work is performed using standard procedures (fear conditioning and continuous theta-burst stimulation) and there is some justification of the sample sizes. The results replicate previous findings - some of which have been difficult to replicate and this needs to be acknowledged - and suggest that the effect can also be observed in a short-term reinstatement test.

      The study establishes links between the memory reconsolidation and retrieval-induced forgetting (or memory suppression) literatures. The explanations that have been developed for these are distinct and the current results integrate these, by revealing that the DLPFC activity involved in retrieval-extinction short-term effect. There is thus some novelty in the present results, but numerous questions remain unaddressed.

      Weakness

      The fear acquisition data is converted to a differential fear SCR and this is what is analysed (early vs late). However, the figure shows the raw SCR values for CS+ and CS- and therefore it is unclear whether acquisition was successful (despite there being an "early" vs "late" effect - no descriptives are provided).

      In Experiment 1 (Test results) it is unclear whether the main conclusion stems from a comparison of the test data relative to the last extinction trial ("we defined the fear recovery index as the SCR difference between the first test trial and the last extinction trial for a specific CS") or the difference relative to the CS- ("differential fear recovery index between CS+ and CS-"). It would help the reader assess the data if Fig 1e presents all the indexes (both CS+ and CS-). In addition, there is one sentence which I could not understand "there is no statistical difference between the differential fear recovery indexes between CS+ in the reminder and no reminder groups (P=0.048)". The p value suggests that there is a difference, yet it is not clear what is being compared here. Critically, any index taken as a difference relative to the CS- can indicate recovery of fear to the CS+ or absence of discrimination relative to the CS-, so ideally the authors would want to directly compare responses to the CS+ in the reminder and no-reminder groups. In the absence of such comparison, little can be concluded, in particular if SCR CS- data is different between groups. The latter issue is particularly relevant in Experiment 2, in which the CS- seems to vary between groups during the test and this can obscure the interpretation of the result.

      In experiment 1, the findings suggest that there is a benefit of retrieval followed by extinction in a short-term reinstatement test. In Experiment 2, the same effect is observed to a cue which did not undergo retrieval before extinction (CS2+), a result that is interpreted as resulting from cue-independence, rather than a failure to replicate in a within-subjects design the observations of Experiment 1 (between-subjects). Although retrieval-induced forgetting is cue-independent (the effect on items that are supressed [Rp-] can be observed with an independent probe), it is not clear that the current findings are similar, and thus that the strong parallels made are not warranted. Here, both cues have been extinguished and therefore been equally exposed during the critical stage.

      The findings in Experiment 2 suggest that the amnesia reported in experiment 1 is transient, in that no effect is observed when the test is delayed by 6 hours. The phenomena whereby reactivated memories transition to extinguished memories as a function of the amount of exposure (or number of trials) is completely different from the phenomena observed here. In the former, the manipulation has to do with the number of trials (or total amount of time) that the cues are exposed. In the current Experiment 2, the authors did not manipulate the number of trials but instead the retention interval between extinction and test. The finding reported here is closer to a "Kamin effect", that is the forgetting of learned information which is observed with intervals of intermediate length (Baum, 1968). Because the Kamin effect has been inferred to result from retrieval failure, it is unclear how this can be explained here. There needs to be much more clarity on the explanations to substantiate the conclusions.

      There are many results (Ryan et al., 2015) that challenge the framework that the authors base their predictions on (consolidation and reconsolidation theory), therefore these need to be acknowledged. These studies showed that memory can be expressed in the absence of the biological machinery thought to be needed for memory performance. The authors should be careful about statements such as "eliminate fear memores" for which there is little evidence.

      The parallels between the current findings and the memory suppression literature are speculated in the general discussion, and there is the conclusion that "the retrieval-extinction procedure might facilitate a spontaneous memory suppression process". Because one of the basic tenets of the memory suppression literature is that it reflects an "active suppression" process, there is no reason to believe that in the current paradigm the same phenomenon is in place, but instead it is "automatic". In other words, the conclusions make strong parallels with the memory suppression (and cognitive control) literature, yet the phenomena that they observed is thought to be passive (or spontaneous/automatic). Ultimately, it is unclear why 10 mins between the reminder and extinction learning will "automatically" supress fear memories. Further down in the discussion it is argued that "For example, in the well-known retrieval-induced forgetting (RIF) phenomenon, the recall of a stored memory can impair the retention of related long-term memory and this forgetting effect emerges as early as 20 minutes after the retrieval procedure, suggesting memory suppression or inhibition can occur in a more spontaneous and automatic manner". I did not follow with the time delay between manipulation and test (20 mins) would speak about whether the process is controlled or automatic. In addition, the links with the "latent cause" theoretical framework are weak if any. There is little reason to believe that one extinction trial, separated by 10 mins from the rest of extinction trials, may lead participants to learn that extinction and acquisition have been generated by the same latent cause.

      Among the many conclusions, one is that the current study uncovers the "mechanism" underlying the short-term effects of retrieval-extinction. There is little in the current report that uncovers the mechanism, even in the most psychological sense of the mechanism, so this needs to be clarified. The same applies to the use of "adaptive".

      Whilst I could access the data in the OFS site, I could not make sense of the Matlab files as there is no signposting indicating what data is being shown in the files. Thus, as it stands, there is no way of independently replicating the analyses reported.<br /> The supplemental material shows figures with all participants, but only some statistical analyses are provided, and sometimes these are different from those reported in the main manuscript. For example, the test data in Experiment 1 is analysed with a two-way ANOVA with main effects of group (reminder vs no-reminder) and time (last trial of extinction vs first trial of test) in the main report. The analyses with all participants in the sup mat used a mixed two-way ANOVA with group (reminder vs no reminder) and CS (CS+ vs CS-). This makes it difficult to assess the robustness of the results when including all participants. In addition, in the supplementary materials there are no figures and analyses for Experiment 3.

      One of the overarching conclusions is that the "mechanisms" underlying reconsolidation (long term) and memory suppression (short term) phenomena are distinct, but memory suppression phenomena can also be observed after a 7-day retention interval (Storm et al., 2012), which then questions the conclusions achieved by the current study.

      References:

      Baum, M. (1968). Reversal learning of an avoidance response and the Kamin effect. Journal of Comparative and Physiological Psychology, 66(2), 495.<br /> Chalkia, A., Schroyens, N., Leng, L., Vanhasbroeck, N., Zenses, A. K., Van Oudenhove, L., & Beckers, T. (2020). No persistent attenuation of fear memories in humans: A registered replication of the reactivation-extinction effect. Cortex, 129, 496-509.<br /> Ryan, T. J., Roy, D. S., Pignatelli, M., Arons, A., & Tonegawa, S. (2015). Engram cells retain memory under retrograde amnesia. Science, 348(6238), 1007-1013.<br /> Storm, B. C., Bjork, E. L., & Bjork, R. A. (2012). On the durability of retrieval-induced forgetting. Journal of Cognitive Psychology, 24(5), 617-629.

      Comments on revisions:

      Thanks to the authors for trying to address my concerns.

      (1 and 2) My point about evidence for learning relates to the fact that in none of the experiments an increase in SCR to the CSs+ is observed during training (in Experiment 1 CS+/CS- differences are even present from the outset), instead what happens is that participants learn to discriminate between the CS+ and CS- and decrease their SCR responding to the safe CS-. This begs the question as to what is being learned, given that the assumption is that the retrieval-extinction treatment is concerned with the excitatory memory (CS+) rather than the CS+/CS- discrimination. For example, Figures 6A and 6B have short/Long term amnesia in the right axes, but it is unclear from the data what memory is being targeted. In Figure 6C, the right panels depicting Suppression and Reconsolidation mechanisms suggest that it is the CS+ memory that is being targeted. Because the dependent measure (differential SCR) captures how well the discrimination was learned (this point relates to point 2 which the authors now acknowledge that there are differences between groups in responding to the CS-), then I struggle to see how the data supports these CS+ conclusions. The fact that influential papers have used this dependent measure (i.e., differential SCR) does not undermine the point that differences between groups at test are driven by differences in responding to the CS-.

      (3, 4 and 5) The authors have qualified some of the statements, yet I fail to see some of these parallels. Much of the discussion is speculative and ultimately left for future research to address.

      (6) I can now make more sense of the publicly available data, although the files would benefit from an additional column that distinguishes between participants that were included in the final analyses (passed the multiple criteria = 1) and those who did not (did not pass the criteria = 0). Otherwise, anyone who wants to replicate these analyses needs to decipher the multiple inclusion criteria and apply it to the dataset.

    1. Reviewer #1 (Public review):

      Summary:

      The study examines human biases in a regime-change task, in which participants have to report the probability of a regime change in the face of noisy data. The behavioral results indicate that humans display systematic biases, in particular, overreaction in stable but noisy environments and underreaction in volatile settings with more certain signals. fMRI results suggest that a frontoparietal brain network is selectively involved in representing subjective sensitivity to noise, while the vmPFC selectively represents sensitivity to the rate of change.

      Strengths:

      - The study relies on a task that measures regime-change detection primarily based on descriptive information about the noisiness and rate of change. This distinguishes the study from prior work using reversal-learning or change-point tasks in which participants are required to learn these parameters from experiences. The authors discuss these differences comprehensively.

      - The study uses a simple Bayes-optimal model combined with model fitting, which seems to describe the data well. The model is comprehensively validated.

      - The authors apply model-based fMRI analyses that provide a close link to behavioral results, offering an elegant way to examine individual biases.

      Weaknesses:

      The authors have adequately addressed most of my prior concerns.

      My only remaining comment concerns the z-test of the correlations. I agree with the non-parametric test based on bootstrapping at the subject level, providing evidence for significant differences in correlations within the left IFG and IPS.

      However, the parametric test seems inadequate to me. The equation presented is described as the Fisher z-test, but the numerator uses the raw correlation coefficients (r) rather than the Fisher-transformed values (z). To my understanding, the subtraction should involve the Fisher z-scores, not the raw correlations.

      More importantly, the Fisher z-test in its standard form assumes that the correlations come from independent samples, as reflected in the denominator (which uses the n of each independent sample). However, in my opinion, the two correlations are not independent but computed within-subject. In such cases, parametric tests should take into account the dependency. I believe one appropriate method for the current case (correlated correlation coefficients sharing a variable [behavioral slope]) is explained here:

      Meng, X.-l., Rosenthal, R., & Rubin, D. B. (1992). Comparing correlated correlation coefficients. Psychological Bulletin, 111(1), 172-175. https://doi.org/10.1037/0033-2909.111.1.172

      It should be implemented here:

      Diedenhofen B, Musch J (2015) cocor: A Comprehensive Solution for the Statistical Comparison of Correlations. PLoS ONE 10(4): e0121945. https://doi.org/10.1371/journal.pone.0121945

      My recommendation is to verify whether my assumptions hold, and if so, perform a test that takes correlated correlations into account. Or, to focus exclusively on the non-parametric test.

      In any case, I recommend a short discussion of these findings and how the authors interpret that some of the differences in correlations are not significant.

    2. Reviewer #3 (Public review):

      This study concerns how observers (human participants) detect changes in the statistics of their environment, termed regime shifts. To make this concrete, a series of 10 balls are drawn from an urn that contains mainly red or mainly blue balls. If there is a regime shift, the urn is changed over (from mainly red to mainly blue) at some point in the 10 trials. Participants report their belief that there has been a regime shift as a % probability. Their judgement should (mathematically) depend on the prior probability of a regime shift (which is set at one of three levels) and the strength of evidence (also one of three levels, operationalized as the proportion of red balls in the mostly-blue urn and vice versa). Participants are directly instructed of the prior probability of regime shift and proportion of red balls, which are presented on-screen as numerical probabilities. The task therefore differs from most previous work on this question in that probabilities are instructed rather than learned by observation, and beliefs are reported as numerical probabilities rather than being inferred from participants' choice behaviour (as in many bandit tasks, such as Behrens 2007 Nature Neurosci).

      The key behavioural finding is that participants over-estimate the prior probability of regime change when it is low, and under estimate it when it is high; and participants over-estimate the strength of evidence when it is low and under-estimate it when it is high. In other words participants make much less distinction between the different generative environments than an optimal observer would. This is termed 'system neglect'. A neuroeconomic-style mathematical model is presented and fit to data.

      Functional MRI results how that strength of evidence for a regime shift (roughly, the surprise associated with a blue ball from an apparently red urn) is associated with activity in the frontal-parietal orienting network. Meanwhile, at time-points where the probability of a regime shift is high, there is activity in another network including vmPFC. Both networks show individual differences effects, such that people who were more sensitive to strength of evidence and prior probability show more activity in the frontal-parietal and vmPFC-linked networks respectively.

      Strengths

      (1) The study provides a different task for looking at change-detection and how this depends on estimates of environmental volatility and sensory evidence strength, in which participants are directly and precisely informed of the environmental volatility and sensory evidence strength rather than inferring them through observation as in most previous studies<br /> (2) Participants directly provide belief estimates as probabilities rather than experimenters inferring them from choice behaviour as in most previous studies<br /> (3) The results are consistent with well-established findings that surprising sensory events activate the frontal-parietal orienting network whilst updating of beliefs about the word ('regime shift') activates vmPFC.

      Weaknesses

      (1) The use of numerical probabilities (both to describe the environments to participants, and for participants to report their beliefs) may be problematic because people are notoriously bad at interpreting probabilities presented in this way, and show poor ability to reason with this information (see Kahneman's classic work on probabilistic reasoning, and how it can be improved by using natural frequencies). Therefore the fact that, in the present study, people do not fully use this information, or use it inaccurately, may reflect the mode of information delivery.

      (2) Although a very precise model of 'system neglect' is presented, many other models could fit the data.

      For example, you would get similar effects due to attraction of parameter estimates towards a global mean - essentially application of a hyper-prior in which the parameters applied by each participant in each block are attracted towards the experiment-wise mean values of these parameters. For example, the prior probability of regime shift ground-truth values [0.01, 0.05, 0.10] are mapped to subjective values of [0.037, 0.052, 0.069]; this would occur if observers apply a hyper-prior that the probability of regime shift is about 0.05 (the average value over all blocks). This 'attraction to the mean' is a well-established phenomenon and cannot be ruled out with the current data (I suppose you could rule it out by comparing to another dataset in which the mean ground-truth value was different).

      More generally, any model in which participants don't fully use the numerical information they were given would produce apparent 'system neglect'. Four qualitatively different example reasons are: 1. Some individual participants completely ignored the probability values given. 2. Participants did not ignore the probability values given, but combined them with a hyperprior as above. 3. Participants had a reporting bias where their reported beliefs that a regime-change had occurred tend to be shifted towards 50% (rather than reporting 'confident' values such 5% or 95%). 4. Participants underweighted probability outliers resulting in underweighting of evidence in the 'high signal diagnosticity' environment (10.1016/j.neuron.2014.01.020 )

      In summary I agree that any model that fits the data would have to capture the idea that participants don't differentiate between the different environments as much as they should, but I think there are a number of qualitatively different reasons why they might do this - of which the above are only examples - hence I find it problematic that the authors present the behaviour as evidence for one extremely specific model.

      (3) Despite efforts to control confounds in the fMRI study, including two control experiments, I think some confounds remain.

      For example, a network of regions is presented as correlating with the cumulative probability that there has been a regime shift in this block of 10 samples (Pt). However, regardless of the exact samples shown, doesn't Pt always increase with sample number (as by the time of later samples, there have been more opportunities for a regime shift)? Unless this is completely linear, the effect won't be controlled by including trial number as a co-regressor (which was done).

      On the other hand, two additional fMRI experiments are done as control experiments and the effect of Pt in the main study is compared to Pt in these control experiments. Whilst I admire the effort in carrying out control studies, I can't understand how these particular experiment are useful controls. For example in experiment 3 participants simply type in numbers presented on the screen - how can we even have an estimate of Pt from this task?

      (4) The Discussion is very long, and whilst a lot of related literature is cited, I found it hard to pin down within the discussion, what the key contributions of this study are. In my opinion it would be better to have a short but incisive discussion highlighting the advances in understanding that arise from the current study, rather than reviewing the field so broadly.

      Editors’ note: Reviewer #2 was unavailable to re-review the manuscript. Reviewer #3 was added for this round of review to ensure two reviewers and because of their expertise in the computational and modelling aspects of the work.

    1. Reviewer #1 (Public review):

      Summary:

      Silbaugh, Koster and Hansel investigated how the cerebellar climbing fiber (CF) signals influence neuronal activity and plasticity in mouse primary somatosensory (S1) cortex. They found that optogenetic activation of CFs in the cerebellum modulates responses of cortical neurons to whisker stimulation in a cell-type-specific manner and suppresses potentiation of layer 2/3 pyramidal neurons induced by repeated whisker stimulation. This suppression of plasticity by CF activation is mediated through modulation of VIP- and SST-positive interneurons. Using transsynaptic tracing and chemogenetic approaches, the authors identified a pathway from the cerebellum through the zona incerta and the thalamic posterior medial (POm) nucleus to the S1 cortex, which underlies this functional modulation.

      The authors have addressed all the necessary points.

    2. Reviewer #2 (Public review):

      Summary:

      The authors examined long-distance influence of climbing fiber (CF) signaling in the somatosensory cortex by manipulating whiskers through stimulation. Also, they examined CF signaling using two-photon imaging and mapped projections from the cerebellum to somatosensory cortex using transsynaptic tracing. As a final manipulation, they used chemogenetics to perturb parvalbumin positive neurons in the zona incerta and recorded from climbing fibers.

      Strengths:

      There are several strengths to this paper. The recordings were carefully performed and AAVs used were selective and specific for the cell-types and pathways being analyzed. In addition, the authors used multiple approaches that support climbing fiber pathways to distal regions of the brain. This work will impact the field and describes nice methods to target difficult to reach brain regions, such as the inferior olive.

      No weaknesses noted.

    3. Reviewer #3 (Public review):

      Summary:

      The authors developed an interesting novel paradigm to probe the effects of cerebellar climbing fiber activation on short-term adaptation of somatosensory neocortical activity during repetitive whisker stimulation. Normally, RWS potentiated whisker responses in pyramidal cells and weakly suppressed them in interneruons, lasting for at least 1h. Crusii Optogenetic climbing fiber activation during RWS reduced or inverted these adaptive changes. This effect was generally mimicked or blocked with chemogenetic SST or VIP activation/suppression as predicted based on their "sign" in the circuit.

      Strengths:

      The central finding about CF modulation of S1 response adaptation is interesting, important, and convincing, and provides a jumping-off point for the field to start to think carefully about cerebellar modulation of neocortical plasticity.

      Weaknesses:

      The SST and VIP results appeared slightly weaker statistically, but I do not personally think this detracts from the importance of the initial finding (if there are multiple underlying mechanisms, modulating one may reproduce only a fraction of the effect size). I found the suggestion that zona incerta may be responsible for the cerebellar effects on S1 to be a more speculative result (it is not so easy with existing technology to effectively modulate this type of polysynaptic pathway), but this may be an interesting topic for the authors to follow up on in more detail in the future.

      Comments on revisions:

      The authors have appropriately addressed my comments.

    1. Reviewer #1 (Public review):

      The study aims to determine the role of Slit-Robo signaling in the development and patterning of cardiac innervation, a key process in heart development. Despite the well-studied roles of Slit axon guidance molecules in the development of the central nervous system, their roles in the peripheral nervous system are less clear. Thus, the present study addresses an important question. The study uses genetic knockout models to investigate how Slit2, Slit3, Robo1, and Robo2 contribute to cardiac innervation

      Using constitutive and cell type-specific knockout mouse models, they show that the loss of endothelial-derived Slit2 reduces cardiac innervation. Additionally, Robo1 knockout, but not Robo2 knockout, recapitulated the Slit2 knockout effect on cardiac innervation, leading to the conclusion that Slit2-Robo1 signaling drives sympathetic innervation in the heart. Finally, the authors also show a reduction in isoproterenol-stimulated heart rate but not basal heart rate in the absence of endothelial Slit2.

      The conclusions of this paper are mostly well supported by the data, but there are several limitations:

      (1) It is well established that Slit ligands undergo proteolytic cleavage, generating N- and C-terminal fragments with distinct biological functions. Full-length Slit proteins and their fragments differ in cell association, with the N-terminal fragment typically remaining membrane-bound, while the C-terminal fragment is more diffusible. This distinction is crucial when evaluating the role of Slit proteins secreted by different cell types in the heart. However, this study does not examine or discuss the specific contributions of different Slit2 fragments, limiting its mechanistic insight into how Slit2 regulates cardiac innervation. While these points are mentioned in the discussion, they are not incorporated into the interpretation of the data or the presented model.

      (2) The endothelial-specific deletion of Slit2 leads to its loss in endothelial cells across various organs and tissues in the developing embryo. Therefore, the phenotypes observed in the heart may be influenced by defects in other parts of the embryo, such as the CNS or sympathetic ganglia, and this possibility cannot be ruled out. The data presented in the manuscript does not dissect the relative contributions of endothelial Slit2 loss in the heart versus secondary effects arising from other organ systems. Without tissue-specific rescue or complementary conditional models, it remains unclear whether the observed cardiac phenotypes are a direct consequence of local endothelial Slit2 deficiency or an indirect outcome of broader developmental perturbations.

    2. Reviewer #2 (Public review):

      The aims of investigating Slit-Robo signaling in cardiac innervation were achieved by the experiments designed. The authors demonstrate that endothelial Slit2 signaling through Robo1 drives sympathetic innervation. While questions remain regarding signal regulation and interplay between established axon guidance signals and the further role of other Slit ligands and Robo expression in endothelium, the results strongly support the conclusions drawn.<br /> Writing and presentation are easy to follow and well structured. Appropriate controls are used, statistical analysis applied appropriately, and experiments directly test aims following a logical story.<br /> The authors demonstrate a novel mechanism for Slit-Robo signaling in cardiac sympathetic innervation. The data establishes a framework for future studies.

      The authors have updated their discussion to highlight the need for investigation of the role of proteolytic cleavage of Slit2 as well as the potential for defects in other tissues due to endothelial knockout of Slit2 influencing cardiac innervation.

    1. Reviewer #2 (Public review):

      Summary:

      This work presents a modality-agnostic decoder trained on a large fMRI dataset (SemReps-8K), in which subjects viewed natural images and corresponding captions. The decoder predicts stimulus content from brain activity irrespective of the input modality and performs on par with-or even outperforms-modality-specific decoders. Its success depends more on the diversity of brain data (multimodal vs. unimodal) than on whether the feature-extraction models are visual, linguistic, or multimodal. Particularly, the decoder shows strong performance in decoding imagery content. These results suggest that the modality-agnostic decoder effectively leverages shared brain information across image and caption tasks.

      Strengths:

      (1) The modality-agnostic decoder compellingly leverages multimodal brain information, improving decoding accuracy-particularly for non-sensory input such as captions-showing high methodological and application value.

      (2) The dataset is a substantial and well-controlled contribution, with >8,000 image-caption trials per subject and careful matching of stimuli across modalities-an essential resource for testing theories about different representational modalities.

      Weakness:

      In the searchlight analysis aimed at identifying modality-invariant representations, although the combined use of four decoding conditions represents a relatively strict approach, the underlying logic remains unclear. The modality-agnostic decoder has demonstrated strong sensitivity in decoding brain activity, as shown earlier in the paper, whereas the cross-decoding with modality-specific decoders is inherently more conservative. If, as the authors note, the modality-agnostic decoder might have learned to leverage different features to project stimuli from different modalities, then taking the union of conditions would seem more appropriate. Conversely, if the goal is to obtain a more conservative result, why not focus solely on the cross-decoding conditions? The relationships among the four decoding conditions are not clearly delineated, and the contrasts between them might themselves yield valuable insights. As it stands, however, the logic of the current approach is not straightforward.

    2. Reviewer #3 (Public review):

      Summary:

      The authors recorded brain responses while participants viewed images and captions. The images and captions were taken from the COCO dataset, so each image has a corresponding caption and each caption has a corresponding image. This enabled the authors to extract features from either the presented stimulus or the corresponding stimulus in the other modality. The authors trained linear decoders to take brain responses and predict stimulus features. "Modality-specific" decoders were trained on brain responses to either images or captions while "modality-agnostic" decoders were trained on brain responses to both stimulus modalities. The decoders were evaluated on brain responses while the participants viewed and imagined new stimuli, and prediction performance was quantified using pairwise accuracy. The authors reported the following results:

      (1) Decoders trained on brain responses to both images and captions can predict new brain responses to either modality.

      (2) Decoders trained on brain responses to both images and captions outperform decoders trained on brain responses to a single modality.

      (3) Many cortical regions represent the same concepts in vision and language.

      (4) Decoders trained on brain responses to both images and captions can decode brain responses to imagined scenes.

      Strengths:

      This is an interesting study that addresses important questions about modality-agnostic representations. Previous work has shown that decoders trained on brain responses to one modality can be used to decode brain responses to another modality. The authors build on these findings by collecting a new multimodal dataset and training decoders on brain responses to both modalities.

      To my knowledge, SemReps-8K is the first dataset of brain responses to vision and language where each stimulus item has a corresponding stimulus item in the other modality. This means that brain responses to a stimulus item can be modeled using visual features of the image, linguistic features of the caption, or multimodal features derived from both the image and the caption. The authors also employed a multimodal one-back matching task which forces the participants to activate modality-agnostic representations. Overall, SemReps-8K is a valuable resource that will help researchers answer more questions about modality-agnostic representations.

      The analyses are also very comprehensive. The authors trained decoders on brain responses to images, captions, and both modalities, and they tested the decoders on brain responses to images, caption, and imagined scenes. They extracted stimulus features using a range of visual, linguistic, and multimodal models. The modeling framework appears rigorous and the results offer new insights into the relationship between vision, language, and imagery. In particular, the authors found that decoders trained on brain responses to both images and captions were more effective at decoding brain responses to imagined scenes than decoders trained on brain responses to either modality in isolation. The authors also found that imagined scenes can be decoded from a broad network of cortical regions.

      Weaknesses:

      The characterization of "modality-agnostic" and "modality-specific" decoders seems a bit contradictory. There are three major choices when fitting a decoder: the modality of the training stimuli, the modality of the testing stimuli, and the model used to extract stimulus features. However, the authors characterize their decoders based on only the first choice-"modality-specific" decoders were trained on brain responses to either images or captions while "modality-agnostic" decoders were trained on brain responses to both stimulus modalities. I think that this leads to some instances where the conclusions are inconsistent with the methods and results.

      First, the authors suggest that "modality-specific decoders are not explicitly encouraged to pick up on modality-agnostic features during training" (line 137) while "modality-agnostic decoders may be more likely to leverage representations that are modality-agnostic" (line 140). However, whether a decoder is required to learn modality-agnostic representations depends on both the training responses and the stimulus features. Consider the case where the stimuli are represented using linguistic features of the captions. When you train a "modality-specific" decoder on image responses, the decoder is forced to rely on modality-agnostic information that is shared between the image responses and the caption features. On the other hand, when you train a "modality-agnostic" decoder on both image responses and caption responses, the decoder has access to the modality-specific information that is shared by the caption responses and the caption features, so it is not explicitly required to learn modality-agnostic features. As a result, while the authors show that "modality-agnostic" decoders outperform "modality-specific" decoders in most conditions, I am not convinced that this is because they are forced to learn more modality-agnostic features.

      Second, the authors claim that "modality-specific decoders can be applied only in the modality that they were trained on" while "modality-agnostic decoders can be applied to decode stimuli from multiple modalities, even without knowing a priori the modality the stimulus was presented in" (line 47). While "modality-agnostic" decoders do outperform "modality-specific" decoders in the cross-modality conditions, it is important to note that "modality-specific" decoders still perform better than expected by chance (figure 5). It is also important to note that knowing about the input modality still improves decoding performance even for "modality-agnostic" decoders, since it determines the optimal feature space-it is better to decode brain responses to images using decoders trained on image features, and it is better to decode brain responses to captions using decoders trained on caption features.

      Comments on revised version:

      The revised version benefits from clearer claims and more precise terminology (i.e. classifying the decoders as "modality-agnostic" or "modality-specific" while classifying the representations as "modality-invariant" or "modality-dependent").

      While the modality-agnostic decoders outperform the modality-specific decoders, I am still not convinced that this is because they are "explicitly trained to leverage the shared information in modality-invariant patterns of the brain activity". On one hand, the high-level feature spaces may each contain some amount of modality-invariant information, so even modality-specific decoders can capture some modality-invariant information. On the other hand, I do not see how training the modality-agnostic decoders on responses to both modalities necessitates that they learn modality-invariant representations beyond those that are learned by the modality-specific decoders.

    1. Reviewer #1 (Public review):

      This work compiles a comprehensive atlas of ncORFs across mammalian tissues and cell types, derived from reanalysis of ~400 public ribosome profiling datasets. The authors then evaluate cross-species conservation and functional signatures, proposing that evolutionarily ancient ncORFs tend to have higher translation potential, stronger expression, and closer relationships with canonical coding sequences.

      Strengths:

      In general, the study provides a large-scale and timely resource of annotated ncORFs, which could be broadly useful for the community. The authors collected ~400 public ribosome profiling datasets for annotations of ncORFs, which, to my best knowledge, is the largest collection of data for such a purpose. The catalog could facilitate future investigations into ncORF biology and broaden understanding of the coding potential of the "non-coding" genome.

      Weaknesses:

      Based on the ncORF catalog, some of the analyses were not properly done. Some of the results are descriptive.

      (1) Bias and representations of the data source. Public ribo-seq datasets are unevenly distributed across tissues and cell lines, raising concerns about heterogeneity and underrepresentation of certain contexts. This may limit the generalizability of the catalog.

      (2) The discussion on modular domains of ncORFs is unclear, and the claim that they may originate via TE-related mechanisms is not well supported. Stronger evidence or clearer reasoning is needed.

      (3) The conservation comparisons are not fully convincing. Figure S7 shows only mild differences between ncORFs and CDS, and statistical significance is not clearly demonstrated.<br /> Comparisons with other non-coding RNAs should be added, and overlapping sequences between ncORFs and CDS should be excluded to avoid bias.

      (4) Figure 3 indicates that some ncORFs are subject to evolutionary constraints. This is not surprising. The authors should provide further analyses on more detailed features of these "conserved" ncORFs vs. the "non-conserved" ones. Some pretty informative works have been done in Drosophila, worms, mice, and humans. Figure 3 suggests some ncORFs are under evolutionary constraint, but this is not unexpected. More granular analyses contrasting "conserved" versus "non-conserved" ncORFs would be informative. In fact, small ORFs, especially uORFs, have been extensively studied for their functions and cross-species conservation. The authors should explicitly show what is new here in their analyses.

      (5) Translation levels are reported using RPF counts. However, translation efficiency (normalized by RNA expression) is a more appropriate measure to account for expression heterogeneity.

      (6) The correlation analyses between ncORF translation levels and PhyloCSF are confusing and largely descriptive. These sections need sharper framing and clearer conclusions.

      (7) Public ribo-seq datasets, generated by different research labs, are known for their strong batch effects. Representations of tissues and cells are also very unbalanced. Therefore, the co-translation analysis between ncORFs and canonical CDS is not well controlled. This should be done by referring to a recent large-scale ribo-seq meta-analysis (Nat Biotechnol. 2025. doi: 10.1038/s41587-025-02718-5).

    2. Reviewer #2 (Public review):

      Summary:

      Chang et al. attempted to analyze a large number of ribo-seq datasets through a standardized pipeline, identifying novel non-canonical ORFs and elucidating their evolutionary and expression characteristics.

      Strengths:

      (1) The datasets analyzed by the authors are sufficiently comprehensive, and the use of standardized pipelines ensures excellent analytical consistency.

      (2) Their analyses of ORF evolution and co-expression further deepen our understanding of these ORFs.

      Weaknesses:

      (1) The authors primarily conducted analyses through bioinformatics, lacking sufficient wet-lab experimental evidence.

      (2) Regarding the evolution of non-canonical ORFs, a considerable amount of prior work already exists. The authors need to further clarify what new insights and discoveries they have made based on the analysis of such a large dataset.

    1. Reviewer #1 (Public review):

      Summary:

      RNA modification has emerged as an important modulator of protein synthesis. Recent studies found that mRNA can be acetylated (ac4c), which can alter mRNA stability and translation efficiency. The role of ac4c mRNA in the brain has not been studied. In this paper, the authors convincingly show that ac4c occurs selectively on mRNAs localized at synapses, but not cell-wide. The ac4c "writer" NAT10 is highly expressed in hippocampal excitatory neurons. Using NAT10 conditional KO mice, decreasing levels of NAT10 resulted in decreases in ac4c of mRNAs and also showed deficits in LTP and spatial memory. These results reveal a potential role for ac4c mRNA in memory consolidation.

      This is a new type of mRNA regulation that seems to act specifically at synapses, which may help elucidate the mechanisms of local protein synthesis in memory consolidation. Overall, the studies are well carried out and presented. There is some confusion over training/learning vs memory, and the precise mRNAs that require ac4c to carry out memory consolidation are not clear. The specificity of changes occurring only at the end of training, rather than after each day of training, is interesting and warrants some investigation. This timeframe is puzzling because the authors show that ac4c can dynamically increase within 1 hour after cLTP.

      Strengths:

      (1) The studies show that mRNA acetylation (ac4c) occurs selectively at mRNAs localized to synaptic compartments (using synaptoneurosome preps).

      (2) The authors identify a few key mRNAs acetylated and involved in plasticity and memory - e.g., Arc.

      (3) The authors show that Ac4c is induced by learning and neuronal activity (cLTP).

      (4) The studies show that the ac4c "writer" NAT10 is expressed in hippocampal excitatory neurons and may be relocated to synapses after cLTP/learning induction.

      (5) The authors used floxed NAT10 mice injected with AAV-Cre in the hippocampus (NAT10 cKO) to show that NAT10 may play a role in LTP maintenance and memory consolidation (using the Morris Water Maze).

      Weaknesses:

      (1) The authors use a confusing timeline for their behavioral experiments, i.e, day 1 is the first day of training in the MWM, and day 6 is the probe trial, but in reality, day 6 is the first day after the last training day. So this is really day 1 post-training, and day 20 is 14 days post-training.

      (2) The authors inaccurately use memory as a term. During the training period in the MWM, the animals are learning, while memory is only probed on day 6 (after learning). Thus, day 6 reflects memory consolidation processes after learning has taken place.

      (3) The NAT10 cKO mice are useful to test the causal role of NAT10 in ac4a and plasticity/memory, but all the experiments used AAV-CRE injections in the dorsal hippocampus that showed somewhat modest decreases in total NAT10 protein levels. For these experiments, it would be better to cross the NAT10 floxed animals to CRE lines where a better knockdown of NAT10 can be achieved, with less variability.

      (4) Because knockdown is only modest (~50%), it is not clear if the remaining ac4c on mRNAs is due to remaining NAT10 protein or due to an alternative writer (as the authors pose).

    2. Reviewer #2 (Public review):

      This is an interesting study that shows that mRNA acetylation at synapses is dynamically regulated at synapses by spatial memory in the mouse hippocampus. The dynamic changes of ac4C-mRNAs regulated by memory were validated by methods including ac4C dot-blot and liquid 13 chromatography-tandem mass spectrometry (LC-MS/MS).

      Here are some comments for consideration by readers and authors:

      (1) It is known that synaptosomes are contaminated with glial tissue. In the study, the authors also show that NAT0 is expressed in glia. So the candidate mRNAs identified by acRIP-seq might also be mixed with glial mRNAs. Are the GO BP terms shown in Figure 3A specifically chosen, or unbiasedly listed for all top ones?

      (2) Where does NAT10-mediated mRNA acetylation take place within cells generally? Is there evidence that NAT10 can catalyze mRNA acetylation in the cytoplasm?

      (3) "The NAT10 proteins were significantly reduced in the cytoplasm (S2 fraction) but increased in the PSD fraction at day 6 after memory (Figures 5J and 5K)." The authors argue that the translocation of NAT10 from soma to synapses accounts for these changes. The increase of NAT10 protein in the PSD fraction can be understood. However, it is quite surprising that the NAT10 proteins were significantly reduced in the cytoplasm (S2 fraction), considering the amount of NAT10 in soma is much more abundant in synapses. The small increase in synaptic NAT10 might not be enough to cause a decrease in soma NAT10 protein level.

      (4) It is difficult to separate the effect on mRNA acetylation and protein mRNA acetylation when doing the loss of function of NAT10.

    1. Reviewer #1 (Public review):

      Summary:

      In their manuscript, Richter and colleagues comprehensively investigate the cell wall recycling pathway in the model alphaproteobacterium Caulobacter crescentus using biochemical, imaging, and genetic approaches. They clearly demonstrate that this organism encodes a functional peptidoglycan recycling pathway and demonstrate the activities of many enzymes and transporters within this pathway. They leverage imaging and growth assays to demonstrate that mutants in peptidoglycan recycling have varying degrees of beta-lactam sensitivity as well as morphological and cell division defects. They propose that, rather than impacting the levels or activity of the major beta-lactamase, BlaA, defects in PG recycling lead to beta-lactam sensitivity by limiting the availability of new cell wall precursors. The findings will be of interest to those in the field of bacterial cell wall biochemistry, antibiotics and antibiotic resistance, and bacterial morphogenesis.

      Strengths:

      Overall, the manuscript is laid out logically, and the data are comprehensive, quantitative, and rigorous. The mutants and their phenotypes will be a valuable resource for Caulobacter researchers.

      Weaknesses:

      The only major missing piece is the complementation of mutants to demonstrate that loss of the targeted gene is responsible for the observed phenotypes.

    2. Reviewer #2 (Public review):

      Summary:

      Pia Richter et al. investigated the peptidoglycan (PG) recycling metabolism in the alpha-proteobacterium Caulobacter crescentus. The authors first identified a functional recycling pathway in this organism, which is similar to the Pseudomonas route, and they characterized two key enzymes (NagZ, AmiR) of this pathway, showing that AmiR differs in specificity from the AmpD counterpart of E. coli. Further, they studied the effects of deletions within the PG recycling pathway (ampG, amiR, nagZ, sdpA, blaA, nagA1, nagA2, amgK, nagK mutants), showing filamentation and cell widening, thereby revealing a link between PG recycling and cell division. Finally, they provide a link between PG recycling and beta-lactam sensitivity in C. crescents that is not caused by activation of a beta-lactamase, but rather is a result of reduced supply of PG building blocks increasing the sensitivity of penicillin-binding proteins.

      Strengths:

      This work adds to the understanding of the role of PG recycling in alpha-proteobacteria, which significantly differ in their mode of cell wall growth from the better studied gamma-proteobacteria.

      Weaknesses:

      The findings are not entirely novel as recent studies by Modi et al. 2025 mBio (studying C. crescentus) and Gilmore & Cava 2022 Nat. Commun. (studying Agrobacterium tumefaciens) came to similar conclusions.

    1. Reviewer #1 (Public review):

      The paper by Chen et al describes the role of neuronal themo-TRPV3 channels in the firing of cortical neurons at fever temperature range. The authors began by demonstrating that exposure to infrared light increasing ambient temperature causes body temperature rise to fever level above 38{degree sign}C. Subsequently, they showed that at the fever temperature of 39{degree sign}C, the increased spike threshold (ST) increased in both populations (P12-14 and P7-8) of cortical excitatory pyramidal neurons (PNs). However, the spike number only decreased in P7-8 PNs, while it remained stable in P12-14 PNs at 39{degree sign}C. In addition, the fever temperature also reduced the late peak postsynaptic potential (PSP) in P12-14 PNs. The authors further characterized the firing properties of cortical P12-14 PNs, identifying two types: STAY PNs that retained spiking at 30{degree sign}C, 36{degree sign}C and 39{degree sign}C, and STOP PNs that stopped spiking upon temperature change. They further extended their and analysis and characterization to striatal medium spiny neurons (MSNs) and found that STAY MSNs and PNs shared same ST temperature sensitivity. Using small molecule tools, they further identified that themo-TRPV3 currents in cortical PNs increased in response to temperature elevation, but not TRPV4 currents. The authors concluded that during fever, neuronal firing stability is largely maintained by sensory STAY PNs and MSNs that express functional TRPV3 channels. Overall, this study is well designed and executed with substantial controls, some interesting findings and quality of data.

      Comments on revisions:

      My previous concerns have been addressed in this revised manuscript.

    2. Reviewer #2 (Public review):

      Summary:

      The authors studied the excitability of layer 2/3 pyramidal neurons in response to layer four stimulation at temperatures ranging from 30 to 39{degree sign}C in P7-8, P12-P14, and P22-P24 animals. They also measure brain temperature and spiking in vivo in response to externally applied heat. Some pyramidal neurons continue to fire action potentials in response to stimulation at 39{degree sign}C and are referred to as "stay neurons." Stay neurons have unique properties, aided by the expression of the TRPV3 channel.

      Strengths:

      The authors focused on layer 2/3 neuronal excitability at three developmental stages: during the window of susceptibility to febrile seizures, before the window opens, and after it closes.

      Electrophysiological experiments are rigorously performed and carefully interpreted.

      The cellular electrophysiology is further confirmed. The authors compared the seizure susceptibility of TRPV3 knockout, heterozygous, and wild-type mice. EEG recording would have strengthened the study, but they are challenging in this age group.

      Finally, the authors studied TRPV3 expression with immunohistochemistry.

    3. Reviewer #3 (Public review):

      Summary:

      This important study combines in vitro and in vivo recording to determine how the firing of cortical and striatal neurons changes during a fever range temperature rise (37-40 oC). The authors found that certain neurons will start, stop, or maintain firing during these body temperature changes. The authors further suggested that the TRPV3 channel plays a role in maintaining cortical activity during fever.

      Strengths:

      The topic of how the firing pattern of neurons changes during fever is unique and interesting. The authors carefully used in vitro electrophysiology assays to study this interesting topic.

      Weaknesses:

      (1) In vivo recording is a strength of this study. However, data from in vivo recording is only shown in Fig 5A,B. This reviewer suggests the authors further expand on the analysis of the in vivo Neuropixels recording. For example, to show single spike waveforms and raster plots to provide more information on the recording. The authors can also separate the recording based on brain regions (cortex vs striatum) using the depth of the probe as a landmark to study the specific firing of cortical neurons and striatal neurons. It is also possible to use published parameters to separate the recording based on spike waveform to identify regular principal neurons vs fast-spiking interneurons. Since the authors studied E/I balance in brain slices, it would be very interesting to see whether the "E/I balance" based on the firing of excitatory neurons vs fast-spiking interneurons might be changed or not in the in vivo condition.

      (2) The author should propose a potential mechanism for how TRPV3 helps to maintain cortical activity during fever. Would calcium influx-mediated change of membrane potential be the possible reason? Making a summary figure to put all the findings into perspective and propose a possible mechanism would also be appreciated.

      (3) The author studied P7-8, P12-14, and P20-26 mice. How do these ages correspond to the human ages? it would be nice to provide a comparison to help the reader understand the context better.

      Comments on revisions:

      In this revised version, the authors nicely addressed my critiques. I have no more comments to make.

    1. Reviewer #1 (Public review):

      Summary:

      In the study by Roeder and colleagues, the authors aim to identify the psychophysiological markers of trust during the evaluation of matching or mismatching AI decision-making. Specifically, they aim to characterize through brain activity how the decision made by an AI can be monitored throughout time in a two-step decision-making task. The objective of this study is to unfold, through continuous brain activity recording, the general information processing sequence while interacting with an artificial agent, and how internal as well as external information interact and modify this processing. Additionally, the authors provide a subset of factors affecting this information processing for both decisions.

      Strengths:

      The study addresses a wide and important topic of the value attributed to AI decisions and their impact on our own confidence in decision-making. It especially questions some of the factors modulating the dynamical adaptation of trust in AI decisions. Factors such as perceived reliability, type of image, mismatch, or participants' bias toward one response or the other are very relevant to the question in human-AI interactions.

      Interestingly, the authors also question the processing of more ambiguous stimuli, with no real ground truth. This gets closer to everyday life situations where people have to make decisions in uncertain environments. Having a better understanding of how those decisions are made is very relevant in many domains.

      Also, the method for processing behavioral and especially EEG data is overall very robust and is what is currently recommended for statistical analyses for group studies. Additionally, authors provide complete figures with all robustness evaluation information. The results and statistics are very detailed. This promotes confidence, but also replicability of results.

      An additional interesting method aspect is that it is addressing a large window of analysis and the interaction between three timeframes (evidence accumulation pre-decision, decision-making, post-AI decision processing) within the same trials. This type of analysis is quite innovative in the sense that it is not yet a standard in complex experimental designs. It moves forward from classical short-time windows and baseline ERP analysis.

      Weaknesses:

      This manuscript raises several conceptual and theoretical considerations that are not necessarily answered by the methods (especially the task) used. Even though the authors propose to assess trust dynamics and violations in cooperative human-AI teaming decision-making, I don't believe their task resolves such a question. Indeed, there is no direct link between the human decision and the AI decision. They do not cooperate per se, and the AI decision doesn't seem, from what I understood to have an impact on the participants' decision making. The authors make several assumptions regarding trust, feedback, response expectation, and "classification" (i.e., match vs. mismatch) which seem far stretched when considering the scientific literature on these topics.

      Unlike what is done for the data processing, the authors have not managed to take the big picture of the theoretical implications of their results. A big part of this study's interpretation aims to have their results fit into the theoretical box of the neural markers of performance monitoring.

      Overall, the analysis method was very robust and well-managed, but the experimental task they have set up does not allow to support their claim. Here, they seem to be assessing the impact of a mismatch between two independent decisions.

      Nevertheless, this type of work is very important to various communities. First, it addresses topical concerns associated with the introduction of AI in our daily life and decisions, but it also addresses methodological difficulties that the EEG community has been having to move slowly away from the static event-based short-timeframe analyses onto a more dynamic evaluation of the unfolding of cognitive processes and their interactions. The topic of trust toward AI in cooperative decision making has also been raised by many communities, and understanding the dynamics of trust, as well as the factors modulating it, is of concern to many high-risk environments, or even everyday life contexts. Policy makers are especially interested in this kind of research output.

    2. Reviewer #2 (Public review):

      Summary:

      The authors investigated how "AI-agent" feedback is perceived in an ambiguous classification task, and categorised the neural responses to this. They asked participants to classify real or fake faces, and presented an AI-agent's feedback afterwards, where the AI-feedback disagreed with the participants' response on a random 25% of trials (called mismatches). Pre-response ERP was sensitive to participants' classification as real or fake, while ERPs after the AI-feedback were sensitive to AI-mismatches, with stronger N2 and P3a&b components. There was an interaction of these effects, with mismatches after a "Fake" response affecting the N2 and those after "Real" responses affecting P3a&b. The ERPs were also sensitive to the participants' response biases, and their subjective ratings of the AI agent's reliability.

      Strengths:

      The researchers address an interesting question, and extend the AI-feedback paradigm to ambiguous tasks without veridical feedback, which is closer to many real-world tasks. The in-depth analysis of ERPs provides a detailed categorisation of several ERPs, as well as whole-brain responses, to AI-feedback, and how this interacts with internal beliefs, response biases, and trust in the AI-agent.

      Weaknesses:

      There is little discussion of how the poor performance (close to 50% chance) may have affected performance on the task, such as by leading to entirely random guessing or overreliance on response biases. This can change how error-monitoring signals presented, as they are affected by participants' accuracy, as well as affecting how the AI feedback is perceived.

      The task design and performance make it hard to assess how much it was truly measuring "trust" in an AI agent's feedback. The AI-feedback is yoked to the participants' performance, agreeing on 75% of trials and disagreeing on 25% (randomly), which is an important difference from the framing provided of human-AI partnerships, where AI-agents usually act independently from the humans and thus disagreements offer information about the human's own performance. In this task, disagreements are uninformative, and coupled with the at-chance performance on an ambiguous task, it is not clear how participants should be interpreting disagreements, and whether they treat it like receiving feedback about the accuracy of their choices, or whether they realise it is uninformative. Much greater discussion and justification are needed about the behaviour in the task, how participants did/should treat the feedback, and how these affect the trust/reliability ratings, as these are all central to the claims of the paper.

      There are a lot of EEG results presented here, including whole-brain and window-free analyses, so greater clarity on which results were a priori hypothesised should be given, along with details on how electrodes were selected for ERPs and follow-up tests.

    3. Reviewer #3 (Public review):

      The current paper investigates neural correlates of trust development in human-AI interaction, looking at EEG signatures locked to the moment that AI advice is presented. The key finding is that both human-response-locked EEG signatures (the CPP) and post-AI-advice signatures (N2, P3) are modulated by trust ratings. The study is interesting, however, it does have some clear and sometimes problematic weaknesses:

      (1) The authors did not include "AI-advice". Instead, a manikin turned green or blue, which was framed as AI advice. It is unclear whether participants viewed this as actual AI advice.

      (2) The authors did not include a "non-AI" control condition in their experiment, such that we cannot know how specific all of these effects are to AI, or just generic uncertain feedback processing.

      (3) Participants perform the task at chance level. This makes it unclear to what extent they even tried to perform the task or just randomly pressed buttons. These situations likely differ substantially from a real-life scenario where humans perform an actual task (which is not impossible) and receive actual AI advice.

      (4) Many of the conclusions in the paper are overstated or very generic.

    1. Reviewer #1 (Public review):

      Summary:

      This manuscript provides a comprehensive systematic analysis of envelope-containing Ty3/gypsy retrotransposons (errantiviruses) across metazoan genomes, including both invertebrates and ancient animal lineages. Using iterative tBLASTn mining of over 1,900 genomes, the authors catalog 1,512 intact retrotransposons with uninterrupted gag, pol, and env open reading frames. They show that these elements are widespread-present in most metazoan phyla, including cnidarians, ctenophores, and tunicates-with active proliferation indicated by their multicopy status. Phylogenetic analyses distinguish "ancient" and "insect" errantivirus clades, while structural characterization (including AlphaFold2 modeling) reveals two major env types: paramyxovirus F-like and herpesvirus gB-like proteins. Although bot envelope types were identified in previous analyses two decades ago, the evolutionary provenance of these envelope genes was almost rudimentary and anecdotal (I can say this because I authored one of these studies). The results in the present study support an ancient origin for env acquisition in metazoan Ty3/gypsy elements, with subsequent vertical inheritance and limited recombination between env and pol domains. The paper also proposes an expanded definition of 'errantivirus' for env-carrying Ty3/gypsy elements outside Drosophila.

      Strengths:

      (1) Comprehensive Genomic Survey:<br /> The breadth of the genome search across non-model metazoan phyla yields an impressive dataset covering evolutionary breadth, with clear documentation of search iterations and validation criteria for intact elements.

      (2) Robust Phylogenetic Inference:<br /> The use of maximum likelihood trees on both pol and env domains, with thorough congruence analysis, convincingly separates ancient from lineage-specific elements and demonstrates co-evolution of env and pol within clades.

      (3) Structural Insights:<br /> AlphaFold2-based predictions provide high-confidence structural evidence that both env types have retained fusion-competent architectures, supporting the hypothesis of preserved functional potential.

      (4) Novelty and Scope:<br /> The study challenges previous assumptions of insect-centric or recent env acquisition and makes a compelling case for a Pre-Cambrian origin, significantly advancing our understanding of animal retroelement diversity and evolution. THIS IS A MAJOR ADVANCE.

      (5) Data Transparency:<br /> I appreciate that all data, code, and predicted structures are made openly available, facilitating reproducibility and future comparative analyses.

      Major Weaknesses

      (1) Functional Evidence Gaps:<br /> The work rests largely on sequence and structure prediction. No direct expression or experimental validation of envelope gene function or infectivity outside Drosophila is attempted, which would be valuable to corroborate the inferred roles of these glycoproteins in non-insect lineages. At least for some of these species, there are RNA-seq datasets that could be leveraged.

      (2) Horizontal Transfer vs. Loss Hypotheses:<br /> The discussion argues primarily for vertical inheritance, but the somewhat sporadic phylogenetic distributions and long-branch effects suggest that loss and possibly rare horizontal events may contribute more than acknowledged. Explicit quantitative tests for horizontal transfer, or reconciliation analyses, would strengthen this conclusion. It's also worth pointing out that, unlike retrotransposons that can be found in genomes, any potential related viral envelopes must, by definition, have a spottier distribution due to sampling. I don't think this challenges any of the conclusions, but it must be acknowledged as something that could affect the strength of this conclusion

      (3) Limited Taxon Sampling for Certain Phyla:<br /> Despite the impressive breadth, some ancient lineages (e.g., Porifera, Echinodermata) are negative, but the manuscript does not fully explore whether this reflects real biological absence, assembly quality, or insufficient sampling. A more systematic treatment of negative findings would clarify claims of ubiquity. However, I also believe this falls beyond the scope of this study.

      (4) Mechanistic Ambiguity:<br /> The proposed model that env-containing elements exploit ovarian somatic niches is plausible but extrapolated from Drosophila data; for most taxa, actual tissue specificity, lifecycle, or host interaction mechanisms remain speculative and, to me, a bit unreasonable.

      Minor Weaknesses:

      (1) Terminology and Nomenclature:<br /> The paper introduces and then generalizes the term "errantivirus" to non-insect elements. While this is logical, it may confuse readers familiar with the established, Drosophila-centric definition if not more explicitly clarified throughout. I also worry about changes being made without any input from the ICTV nomenclature committee, which just went through a thorough reclassification. Nevertheless, change is expected, and calling them all errantiviruses is entirely reasonable.

      (2) Figures and Supplementary Data Navigation:<br /> Some key phylogenies and domain alignments are found only in supplementary figures, occasionally hindering readability for non-expert audiences. Selected main-text inclusion of representative trees would benefit accessibility.

      (3) ORF Integrity Thresholds:<br /> The cutoff choices for defining "intact" elements (e.g., numbers/placement of stop codons, length ranges) are reasonable but only lightly justified. More rationale or sensitivity analysis would improve confidence in the inclusion criteria. For example, how did changing these criteria change the number of intact elements?

      (4) Minor Typos/Formatting:<br /> The paper contains sporadic typographical errors and formatting glitches (e.g., misaligned figure labels, unrendered symbols) that should be addressed.

    2. Reviewer #2 (Public review):

      Summary:

      The authors first surveyed metazoan genomes to identify homologs of Drosophila errantiviruses and classified them into two groups, "insect" and "ancient" elements, supporting the hypothesis of an early evolutionary origin for these retrotransposons. They subsequently identified two distinct types of envelope proteins, one resembling the glycoprotein F of paramyxoviruses and the other akin to the glycoprotein B of herpesviruses. Despite differences in their primary amino acid sequences, these proteins display notable structural similarity in their predicted domain architectures. The congruence between the phylogenies of the envelope and pol genes further supports the ancient origin of the envelope genes, challenging earlier hypotheses that proposed recent recombination events with baculoviruses. Additional analysis of the Pol "bridge region" corroborated the divergence among these elements, consistent with a pattern of limited cross-species recombination. Finally, by comparing these elements with non-envelope-containing Gypsy retrotransposons, the authors concluded that errantiviruses originated from multiple elements independently.

      Strengths:

      The conclusions of this study are based on a comprehensive collection of errantiviruses identified across a wide range of metazoan genomes. These findings are further supported by multiple lines of evidence, including phylogenetic congruence and the diverse evolutionary origins of envelope genes. AlphaFold2-assisted protein domain structure analyses also provided key insights into the characterization of these elements. Together, these results present a compelling case that errantiviruses arose independently through multiple evolutionary events, extending well beyond previous hypotheses.

      Weaknesses:

      It would be beneficial to emphasize in the Abstract the potential impact of this work by more clearly articulating the current knowledge gap in the field. While the second paragraph of the Introduction briefly touches on this point, highlighting the broader significance in the Abstract would better capture readers' interest. Additionally, some methodological choices would benefit from clearer justification and explanation. For instance, in Figure 6, the selection of the bridge region/RNase H domain is not explicitly explained, leaving the rationale for its choice unclear. As a minor point, some figure labels and texts are too small and difficult to read, and improving their legibility would enhance overall clarity.

    3. Reviewer #3 (Public review):

      Summary and Significance:

      In this work, Cary and Hayashi address the important question of when, in evolution, certain mobile genetic elements (Ty3/gypsy-like non-LTR retrotransposons) associated with certain membrane fusion proteins (viral glycoprotein F or B-like proteins), which could allow these mobile genetic elements to be transferred between individual cells of a given host. It is debated in the literature whether the acquisition of membrane fusion proteins by non-LTR retrotransposons is a rather recent phenomenon that separately occurred in the ancestors of certain host species or whether the association with membrane fusion proteins is a much more ancient one, pre-dating the Cambrian explosion. Obviously, this question also touches upon the origin of the retroviruses, which can spread between individuals of a given host but seem restricted to vertebrates. Based on convincing data, Cary and Hayashi argue that an ancient association of non-LTR retrotransposons with membrane fusion proteins is most probable.

      Strengths:

      The authors take the smart approach to systematically retrieve apparently complete, intact, and recently functional Ty3/gypsy-like non-LTR retrotransposons that, next to their characteristic gag and pol genes, additionally carry sequences that are homologous to viral glycoprotein F (env-F) or viral glycoprotein B (env-B). They then construct and compare phylogenetic trees of the host species and individual encoded proteins and protein domains, where 3D-structure calculations and other features explain and corroborate the clustering within the phylogenetic trees. Congruence of phylogenetic trees and correlation of structural features is then taken as evidence for an infrequent recombination and a long-term co-evolution of the reverse transcriptase (encoded by the pol gene) and its respective putative membrane fusion gene (encoded by env-F or env-B). Importantly, the env-F and env-B containing retrotransposons do not form a monophyletic group among the Ty3/gypsy-like non-LTR retrotransposons, but are scattered throughout, supporting the idea of an originally ancient association followed by a random loss of env-F/env-B in individual branches of the tree (and rather rare re-associations via more recent recombinations).

      Overall, this is valuable, stimulating, and important work of general and fundamental interest, but still also somewhat incompletely explored, imprecisely explained, and insufficiently put into context for a more general audience.

      Weaknesses:

      Some points that might be considered and clarified:

      (1) Imprecise explanations, terms, and definitions:

      It might help to add a 'definitions box' or similar to precisely explain how the authors decided to use certain terms in this manuscript, and then use these terms consistently and with precision.

      a) In particular, these are terms such as 'vertebrate retrovirus' vs 'retrovirus' vs 'endogenized retrovirus' vs 'endogenous retrovirus' vs 'non-LTR retrotransposon' and 'Ty3/gypsi-like retrotransposon' vs 'Ty3/gypsy retrotransposon' vs 'errantivirus'.

      b) The comment also applies to the term 'env' used for both 'env-F' and 'env-B', where often it remains unclear which of the two protein types the authors refer to. This is confusing, particularly in the methods, where the search for the respective homologs is described.

      c) Other examples are the use of the entire pol gene vs. pol-RT for the definition of the Ty3/gypsy clade and for the generation of phylogenetic trees (Methods and Figure S1), and the names for various portions of pol that appear without prior definition or explanation (e.g., 'pro' in Figure 1A, 'bridge' in Figure S1C, 'the chromodomain' in the text and Figure 7).

      d) It is unclear from the main text which portions of pol were chosen to define pol-RT and why. The methods name the 'palm-and-fingers', 'thumb', and 'connections' domains to define RT. In the main text, the 'connection' domain is called 'tether' and is instead defined as part of the 'bridge' region following RT, which is not part of RT.

      (2) Insufficient broader context:

      a) The introduction does not state what defines Ty3/gypsy non-LTR retrotransposons as compared to their closest relatives (Ty1/copia retrotransposons, BEL/pao retrotransposons, vertebrate retroviruses). This makes it difficult to judge the significance and generality of the findings.

      b) The various known compositions of Ty3/gypsi-like retrotransposons are not mentioned and explained in the introduction (open reading frames, (poly-)proteins and protein domains, and their variable arrangement, enzymatic activities, and putative functions), and the distribution of Ty3/gypsi-like retrotransposons among eukaryotes remains unclear. The introduction does not mention that Ty3/gypsi-like retrotransposons apparently are absent from vertebrates, and Figure 7 is not very clear about whether or not it includes sequences from plants ('Chromoviridae').

      c) The known association of Ty3/gypsi-like retrotransposons from different metazoan phyla with putative membrane fusion proteins (env-like) genes is mentioned in the introduction, but literature information, whether such associations also occur in the context of other retrotransposons (e.g., Ty1/ copia or BEL/pao), is not provided. The abstract is somewhat misleading in this respect. Finally, the different known types of env-like genes are not mentioned and explained as part of the introduction ('env-f', 'env-B', 'retroviral env', others?)

      d) Some key references and reviews might be added:

      - Pelisson, A. et al. (1994) https://www.embopress.org/doi/abs/10.1002/j.1460-2075.1994.tb06760.x<br /> (next to Song et al. (1994), for the identification of env in Ty3/gypsy)

      - Boeke, J.D. et al. (1999)<br /> In Virus Taxonomy: ICTV VIIth report. (ed. F.A. Murphy),. Springer-Verlag, New York.<br /> (cited by Malik et al. (2000) - for the definition and first use of the term 'errantivirus')

      - Eickbush, T.H. and Jamburuthugoda, V.K. (2008) https://doi.org/10.1016/j.virusres.2007.12.010<br /> (on the classification of retrotransposons and their env-like genes)

      - Hayward, A. (2017) https://doi.org/10.1016/j.coviro.2017.06.006<br /> (on scenarios of env acquisition)

      (3) Incomplete analysis:

      a) Mobile genetic elements are sometimes difficult to assemble correctly from short-read sequencing data. Did the authors confirm some of their newly identified elements by e.g., PCR analysis or re-identification in long-read sequencing data?

      b) The authors mention somewhat on the side that there are Ty3/gypsy elements with a different arrangement (gag-env-pol instead of gag-pol-env). Why was this important feature apparently not used and correlated in the analysis? How does it map on the RT phylogenetic tree? Which type of env is found with either arrangement? Is there evidence for a loss of env also in the case of gag-env-pol elements?

      c) Sankey plots are insufficiently explained. How would inconsistencies between trees (recombinations) show up here? Why is there no Sankey plot for the analysis of env-B in Figure 5?

      d) Why are there no trees generated for env-F and env-B like proteins, including closely related homologous sequences that do NOT come from Ty3/gypsy retrotransposons (e.g., from the eukaryotic hosts, from other types of retrotransposons (Ty1/copia or BEL/pao), from viruses such as Herpesvirus and Baculovirus)? It would be informative whether the sequences from Ty3/gypsy cluster together in this case.

      e) Did the authors identify any other env-like ORFs (apart from env-F and env-B) among Ty3/gypsy retrotransposons? Did they identify other, non-env-like ORFs that might help in the analysis? It is not quite clear from the methods if the searches for env-F and env-B - containing Ty3/gypsy elements were done separately and consecutively or somehow combined (the authors generally use 'env', and it is not clear which type of protein this refers to).

      f) Why was the gag protein apparently not used to support the analysis? Are there different, unrelated types of gag among non-LTR retrotransposons? Does gag follow or break the pattern of co-evolution between RT and env-F/env-B?

      g) Data availability. The link given in the paper does not seem to work (https://github.com/RippeiHayashi/errantiviruses_2025/tree/main). It would be useful for the community to have the sequences of the newly identified Ty3/gypsy retrotransposons listed readily available (not just genome coordinates as in table S1), together with the respective annotations of ORFs and features.

  2. Nov 2025
    1. Reviewer #1 (Public review):

      Summary:

      From a forward genetic mosaic mutant screen using EMS, the authors identify mutations in glucosylceramide synthase (GlcT), a rate-limiting enzyme for glycosphingolipid (GSL) production, that result in ee tumors. Multiple genetic experiments strongly support the model that the mutant phenotype caused by GlcT loss is due to by failure of conversion of ceramide into glucosylceramide. Further genetic evidence suggests that Notch signaling is comprised in the ISC lineage and may affect endocytosis of Delta. Loss of GlcT does not affect wing development or oogenesis, suggesting tissue-specific roles for GlcT. Finally, an increase in goblet cells in UGCG knockout mice, not previously reported, suggests a conserved role for GlcT in Notch signaling in intestinal cell lineage specification.

      Strengths:

      Overall, this is a well-written paper with multiple well-designed and executed genetic experiments that support a role for GlcT in Notch signaling in the fly and mammalian intestine. The authors have addressed my concerns from the prior review.

    2. Reviewer #2 (Public review):

      Summary:

      This study genetically identifies two key enzymes involved in the biosynthesis of glycosphingolipids, GlcT and Egh, act as tumor suppressors in the adult fly gut. Detailed genetic analysis indicates that a deficiency in Mactosyl-ceramide (Mac-Cer) is causing tumor formation. Analysis of a Notch transcriptional reporter further indicates that the lack of Mac-Ser is associated with reduced Notch activity in the gut, but not in other tissues.

      Addressing how a change in the lipid composition of the membranes might lead to defective Notch receptor activation, the authors studied the endocytic trafficking of Delta and claimed that internalized Delta appeared to accumulate faster into endosomes in the absence of Mac-Cer. Further analysis of Delta steady state accumulation in fixed samples suggested a delay in the endosomal trafficking of Delta from Rab5+ to Rab7+ endosomes, which was interpreted to suggest that the inefficient, or delayed, recycling of Delta might cause a loss in Notch receptor activation.

      Finally, the histological analysis of mouse guts following the conditional knock-out of the GlcT gene suggested that Mac-Cer might also be important for proper Notch signaling activity in that context.

      Strengths:

      The genetic analysis is of high quality. The finding that a Mac-Cer deficiency results in reduced Notch activity in the fly gut is important and fully convincing.

      The mouse data, although preliminary, raised the possibility that the role of this specific lipid may be conserved across species.

    1. Reviewer #1 (Public review):

      The study analyzes the gastric fluid DNA content identified as a potential biomarker for human gastric cancer. However, the study lacks overall logicality, and several key issues require improvement and clarification. In the opinion of this reviewer, some major revisions are needed:

      (1) This manuscript lacks a comparison of gastric cancer patients' stages with PN and N+PD patients, especially T0-T2 patients.

      (2) The comparison between gastric cancer stages seems only to reveal the difference between T3 patients and early-stage gastric cancer patients, which raises doubts about the authenticity of the previous differences between gastric cancer patients and normal patients, whether it is only due to the higher number of T3 patients.

      (3) The prognosis evaluation is too simplistic, only considering staging factors, without taking into account other factors such as tumor pathology and the time from onset to tumor detection.

      (4) The comparison between gfDNA and conventional pathological examination methods should be mentioned, reflecting advantages such as accuracy and patient comfort.

      (5) There are many questions in the figures and tables. Please match the Title, Figure legends, Footnote, Alphabetic order, etc.

      (6) The overall logicality of the manuscript is not rigorous enough, with few discussion factors, and cannot represent the conclusions drawn

    2. Reviewer #2 (Public review):

      Summary:

      The authors investigated whether the total DNA concentration in gastric fluid (gfDNA), collected via routine esophagogastroduodenoscopy (EGD), could serve as a diagnostic and prognostic biomarker for gastric cancer. In a large patient cohort (initial n=1,056; analyzed n=941), they found that gfDNA levels were significantly higher in gastric cancer patients compared to non-cancer, gastritis, and precancerous lesion groups. Unexpectedly, higher gfDNA concentrations were also significantly associated with better survival prognosis and positively correlated with immune cell infiltration. The authors proposed that gfDNA may reflect both tumor burden and immune activity, potentially serving as a cost-effective and convenient liquid biopsy tool to assist in gastric cancer diagnosis, staging, and follow-up.

      Strengths:

      This study is supported by a robust sample size (n=941) with clear patient classification, enabling reliable statistical analysis. It employs a simple, low-threshold method for measuring total gfDNA, making it suitable for large-scale clinical use. Clinical confounders, including age, sex, BMI, gastric fluid pH, and PPI use, were systematically controlled. The findings demonstrate both diagnostic and prognostic value of gfDNA, as its concentration can help distinguish gastric cancer patients and correlates with tumor progression and survival. Additionally, preliminary mechanistic data reveal a significant association between elevated gfDNA levels and increased immune cell infiltration in tumors (p=0.001).

      Weaknesses:

      The study has several notable weaknesses. The association between high gfDNA levels and better survival contradicts conventional expectations and raises concerns about the biological interpretation of the findings. The diagnostic performance of gfDNA alone was only moderate, and the study did not explore potential improvements through combination with established biomarkers. Methodological limitations include a lack of control for pre-analytical variables, the absence of longitudinal data, and imbalanced group sizes, which may affect the robustness and generalizability of the results. Additionally, key methodological details were insufficiently reported, and the ROC analysis lacked comprehensive performance metrics, limiting the study's clinical applicability.

    1. Reviewer #1 (Public review):

      Summary:

      Most studies in sensory neuroscience investigate how individual sensory stimuli are represented in the brain (e.g., the motion or color of a single object). This study starts tackling the more difficult question of how the brain represents multiple stimuli simultaneously and how these representations help to segregate objects from cluttered scenes with overlapping objects.

      Strengths:

      The authors first document the ability of humans to segregate two motion patterns based on differences in speed. Then they show that a monkey's performance is largely similar; thus establishing the monkey as a good model to study the underlying neural representations.

      Careful quantification of the neural responses in the middle temporal area during the simultaneous presentation of fast and slow speeds leads to the surprising finding that, at low average speeds, many neurons respond as if the slowest speed is not present, while they show averaged responses at high speeds. This unexpected complexity of the integration of multiple stimuli is key to the model developed in this paper.

      One experiment in which attention is drawn away from the receptive field supports the claim that this is not due to the involuntary capture of attention by fast speeds.

      A classifier using the neuronal response and trained to distinguish single speed from bi-speed stimuli shows a similar overall performance and dependence on the mean speed as the monkey. This supports the claim that these neurons may indeed underlie the animal's decision process.

      The authors expand the well-established divisive normalization model to capture the responses to bi-speed stimuli. The incremental modeling (eq 9 and 10) clarifies which aspects of the tuning curves are captured by the parameters.

    2. Reviewer #3 (Public review):

      Summary:

      This study concerns how macaque visual cortical area MT represents stimuli composed of more than one speed of motion.

      Strengths:

      The study is valuable because little is known about how the visual pathway segments and preserves information about multiple stimuli. The study presents compelling evidence that (on average) MT neurons shift from faster-speed-takes-all at low speeds to representing the average of the two speeds at higher speeds. An additional strength of the study is the inclusion of perceptual reports from both humans and one monkey participant performing a task in which they judged whether the stimuli involved one vs two different speeds. Ultimately, this study raises intriguing questions about how exactly the response patterns in visual cortical area MT might preserve information about each speed, since such information is potentially lost in an average response as described here.

      Reviewing Editor comment on revised version:

      The remaining concern was resolved.

    1. Reviewer #1 (Public review):

      The authors focus on the molecular mechanisms by which EMT cells confer resistance to cancer cells. The authors use a wide range of methods to reveal that overexpression of Snail in EMT cells induces cholesterol/sphingomyelin imbalance via transcriptional repression of biosynthetic enzymes involved in sphingomyelin synthesis. The study also revealed that ABCA1 is important for cholesterol efflux and thus for counterbalancing the excess of intracellular free cholesterol in these snail-EMT cells. Inhibition of ACAT, an enzyme catalyzing cholesterol esterification, also seems essential to inhibit the growth of snail-expressing cancer cells.

      Overall, the provided data are convincing and enhance our knowledge on cancer biology.

    2. Reviewer #2 (Public review):

      Summary:

      This revised study provides a clearer and more mechanistically grounded explanation of how lipid metabolic imbalance contributes to EMT-associated chemoresistance in renal cancer. In this study, the authors discovered that chemoresistance in RCC cell lines correlates with the expression levels of ABCA1 and the EMT-related transcription factor Snail. They demonstrate that Snail induces ABCA1 expression and chemoresistance, and that inhibition of ABCA1-associated pathways can counteract this resistance. The study also suggests that Snail disrupts the cholesterol-sphingomyelin balance by repressing enzymes involved in VLCFA-sphingomyelin synthesis, leading to excess free cholesterol and activation of the LXR-ABCA1 axis. Importantly, inhibiting cholesterol esterification, which renders free cholesterol inert, selectively suppresses growth of a xenograft model of Snail-positive kidney cancer. These findings provide potential lipid metabolism-targeting strategies for cancer therapy. The revised version includes additional quantitative analyses and new experiments addressing lipid balance and ABCA1 localization, further strengthening the overall mechanistic model.

      Strengths:

      This revised manuscript provides a more comprehensive and convincing mechanistic explanation for how Snail-driven EMT induces chemoresistance through altered lipid homeostasis. The study presents a novel concept in which the Chol/SM balance, rather than individual lipid levels, shapes therapeutic vulnerability. The potential for targeting cholesterol detoxification pathways in Snail-positive cancer cells remains a significant therapeutic implication. In the revised version, the authors provide additional quantitative analyses and complementary experiments - including ABCA1 localization, restoration of VLCFA-SM levels by supplementation with C22:0 ceramide, and membrane-order assays - which further strengthen the mechanistic interpretation and address key concerns raised in earlier reviews.

      Weaknesses:

      The revised version includes new experiments showing that restoring sphingomyelin levels suppresses ABCA1 expression, thereby strengthening the causal link between altered lipid balance and ABCA1 induction. However, the evidence that ABCA1 is directly required for chemoresistance remains somewhat limited, as the phenotype was not reproduced by ABCA1 knockout or knockdown, and CsA may affect additional targets beyond ABCA1.

    1. Reviewer #1 (Public review):

      Summary:

      This study builds off prior work that focused on the molecule AA147 and its role as an activator of the ATF6 arm of the unfolded protein response. In prior manuscripts, AA147 was shown to enter the ER, covalently modify a subset of protein disulfide isomerases (PDIs), and improve ER quality control for the disease-associated mutants of AAT and GABAA. Unsuccessful attempts to improve the potency of AA147 have led the authors to characterize a second hit from the screen in this study: the phenylhydrazone compound AA263. The focus of this study on enhancing biological activity of the AA147 molecule is compelling, and overcomes a hurdle of the prior AA147 drug that proved difficult to modify. The study successfully identifies PDIs as a shared cellular target of AA263 and its analogs. The authors infer, based on the similar target hits previously characterized for AA147, that PDI modification likely accounts for a mechanism of action for AA263.

      Strengths:

      The work establishes the ability to modify the AA263 molecule to create analogs with more potency and efficacy for ATF6 activation. The "next generation" analogs are able to enhance the levels of functional AAT and GABAA receptors in cellular models expressing the Z-variant of AAT or an epilepsy-associated variant of the GABAA receptor, outlining the therapeutic potential for this molecule and laying the foundation for future organism-based studies.

      The authors are able to establish that like AA147, AA263 covalently targets ER PDIs. While it is a likely mechanism that AA263 works through the PDIs, the authors are careful to discuss that this is a potential mechanism that remains to be explicitly proven. The study provides the foundation for future work to further define a role for the PDIs in the actions of AA263.

    2. Reviewer #2 (Public review):

      Modulating the UPR by pharmacological targeting of its sensors (or regulators) provides mostly uncharted opportunities in diseases associated with protein misfolding in the secretory pathway. Spearheaded by the Kelly and Wiseman labs, ATF6 modulators were developed in previous years that act on ER PDIs as regulators of ATF6. However, hurdles in their medicinal chemistry have hampered further developments. In this study, the authors provide evidence that the small molecule AA263 also targets and covalently modifies ER PDIs with the effect of activating ATF6. Importantly, AA263 turned out to be amenable to chemical optimization while maintaining its desired activity. Building on this, the authors show that AA263 derivatives can improve aggregation, trafficking and function of two disease-associated mutants of secretory pathway proteins. Together, this study provides compelling evidence for AA263 (and its derivatives) being interesting modulators of ER proteostasis. Mechanistic details of its mode of action will need more attention in future studies that can now build on this.

      In detail, the authors provide strong evidence that AA263 covalently binds to ER PDIs, which will inhibit the protein disulfide isomerase activity. ER PDIs regulate ATF6, and thus their finding provides a mechanistic interpretation of AA263 activating the UPR. It should be noted, however, that AA263 shows broad protein labeling (Fig. 1G) which may suggest additional targets, beyond the ones defined as MS hits in this study. Also, a further direct analysis of the IRE1 and PERK pathways (activated or not by AA263) may be an interesting future directions, as e.g. PDIA1, a target of AA263, directly regulates IRE1 (Yu et al., EMBOJ, 2020) and other PDIs also act on PERK and IRE1. The authors interpret modest activation of IRE1/PERK target genes (Fig. 2C) as an effect on target gene overlap, indeed the most likely explanation based on their selective analyses on IRE1 (ERdj4) and PERK (CHOP) downstream genes, but direct activation due to the targeting of their PDI regulators is also a possible explanation. Further key findings of this paper are the observed improvement of AAT behavior and GABAA trafficking and function. Further strength to the mechanistic conclusion that ATF6 activation causes this could be obtained by using ATF6 inhibitors/knockouts in the presence of AA263 (as the target PDIs may directly modulate behavior of AAT and/or GABAA). Along the same line, it also warrants further investigation in future studies why the different compounds, even if all were used at concentrations above their EC50, had different rescuing capacities on the clients.

      Together, the study now provides a strong basis for such in-depth mechanistic analyses.

    3. Reviewer #3 (Public review):

      Summary:

      This study aims to develop and characterize phenylhydrazone-based small molecules that selectively activate the ATF6 arm of the unfolded protein response by covalently modifying a subset of ER-resident PDIs. The authors identify AA263 as a lead scaffold and optimize its structure to generate analogs with improved potency and ATF6 selectivity, notably AA263-20. These compounds are shown to restore proteostasis and functional expression of disease-associated misfolded proteins in cellular models involving both secretory (AAT-Z) and membrane (GABAA receptor) proteins. The findings provide valuable chemical tools for modulating ER proteostasis and may serve as promising leads for therapeutic development targeting protein misfolding diseases.

      Strengths:

      The study presents a well-defined chemical biology framework integrating proteomics, transcriptomics, and disease-relevant functional assays.

      Identification and optimization of a new electrophilic scaffold (AA263) that selectively activates ATF6 represents a valuable advance in UPR-targeted pharmacology.

      SAR studies are comprehensive and logically drive the development of more potent and selective analogs such as AA263-20.

      Functional rescue is demonstrated in two mechanistically distinct disease models of protein misfolding-one involving a secretory protein and the other a membrane protein-underscoring the translational relevance of the approach.

      Weaknesses:

      ATF6 activation is primarily inferred from reporter assays and transcriptional profiling; direct biochemical evidence of ATF6 cleavage or nuclear translocation remains missing. However, the authors have added supporting data showing that co-treatment with the ATF6 inhibitor CP7 suppresses target gene induction, which partially strengthens the evidence for ATF6-dependent activity.

      Although the proposed mechanism involving PDI modification and ATF6 activation is plausible, it is still not experimentally demonstrated and remains incompletely characterized.

      In vivo validation is absent, and thus the pharmacological feasibility, selectivity, and bioavailability of these compounds in physiological systems remain untested.

      Comments on revisions:

      The authors have generally addressed my comments.

    1. Reviewer #2 (Public review):

      Summary:

      This paper formulates an individual-based model to understand the evolution of division of labor in vertebrates. The model considers a population subdivided in groups, each group has a single asexually-reproducing breeder, other group members (subordinates) can perform two types of tasks called "work" or "defense", individuals have different ages, individuals can disperse between groups, each individual has a dominance rank that increases with age, and upon death of the breeder a new breeder is chosen among group members depending on their dominance. "Workers" pay a reproduction cost by having their dominance decreased, and "defenders" pay a survival cost. Every group member receives a survival benefit with increasing group size. There are 6 genetic traits, each controlled by a single locus, that control propensities to help and disperse, and how task choice and dispersal relate to dominance. To study the effect of group augmentation without kin selection, the authors cross-foster individuals to eliminate relatedness. The paper allows for the evolution of the 6 genetic traits under some different parameter values to study the conditions under which division of labour evolves, defined as the occurrence of different subordinates performing "work" and "defense" tasks. The authors envision the model as one of vertebrate division of labor.

      The main conclusion of the paper is that group augmentation is the primary factor causing the evolution of vertebrate division of labor, rather than kin selection. This conclusion is drawn because, for the parameter values considered, when the benefit of group augmentation is set to zero, no division of labor evolves and all subordinates perform "work" tasks but no "defense" tasks.

      Strengths:

      The model incorporates various biologically realistic details, including the possibility to evolve age polytheism where individuals switch from "work" to "defence" tasks as they age or vice versa, as well as the possibility of comparing the action of group augmentation alone with that of kin selection alone.

      Weaknesses:

      The model and its analysis are limited, which in my view makes the results insufficient to reach the main conclusion that group augmentation and not kin selection is the primary cause of the evolution of vertebrate division of labour. There are several reasons.

      First, although the main claim that group augmentation drives the evolution of division of labour in vertebrates, the model is rather conceptual in that it doesn't use quantitative empirical data that applies to all/most vertebrates and vertebrates only. So, I think the approach has a conceptual reach rather than being able to achieve such a conclusion about a real taxon.

      Second, I think that the model strongly restricts the possibility that kin selection is relevant. The two tasks considered essentially differ only by whether they are costly for reproduction or survival. "Work" tasks are those costly for reproduction and "defense" tasks are those costly for survival. The two tasks provide the same benefits for reproduction (eqs. 4, 5) and survival (through group augmentation, eq. 3.1). So, whether one, the other, or both helper types evolve presumably only depends on which task is less costly, not really on which benefits it provides. As the two tasks give the same benefits, there is no possibility that the two tasks act synergistically, where performing one task increases a benefit (e.g., increasing someone's survival) that is going to be compounded by someone else performing the other task (e.g., increasing that someone's reproduction). So, there is very little scope for kin selection to cause the evolution of labour in this model. Note synergy between tasks is not something unusual in division of labour models, but is in fact a basic element in them, so excluding it from the start in the model and then making general claims about division of labour is unwarranted. In their reply, the authors point out that they only consider fertility benefits as this, according to them, is what happens in cooperative breeders with alloparental care; however, alloparental care entails that workers can increase other's survival *without group augmentation*, such as via workers feeding young or defenders reducing predator-caused mortality, as a mentioned in my previous review but these potentially kin-selected benefits are not allowed here.

      Third, the parameter space is understandably little explored. This is necessarily an issue when trying to make general claims from an individual-based model where only a very narrow parameter region of a necessarily particular model can be feasibly explored. As in this model the two tasks ultimately only differ by their costs, the parameter values specifying their costs should be varied to determine their effects. In the main results, the model sets a very low survival cost for work (yh=0.1) and a very high survival cost for defense (xh=3), the latter of which can be compensated by the benefit of group augmentation (xn=3). Some limited variation of xh and xn is explored, always for very high values, effectively making defense unevolvable except if there is group augmentation. In this revision, additional runs have been included varying yh and keeping xh and xn constant (Fig. S6), so without addressing my comment as xn remains very high. Consequently, the main conclusion that "division of labor" needs group augmentation seems essentially enforced by the limited parameter exploration, in addition to the second reason above.

      Fourth, my view is that what is called "division of labor" here is an overinterpretation. When the two helper types evolve, what exists in the model is some individuals that do reproduction-costly tasks (so-called "work") and survival-costly tasks (so-called "defense"). However, there are really no two tasks that are being completed, in the sense that completing both tasks (e.g., work and defense) is not necessary to achieve a goal (e.g., reproduction). In this model there is only one task (reproduction, equation 4,5) to which both helper types contribute equally and so one task doesn't need to be completed if completing the other task compensates for it; instead, it seems more fitting to say that there are two types of helpers, one that pays a fertility cost and another one a survival cost, for doing the same task. So, this model does not actually consider division of labor but the evolution of different helper types where both helper types are just as good at doing the single task but perhaps do it differently and so pay different types of costs. In this revision, the authors introduced a modified model where "work" and "defense" must be performed to a similar extent. Although I appreciate their effort, this model modification is rather unnatural and forces the evolution of different helper types if any help is to evolve.

      I should end by saying that these comments don't aim to discourage the authors, who have worked hard to put together a worthwhile model and have patiently attended to my reviews. My hope is that these comments can be helpful to build upon what has been done to address the question posed.

    1. Reviewer #1 (Public review):

      Summary:

      The authors investigate how the Drosophila TNF receptor-associated factor Traf4 - a multifunctional adaptor protein with potential E3 ubiquitin ligase activity - regulates JNK signaling and adherens junctions (AJs) in wing disc epithelium. When they overexpress Traf4 in the posterior compartment of the wing disc, many posterior cells express the JNK target gene puckered (puc), apoptose, and are basally extruded from the epithelium. The authors term this process "delamination", but I think that this is an inaccurate description, especially since they can suppress the "delamination" by blocking programmed cell death (by concomitantly overexpressing p35). Through Y2H assays using Traf4 as a bait, they identified the Bearded family proteins E(spl)m4 (and to a lesser extent E(spl)m2), as Traf4 interactors. They use Alphafold to model computationally the interaction between Traf4 and E(spl)m4. They show that co-overexpression of Traf4 with E(spl)m4 in the posterior domain of the wing disc reduces death of posterior cells. They generate a new, weaker hypomorphic allele of Traf4 that is viable (as opposed to the homozygous lethality of null Traf4 alleles). There is some effect of these mutations on wing margin bristles; fewer wing margin bristle defects are seen when E(spl)m4 is overexpressed, suggesting opposite effects of Traf4 and E(spl)m4. Finally, they use the Minute model of cell competition to show that Rp/+ loser clones have greater clone area (indicating increased survival) when they are depleted for Traf4 or when they overexpress E(spl)m4. Only the cell competition results are quantified. Because most of the data in the preprint are not quantified, it is impossible to know how penetrant the phenotypes are. The authors conclude that E(spl)m4 binds the Traf4 MATH/TRAF domain, disrupts Traf4 trimerization, and selectively suppresses Traf4-mediated JNK and caspase activation without affecting its role in AJ destabilization. However, I believe that this is an overstatement. First, there is no biochemical evidence showing that Traf4 binds E(spl)m4 and that E(spl)m4 disrupts Traf4 trimerization. Second, the data on AJs is weak and not quantified; additionally, cells that are being basally extruded lose contact with neighboring cells, hence changes in adhesion proteins. Related to this, the authors, in my opinion, inaccurately describe basal extrusion of dying cells from the wing disc epithelium as delamination.

      Strengths:

      (1) The authors use multiple approaches to test the model that overexpressed E(spl)m4 inhibits Traf4, including genetics, cell biological imaging, yeast two-hybrid assays, and molecular modeling.

      (2) The authors generate a new Traf4 hypomorphic mutant and use this mutant in cell competition studies, which supports the concept that E(spl)m4 (when overexpressed) can antagonize Traf4.

      Weaknesses:

      (1) Conflation of "delamination" with "basal extrusion of apoptotic cells": Over-expression of Traf4 causes apoptosis in wing disc cells, and this is a distinct process from delamination of viable cells from an epithelium. However, the two processes are conflated by the authors, and this weakens the premise of the paper.

      (2) Dependence on overexpression: The conclusions rely heavily on ectopic expression of Traf4 and E(spl)m4. Thus, the physiological relevance of the interaction remains inferred rather than demonstrated.

      (3) Lack of quantitative rigor: Except for the cell competition studies, phenotypic descriptions (e.g., number of apoptotic cells, puc-LacZ intensity) are qualitative; additional quantification, inclusion of sample size, and statistical testing would strengthen the conclusions.

      (4) Limited biochemical validation: The Traf4-E(spl)m4 binding is inferred from Y2H and in silico models, but no co-immunoprecipitation or in vitro binding assays confirm direct interaction or the predicted disruption of trimerization.

      (5) Specificity within the Bearded family: While E(spl)m2 shows partial binding and Tom shows none, the mechanistic basis for this selectivity is not deeply explored experimentally, leaving questions about motif-context contributions unresolved.

    2. Reviewer #2 (Public review):

      Summary:

      This manuscript analyzes the contribution of Traf4 to the fate of epithelial cells in the developing wing imaginal disc tissue. The manuscript is direct and concise and suggests an interesting and valuable hypothesis with dual functions of Traf4 in JNK pathway activation and cell delamination. However, the text is partially speculative, and the evidence is incomplete as the main claims are only partially supported. Some results require validation to support the conclusions.

      Strengths:

      (1) The manuscript is direct and concise, with a well-written and precise introduction.

      (2) It presents an interesting and valuable hypothesis regarding the dual role of Traf4 in JNK pathway activation and cell delamination.

      (3) The study addresses a relevant biological question in epithelial tissue development using a genetically tractable model.

      (4) The use of newly generated Traf4 mutants adds novelty to the experimental approach.

      (5) The manuscript includes multiple experimental strategies, such as genetic manipulation and imaging, to explore Traf4 function.

      Weaknesses:

      (1) The evidence supporting key claims is incomplete, and some conclusions are speculative.

      (2) The use of GFP-tagged Traf4 lacks validation regarding its functional integrity.

      (3) Orthogonal views and additional imaging data are needed to confirm changes in apicobasal localization and cell delamination.

      (4) Experimental conditions and additional methods should be further detailed.

      (5) The interaction between Traf4 and E(spl)m4 remains speculative in Drosophila.

      (6) New mutants require deeper analysis and validation.

      (7) The elimination of Traf4 mutant clones may be due to cell competition, which requires further experimental clarification.

      (8) The role of Traf4 in cell competition is contradictory and needs to be resolved.

    3. Reviewer #3 (Public review):

      Summary:

      This is an important and well-conceived study that identifies the Bearded-type small protein E(spl)m4 as a physical and genetic interactor of TRAF4 in Drosophila. By combining classical genetics, yeast two-hybrid assays, and AlphaFold in silico modeling, the authors convincingly demonstrate that E(spl)m4 acts as an inhibitor of TRAF4-mediated induction of JNK-driven apoptosis in developing larval imaginal wing discs, while not affecting TRAF4's role in adherence junction remodeling.

      Based primarily on modeling, the authors propose that the specificity of E(spl)m4 towards TRAF4-mediated signaling arises from its interference with TRAF4 trimerization, which is likely required for the activation of the JNK signaling arm but not for the maintenance of adherence junctions and stability of E-cadherin/β-catenin complex.

      Overall, this study is of broad interest to cell and developmental biologists. It also holds potential biomedical relevance, particularly for strategies aimed at modulating TRAF protein activities to dissect and modulate canonical versus non-canonical signaling functions.

      Strengths:

      (1) The work identifies the Bearded-type small protein E(spl)m4 as a physical and genetic interactor of TRAF4 in Drosophila, extending the understanding of E(spl)m4 beyond its established functions in Notch signaling.

      (2) The study is experimentally solid, well-executed, and written, combining classical genetics with protein-protein interaction assays and modeling to reveal E(spl)m4 as a new regulator of TRAF4 signaling.

      (3) The genetic and biochemical data convincingly show the ability of E(spl)m4 overexpression to inhibit TRAF4-induced JNK-dependent apoptosis, while leaving the TRAF4 role in adherens junction remodeling unaffected.

      (4) The findings have important implications for the regulation of cell signaling and apoptosis and may guide pharmacological targeting of TRAF proteins.

      Weaknesses:

      The study is overall strong; however, several aspects could be clarified or expanded to strengthen the proposed mechanism and data presentation:

      (1) The proposed mechanism that E(spl)m4 inhibits TRAF4 activation of JNK signaling by affecting TRAF4 trimerization relies mainly on modeling. Experimental evidence would strengthen this claim. For example, a native or non-denaturing SDS-PAGE could be used to assess TRAF4 oligomerization states in the absence or presence of E(spl)m4 overexpression, testing whether E(spl)m4 interferes with high-molecular-weight TRAF4 assemblies.

      (2) The study depends largely on E(spl)m4 overexpression, which may not reflect physiological conditions. It would be valuable to test, or at least discuss, whether loss-of-function or knockdown of E(spl)m4 modulates the strength or duration of JNK-mediated signaling, potentially accelerating apoptosis. Such data would reinforce the model that E(spl)m4 acts as a physiological modulator of TRAF4-JNK signaling in vivo.

      (3) The authors initially identify both E(spl)m4 and E(spl)m2 as TRAF4 interactions, but subsequently focus on E(spl)m4. It would be helpful to clarify or discuss the rationale for prioritizing E(spl)m4 for detailed functional analysis.

      (4) E(spl)m4 overexpression appears to protect RpS3 loser clones (Figure 6H-K), yet caspase-3-positive cells are still visible in mosaic wing discs. Please comment on the nature of these Caspase 3-positive cells, whether they are cell-autonomous to the clone or non-autonomous (Figure 6K)?

      (5) This is a clear, well-executed, and conceptually strong study that significantly advances understanding of TRAF4 signaling specificity and its modulation by the Bearded-type protein E(spl)m4.

    1. Reviewer #1 (Public review):

      Nielsen et al have identified a new disease mechanism underlying hypoplastic left heart syndrome due to variants in ribosomal protein genes that lead to impaired cardiomyocyte proliferation. This detailed study starts with an elegant screen in stem cell derived cardiomyocytes and whole genome sequencing of human patients and extends to careful functional analysis of RP gene variants in fly and fish models. Striking phenotypic rescue is seen by modulating known regulators of proliferation including the p53 and Hippo pathways. Additional experiments suggest that cell type specificity of the variants in these ubiquitously expressed genes may result from genetic interactions with cardiac transcription factors. This work positions RPs as important regulators of cardiomyocyte proliferation and differentiation involved in the etiology of HLHS, and point to potential downstream mechanisms.

      The revised manuscript has been extended, facilitating interpretation and reinforcing the authors' conclusions.

    2. Reviewer #2 (Public review):

      Tanja Nielsen et al. presents a novel strategy for identification of candidate genes in Congenital Heart Disease (CHD). Their methodology, which is based on comprehensive experiments across cell models, drosophila and zebrafish models, represents an innovative, refreshing and very useful set of tools for identification of disease genes, in a field which are struggling with exactly this problem.

      The authors have applied their methodology to investigate the pathomechanisms of Hypoplastic Left Heart Syndrome (HLHS) - a severe and rare subphenotype in the large spectrum of CHD malformations. Their data convincingly implicates ribosomal proteins (RPs) in growth and proliferation defects of cardiomyocytes, a mechanism which is suspected to be associated with HLHS.

      By whole genome sequencing analysis of a small cohort of trios (25 HLHS patients and their parents) the authors investigated a possible association between RP encoding genes and HLHS.

      Although the possible association between defective RPs and HLHS needs to be verified, the results suggest a novel disease mechanism in HLHS, which is a potentially substantial advance in our understanding of HLHS and CHD. The conclusions of the paper are based on solid experimental evidence from appropriate high- to medium-throughput models, while additional genetic results from an independent patient cohort is needed to verify an association between RP encoding genes and HLHS in patients.

    1. Reviewer #1 (Public review):

      The study analyzes the gastric fluid DNA content identified as a potential biomarker for human gastric cancer. However, the study lacks overall logicality, and several key issues require improvement and clarification. In the opinion of this reviewer, some major revisions are needed:

      (1) This manuscript lacks a comparison of gastric cancer patients' stages with PN and N+PD patients, especially T0-T2 patients.

      (2) The comparison between gastric cancer stages seems only to reveal the difference between T3 patients and early-stage gastric cancer patients, which raises doubts about the authenticity of the previous differences between gastric cancer patients and normal patients, whether it is only due to the higher number of T3 patients.

      (3) The prognosis evaluation is too simplistic, only considering staging factors, without taking into account other factors such as tumor pathology and the time from onset to tumor detection.

      (4) The comparison between gfDNA and conventional pathological examination methods should be mentioned, reflecting advantages such as accuracy and patient comfort.

      (5) There are many questions in the figures and tables. Please match the Title, Figure legends, Footnote, Alphabetic order, etc.

      (6) The overall logicality of the manuscript is not rigorous enough, with few discussion factors, and cannot represent the conclusions drawn.

      Comments on revisions:

      The authors have addressed all concerns in the revision.

    2. Reviewer #2 (Public review):

      Summary

      The authors aimed to evaluate whether total DNA concentration in gastric fluid (gfDNA) collected during routine endoscopy could serve as a diagnostic and prognostic biomarker for gastric cancer. Using a large cohort (n=941), they reported elevated gfDNA in gastric cancer patients, an unexpected association with improved survival, and a positive correlation with immune cell infiltration.

      Strengths

      The study benefits from a substantial sample size, clear patient stratification, and control of key clinical confounders. The method is simple and clinically feasible, with preliminary evidence linking gfDNA to immune infiltration.

      Weaknesses

      (1) While the study identifies gfDNA as a potential prognostic tool, the evidence remains preliminary. Unexplained survival associations and methodological gaps weaken support for the conclusions.

      (2) The paradoxical association between high gfDNA and better survival lacks mechanistic validation. The authors acknowledge but do not experimentally distinguish tumor vs. immune-derived DNA, leaving the biological basis speculative.

      (3) Pre-analytical variables were noted but not systematically analyzed for their impact on gfDNA stability.

      Comments on revisions:

      To enhance the completeness and credibility of this research, it is essential to clarify the biological origin of gastric fluid DNA and validate these preliminary findings through a prospective, longitudinal study design.

    1. Reviewer #1 (Public review):

      The authors of this study set out to address a central question in the psycholinguistics literature: does the human brain's ability to predict upcoming language come at a cognitive cost, or is it an automatic, "free" process? To investigate this, they employed a dual-task paradigm where participants read texts word-by-word while simultaneously performing a secondary task (an n-back task on font color) designed to manipulate cognitive load. The study examines how this external cognitive load, along with the effects of aging, modulates the impact of word predictability (measured by surprisal and entropy) on reading times. The central finding is that increased cognitive load diminishes the effects of word predictability, supporting the conclusion that language prediction is a resource-dependent process.

      A major strength of the revised manuscript is its comprehensive and parallel analysis of both word surprisal and entropy. The initial submission focused almost exclusively on surprisal, which primarily reflects the cost of integrating a word into its context after it has been perceived. The new analysis now thoroughly investigates entropy as well, which reflects the uncertainty and cognitive effort involved in predicting the next word before it appears. This addition provides a much more complete and theoretically nuanced picture, allowing the authors to address how cognitive load affects both predictive and integrative stages of language processing. This is a significant improvement and substantially increases the paper's contribution to the field.

      Furthermore, the authors have commendably addressed the initial concerns regarding the robustness of their replication findings. The first version of the manuscript presented replication results that were inconsistent, particularly for key interaction effects. In the revision, the authors have adopted a more focused and appropriately powered modeling approach for the replication analysis. This revised analysis now demonstrates a consistent effect of cognitive load on the processing of predictable words across both the original and replication datasets. This strengthens the evidence for the paper's primary claim.

      The initial review also raised concerns that the results could be explained by general cognitive factors, such as task-switching costs, rather than the specific demands on the language prediction system. While the complexity of cognitive load in a dual-task paradigm remains a challenge, the authors have provided sufficient justification in their revisions and rebuttal to support their interpretation that the observed effects are genuinely tied to the process of language prediction.

    2. Reviewer #2 (Public review):

      Summary:

      This paper considers the effects of cognitive load (using an n-back task related to font color), predictability, and age on reading times in two experiments. There were main effects of all predictors, but more interesting effects of load and age on predictability. The effect of load is very interesting, but the manipulation of age is problematic, because we don't know what is predictable for different participants (in relation to their age). There are some theoretical concerns about prediction and predictability, and a need to address literature (reading time, visual world, ERP studies).

      There is a major concern about the effects of age. See the results (155-190): this depends what is meant by word predictability. It's correct if it means the predictability in the corpus. But it may or may not be correct if it refers to how predictable a word is to an individual participant. The texts are unlikely to be equally predictable to different participants, and in particular to younger vs. older participants, because of their different experience. To put it informally, the newspaper articles may be more geared to the expectations of younger people. But there is also another problem: the LLM may have learned on the basis of language that has largely been produced by young people and so its predictions are based on what young people are likely to say. Both of these possibilities strike me as extremely likely. So it may be that older adults are affected more by words that they find surprising, but it is also possible that the texts are not what they expect, or the LLM predictions from the text are not the ones that they would make. In sum, I am not convinced that the authors can say anything about the effects of age unless they can determine what is predictable for different ages of participants. I suspect that this failure to control is an endemic problem in the literature on aging and language processing and needs to be systematically addressed.

      Overall, I think the paper makes enough of a contribution with respect to load to be useful to the literature. But for discussion of age, we would need something like evidence of how younger and older adults would complete these texts (on a word-by-word basis) and that they were equally predictable for different ages. I assume there are ways to get LLMs to emulate different participant groups, but I doubt if we could be confident about their accuracy without a lot of testing. But without something like this, I think making claims about age would be quite misleading.

      The authors respond to my summary comment by saying that prediction is individual and that they account for age-related effects in their models. But these aren't my concerns. Rather:

      (1) The texts (these edited newspaper articles) could be more predictable for younger than older adults. If so, effects with older adults could simply be because people are less likely to predict less than more predictable words.

      (2) The GPT-2 generated surprisal scores may correspond more closely to younger than older adult responses -- that is, its next word predictions may be more younger- than older-adult-like.

      In my view, the authors have two choices: they could remove the discussion of age-related effects, or they could try to address BOTH (1) and (2).

      As an aside, consider what we would conclude if we drew similar conclusions from a study in which children and adults read the same (children's) texts, but we didn't test what was predictable to each of them separately.

      The paper is really strong in other respects and if my concern is not addressed, the conclusions about age might be generally accepted.

    1. Reviewer #1 (Public review):

      Summary:

      Wojnowska et al. report structural and functional studies of the interaction of Streptococcus pyogenes M3 protein with collagen. They show through X-ray crystallographic studies that the N-terminal hypervariable region of M3 protein forms a T-like structure, and that the T-like structure binds a three-stranded collagen-mimetic peptide. They indicate that the T-like structure is predicted by AlphaFold3 with moderate confidence level in other M proteins that have sequence similarity to M3 protein and M-like proteins from group C and G streptococci. For some, but not all, of these related M and M-like proteins, AlphaFold3 predicts, with moderate confidence level, complexes similar to the one observed for M3-collagen. Functionally, the authors show that emm3 strains form biofilms with more mass when surfaces are coated with collagen, and this effect can be blocked by an M3 protein fragment that contains the T-structure. They also show the co-occurrence of emm3 strains and collagen in patient biopsies and a skin tissue organoid. Puzzlingly, M1 protein has been reported to bind collagen, but collagen inhibits biofilm in a particular emm1 strain but that same emm1 strain colocalizes with collagen in a patient biopsy sample. The implications of the variable actions of collagen on biofilm formation are not clear.

      Strengths:

      The paper is well written and the results are presented in a logical fashion.

      Weaknesses:

      A major limitation of the paper is that it is almost entirely observational and lacks detailed molecular investigation. Insufficient details or controls are provided to establish the robustness of the data.

      Comments on revisions:

      The authors' response to this reviewer's Major issue #1 is inadequate. Their argument is essentially that if they denature the protein, then there is no activity. This does not address the specificity of the structure or its interactions.

      They went only part way to addressing this reviewer's Major issue #2. While Figure 8 - supplement 3 shows 1D NMR spectra for M3 protein (what temperature?), it does not establish that stability is unaltered (to a significant degree).

      This reviewer's Major issue #3 is one of the major reasons for considering this study to be observational. This reviewer agrees that structural biology is by its nature observational, but modern standards require validation of structural observations. The authors' response is that a mechanistic investigation involving mutant bacterial strains and validation involving mutated proteins is beyond their scope. Therefore, the study remains observational.

      Major issue 4 was addressed suitably, but brings up the problematic point that the emm1 2006 strain colocalizes quite well with collagen in a patient biopsy sample but not in other assays. This calls into question the overall interpretability of the patient biopsy data.

      The authors have not provided a point-by-point response. Issues that were indicated to be minor previously were deemed to be minor because this reviewer thought that they could easily be addressed in a revision. It appears that the authors have ignored many of these comments, and these issues are therefore now considered to be major issues. For example, no errors are given for Kd measurements, Table 2 is sloppy and lacks the requested information, negative controls are missing (Figure 10 - figure supplement 1), and there is no indication of how many independent times each experiment was done.

      And "C4-binding protein" should be corrected to "C4b-binding protein."

    2. Reviewer #2 (Public review):

      Streptococcus pyogenes, or group A streptococci (GAS) can cause diseases ranging skin and mucosal infections, plasma invasion, and post-infection autoimmune syndromes. M proteins are essential GAS virulence factors that include an N-terminal hypervariable region (HVR). M proteins are known to bind to numerous human proteins; a small subset of M proteins were reported to bind collagen, which is thought to promote tissue adherence. In this paper, authors characterize M3 interactions with collagen and its role in biofilm formation. Specifically, they screened different collagen type II and III variants for full-length M3 protein binding using an ELISA-like method, detecting anti-GST antibody signal. By statistical analysis, hydrophobic amino acids and hydroxyproline found to positively support binding, whereas acidic residues and proline negatively impacted binding. The authors applied X-ray crystallography to determine the structure of the N-terminal domain (42-151 amino acids) of M3 protein (M3-NTD). M3-NTD dimmer (PDB 8P6K) forms a T-shaped structure with three helices (H1, H2, H3), which are stabilized by a hydrophobic core, inter-chain salt bridges and hydrogen bonds on H1, H2 helices, and H3 coiled coil. The conserved Gly113 serves as the turning point between H2 and H3. The M3-NTD is co-crystalized with a 24-residue peptide, JDM238, to determine the structure of M3-collagen binding. The structure (PDB 8P6J) shows that two copies of collagen in parallel bind to H1 and H2 of M3-NTD. Among the residues involved binding, conserved Try96 is shown to play a critical role supported by structure and isothermal titration calorimetry (ITC). The authors also apply a crystal-violet assay and fluorescence microscopy to determine that M3 is involved in collagen type I binding, but not M1 or M28. Tissue biopsy staining indicates that M3 strains co-localize with collagen IV-containing tissue, while M1 strains do not. The authors provide generally compelling evidence to show that GAS M3 protein binds to collagen, and plays a critical role in forming biofilms, which contribute to disease pathology. This is a very well-executed study and a well-written report relevant to understanding GAS pathogenesis and approaches to combatting disease; data are also applicable to emerging human pathogen Streptococcus dysgalactiae. One caveat that was not entirely resolved is if/how different collagen types might impact M3 binding and function. Due to the technical constrains, the in vitro structure and other binding assays use type II collagen whereas in vivo, biofilm formation assays and tissue biopsy staining use type I and IV collagen; it was unclear if this difference is significant. One possibility is that M3 has an unbiased binding to all types of collagens, only the distribution of collagens leads to the finding that M3 binds to type IV (basement membrane) and type I (varies of tissue including skin), rather than type II (cartilage).

      Comments on revisions:

      We are glad to see that the authors addressed our prior comments on M3 binding to different types of collagens in discussion section; adding a prediction of M3 binding to type I collagen (Figure 8-figure supplement 1B and 1C) is helpful to fill in the gap. Although it would be nice to experimentally fill in the gap by putting all types of collagens into one experiment (For example, like Figure 9A, use different types of human collagens to test biofilm formation; or Figure 10, use different types of human collagens to compete for biofilm formation), this appears to be beyond the scope of this paper. Meanwhile, the changes they have made are constructive.

      The authors have addressed the majority of our prior comments.

    1. Reviewer #3 (Public review):

      Summary:

      In this well-written manuscript, Unitt and colleagues propose a new, hierarchical nomenclature system for the pathogen Neisseria gonorrhoeae. The proposed nomenclature addresses a longstanding problem in N. gonorrhoeae genomics, namely that the highly recombinant population complicates typing schemes based on only a few loci and that previous typing systems, even those based on the core genome, group strains at only one level of genomic divergence without a system for clustering sequence types together. In this work, the authors have revised the core genome MLST scheme for N. gonorrhoeae and devised life identification numbers (LIN) codes to describe the N. gonorrhoeae population structure.

      Strengths:

      The LIN codes proposed in this manuscript are congruent with previous typing methods for Neisseria gonorrhoeae like cgMLST groups, Ng-STAR, and NG-MAST. Importantly, they improve upon many of these methods as the LIN codes are also congruent with the phylogeny and represent monophyletic lineages/sublineages. Additionally, LIN code cluster assignment is fixed, and clusters are not fused as is common in other typing schemes.

      The LIN code assignment has been implemented in PubMLST allowing other researchers to assign LIN codes to new assemblies and put genomes of interest in context with global datasets, including in private datasets.

      Weaknesses:

      The authors have defined higher resolution thresholds for the LIN code scheme. However, they do not investigate how these levels correspond to previously identified transmission clusters from genomic epidemiology studies. This will be an important focus of future work, but it may be beyond the scope of the current manuscript.

      Comments on revisions:

      The authors have addressed my previous comments. I have no additional recommendations.

    1. Reviewer #1 (Public review):

      Summary:

      The authors set out to evaluate the regulation of interferon (IFN) gene expression in fish, using mainly zebrafish as a model system. Similar to more widely characterized mammalian systems, fish IFN is induced during viral infection through the action of the transcription factor IRF3 which is activated by phosphorylation by the kinase TBK1. It has been previously shown in many systems that TBK1 is subjected to both positive and negative regulation to control IFN production. In this work, the authors find that the cell cycle kinase CDK2 functions as a TBK1 inhibitor by decreasing its abundance through recruitment of the ubiquitinylation ligase, Dtx4, which has been similarly implicated in the regulation of mammalian TBK1. Experimental data are presented showing that CDK2 interacts with both TBK1 and Dtx4, leading to TBK1 K48 ubiqutinylation on K567 and its subsequent degradation by the proteasome.

      Strengths:

      The strengths of this manuscript are its novel demonstration of the involvement of CDK2 in a process in fish that is controlled by different factors in other vertebrates and its clear and supportive experimental data.

      Weaknesses:

      The weaknesses of the study include the following. 1) It remains unclear how CDK is regulated during viral infection and how it specifically recruits E3 ligase to TBK1. The authors find that its abundance increases during viral infection, an unusual finding given that CDK2 levels are often found to be stable. How this change in abundance might affect cell cycle control was not explored. 2) The implications and mechanisms for a relationship between the cell cycle and IFN production will be a fascinating topic for future studies. In particular, it will be critical to determine if CDK2 catalytic activity is required. An experiment with an inhibitor suggests that this novel action of CDK2 is kinase independent, but the lack of controls showing the efficacy of the inhibitor prevents a firm conclusion. It will also be critical to determine if there is a role for cyclins in this process or if there is competition for binding between TBK1 and cyclin and, if so, if this has an impact on the cell cycle. Likewise, an impact of CDK2 induction by virus infection on normal cell cycling will be important to investigate.

    2. Reviewer #2 (Public review):

      Summary:

      In this paper, the authors describe a novel function involving the cell cycle protein kinase CDK2, which binds to TBK1 (an essential component of the innate immune response) leading to its degradation in a ubiquitin/proteasome-dependent manner. Moreover, the E3 ubiquitin ligase, Dtx4, is implicated in the process by which CDK2 increases the K48-linked ubiquitination of TBK1. This paper presents intriguing findings on the function of CDK2 in lower vertebrates, particularly its regulation of IFN expression and antiviral immunity.

      Strengths:

      (1) The research employs a variety of experimental approaches to address a single question. The data are largely convincing and appear to be well executed.

      (2) The evidence is strong and includes a combination of in vivo and in vitro experiments, including knockout models, protein interaction studies, and ubiquitination analyses.

      (3) This study significantly impacts the field of immunology and virology, particularly concerning the antiviral mechanisms in lower vertebrates. The findings provide new insights into the regulation of IFN expression and the broader role of CDK2 in immune responses. The methods and data presented in this paper are highly valuable for the scientific community, offering new avenues for research into antiviral strategies and the development of therapeutic interventions targeting CDK2 and its associated pathways.

    1. Reviewer #1 (Public review):

      The authors investigated the potential role of IgG N-glycosylation in Haemorrhagic Fever with Renal Syndrome (HFRS), which may offer significant insights for understanding molecular mechanisms and for the development of therapeutic strategies for this infectious disease.

    2. Reviewer #2 (Public review):

      This work sought to explore antibody responses in the context of hemorrhagic fever with renal syndrome (HFRS) - a severe disease caused by Hantaan virus infection. Little is known about the characteristics or functional relevance of IgG Fc glycosylation in HFRS. To address this gap, the authors analyzed samples from 65 patients with HFRS spanning the acute and convalescent phases of disease via IgG Fc glycan analysis, scRNAseq, and flow cytometry. The authors observed changes in Fc glycosylation (increased fucosylation and decreased bisection) coinciding with a 4-fold or greater increased in Haantan virus-specific antibody titer. The study also includes exploratory analyses linking IgG glycan profiles to glycosylation-related gene expression in distinct B cell subsets, using single-cell transcriptomics. Overall, this is an interesting study that combines serological profiling with transcriptomic data to shed light on humoral immune responses in an underexplored infectious disease. The integration of Fc glycosylation data with single-cell transcriptomic data is a strength.

    1. Reviewer #1 (Public review):

      Summary:

      The microbiota of Dactylorhiza traunsteineri, an endangered marsh orchid, forms complex root associations that support plant health. Using 16S rRNA sequencing, we identified dominant bacterial phyla in its rhizosphere, including Proteobacteria, Actinobacteria, and Bacteroidota. Deep shotgun metagenomics revealed high-quality MAGs with rich metabolic and biosynthetic potential. This study provides key insights into root-associated bacteria and highlights the rhizosphere as a promising source of bioactive compounds, supporting both microbial ecology research and orchid conservation.

      Strengths:

      The manuscript presents an investigation of the bacterial communities in the rhizosphere of D. traunsteineri using advanced metagenomic approaches. The topic is relevant, and the techniques are up-to-date; however, the study has several critical weaknesses.

      Weaknesses:

      (1) Title: The current title is misleading. Given that fungi are the primary symbionts in orchids and were not analyzed in this study (nor were they included among other microbial groups), the use of the term "microbiome" is not appropriate. I recommend replacing it with "bacteriome" to better reflect the scope of the work.

      (2) Line 124: The phrase "D. traunsteineri individuals were isolated" seems misleading. A more accurate description would be "individuals were collected", as also mentioned in line 128.

      (3) Experimental design: The major limitation of this study lies in its experimental design. The number of plant individuals and soil samples analyzed is unclear, making it difficult to assess the statistical robustness of the findings. It is also not well explained why the orchids were collected two years before the rhizosphere soil samples. Was the rhizosphere soil collected from the same site and from remnants of the previously sampled individuals in 2018? This temporal gap raises serious concerns about the validity of the biological associations being inferred.

      (4) Low sample size: In lines 249-251 (Results section), the authors mention that only one plant individual was used for identifying rhizosphere bacteria. This is insufficient to produce scientifically robust or generalizable conclusions.

      (5) Contextual limitations: Numerous studies have shown that plant-microbe interactions are influenced by external biotic and abiotic factors, as well as by plant age and population structure. These elements are not discussed or controlled for in the manuscript. Furthermore, the ecological and environmental conditions of the site where the plants and soil were collected are poorly described. The number of biological and technical replicates is also not clearly stated.

      (6) Terminology: Throughout the manuscript, the authors refer to the "microbiome," though only bacterial communities were analyzed. This terminology is inaccurate and should be corrected consistently.

      Considering the issues addressed, particularly regarding experimental design and data interpretation, significant improvements to the study are needed.

    2. Reviewer #2 (Public review):

      Summary:

      The authors aim to provide an overview of the D. traunsteineri rhizosphere microbiome on a taxonomic and functional level, through 16S rRNA amplicon analysis and shotgun metagenome analysis. The amplicon sequencing shows that the major phyla present in the microbiome belong to phyla with members previously found to be enriched in rhizospheres and bulk soils. Their shotgun metagenome analysis focused on producing metagenome assembled genomes (MAGs), of which one satisfies the MIMAG quality criteria for high-quality MAGs and three those for medium-quality MAGs. These MAGs were subjected to functional annotations focusing on metabolic pathway enrichment and secondary metabolic pathway biosynthetic gene cluster analysis. They find 1741 BGCs of various categories in the MAGs that were analyzed, with the high-quality MAG being claimed to contain 181 SM BGCs. The authors provide a useful, albeit superficial, overview of the taxonomic composition of the microbiome, and their dataset can be used for further analysis.

      The conclusions of this paper are not well-supported by the data, as the paper only superficially discusses the results, and the functional interpretation based on taxonomic evidence or generic functional annotations does not allow drawing any conclusions on the functional roles of the orchid microbiota.

      Weaknesses:

      The authors only used one individual plant to take samples. This makes it hard to generalize about the natural orchid microbiome.

      The authors use both 16S amplicon sequencing and shotgun metagenomics to analyse the microbiome. However, the authors barely discuss the similarities and differences between the results of these two methods, even though comparing these results may be able to provide further insights into the conclusions of the authors. For example, the relative abundance of the ASVs from the amplicon analysis is not linked to the relative abundances of the MAGs.

      Furthermore, the authors discuss that phyla present in the orchid microbiome are also found in other microbiomes and are linked to important ecological functions. However, their results reach further than the phylum level, and a discussion of genera or even species is lacking. The phyla that were found have very large within-phylum functional variability, and reliable functional conclusions cannot be drawn based on taxonomic assignment at this level, or even the genus level (Yan et al. 2017).

      Additionally, although the authors mention their techniques used, their method section is sometimes not clear about how samples or replicates were defined. There are also inconsistencies between the methods and the results section, for example, regarding the prediction of secondary metabolite biosynthetic gene clusters (BGCs).

      The BGC prediction was done with several tools, and the unusually high number of found BGCs (181 in their high-quality MAG) is likely due to false positives or fragmented BGCs. The numbers are much higher than any numbers ever reported in literature supported by functional evidence (Amos et al, 2017), even in a prolific genus like Streptomyces (Belknap et al., 2020). This caveat is not discussed by the authors.

      The authors have generated one high-quality MAG and three medium-quality MAGs. In the discussion, they present all four of these as high-quality, which could be misleading. The authors discuss what was found in the literature about the role of the bacterial genera/phyla linked to these MAGs in plant rhizospheres, but they do not sufficiently link their own analysis results (metabolic pathway enrichment and biosynthetic gene cluster prediction) to this discussion. The results of these analyses are only presented in tables without further explanation in either the results section or the discussion, even though there may be interesting findings. For example, the authors only discuss the class of the BGCs that were found, but don't search for experimentally verified homologs in databases, which could shed more light on the possible functional roles of BGCs in this microbiome.

      In the conclusions, the authors state: "These analyses uncovered potential metabolic capabilities and biosynthetic potentials that are integral to the rhizosphere's ecological dynamics." I don't see any support for this. Mentioning that certain classes of BGCs are present is not enough to make this claim, in my opinion. Any BGC is likely important for the ecological niche the bacteria live in. The fact that rhizosphere bacteria harbour BGCs is not surprising, and it doesn't tell us more than is already known.

      References:

      Belknap, Kaitlyn C., et al. "Genome mining of biosynthetic and chemotherapeutic gene clusters in Streptomyces bacteria." Scientific reports 10.1 (2020): 2003

      Amos GCA, Awakawa T, Tuttle RN, Letzel AC, Kim MC, Kudo Y, Fenical W, Moore BS, Jensen PR. Comparative transcriptomics as a guide to natural product discovery and biosynthetic gene cluster functionality. Proc Natl Acad Sci U S A. 2017 Dec 26;114(52):E11121-E11130.

      References:

      Belknap, Kaitlyn C., et al. "Genome mining of biosynthetic and chemotherapeutic gene clusters in Streptomyces bacteria." Scientific reports 10.1 (2020): 2003

      Amos GCA, Awakawa T, Tuttle RN, Letzel AC, Kim MC, Kudo Y, Fenical W, Moore BS, Jensen PR. Comparative transcriptomics as a guide to natural product discovery and biosynthetic gene cluster functionality. Proc Natl Acad Sci U S A. 2017 Dec 26;114(52):E11121-E11130.

      Yan Yan, Eiko E Kuramae, Mattias de Hollander, Peter G L Klinkhamer, Johannes A van Veen, Functional traits dominate the diversity-related selection of bacterial communities in the rhizosphere, The ISME Journal, Volume 11, Issue 1, January 2017, Pages 56-66

    1. Reviewer #1 (Public review):

      Summary:

      This manuscript addresses an important methodological issue - the fragility of meta-analytic findings - by extending fragility concepts beyond trial-level analysis. The proposed EOIMETA framework provides a generalizable and analytically tractable approach that complements existing methods such as the traditional Fragility Index and Atal et al.'s algorithm. The findings are significant in showing that even large meta-analyses can be highly fragile, with results overturned by very small numbers of event recodings or additions. The evidence is clearly presented, supported by applications to vitamin D supplementation trials, and contributes meaningfully to ongoing debates about the robustness of meta-analytic evidence. Overall, the strength of evidence is moderate to strong, though some clarifications would further enhance interpretability.

      Strengths:

      (1) The manuscript tackles a highly relevant methodological question on the robustness of meta-analytic evidence.

      (2) EOIMETA represents an innovative extension of fragility concepts from single trials to meta-analyses.

      (3) The applications are clearly presented and highlight the potential importance of fragility considerations for evidence synthesis.

      Weaknesses:

      (1) The rationale and mathematical details behind the proposed EOI and ROAR methods are insufficiently explained. Readers are asked to rely on external sources (Grimes, 2022; 2024b) without adequate exposition here. At a minimum, the definitions, intuition, and key formulas should be summarized in the manuscript to ensure comprehensibility.

      (2) EOIMETA is described as being applicable when heterogeneity is low, but guidance is missing on how to interpret results when heterogeneity is high (e.g., large I²). Clarification in the Results/Discussion is needed, and ideally, a simulation or illustrative example could be added.

      (3) The manuscript would benefit from side-by-side comparisons between the traditional FI at the trial level and EOIMETA at the meta-analytic level. This would contextualize the proposed approach and underscore the added value of EOIMETA.

      (4) Scope of FI: The statement that FI applies only to binary outcomes is inaccurate. While originally developed for dichotomous endpoints, extensions exist (e.g., Continuous Fragility Index, CFI). The manuscript should clarify that EOIMETA focuses on binary outcomes, but FI, as a concept, has been generalized.

    2. Reviewer #2 (Public review):

      Summary:

      The study expands existing analytical tools originally developed for randomized controlled trials with dichotomous outcomes to assess the potential impact of missing data, adapting them for meta-analytical contexts. These tools evaluate how missing data may influence meta-analyses where p-value distributions cluster around significance thresholds, often leading to conflicting meta-analyses addressing the same research question. The approach quantifies the number of recodings (adding events to the experimental group and/or removing events from the control group) required for a meta-analysis to lose or gain statistical significance. The author developed an R package to perform fragility and redaction analyses and to compare these methods with a previously established approach by Atal et al. (2019), also integrated into the package. Overall, the study provides valuable insights by applying existing analytical tools from randomized controlled trials to meta-analytical contexts.

      Strengths:

      The author's results support his claims. Analyzing the fragility of a given meta-analysis could be a valuable approach for identifying early signs of fragility within a specific topic or body of evidence. If fragility is detected alongside results that hover around the significance threshold, adjusting the significance cutoff as a function of sample size should be considered before making any binary decision regarding statistical significance for that body of evidence. Although the primary goal of meta-analysis is effect estimation, conclusions often still rely on threshold-based interpretations, which is understandable. In some of the examples presented by Atal et al. (2019), the event recoding required to shift a meta-analysis from significant to non-significant (or vice versa) produced only minimal changes in the effect size estimation. Therefore, in bodies of evidence where meta-analyses are fragile or where results cluster near the null, it may be appropriate to adjust the cutoff. Conducting such analyses-identifying fragility early and adapting thresholds accordingly-could help flag fragile bodies of evidence and prevent future conflicting meta-analyses on the same question, thereby reducing research waste and improving reproducibility.

      Weaknesses:

      It would be valuable to include additional bodies of conflicting literature in which meta-analyses have demonstrated fragility. This would allow for a more thorough assessment of the consistency of these analytical tools, their differences, and whether this particular body of literature favored one methodology over another. The method proposed by Atal et al. was applied to numerous meta-analyses and demonstrated consistent performance. I believe there is room for improvement, as both the EOI and ROAR appear to be very promising tools for identifying fragility in meta-analytical contexts.

      I believe the manuscript should be improved in terms of reporting, with clearer statements of the study's and methods' limitations, and by incorporating additional bodies of evidence to strengthen its claims.

    3. Reviewer #3 (Public review):

      Summary and strengths:

      In this manuscript, Grimes presents an extension of the Ellipse of Insignificant (EOI) and Region of Attainable Redaction (ROAR) metrics to the meta-analysis setting as metrics for fragility and robustness evaluation of meta-analysis. The author applies these metrics to three meta-analyses of Vitamin D and cancer mortality, finding substantial fragility in their conclusions. Overall, I think extension/adaptation is a conceptually valuable addition to meta-analysis evaluation, and the manuscript is generally well-written.

      Specific comments:

      (1) The manuscript would benefit from a clearer explanation of in what sense EOIMETA is generalizable. The author mentions this several times, but without a clear explanation of what they mean here.

      (2) The authors mentioned the proposed tools assume low between-study heterogeneity. Could the author illustrate mathematically in the paper how the between-study heterogeneity would influence the proposed measures? Moreover, the between-study heterogeneity is high in Zhang et al's 2022 study. It would be a good place to comment on the influence of such high heterogeneity on the results, and specifying a practical heterogeneity cutoff would better guide future users.

      (3) I think clarifying the concepts of "small effect", "fragile result", and "unreliable result" would be helpful for preventing misinterpretation by future users. I am concerned that the audience may be confusing these concepts. A small effect may be related to a fragile meta-analysis result. A fragile meta-analysis doesn't necessarily mean wrong/untrustworthy results. A fragile but precise estimate can still reflect a true effect, but whether that size of true effect is clinically meaningful is another question. Clarifying the effect magnitude, fragility, and reliability in the discussion would be helpful.

    1. Reviewer #1 (Public review):

      The authors used fluorescence microscopy, image analysis, and mathematical modeling to study the effects of membrane affinity and diffusion rates of MinD monomer and dimer states on MinD gradient formation in B. subtilis. To test these effects, the authors experimentally examined MinD mutants that lock the protein in specific states, including Apo monomer (K16A), ATP-bound monomer (G12V) and ATP-bound dimer (D40A, hydrolysis defective), and compared to wild-type MinD. Overall, the experimental results support the conclusions that reversible membrane binding of MinD is critical for the formation of the MinD gradient, but the binding affinities between monomers and dimers are similar.

      The modeling part is a new attempt to use the Monte Carlo method to test the conditions for the formation of the MinD gradient in B. subtilis. The modeling results provide good support for the observations and find that the MinD gradient is sensitive to different diffusion rates between monomers and dimers. This simulation is based on several assumptions and predictions, which raises new questions that need to be addressed experimentally in the future.

    2. Reviewer #3 (Public review):

      This important study by Bohorquez et al examines the determinants necessary for concentrating the spatial modulator of cell division, MinD, at the future site of division and the cell poles. Proper localization of MinD is necessary to bring the division inhibitor, MinC, in proximity to the cell membrane and cell poles where it prevents aberrant assembly of the division machinery. In contrast to E. coli, in which MinD oscillates from pole-to-pole courtesy of a third protein MinE, how MinD localization is achieved in B. subtilis-which does not encode a MinE analog-has remained largely a mystery. The authors present compelling data indicating that MinD dimerization is dispensable for membrane localization but required for concentration at the cell poles. Dimerization is also important for interactions between MinD and MinC, leading to the formation of large protein complexes. Computational modeling, specifically a Monte Carlo simulation, supports a model in which differences in diffusion rates between MinD monomers and dimers lead to concentration of MinD at cell poles. Once there, interaction with MinC increases the size of the complex, further reinforcing diffusion differences. Notably, interactions with MinJ-which has previously been implicated in MinCD localization, are dispensable for concentrating MinD at cell poles although MinJ may help stabilize the MinCD complex at those locations.

      Comments on revisions:

      I believe the authors put respectable effort into revisions and addressing reviewer comments, particularly those that focused on the strengths of the original conclusions. The language in the current version of the manuscript is more precise and the overall product is stronger.

    1. Reviewer #1 (Public review):

      Summary:

      Outstanding fundamental phenomenon (migrasomes) en route to become transitionally highly significant.

      Strengths:

      Innovative approach at several levels: Migrasomes, discovered by DR. Yu's group, are an outstanding biological phenomenon of fundamental interest and now of potentially practical value.

      Weaknesses:

      I feel that the overemphasis on practical aspects (vaccine), however important, eclipses some of the fundamental aspects that may be just as important and actually more interesting. If this can be expanded, the study would be outstanding.

      Comments on revisions: This reviewer feels that the authors have addressed all issues.

    2. Reviewer #2 (Public review):

      Summary:

      The authors report describes a novel vaccine platform derived from a newly discovered organelle called a migrasome. First, the authors address a technical hurdle for using migrasomes as a vaccine platform. Natural migrasome formation occurs at low levels and is labor intensive, however, by understanding the molecular underpinning of migrasome formation, the authors have designed a method to make engineered migrasomes from cultures cells at higher yields utilizing a robust process. These engineered migrasomes behave like natural migrasomes. Next, the authors immunized mice with migrasomes that either expressed a model peptide or the SARS-CoV-2 spike protein. Antibodies against the spike protein were raised that could be boosted by a 2nd vaccination and these antibodies were functional as assessed by an in vitro pseudoviral assay. This new vaccine platform has the potential to overcome obstacles such as cold chain issues for vaccines like messenger RNA that require very stringent storage conditions.

      Strengths:

      The authors present very robust studies detailing the biology behind migrasome formation and this fundamental understanding was used to from engineered migrasomes, which makes it possible to utilize migrasomes as a vaccine platform. The characterization of engineered migrasomes is thorough and establishes comparability with naturally occurring migrasomes. The biophysical characterization of the migrasomes is well done, including thermal stability and characterization of the particle size (important characterizations for a good vaccine).

      Weaknesses:

      With a new vaccine platform technology, it would be nice to compare them head-to-head against a proven technology. The authors would improve the manuscript if they made some comparisons to other vaccine platforms such as a SARS-CoV-2 mRNA vaccine or even an adjuvanted recombinant spike protein. This would demonstrate a migrasome based vaccine could elicit responses comparable to a proven vaccine technology. Additionally, understanding the integrity of the antigens expressed in their migrasomes could be useful. This could be done by looking at functional monoclonal antibody binding to their migrasomes in a confocal microscopy experiment.

      Updates after revision:

      The revised manuscript has additional experiments that I believe improve the strength of evidence presented in the manuscript and address the weaknesses of the first draft. First, they provide a comparison to the antibody responses induced by their migrasome based platform to recombinant protein formulated in an adjuvant and show the response is comparable. Second, they provide evidence that the spike protein incorporated into their migrasomes retains structural integrity by preserving binding to monoclonal antibodies. Together, these results strengthen the paper significantly and support the claims that the novel migrasome based vaccine platform could be a useful in the vaccine development field.

    1. Reviewer #1 (Public review):

      Summary

      This work performed Raman spectral microscopy for E. coli cells with 15 different culture conditions. The author developed a theoretical framework to construct a regression matrix which predicts proteome composition by Raman data. Specifically, this regression matrix is obtained by statistical inference from various experimental conditions. With this model, the authors categorized co-expressed genes and illustrate how proteome stoichiometry is regulated among different culture conditions. Co-expressed gene clusters were investigated and identified as homeostasis core, carbon-source dependent, and stationary phase dependent genes. Overall, the author demonstrates a strong and comprehensive data analysis scheme for the joint analysis of Raman and proteome datasets.

      Strengths and major contributions

      Major contributions: (1) Experimentally, the authors contributed Raman datasets of E. coli with various growth conditions. (2) In data analysis, the authors developed a scheme to compare proteome and Raman datasets. Protein co-expression clusters were identified, and their biological meaning were investigated.

      Discussion and impact for the field

      Raman signature contains both proteomic and metabolomic information and is an orthogonal method to infer the composition biomolecules. This work is a strong initiative for introducing the powerful technique to systems biology and provide a rigorous pipeline for future data analysis. The regression matrix can be used for cross-comparison among future experimental results on proteome-Raman datasets.

      Comments on revisions:

      The authors addressed all my questions nicely. In particular, the subsampling test demonstrated that with enough "distinct" physiological condition (even for m=5) one could already explore the major mode of proteome regulation and Raman signature. The main text has been streamlined and the clarity is improved. I have a minor suggestion:

      (i) For equation (1), it is important to emphasize that the formula works for every j=1,...,15, and the regression matrix B is obtained by statistical inference by summarizing data from all 15 conditions.

    1. Reviewer #1 (Public review):

      Summary:

      The authors recorded neural activity using laminar probes while mice engaged in a global/local visual oddball paradigm. The focus of the article is on oscillatory activity, and found activity differences in theta, alpha/beta, and gamma bands related to predictability and prediction error.

      I think this is an important paper, providing more direct evidence for the role of signals in different frequency bands related to predictability and surprise in the sensory cortex.

      Comments:

      Below are some comments that may hopefully help further improve the quality of this already very interesting manuscript.

      (1) Introduction:

      The authors write in their introduction: "H1 further suggests a role for θ oscillations in prediction error processing as well." Without being fleshed out further, it is unclear what role this would be, or why. Could the authors expand this statement?

      (2) Limited propagation of gamma band signals:

      Some recent work (e.g. https://www.cell.com/cell-reports/fulltext/S2211-1247(23)00503-X) suggests that gamma-band signals reflect mainly entrainment of the fast-spiking interneurons, and don't propagate from V1 to downstream areas. Could the authors connect their findings to these emerging findings, suggesting no role in gamma-band activity in communication outside of the cortical column?

      (3) Paradigm:

      While I agree that the paradigm tests whether a specific type of temporal prediction can be formed, it is not a type of prediction that one would easily observe in mice, or even humans. The regularity that must be learned, in order to be able to see a reflection of predictability, integrates over 4 stimuli, each shown for 500 ms with a 500 ms blank in between (and a 1000 ms interval separating the 4th stimulus from the 1st stimulus of the next sequence). In other words, the mouse must keep in working memory three stimuli, which partly occurred more than a second ago, in order to correctly predict the fourth stimulus (and signal a 1000 ms interval as evidence for starting a new sequence).

      A problem with this paradigm is that positive findings are easier to interpret than negative findings. If mice do not show a modulation to the global oddball, is it because "predictive coding" is the wrong hypothesis, or simply because the authors generated a design that operates outside of the boundary conditions of the theory? I think the latter is more plausible. Even in more complex animals, (eg monkeys or humans), I suspect that participants would have trouble picking up this regularity and sequence, unless it is directly task-relevant (which it is not, in the current setting). Previous experiments often used simple pairs (where transitional probability was varied, eg, Meyer and Olson, PNAS 2012) of stimuli that were presented within an intervening blank period. Clearly, these regularities would be a lot simpler to learn than the highly complex and temporally spread-out regularity used here, facilitating the interpretation of negative findings (especially in early cortical areas, which are known to have relatively small temporal receptive fields).

      I am, of course, not asking the authors to redesign their study. I would like to ask them to discuss this caveat more clearly, in the Introduction and Discussion, and situate their design in the broader literature. For example, Jeff Gavornik has used much more rapid stimulus designs and observed clear modulations of spiking activity in early visual regions. I realize that this caveat may be more relevant for the spiking paper (which does not show any spiking activity modulation in V1 by global predictability) than for the current paper, but I still think it is an important general caveat to point out.

      (4) Reporting of results:

      I did not see any quantification of the strength of evidence of any of the results, beyond a general statement that all reported results pass significance at an alpha=0.01 threshold. It would be informative to know, for all reported results, what exactly the p-value of the significant cluster is; as well as for which performed tests there was no significant difference.

      (5) Cluster test:

      The authors use a three-dimensional cluster test, clustering across time, frequency, and location/channel. I am wondering how meaningful this analytical approach is. For example, there could be clusters that show an early difference at some location in low frequencies, and then a later difference in a different frequency band at another (adjacent) location. It seems a priori illogical to me to want to cluster across all these dimensions together, given that this kind of clustering does not appear neurophysiologically implausible/not meaningful. Can the authors motivate their choice of three-dimensional clustering, or better, facilitating interpretability, cluster eg at space and time within specific frequency bands (2d clustering)?

    2. Reviewer #2 (Public review):

      Summary:

      Sennesh and colleagues analyzed LFP data from 6 regions of rodents while they were habituated to a stimulus sequence containing a local oddball (xxxy) and later exposed to either the same (xxxY) or a deviant global oddball (xxxX). Subsequently, they were exposed to a controlled random sequence (XXXY) or a controlled deterministic sequence (xxxx or yyyy). From these, the authors looked for differences in spectral properties (both oscillatory and aperiodic) between three contrasts (only for the last stimulus of the sequence).

      (1) Deviance detection: unpredictable random (XXXY) versus predictable habituation (xxxy)

      (2) Global oddball: unpredictable global oddball (xxxX) versus predictable deterministic (xxxx), and

      (3) "Stimulus-specific adaptation:" locally unpredictable oddball (xxxY) versus predictable deterministic (yyyy).

      They found evidence for an increase in gamma (and theta in some cases) for unpredictable versus predictable stimuli, and a reduction in alpha/beta, which they consider evidence towards the "predictive routing" scheme.

      While the dataset and analyses are well-suited to test evidence for predictive coding versus alternative hypotheses, I felt that the formulation was ambiguous, and the results were not very clear. My major concerns are as follows:

      (1) The authors set up three competing hypotheses, in which H1 and H2 make directly opposite predictions. However, it must be noted that H2 is proposed for spatial prediction, where the predictability is computed from the part of the image outside the RF. This is different from the temporal prediction that is tested here. Evidence in favor of H2 is readily observed when large gratings are presented, for which there is substantially more gamma than in small images. Actually, there are multiple features in the spectral domain that should not be conflated, namely (i) the transient broadband response, which includes all frequencies, (ii) contribution from the evoked response (ERP), which is often in frequencies below 30 Hz, (iii) narrow-band gamma oscillations which are produced by large and continuous stimuli (which happen to be highly predictive), and (iv) sustained low-frequency rhythms in theta and alpha/beta bands which are prominent before stimulus onset and reduce after ~200 ms of stimulus onset. The authors should be careful to incorporate these in their formulation of PC, and in particular should not conflate narrow-band and broadband gamma.

      (2) My understanding is that any aspect of predictive coding must be present before the onset of stimulus (expected or unexpected). So, I was surprised to see that the authors have shown the results only after stimulus onset. For all figures, the authors should show results from -500 ms to 500 ms instead of zero to 500 ms.

      (3) In many cases, some change is observed in the initial ~100 ms of stimulus onset, especially for the alpha/beta and theta ranges. However, the evoked response contributes substantially in the transient period in these frequencies, and this evoked response could be different for different conditions. The authors should show the evoked responses to confirm the same, and if the claim really is that predictions are carried by genuine "oscillatory" activity, show the results after removing the ERP (as they had done for the CSD analysis).

      (4) I was surprised by the statistics used in the plots. Anything that is even slightly positive or negative is turning out to be significant. Perhaps the authors could use a more stringent criterion for multiple comparisons?

      (5) Since the design is blocked, there might be changes in global arousal levels. This is particularly important because the more predictive stimuli in the controlled deterministic stimuli were presented towards the end of the session, when the animal is likely less motivated. One idea to check for this is to do the analysis on the 3rd stimulus instead of the 4th? Any general effect of arousal/attention will be reflected in this stimulus.

      (6) The authors should also acknowledge/discuss that typical stimulus presentation/attention modulation involves both (i) an increase in broadband power early on and (ii) a reduction in low-frequency alpha/beta power. This could be just a sensory response, without having a role in sending prediction signals per se. So the predictive routing hypothesis should involve testing for signatures of prediction while ruling out other confounds related to stimulus/cognition. It is, of course, very difficult to do so, but at the same time, simply showing a reduction in low-frequency power coupled with an increase in high-frequency power is not sufficient to prove PR.

      (7) The CSD results need to be explained better - you should explain on what basis they are being called feedforward/feedback. Was LFP taken from Layer 4 LFP (as was done by van Kerkoerle et al, 2014)? The nice ">" and "<" CSD patterns (Figure 3B and 3F of their paper) in that paper are barely observed in this case, especially for the alpha/beta range.

      (8) Figure 4a-c, I don't see a reduction in the broadband signal in a compared to b in the initial segment. Maybe change the clim to make this clearer?

      (9) Figure 5 - please show the same for all three frequency ranges, show all bars (including the non-significant ones), and indicate the significance (p-values or by *, **, ***, etc) as done usually for bar plots.

      (10) Their claim of alpha/beta oscillations being suppressed for unpredictable conditions is not as evident. A figure akin to Figure 5 would be helpful to see if this assertion holds.

      (11) To investigate the prediction and violation or confirmation of expectation, it would help to look at both the baseline and stimulus periods in the analyses.

    3. Reviewer #3 (Public review):

      Summary:

      In their manuscript entitled "Ubiquitous predictive processing in the spectral domain of sensory cortex", Sennesh and colleagues perform spectral analysis across multiple layers and areas in the visual system of mice. Their results are timely and interesting as they provide a complement to a study from the same lab focussed on firing rates, instead of oscillations. Together, the present study argues for a hypothesis called predictive routing, which argues that non-predictable stimuli are gated by Gamma oscillations, while alpha/beta oscillations are related to predictions.

      Strengths:

      (1) The study contains a clear introduction, which provides a clear contrast between a number of relevant theories in the field, including their hypotheses in relation to the present data set.

      (2) The study provides a systematic analysis across multiple areas and layers of the visual cortex.

      Weaknesses:

      (1) It is claimed in the abstract that the present study supports predictive routing over predictive coding; however, this claim is nowhere in the manuscript directly substantiated. Not even the differences are clearly laid out, much less tested explicitly. While this might be obvious to the authors, it remains completely opaque to the reader, e.g., as it is also not part of the different hypotheses addressed. I guess this result is meant in contrast to reference 17, by some of the same authors, which argues against predictive coding, while the present work finds differences in the results, which they relate to spectral vs firing rate analysis (although without direct comparison).

      (2) Most of the claims about a direction of propagation of certain frequency-related activities (made in the context of Figures 2-4) are - to the eyes of the reviewer - not supported by actual analysis but glimpsed from the pictures, sometimes, with very little evidence/very small time differences to go on. To keep these claims, proper statistical testing should be performed.

      (3) Results from different areas are barely presented. While I can see that presenting them in the same format as Figures 2-4 would be quite lengthy, it might be a good idea to contrast the right columns (difference plots) across areas, rather than just the overall averages.

      (4) Statistical testing is treated very generally, which can help to improve the readability of the text; however, in the present case, this is a bit extreme, with even obvious tests not reported or not even performed (in particular in Figure 5).

      (5) The description of the analysis in the methods is rather short and, to my eye, was missing one of the key descriptions, i.e., how the CSD plots were baselined (which was hinted at in the results, but, as far as I know, not clearly described in the analysis methods). Maybe the authors could section the methods more to point out where this is discussed.

      (6) While I appreciate the efforts of the authors to formulate their hypotheses and test them clearly, the text is quite dense at times. Partly this is due to the compared conditions in this paradigm; however, it would help a lot to show a visualization of what is being compared in Figures 2-4, rather than just showing the results.

    1. Reviewer #1 (Public review):

      Summary:

      This study develops and validates a neural subspace similarity analysis for testing whether neural representations of graph structures generalize across graph size and stimulus sets. The authors show the method works in rat grid and place cell data, finding that grid but not place cells generalize across different environments, as expected. The authors then perform additional analyses and simulations to show that this method should also work on fMRI data. Finally, the authors test their method on fMRI responses from entorhinal cortex (EC) in a task that involves graphs that vary in size (and stimulus set) and statistical structure (hexagonal and community). They find neural representations of stimulus sets in lateral occipital complex (LOC) generalize across statistical structure and that EC activity generalizes across stimulus sets/graph size, but only for the hexagonal structures.

      Strengths:

      (1) The overall topic is very interesting and timely and the manuscript is well written.

      (2) The method is clever and powerful. It could be important for future research testing whether neural representations are aligned across problems with different state manifestations.

      (3) The findings provide new insights into generalizable neural representations of abstract task states in entorhinal cortex.

      Weaknesses:

      (1) There are two design confounds that are not sufficiently discussed.

      (1.1) First, hexagonal and community structures are confounded by training order. All subjects learned the hexagonal graph always before the community graph. As such, any differences between the two graphs could be explained (in theory) by order effects (although this is unlikely). However, because community and hexagonal structures shared the same stimuli, it is possible that subjects had to find ways to represent the community structures separately from the hexagonal structures. This could potentially explain why there was no generalization across graph size for community structures.

      (1.2) Second, subjects had more experience with the hexagonal and community structures before and during fMRI scanning. This is another possible reason why there was no generalization for the community structure.

      (2) The authors include the results from a searchlight analysis to show specificity of the effects for EC. A more convincing way (in my opinion) to show specificity would be to test for (and report the results) of a double dissociation between the visual and structural contrast in two independently defined regions (e.g., anatomical ROIs of LOC and EC). This would substantiate the point that EC activity generalizes across structural similarity while sensory regions like LOC generalize across visual similarity.

    2. Reviewer #2 (Public review):

      Summary:

      Mark and colleagues test the hypothesis that entorhinal cortical representations may contain abstract structural information that facilitates generalization across structurally similar contexts. To do so, they use a method called "subspace generalization" designed to measure abstraction of representations across different settings. The authors validate the method using hippocampal place cells and entorhinal grid cells recorded in a spatial task, then show perform simulations that support that it might be useful in aggregated responses such as those measured with fMRI. Then the method is applied to an fMRI data that required participants to learn relationships between images in one of two structural motifs (hexagonal grids versus community structure). They show that the BOLD signal within an entorhinal ROI shows increased measures of subspace generalization across different tasks with the same hexagonal structure (as compared to tasks with different structures) but that there was not evidence for the complementary result (ie. increased generalization across tasks that share community structure, as compared to those with different structures). Taken together, this manuscript describes and validates a method for identifying fMRI representations that generalize across conditions and applies it to reveal that entorhinal representations that emerge across specific shared structural conditions.

      Strengths:

      I found this paper interesting both in terms of its methods and its motivating questions. The question asked is novel and the methods employed are new - and I believe this is the first time that they have been applied to fMRI data. I also found the iterative validation of the methodology to be interesting and important - showing persuasively that the method could detect a target representation - even in the face of random combination of tuning and with the addition of noise, both being major hurdles to investigating representations using fMRI.

      Weaknesses:

      The primary weakness of the paper in terms of empirical results is that the representations identified in EC had no clear relationship to behavior, raising questions about their functional importance.

      The method developed is a clearly valuable tool that can serve as part of a larger battery of analysis techniques, but a small weakness on the methodological side is that for a given dataset, it might be hard to determine whether the method developed here would be better or worse than alternative methods.

    3. Reviewer #3 (Public review):

      Summary:

      The article explores the brain's ability to generalize information, with a specific focus on the entorhinal cortex (EC) and its role in learning and representing structural regularities that define relationships between entities in networks. The research provides empirical support for the longstanding theoretical and computational neuroscience hypothesis that the EC is crucial for structure generalization. It demonstrates that EC codes can generalize across non-spatial tasks that share common structural regularities, regardless of the similarity of sensory stimuli and network size.

      Strengths:

      At first glance, a potential limitation of this study appears to be its application of analytical methods originally developed for high-resolution animal electrophysiology (Samborska et al., 2022) to the relatively coarse and noisy signals of human fMRI. Rather than sidestepping this issue, however, the authors embrace it as a methodological challenge. They provide compelling empirical evidence and biologically grounded simulations to show that key generalization properties of entorhinal cortex representations can still be robustly detected. This not only validates their approach but also demonstrates how far non-invasive human neuroimaging can be pushed. The use of multiple independent datasets and carefully controlled permutation tests further underscores the reliability of their findings, making a strong case that structural generalization across diverse task environments can be meaningfully studied even in abstract, non-spatial domains that are otherwise difficult to investigate in animal models.

      Weaknesses:

      While this study provides compelling evidence for structural generalization in the entorhinal cortex (EC), several limitations remain that pave the way for promising future research. One issue is that the generalization effect was statistically robust in only one task condition, with weaker effects observed in the "community" condition. This raises the question of whether the null result genuinely reflects a lack of EC involvement, or whether it might be attributable to other factors such as task complexity, training order, or insufficient exposure possibilities that the authors acknowledge as open questions. Moreover, although the study leverages fMRI to examine EC representations in humans, it does not clarify which specific components of EC coding-such as grid cells versus other spatially tuned but non-grid codes-underlie the observed generalization. While electrophysiological data in animals have begun to address this, the human experiments do not disentangle the contributions of these different coding types. This leaves unresolved the important question of what makes EC representations uniquely suited for generalization, particularly given that similar effects were not observed in other regions known to contain grid cells, such as the medial prefrontal cortex (mPFC) or posterior cingulate cortex (PCC). These limitations point to important future directions for better characterizing the computational role of the EC and its distinctiveness within the broader network supporting learning and decision making based on cognitive maps.

    1. Reviewer #1 (Public review):

      The authors present exciting new experimental data on the antigenic recognition of 78 H3N2 strains (from the beginning of the 2023 Northern Hemisphere season) against a set of 150 serum samples. The authors compare protection profiles of individual sera and find that the antigenic effect of amino acid substitutions at specific sites depends on the immune class of the sera, differentiating between children and adults. Person-to-person heterogeneity in the measured titers is strong, specifically in the group of children's sera. The authors find that the fraction of sera with low titers correlates with the inferred growth rate using maximum likelihood regression (MLR), a correlation that does not hold for pooled sera. The authors then measure the protection profile of the sera against historical vaccine strains and find that it can be explained by birth cohort for children. Finally, the authors present data comparing pre- and post- vaccination protection profiles for 39 (USA) and 8 (Australia) adults. The data shows a cohort-specific vaccination effect as measured by the average titer increase, and also a virus-specific vaccination effect for the historical vaccine strains. The generated data is shared by the authors and they also note that these methods can be applied to inform the bi-annual vaccine composition meetings, which could be highly valuable.

      Thanks to the authors for the revised version of the manuscript. A few concerns remain after the revision:

      (1) We appreciate the additional computational analysis the authors have performed on normalizing the titers with the geometric mean titer for each individual, as shown in the new Supplemental Figure 6. We agree with the authors statement that, after averaging again within specific age groups, "there are no obvious age group-specific patterns." A discussion of this should be added to the revised manuscript, for example in the section "Pooled sera fail to capture the heterogeneity of individual sera," referring to the new Supplemental Figure 6.

      However, we also suggested that after this normalization, patterns might emerge that are not necessarily defined by birth cohort. This possibility remains unexplored and could provide an interesting addition to support potential effects of substitutions at sites 145 and 275/276 in individuals with specific titer profiles, which as stated above do not necessarily follow birth cohort patterns.

      (2) Thank you for elaborating further on the method used to estimate growth rates in your reply to the reviewers. To clarify: the reason that we infer from Fig. 5a that A/Massachusetts has a higher fitness than A/Sydney is not because it reaches a higher maximum frequency, but because it seems to have a higher slope. The discrepancy between this plot and the MLR inferred fitness could be clarified by plotting the frequency trajectories on a log-scale.

      For the MLR, we understand that the initial frequency matters in assessing a variant's growth. However, when starting points of two clades differ in time (i.e., in different contexts of competing clades), this affects comparability, particularly between A/Massachusetts and A/Ontario, as well as for other strains. We still think that mentioning these time-dependent effects, which are not captured by the MLR analysis, would be appropriate. To support this, it could be helpful to include the MLR fits as an appendix figure, showing the different starting and/or time points used.

      (3) Regarding my previous suggestion to test an older vaccine strain than A/Texas/50/2012 to assess whether the observed peak in titer measurements is virus-specific: We understand that the authors want to focus the scope of this paper on the relative fitness of contemporary strains, and that this additional experimental effort would go beyond the main objectives outlined in this manuscript. However, the authors explicitly note that "Adults across age groups also have their highest titers to the oldest vaccine strain tested, consistent with the fact that these adults were first imprinted by exposure to an older strain." This statement gives the impression that imprinting effects increase titers for older strains, whereas this does not seem to be true from their results, but only true for A/Texas. It should be modified accordingly.

    2. Reviewer #2 (Public review):

      This is an excellent paper. The ability to measure the immune response to multiple viruses in parallel is a major advancement for the field, that will be relevant across pathogens (assuming the assay can be appropriately adapted). I only had a few comments, focused on maximising the information provided by the sera. These concerns were all addressed in the revised paper.

    3. Reviewer #3 (Public review):

      The authors use high throughput neutralisation data to explore how different summary statistics for population immune responses relate to strain success, as measured by growth rate during the 2023 season. The question of how serological measurements relate to epidemic growth is an important one, and I thought the authors present a thoughtful analysis tackling this question, with some clear figures. In particular, they found that stratifying the population based on the magnitude of their antibody titres correlates more with strain growth than using measurements derived from pooled serum data. The updated manuscript has a stronger motivation, and there is substantial potential to build on this work in future research.

      Comments on revisions:

      I have no additional recommendations. There are several areas where the work could be further developed, which were not addressed in detail in the responses, but given this is a strong manuscript as it stands, it is fine that these aspects are for consideration only at this point.

    1. Reviewer #1 (Public review):

      Summary:

      This study provides evidence that neuropeptide signaling, particularly via the CRH-CRHBP pathway, plays a key role in regulating the precision of vocal motor output in songbirds. By integrating gene expression profiling with targeted manipulations in the song vocal motor nucleus RA, the authors demonstrate that altering CRH and CRHBP levels bidirectionally modulate song variability. These findings reveal a previously unrecognized neuropeptidergic mechanism underlying motor performance control, supported by molecular and functional evidence.

      Strengths:

      Neural circuit mechanisms underlying motor variability have been intensively studied, yet the molecular bases of such variability remain poorly understood. The authors address this important gap using the songbird (Bengalese finch) as a model system for motor learning, providing experimental evidence that neuropeptide signaling contributes to vocal motor variability. They comprehensively characterize the expression patterns of neuropeptide-related genes in brain regions involved in song vocal learning and production, revealing distinct regulatory profiles compared to non-vocal related regions, as well as developmental, revealing distinct regulatory profiles compared to non-vocal regions, as well as developmental and behavioral dependencies, including altered expression following deafening and correlations with singing activity over the two days preceding sampling. Through these multi-level analyses spanning anatomy, development, and behavior, the authors identify the CRH-CRHBP pathway in the vocal motor nucleus RA as a candidate regulator of song variability. Functional manipulations further demonstrate that modulation of this pathway bidirectionally alters song variability.

      Overall, this work represents an effective use of songbirds, though a well-established neuroethological framework uncovers how previously uncharacterized molecular pathways shape behavioral output at the individual level.

      Weaknesses:

      (1) This study uses Bengalese finches (BFs) for all experiments-bulk RNA-seq, in situ hybridization across developmental stages, deafening, gene manipulation, and CRH microinfusion-except for the sc/snRNA-seq analysis. BFs differ from zebra finches (ZFs) in several important ways, including faster song degradation after deafening and greater syllable sequence complexity. This study makes effective use of these unique BF characteristics and should be commended for doing so.

      However, the major concern lies in the use of the single-cell/single-nucleus RNA-seq dataset from Colquitt et al. (2021), which combines data from both ZFs and BFs for cell-type classification. Based on our reanalysis of the publicly available dataset used in both Colquitt et al. (2021) and the present study, my lab identified two major issues:

      (a) The first concern is that the quality of the single-cell RNA-seq data from BFs is extremely poor, and the number of BF-derived cells is very limited. In other words, most of the gene expression information at the single-cell (or "subcellular type") level in this study likely reflects ZF rather than BF profiles. In our verification of the authors' publicly annotated data, we found that in the song nucleus RA, only about 18 glutamatergic cells (2.3%) of a total of 787 RA_Glut (RA_Glut1+2+3) cells were derived from BFs. Similarly, in HVC, only 53 cells (4.1%) out of 1,278 Glut1+Glut4 cells were BF-derived. This clearly indicates that the cell-subtype-level expression data discussed in this study are predominantly based on ZF, not BF, expression profiles.

      Recent studies have begun to report interspecies differences in the expression of many genes in the song control nuclei. It is therefore highly plausible that the expression patterns of CRHBP and other neuropeptide-signaling-related genes differ between ZFs and BFs. Yet, the current study does not appear to take this potential species difference into account. As a result, analyses such as the CellChat results (Fig. 2F and G) and the model proposed in Fig. 6G are based on ZF-derived transcriptomic information, even though the rest of the experimental data are derived from BF, which raises a critical methodological inconsistency.

      (b) The second major concern involves the definition of "subcellular types" in the sc/snRNA-seq dataset. Specifically, the RA_Glut1, 2, and 3 and HVC_Glu1 and 4 clusters-classified as glutamatergic projection neuron subtypes-may in fact represent inter-individual variation within the same cell type rather than true subtypes. Following Colquitt et al. (2021), Toji et al. (PNAS, 2024) demonstrated clear individual differences in the gene expression profiles of glutamatergic projection neurons in RA.

      In our reanalysis of the same dataset, we also observed multiple clusters representing the same glutamatergic projection neurons in UMAP space. This likely occurs because Seurat integration (anchor-based mutual nearest neighbor integration) was not applied, and because cells were not classified based on individual SNP information using tools such as Souporcell. When classified by individual SNPs, we confirmed that the RA_Glut1-3 and HVC_Glu1 and 4 clusters correspond simply to cells from different individuals rather than distinct subcellular types. (Although images cannot be attached in this review system, we can provide our analysis results if necessary.)

      This distinction is crucial, as subsequent analyses and interpretations throughout the manuscript depend on this classification. In particular, Figure 6G presents a model based on this questionable subcellular classification. Similarly, the ligand-receptor relationships shown in Figure 2G - such as the absence of SST-SSTR1 signaling in RA_Glut3 but its presence in RA_Glut1 and 2-are more plausibly explained by inter-individual variation rather than subcellular-type specificity.

      Whether these differences are interpreted as individual variation within a single cell type or as differences in projection targets among glutamatergic neurons has major implications for understanding the biological meaning of neuropeptide-related gene expression in this system.

      (2) Based on the important finding that "CRHBP expression in the song motor pathway is correlated with singing," it is necessary to provide data showing that the observed changes in CRHBP and other neuropeptide-related gene expression during the song learning period or after deafening are not merely due to differences in singing amount over the two days preceding brain sampling.

      Without such data, the following statement cannot be justified: "Regarding CRHBP expression in the song motor pathway increases during song acquisition and decreases following deafening."

      (3) In Figure 5B, the authors should clearly distinguish between intact and deafened birds and show the singing amount for each group. In practice, deafening often leads to a reduction in both the number of song bouts and the total singing time. If, in this experiment, deafened birds also exhibited reduced singing compared to intact birds, then the decreased CRHBP expression observed in HVC and RA (Figures 3 and 4) may not reflect song deterioration, but rather a simple reduction in singing activity.

      As a similar viewpoint, the authors report that CRHBP expression levels in RA and HVC increase with age during the song learning period. However, this change may not be directly related to age or the decline in vocal plasticity. Instead, it could correlate with the singing amount during the one to two days preceding brain sampling. The authors should provide data on the singing activity of the birds used for in situ hybridization during the two days prior to sampling.

    2. Reviewer #2 (Public review):

      Summary:

      The results presented here are a useful extension of two of their previous papers (Colquitt et al 2021, Colquitt et al 2023), where they used single-cell transcriptomics to characterize the inhibitory and excitatory cell types and gene expression patterns of the song circuit, comparing them to mammalian and reptilian brains, and characterized the effect of deafening on these gene expression patterns. In this paper, they focus on the differential expression of various neuropeptidergic systems in the songbird brain. They discover a role for the CRHBP gene in song performance and causally show its influence on song variability.

      Strengths:

      The authors leverage the advantages of the 'nucleated' structure of the songbird neural circuitry and use a robust approach to compare neuropeptidergic gene expression patterns in these circuits. Their analysis of the expression patterns of the CRHBP gene in different cell types supports their conclusion that interneurons are particularly amenable to this modulation. Their use of a knockdown strategy along with pharmacological manipulation provides strong support for a causal role of neuropeptidergic modulation on song behaviour. These results have important implications as they bring into focus neuropeptide modulation of the song-motor circuit and pave the way for future studies focussing on how this signalling pathway regulates plasticity during song learning and maintenance.

      Weaknesses:

      While the results demonstrating the bidirectional modulation of CRH and CRHBP on song performance shed light on their role in song plasticity, it would be important to show this in juvenile finches during sensorimotor learning. We also don't get a clear picture of the 'causal' role of this signalling pathway on the song pre-motor area, HVC, as the knockdown and pharmacological manipulation studies were done in RA, whereas we see a modulation of CRHBP expression during deafening and song learning in both RA and HVC. Given the role of interneurons in the HVC in song acquisition (e.g., Vallentin et al. 2016, Science), it would have been interesting to see the results of HVC-specific manipulation of this neuropeptidergic pathway and/or how it affects the song learning process. Perhaps a short discussion of this would help to give the readers some perspective. Finally, a more direct demonstration of the neurophysiological effect of the signalling pathway would also strengthen our understanding of precisely how these modulate the song circuit plasticity, which I understand might be beyond the scope of this study.

      Technical/minor:

      In the Methods section, several clarifications would be beneficial. For instance, the description of the design matrices would benefit from being presented in a more general statistical form (e.g., linear model equations) rather than using R syntax. This would make the modeling approach more accessible to readers unfamiliar with software-specific syntax. In addition, while some variables (e.g., cdr_scale, frac_mito_scale) are briefly defined, others (e.g., tags, cut3,nsongs_last_two_days_cut3) could be more clearly described. This applies to the descriptions of both the gene set enrichment analysis and the neuropeptide-receptor analysis, which rely heavily on package-specific terminology (e.g., fgseaMultilevel, computeCommunProb), making it difficult for readers to understand the conceptual or statistical basis of the analyses. It would improve clarity if the authors provided a complete list of variable definitions, types (categorical or continuous), and any scaling/transformations applied would enhance clarity and reproducibility.

    3. Reviewer #3 (Public review):

      Summary:

      The stable production of learned vocalizations like human language and birdsong requires auditory feedback. What happens in the brain areas that generate stable vocalizations as performance deteriorates is not well understood. Using a species of songbird, the current study investigates individual cells within the evolutionarily-conserved brain regions that generate learned vocalizations to describe that the complement of neuropeptide (short proteins) signals may be a key feature of behavioral change. Because neuropeptides are important across species, these findings may help explain diminishing stability in learned behaviors even in humans.

      Strengths:

      The experiments are solid and follow a strong progression from description through manipulation. The songbird model is appropriate and powerful to inform on generalizable biological mechanisms of precisely learned behaviors, including human speech.

      Weaknesses:

      While it is always possible to perform more experiments, most of the weaknesses are in the presentation of the project, not in the evidence or analysis, which are leading-edge and appropriate. Generally, the ability to follow the findings and to independently assess rigor would be enhanced with increased explicit mention of the statistical thresholds and subjective descriptions. In addition, two prior pieces of relevant work seem to be omitted, including one performing deafening, gene expression measures, and behavioral assessment in zebra finches, and another describing neuropeptide complements in zebra finch singing nuclei based largely on mass spectrometry. The former in particular should be related to the current findings.

    1. Reviewer #1 (Public review):

      Summary:

      The authors aim to investigate the mechanisms underlying Kupffer cell death in metabolic-associated steatotic liver disease (MASLD). The authors propose that KCs undergo massive cell death in MASLD and that glycolysis drives this process. However, there appears to be a discrepancy between the reported high rates of KC death and the apparent maintenance of KC homeostasis and replacement capacity.

      Strengths:

      This is an in vivo study.

      Weaknesses:

      There are discrepancies between the authors' observations and previous reports, as well as inconsistencies among their own findings.

      Before presenting the percentage of CLEC4F⁺TUNEL⁺ cells, the authors should have first shown the number of CLEC4F⁺ cells per unit area in Figure 1. At 16 weeks of age, the proportion of TUNEL⁺ KCs is extremely high (~60%), yet the flow cytometry data indicate that nearly all F4/80⁺ KCs are TIMD4⁺, suggesting an embryonic origin. If such extensive KC death occurred, the proportion of embryonically derived TIMD4⁺ KCs would be expected to decrease substantially. Surprisingly, the proportion of TIMD4⁺ KCs is comparable between chow-fed and 16-week HFHC-fed animals. Thus, the immunostaining and flow cytometry data are inconsistent, making it difficult to explain how massive KC death does not lead to their replacement by monocyte-derived cells.

      These data suggest that despite the reported high rate of cell death among CLEC4F⁺TIMD4⁺ KCs, the population appears to self-maintain, with no evidence of monocyte-derived KC generation in this model, which contradicts several recent studies in the field.

      Moreover, there is no evidence that TIMD4⁺CLEC4F⁺ KCs increase their proliferation rate to compensate for such extensive cell death. If approximately 60% of KCs are dying and no monocyte-derived KCs are recruited, one would expect a much greater decrease in total KC numbers than what is reported.

      It is also unexpected that the maximal rate of KC death occurs at early time points (8 weeks), when the mice have not yet gained substantial weight (Figure 1B). Previous studies have shown that longer feeding periods are typically required to observe the loss of embryo-derived KCs.

      Furthermore, it is surprising that the HFD induces as much KC death as the HFHC and MCD diets. Earlier studies suggested that HFD alone is far less effective than MASH-inducing diets at promoting the replacement of embryonic KCs by monocyte-derived macrophages.

      In Figure 2D, TIMD4 staining appears extremely faint, making the results difficult to interpret. In contrast, the TUNEL signal is strikingly intense and encompasses a large proportion of liver cells (approximately 60% of KCs, 15% of hepatocytes, 20% of hepatic stellate cells, 30% of non-KC macrophages, and a proportion of endothelial cells is also likely affected). This pattern closely resembles that typically observed in mouse models of acute liver failure. Given this apparent extent of cell death, it is unexpected that ALT and AST levels remain low in MASH mice, which is highly unusual.

      No statistical analysis is provided for Figure 5D, and it is unclear which metabolites show statistically significant changes in Figure 5C.

      In addition, there is no evaluation of liver pathology in Clec4f-Cre × Chil1flox/flox mice. It remains possible that the observed effects on KC death result from aggravated liver injury in these animals. There is also no evidence that Chil1 deficiency affects glucose metabolism in KCs in vivo.

      Finally, the authors should include a more direct experimental approach to modulate glycolysis in KCs and assess its causal role in KC death in MASH.

    2. Reviewer #2 (Public review):

      Summary:

      In this manuscript, He et al. set out to investigate the mechanisms behind Kupffer Cell death in MASLD. As has been previously shown, they demonstrate a loss of resident KCs in MASLD in different mouse models. They then go on to show that this correlates with alterations in genes/metabolites associated with glucose metabolism in KCs. To investigate the role of glucose metabolism further, they subject isolated KCs in vitro to different metabolic treatments and assess cleaved caspase 3 staining, demonstrating that KCs show increased Cl. Casp 3 staining upon stimulation of glycolysis. Finally, they use a genetic mouse model (Chil1KO) where they have previously reported that loss of this gene leads to increased glycolysis and validate this finding in BMDMs (KO). They then remove this gene specifically from KCs (Clec4fCre) and show that this leads to increased macrophage death compared with controls.

      Strengths:

      As we do not yet understand why KCs die in MASLD, this manuscript provides some explanation for this finding. The metabolomics is novel and provides insight into KC biology. It could also lead to further investigation; here, it will be important that the full dataset is made available.

      Weaknesses:

      Different diets are known to induce different amounts of KC loss, yet here, all models examined appear to result in 60% KC death. One small field of view of liver tissue is shown as representative to make these claims, but this is not sufficient, as anything can be claimed based on one field of view. Rather, a full tissue slice should be included to allow readers to really assess the level of death. Additionally, there is no consistency between the markers used to define KCs and moMFs, with CLEC4F being used in microscopy, TIM4 in flow, while the authors themselves acknowledge that moKCs are CLEC4F+TIM4-. As moKCs are induced in MASLD, this limits interpretation. Additionally, Iba1 is referred to as a moMF marker but is also expressed by KCs, which again prevents an accurate interpretation of the data. Indeed, the authors show 60% of KCs are dying but only 30% of IBA1+ moMFs, as KCs are also IBA1+, this would mean that KCs die much more than moMFs, which would then limit the relevance of the BMDM studies performed if the phenotype is KC specific. Therefore, this needs to be clarified. The claim that periportal KCs die preferentially is not supported, given that the majority of KCs are peri-portal. Rather, these results would need to be normalised to KC numbers in PP vs PC regions to make meaningful conclusions. Additionally, KCs are known to be notoriously difficult to keep alive in vitro, and for these studies, the authors only examine cl. Casp 3 staining. To fully understand that data, a full analysis of the viability of the cells and whether they retain the KC phenotype in all conditions is required. Finally, in the Cre-driven KO model, there does not seem to be any death of KCs in the controls (rather numbers trend towards an increase with time on diet, Figure 6E), contrary to what had been claimed in the rest of the paper, again making it difficult to interpret the overall results. Additionally, there is no validation that the increased death observed in vivo in KCs is due to further promotion of glycolysis.

    3. Reviewer #3 (Public review):

      This manuscript provides novel insights into altered glucose metabolism and KC status during early MASLD. The authors propose that hyperactivated glycolysis drives a spatially patterned KC depletion that is more pronounced than the loss of hepatocytes or hepatic stellate cells. This concept significantly enhances our understanding of early MASLD progression and KC metabolic phenotype.

      Through a combination of TUNEL staining and MS-based metabolomic analyses of KCs from HFHC-fed mice, the authors show increased KC apoptosis alongside dysregulation of glycolysis and the pentose phosphate pathway. Using in vitro culture systems and KC-specific ablation of Chil1, a regulator of glycolytic flux, they further show that elevated glycolysis can promote KC apoptosis.

      However, it remains unclear whether the observed metabolic dysregulation directly causes KC death or whether secondary factors, such as low-grade inflammation or macrophage activation, also contribute significantly. Nonetheless, the results, particularly those derived from the Chil1-ablated model, point to a new potential target for the early prevention of KC death during MASLD progression.

      The manuscript is clearly written and thoughtfully addresses key limitations in the field, especially the focus on glycolytic intermediates rather than fatty acid oxidation. The authors acknowledge the missing mechanistic link between increased glycolysis and KC death. Still, several interpretations require moderation to avoid overstatement, and certain experimental details, particularly those concerning flow cytometry and population gating, need further clarification.

      Strengths:

      (1) The study presents the novel observation of profound metabolic dysregulation in KCs during early MASLD and identifies these cells as undergoing apoptosis. The finding that Chil1 ablation aggravates this phenotype opens new avenues for exploring therapeutic strategies to mitigate or reverse MASLD progression.

      (2) The authors provide a comprehensive metabolic profile of KCs following HFHC diet exposure, including quantification of individual metabolites. They further delineate alterations in glycolysis and the pentose phosphate pathway in Chil1-deficient cells, substantiating enhanced glycolytic flux through 13C-glucose tracing experiments.

      (3) The data underscore the critical importance of maintaining balanced glucose metabolism in both in vitro and in vivo contexts to prevent KC apoptosis, emphasizing the high metabolic specialization of these cells.

      (4) The observed increase in KC death in Chil1-deficient KCs demonstrates their dependence on tightly regulated glycolysis, particularly under pathological conditions such as early MASLD.

      Weaknesses:

      (1) The novelty is questionable. The presented work has considerable overlap with a study by the same lab, which is currently under review (citation 17), and it should be considered whether the data should not be presented in one paper.

      (2) The authors report that 60% of KCs are TUNEL-positive after 16 weeks of HFHC diet and confirm this by cleaved caspase-3 staining. Given that such marker positivity typically indicates imminent cell death within hours, it is unexpected that more extensive KC depletion or monocyte infiltration is not observed. Since Timd4 expression on monocyte-derived macrophages takes roughly one month to establish, the authors should consider whether these TUNEL-positive KCs persist in a pre-apoptotic state longer than anticipated. Alternatively, fate-mapping experiments could clarify the dynamics of KC death and replacement.

      (3) The mechanistic link between elevated glycolytic flux and KC death remains unclear.

      (4) The study does not address the polarization or ontogeny of KCs during early MASLD. Given that pro-inflammatory macrophages preferentially utilize glycolysis, such data could provide valuable insight into the reason for increased KC death beyond the presented hyperreliance on glycolysis.

      (5) The gating strategy for monocyte-derived macrophages (moMFs) appears suboptimal and may include monocytes. A more rigorous characterization of myeloid populations by including additional markers would strengthen the study's conclusions.

      (6) While BMDMs from Chil1 knockout mice are used to demonstrate enhanced glycolytic flux, it remains unclear whether Chil1 deficiency affects macrophage differentiation itself.

      (7) The authors use the PDK activator PS48 and the ATP synthase inhibitor oligomycin to argue that increased glycolytic flux at the expense of OXPHOS promotes KC death. However, given the high energy demands of KCs and the fact that OXPHOS yields 15-16 times more ATP per glucose molecule than glycolysis, the increased apoptosis observed in Figure 4C-F could primarily reflect energy deprivation rather than a glycolysis-specific mechanism.

      (8) In Figure 1C, KC numbers are significantly reduced after 4 and 16 weeks of HFHC diet in WT male mice, yet no comparable reduction is seen in Clec4Cre control mice, which should theoretically exhibit similar behavior under identical conditions.

    1. Reviewer #1 (Public review):

      Summary:

      This study addresses the emerging role of fungal pathogens in colorectal cancer and provides mechanistic insights into how Candida albicans may influence tumor-promoting pathways. While the work is potentially impactful and the experiments are carefully executed, the strength of evidence is limited by reliance on in vitro models, small patient sample size, and the absence of in vivo validation, which reduces the translational significance of the findings.

      Strengths:

      (1) Comprehensive mechanistic dissection of intracellular signaling pathways.

      (2) Broad use of pharmacological inhibitors and cell line models.

      (3) Inclusion of patient-derived organoids, which increases relevance to human disease.

      (4) Focus on an emerging and underexplored aspect of the tumor microenvironment, namely fungal pathogens.

      Weaknesses:

      (1) Clinical association data are inconsistent and based on very small sample numbers.

      (2) No in vivo validation, which limits the translational significance.

      (3) Species- and cell type-specificity claims are not well supported by the presented controls.

      (4) Reliance on colorectal cancer cell lines alone makes it difficult to judge whether findings are specific or general epithelial responses.

    2. Reviewer #2 (Public review):

      The authors in this manuscript studied the role of Candida albicans in Colorectal cancer progression. The authors have undertaken a thorough investigation and used several methods to investigate the role of Candida albicans in Colorectal cancer progression. The topic is highly relevant, given the increasing burden of colon cancer globally and the urgent need for innovative treatment options.

      However, there are some inconsistencies in the figures and some missing details in the figures, including:

      (1) The authors should clearly explain in the results section which patient samples are shown in Figure 1B.

      (2) What do a, ab, b, b written above the bars in Figure 1F represent? Maybe authors should consider removing them, because they create confusion. Also, there is no explanation for those letters in the figure legend.

      (3) The authors should submit all the raw images of Western blot with appropriate labels to indicate the bands of protein of interest along with molecular weight markers.

      (4) The authors should do the quantification of data in Figure 2d and include it in the figure.

      (5) In Figure 2h, the authors should indicate if the quantification represents VEGF expression after 6h or 12h of C. albicans co-culture with cells.

      (6) In Figure 2i, quantification of VEGF should be done and data from three independent experiments should be submitted. The authors should also mention the time point.

    1. Reviewer #1 (Public review):

      This study presents an exploration of PPGL tumour bulk transcriptomics and identifies three clusters of samples (labeled as subtypes C1-C3). Each subtype is then investigated for the presence of somatic mutations, metabolism-associated pathway and inflammation correlates, and disease progression.

      The proposed subtype descriptions are presented as an exploratory study. The proposed potential biomarkers from this subtype are suitably caveated and will require further validation in PPGL cohorts together with mechanistic study.

      The first section uses WGCNA (a method to identify clusters of samples based on gene expression correlations) to discover three transcriptome-based clusters of PPGL tumours using a new cohort of n=87 PPGL samples from various locations in the body.

      The second section inspects a previously published snRNAseq dataset, assigning the published samples to subtypes C1-C3 using a pseudo-bulk approach.

      The tumour samples are obtained from multiple locations in the body, summarised in Fig1A. It will be important to see further investigation of how the sample origin is distributed among the C1-C3 clusters, and whether there is a sample-origin association with mutational drivers and disease progression.

      Comments on revisions:

      In SupplFile3 (pdf) - please correct the table format. The contents are obscured due to the narrowness of the table columns.

      Deposit the new RNAseq data (N=87 cases, N=5 controls) in an appropriate repository; see "Data on human genotypes and phenotypes" at https://elife-rp.msubmit.net/html/elife-rp_author_instructions.html#dataavailability

    2. Reviewer #2 (Public review):

      Summary:

      A study that furthers the molecular definition of PPGL (where prognosis is variable) and provides a wide range of sub-experiments to back up the findings. One of the key premises of the study is that identification of driver mutations in PPGL is incomplete and that compromises characterisation for prognostic purposes. This is a reasonable starting point on which to base some characterisation based on different methods.

      Strengths:

      The cohort is a reasonable size, and a useful validation cohort in the form of TCGA is used. Whilst it would be resource-intensive (though plausible given the rarity of the tumour type) to perform RNAseq on all PPGL samples in clinical practice, some potential proxies are proposed.

      Weaknesses:

      Performance of some of the proxy markers for transcriptional subtype is not presented.

      Limited prognostic information available.

      Comments on revisions:

      Having reviewed the responses to my comments and associated revisions, I am satisfied that they have been addressed.

    1. Reviewer #1 (Public review):

      This paper examines how geometric regularities in abstract shapes (e.g., parallelograms, kites) are perceived and processed in the human brain. The manuscript contains multimodal data (behavior, fMRI, MEG) from adults and additional fMRI data from 6-year-old children. The key findings show that (1) processing geometric shapes lead to reduced activity in ventral areas in comparison to complex stimuli and increased activity in intraparietal and inferior temporal regions, (2) the degree of geometric regularity modulates activity in intraparietal and inferior temporal regions, (3) similarity in neural representation of geometric shapes can be captured early by using CNN models and later by models of geometric regularity. In addition to these novel findings, the paper also includes a replication of behavioral data, showing that the perceptual similarity structure amongst the geometric stimuli used can be explained by a combination of visual similarities (as indexed by feedforward CNN model of ventral visual pathway) and geometric features. The paper comes with openly accessible code in a well-documented GitHub repository and the data will be published with the paper on OpenNeuro.

      In the revised version of this manuscript, the authors clarified certain aspects of the task design, added critical detail to the description of the methods, and updated the figures to show unsmoothed data and variability across participants. Importantly, the authors thoroughly discussed potential task effects (for the fMRI data only) and added additional analyses that indicate that the effects are unlikely to be driven by linguistic labels/name availability of the stimuli.

      Comments on the revision:

      Thank you for carefully addressing all my concerns and especially for clarifying the task design.

    2. Reviewer #2 (Public review):

      Summary

      The current study seeks to understand the neural mechanisms underlying geometric reasoning. Using fMRI with both children and adults, the authors found that contrasting simple geometric shapes with naturalistic images (faces, tools, houses) led to responses in the dorsal visual stream, rather than ventral regions that are generally thought to represent shape properties. The author's followed up on this result using computational modeling and MEG to show that geometric properties explain distinct variance in the neural response than what is captured by a CNN.

      Strengths

      These findings contribute much-needed neural and developmental data to the ongoing debate regarding shape processing in the brain and offer additional insights into why CNNs may have difficulty with shape processing. The motivation and discussion for the study is appropriately measured, and I appreciate the authors' use of multiple populations, neuroimaging modalities, and computational models in explore this question.

      Weaknesses

      The presence of activation in aIPS led the authors to interpret their results to mean that geometric reasoning draws on the same processes as mathematical thinking. However, there is only weak and indirect evidence in the current study that geometric reasoning, as its tested here, draws on the same circuits as math.