10,000 Matching Annotations
  1. Apr 2026
    1. Reviewer #3 (Public review):

      This manuscript investigates computational mechanisms underlying increased risk-taking behavior in adolescent patients with suicidal thoughts and behaviors. Using a well-established gambling task that incorporates momentary mood ratings and previously established computational modeling approaches, the authors identify particular aspects of choice behavior (which they term approach bias) and mood responsivity (to certain rewards) that differ as a function of suicidality. The authors replicate their findings on both clinical and large-scale non-clinical samples.

      The main problem, however, is that the results do not seem to support a specific conclusion with regard to suicidality. The S+ and S- groups differ substantially in the severity of symptoms, as can be seen by all symptom questionnaires and the baseline and mean mood, where S- is closer to HC than it is to S+. The main analyses control for illness duration and medication but not for symptom severity. The supplementary analysis in Figure S11 is insufficient as it mistakes the absence of evidence (i.e., p > 0.05) for evidence of absence. Therefore, the results do not adequately deconfound suicidality from general symptom severity.

      The second main issue is that the relationship between an increased approach bias and decreased mood response to CR is conceptually unclear. In this respect, it would be natural to test whether mood responses influence subsequent gambling choices. This could be done either within the model by having mood moderate the approach bias or outside the model using model-agnostic analyses.

      Additionally, there is a conceptual inconsistency between the choice and mood findings that partly results from the analytic strategy. The approach bias is implemented in choice as a categorical value-independent effect, whereas the mood responses always scale linearly with the magnitude of outcomes. One way to make the models more conceptually related would be to include a categorical value-independent mood response to choosing to gamble/not to gamble.

      The manuscript requires editing to improve clarity and precision. The use of terms such as "mood" and "approach motivation" is often inaccurate or not sufficiently specific. There are also many grammatical errors throughout the text.

      Claims of clinical relevance should be toned down, given that the findings are based on noisy parameter estimates whose clinical utility for the treatment of an individual patient is doubtful at best.

      Comments on revisions:'

      The authors adequately addressed my comments and I find the manuscript substantially strengthened.

    1. Reviewer #1 (Public review):

      Summary

      From transcriptomic comparisons of adult mouse cochlear and vestibular hair cells, Xu et al. provide a broad and well-organized overview of differences across 4 established hair cell types (2 cochlear and 2 vestibular). They go on to demonstrate the power of such analyses to provide functional insights by focusing on the differentiated expression of ciliary genes, building to the hypothesis that kinociliary motility occurs in adult vestibular hair cells.

      Background

      Cilia are prominent in sensory receptors, including vertebrate photoreceptors, olfactory neurons and mechanosensitive hair cells of the inner ear and lateral line. Cilia can be motile or nonmotile depending on their axonemal structure: motile cilia require dynein and the inner 2 singlet microtubules of the 9+2 array. Primary cilia, present early in development, are considered to have sensory functions and to be nonmotile (Mill et al., Nature Rev Gen 2023).

      In hair cells, the kinocilium anchors and polarizes the mechanosensitive hair bundle of specialized microvilli. The kinocilium matures from the primary cilium of a newborn hair cell; behind it the bundle of mechanosensory microvilli rises in a descending staircase of rows. During maturation of the mammalian cochlea, all hair cells lose the kinocilium, though not the associated basal body. The consensus for many years has been that most vertebrate kinocilia, and especially mammalian kinocilia, are nonmotile, based largely on the lack of spontaneous motility in excised mammalian vestibular organs, but also on the impression that the rare examples of spontaneous beating motility even in non-mammalian hair cells are associated with deterioration of the preparation (Rüsch & Thurm 1990).

      Strengths

      In comparing RNA expression across the 4 major types of mouse hair cells - 2 cochlear and 2 vestibular - Xu et al. provide rich data sets for exploration of structure-function differences between these highly specialized cell types. The revised paper significantly improves the organization, interpretation and readability of the presentation of overall findings. smFISH and immuno-staining back up key RNA data, and comparisons are made with published data.

      The ciliary motility focus of the rest of the paper is creative and highly interesting. The authors curated the ciliary genes into types associated with different aspects of beating motility, and also investigated the expression of genes typical of primary cilia, which are considered to have sensory and cell signaling functions and to be nonmotile. Their data justify suggesting a role for kinociliary motility (or force generation) in adult mammalian vestibular hair cells, in opposition to a long-held assumption. The results should stimulate investigation of the implications for mechanosensitivity.

      Weaknesses

      Data

      Functional data on kinocilia motility: The technical difficulty in making such measurements in small mouse hair bundles led the authors to work with bullfrog crista bundles. Though not extensively studied here, the ciliary motility shown is convincing. Mouse hair bundle motions are also shown but the evidence connecting the data to kinociliary motion are more suggestive than convincing. But the authors are not dogmatic about these data, and it is reasonable to show them.

      Interpretation

      The authors take the view that kinociliary motility is likely to be normally present but is rare in their observations because conditions are not right. But while others have described some (rare) kinociliary motility in fish organs (Rusch & Thurm 1990), they interpreted its occurrence as a sign of pathology. Indeed, in this paper, it is not clear what role kinociliary motility would play in mature hair bundles. The authors have added a discussion of this question in the revision.

      An underlying rationale for the hypothesis that ciliary motility manifests in mammalian vestibular hair cells seems to rest on the presence of the necessary mRNA and its contrasting absence in cochlear hair cells. Another way to look at this difference could be that evolution acted on cochlear hair cells to shed kinocilia as one of many changes to improve mechanosensitivity at much higher sound frequencies. In vestibular hair cells, kinociliary motion might be useful to enhance mechanostimulation in the developing vestibule (as suggested in this revision) and not so active in maturity. Nevertheless, with their scholarly analysis of the expression of ciliary genes, the authors make a significant argument for further investigation of when and why hair cell kinocilia show active motility.

    2. Reviewer #2 (Public review):

      Summary:

      In this study the authors compared the transcriptomes of the various different types of hair cells contained in the sensory epithelia of the cochlea and vestibular organs of the mouse inner ear. The analysis of their transcriptomic data lead to novel insights into the potential function of the kinocilium.

      Strengths:

      The novel findings for the kinocilium gene expression along with the demonstration that some kinocilia demonstrate rhythmic beating as would be seen for known motile cilia is fascinating. It is possible that perhaps the kinocilium known to play a very important role in the orientation of the stereocilia, may have a gene expression pattern that is more like a primary cilium early in development and later in mature hair cells more like a motile cilium. Since the kinocilium is retained in vestibular hair cells it makes sense that it is playing a different role in these mature cells than its role in the cochlea.

      Another major strength of this study which cannot be overstated is that for the transcriptome analysis they are using mature mice. To date there is a lot of data from many labs for embryonic and neonatal hair cells but very little transcriptomic data on the mature hair cells. They do a nice job in presenting the differences in marker gene expression between the 4 hair cell types. This information is very useful to those labs studying regeneration or generation of hair cells from ES cell cultures. One of the biggest questions these labs confront is what type of hair cell develop in these systems. The more markers available the better. These data will also allow researchers in the field to compare developing hair cells with mature hair cell to see what genes are only required during development and not in later functioning hair cells.

      Comments on revision:

      I am satisfied with the revision, the authors made an effort to incorporate the changes requested.

    1. Reviewer #1 (Public review):

      This revised manuscript by Qin and colleagues delineates an important neural mechanism that suppresses the intake of sugar solution in response to internal glucose level (the "brake" mechanism for sugar consumption). They identified a three-step neuropeptidergic system that downregulates the sensitivity of sweet-sensing gustatory sensory neurons, primarily in response to elevated level of circulating glucose. First, neurons that release a neuropeptide Hugin (which is an insect homolog of vertebrate Neuromedin U (NMU)) are activated by a high concentration of hemolymph glucose, which is directly sensed by Hugin-releasing neurons in a cell-autonomous mechanism. Next, Hugin neuropeptides activate Allatostatin A (AstA)-releasing neurons via one of Hugin receptors, PK2-R1. Finally, the released AstA neuropeptide suppresses sugar response in sweet-sensing Gr5a-expressing gustatory sensory neurons through the AstA-R1 receptor. Suppression of sugar response in Gr5a-expressing neurons reduces fly's sugar intake motivation. They also found that NMU-expressing neurons in the ventromedial hypothalamus (VMH) of mice (which project to the rostal nucleus of the solitary tract (rNST)) are also activated by high concentration of circulating glucose, independent of synaptic transmission, and that injection of NMU reduces the glucose-induced activity in the downstream of NMU-expressing neurons in rNST. These data suggest that the function of Hugin neuropeptides in the fly is analogous to the function of NMU in the mouse.

      The authors have provided multiple lines of compelling evidence generated through rigorous and comprehensive experiments, which spans genetic abrogation, neuronal manipulation, pharmacology, and functional imaging. The authors are also receptive to the critiques and reframed the central message, such that their conclusions are soundly supported by the presented data. Importantly, the parallel study in mice adds a unique comparative perspective that makes the paper of interest to a wide range of readers.

    2. Reviewer #2 (Public review):

      Summary:

      The question of how caloric and taste information interact and consolidate remains both active and highly relevant to human health and cognition. The authors of this work sought to understand how nutrient sensing of glucose modulates sweet sensation. They found that glucose intake activates hugin signaling to AstA neurons to suppress feeding, which contributes to our mechanistic understanding of nutrient sensation. They did this by leveraging the genetic tools of Drosophila to carry out nuanced experimental manipulations, and confirmed the conservation of their main mechanism in a mammalian model. This work builds on previous studies examining sugar taste and caloric sensing, enhancing the resolution of our understanding.

      Strengths:

      Fully discovering neural circuits that connect body state with perception remains central to understanding homeostasis and behavior. This study expands our understanding of sugar sensing, providing mechanistic evidence for a hugin/AstA circuit that is responsive to sugar intake and suppresses feeding. In addition to effectively leveraging the genetic tools of Drosophila, this study further extends their findings into a mammalian model with the discovery that NMU neural signaling is also responsive to sugar intake.

      Weaknesses:

      The effect of Glut1 knockdown on PER in hugin neurons is modest in both fed and starved flies, suggesting that glucose intake through Glut1 may only be part of the mechanism. The authors address this in their discussion.

    1. Reviewer #2 (Public review):

      Summary:

      The authors goals is to be develop a more accurate system that reports TDP-43 activity as a splicing regulator. Prior to this, most methods employed western blotting or QPCR based assays to determine whether targets of TDP-43 were up or down regulated. The problem with that is the sensitivity. This approach uses an ectopic delivered construct containing splicing elements from CFTR and UNC13A (two known splicing targets) fused to a GFP reporter. Not only does it report TDP-43 function well, but it operates at extremely sensitive TDP-43 levels, requiring only picomolar TDP-43 knockdown for detection. This reporter should supersede the use of current TDP-43 activity assays, its cost-effective, its rapid and reliable.

      Strengths:

      In general, the experiments are convincing and well designed. The rigor, number of samples and statistics, and gradient of TDP-43 knockdown were all viewed as strengths. In addition, the use of multiple assays to confirm the splicing changes were viewed as complimentary (ie PCR and GFP-fluorescence) adding additional rigor. The final major strength i'll add is the very clever approach to tether TDP-43 to the loss of function cassette such that when TDP-43 is inactive it would autoregulate and induce wild-type TDP-43. This has many implications for the use of other genes, not just TDP-43, but also other protective factors that may need to be re-established upon TDP-43 loss of function.

      Weaknesses:

      Admittedly, one needs to initially characterize the sensor and the use of cell lines is an obvious advantage, but it begs the question of whether this will work in neurons. Additional future experiments in primary neurons will be needed. The bulk analysis of GFP-positive cells is a bit crude. As mentioned in the manuscript, flow sorting would be an easy and obvious approach to get more accurate homogenous data. This is especially relevant since the GFP signal is quite heterogenous in the image panels, for example Figure 1C, meaning the siRNA is not fully penetrant. Therefore, stating that 1% TDP-43 knockdown achieves the desired sensor regulation might be misleading. Flow sorting would provide a much more accurate quantification of how subtle changes in TDP-43 protein levels track with GFP fluorescence.

      Some panels in the manuscript would benefit from additional clarity to make the data easier to visualize. For example, Figure 2D and 2G could be presented in a more clear manner, possibly split into additional graphs since there are too many outputs. Sup Figure 2A image panels would benefit from being labeled, its difficult to tell what antibodies or fluorophores were used. Same with Figure 4B.

      Figure 3 is an important addition to this manuscript and in general is convincing showing that TDP-43 loss of function mutants can alter the sensor. However, there is still wild-type endogenous TDP-43 in these cells, and its unclear whether the 5FL mutant is acting as a dominant negative to deplete the total TDP-43 pool, which is what the data would suggest. This could have been clarified. Additional treatment with stressors that inactivate TDP-43 could be tested in future studies.

      Overall, the authors definitely achieved their goals by developing a very sensitive readout for TDP-43 function. The results are convincing, rigorous, and support their main conclusions. There are some minor weaknesses listed above, chief of which is the use of flow sorting to improve the data analysis. But regardless, this study will have an immediate impact for those who need a rapid, reliable, and sensitive assessment of TDP-43 activity, and it will be particularly impactful once this reporter can be used in isolated primary cells (ie neurons) and in vivo in animal models. Since TDP-43 loss of function is thought to be a dominant pathological mechanism in ALS/FTD and likely many others disorders, having these type of sensors is a major boost to field and will change our ability to see sub-threshold changes in TDP-43 function that might otherwise not be possible with current approaches.

      Comments on revisions:

      In the revised version, most of the reviewer's comments have been appropriately addressed with the exception of 1) the use of flow sorting to improve the data analysis and 2) testing this sensor in primary neurons. The latter is the focus of an ongoing separate study. Though flow sorting would significantly strengthen this study and help others in the field to use this sensor, it is still an impactful and innovative study without it.

    2. Reviewer #3 (Public review):

      The DNA and RNA binding protein TDP-43 has been pathologically implicated in a number of neurodegenerative diseases including ALS, FTD, and AD. Normally residing in the nucleus, in TDP-43 proteinopathies, TDP-43 mislocalizes to the cytoplasm where it is found in cytoplasmic aggregates. It is thought that both loss of nuclear function and cytoplasmic gain of toxic function are contributors to disease pathogenesis in TDP-43 proteinopathies. Recent studies have demonstrated that depletion of nuclear TDP-43 leads to loss of its nuclear function characterized by changes in gene expression and splicing of target mRNAs. However, to date, most readouts of TDP-43 loss of function events are dependent upon PCR based assays for single mRNA targets. Thus, reliable and robust assays for detection of global changes in TDP-43 splicing events are lacking. In this manuscript, Xie, Merjane, Bergmann and colleagues describe a biosensor that reports on TDP-43 splicing function in real time. Overall, this is a well-described unique resource that would be of high interest and utility to a number of researchers validated in multiple cell types as a sensitive readout of TDP-43 loss of function. Future studies validating the utility of this biosensor in models of TDP-43 loss of function (e.g. disease iPSNs) that do not rely on TDP-43 knockdown will be of further interest.

    1. Reviewer #1 (Public review):

      Summary:

      Planar cell polarity core proteins Frizzled (Fz)/Dishevelled (Dvl) and Van Gogh-like (Vangl)/Prickle (Pk) are localized on opposite sides of the cell and engage in reciprocal repression to modulate cellular polarity within the plane of static epithelium. In this interesting manuscript, the authors explore how the anterior core proteins (Vangl/Pk) inhibit the posterior core protein (Dvl). The authors propose that Pk assists Vangl2 in sequestering both Dvl2 and Ror2, while Ror2 is essential for Dvl to transition from Vangl to Fz in response to non-canonical Wnt signaling.

      Strengths:

      The strengths of the manuscript are found in the very interesting and new concept along with supportive data for a model of how non-canonical Wnt induces Dvl to transition from Vangl to Fz with an opposing role for PK and Vangl2 to suppress Dvl during convergent extension movements. Ror is key player required for the transition and antagonizes Vangl.

      Weaknesses:

      In addition to general whole embryo morphology that is used as evidence for CE defects, two forms of data are presented: co-expression and IP, as well as IF of exogenously expressed proteins. The microscopy would benefit from super-resolution microscopy since in many cases the differences in protein localization are not very pronounced, and Western analysis data often show relatively subtle differences. Thus, future work will determine the strength of the interactions of the model.

      Major points.

      Overexpression conditions

      A possible concern is that most analyses were performed with overexpression conditions. PCP core proteins (Vangl2, Pk, Dvl, and Fz receptors) are known to display polarized subcellular localization in both the neural epithelium and DMZ explants (Ref: PCP and Septins govern the polarized organization of the actin cytoskeleton during convergent extension, Current Biology, 2024). However, in this study, overexpressed PCP core proteins failed to show polarized localization. Thus, one must be careful in interpreting data.

      Subtle effects

      Several of the reported results show quite modest changes in imaging and immunoprecipitation analyses, which are supportive of the proposed molecular model, but future experiments will be needed to robustly test the model.

    1. Reviewer #1 (Public review):

      Summary:

      This manuscript presents an end-to-end pipeline, intended to accelerate EM-based connectomics by combining low-resolution imaging for large volumes with synapse-level imaging only in selected regions of interest. In principle, this strategy can substantially reduce imaging time, computational demands, analysis time, and overall cost.

      General note:

      Overall, I found the manuscript interesting and valuable, particularly as a description of how one laboratory has assembled and applied a practical workflow to reconstruct and analyze the central complex across multiple insect species. In that sense, the work is compelling as an account of a real, functioning strategy for comparative connectomics, and I appreciated reading it. My main reservation is not about the relevance of the biological problem or the utility of the pipeline in the authors' own hands, but about whether the manuscript, in its current form, fully meets the expectations of a paper that is focused on tools and resources. The expectation would be that this paper would be a venue for sharing new techniques, software tools, datasets, and other resources intended to be usable by the community. Here, because much of the pipeline appears to build on existing methods and software, the key value added should be a particularly clear demonstration of how these components were adapted, integrated, validated, and documented for this specific use case in a way that others could realistically reproduce and adopt. At present, that translational and reproducibility-oriented component does not yet seem sufficiently developed, despite the clear promise of the overall approach.

      Major comments:

      (1) The work is valuable as a practical integration and application of multiple existing tools into a coherent pipeline, together with a new multi-resolution imaging strategy. However, the manuscript at times reads as though it introduces an entirely novel workflow. I would encourage the authors to clarify the contribution more explicitly: which components are genuinely new (for example, the acquisition strategy and the end-to-end integration/validation), and which are adaptations of already established methods or software. This would make the scope and novelty of the paper easier to assess.

      (2) The most distinctive element is the multi-resolution acquisition strategy. However, as described, the selection of high-resolution regions seems to be decided a priori based on anatomy (guided by xCT localization of the CX), rather than being determined automatically from the data (i.e., ROI placement is anatomy-driven rather than data-driven). A more data-driven or machine learning-guided ROI strategy would strengthen the methodological contribution and the adaptability to new scenarios, along the lines of approaches such as SmartEM [1].

      (3) The manuscript emphasizes open-source availability and reduced barriers to entry, but the current software release, as referenced, does not yet appear to support straightforward external reuse. Since much of the pipeline builds on existing methods, the main added value lies in how these technologies were adapted, combined, and validated for the present problem. A clear and complete explanation of this adaptation is therefore essential, but is currently missing. I would suggest the following concrete improvements:<br /> a) Provide a single landing page or umbrella repository that links each pipeline step in the paper to the corresponding codebase, including version tags/commits and expected inputs/outputs for each step.<br /> b) Include step-by-step tutorials for each component.<br /> c) Provide an example dataset together with a full reproduction walkthrough in a controlled environment.<br /> d) Clearly explain the required parameters and configuration for each step, including how they should be adjusted for other datasets or scenarios.<br /> e) Follow packaging and distribution best practices (for example, PyPI/conda releases, Docker containers, and version pinning).

      (4) In my own attempt to set up and run parts of the released code, I encountered issues that currently limit reproducibility. For example, when creating an environment for EMalign (https://github.com/Heinze-lab/EMalign), the required Python version is not specified, and installation did not succeed under Python 3.12 due to dependency constraints. Additionally, synful_312 (https://github.com/Heinze-lab/synful_312) and SegToPCG (https://github.com/Heinze-lab/SegToPCG) appear to be empty despite being referenced in the manuscript. These are fixable issues, but addressing them is important if the paper is to deliver on its "low entry cost" claim.

      (5) Table 1 reports acquisition times, which is helpful. However, the multi-resolution approach adds essential processing steps that appear due to the strategy followed (e.g., "XY alignment high-res" and "high-res to low-res alignment"). Please include registration/alignment (and other major post-processing) runtimes and resource requirements, such as storage, in a comparable table so readers can assess true end-to-end cost.

      References:

      [1] Meirovitch, Y., et al. "SmartEM: machine learning-guided electron microscopy." Nature Methods (2025).

    2. Reviewer #2 (Public review):

      Summary:

      The paper proposes a workflow to accelerate EM connectomics by combining multi-scale imaging with image processing and analysis (image alignment, registration, neuron tracing, automated segmentation and synapse prediction, proof-reading) to derive a brain region connectome. The paper argues and (partially) demonstrates that this approach facilitates comparative connectomics.

      The data acquisition pipeline uses a well-established sample preparation protocol, uCT guided acquisition, and SBEM imaging at cellular and synaptic resolution.

      Data processing and analysis combine existing state-of-the-art components and focus on the alignment and complementary analysis of the two SBEM resolution levels. The paper applies the workflow to the central complex of six different insects and performs some preliminary analysis based on this (which is acceptable for a resource/tool).

      Disclaimer for the rest of the review: I am an expert in image analysis and segmentation, so I have mainly focused on these aspects as I am not qualified to analyze the details of image acquisition.

      Strengths:

      The paper addresses an important problem and promises an acceleration and democratization of comparable connectomics. The time savings of the imaging approach are well-motivated and derived. The methods used for image alignment, segmentation, synapse detection, and proofreading are state-of-the-art.

      Weaknesses:

      I see two major weaknesses in the paper:

      (1) The paper introduces the (approximate) equivalence of the projectome and connectome in the insect brain very prominently in the introduction and uses this as a central motivation for the multi-resolution image acquisition protocol. But - to me - it is unclear how this principle is really used in the analysis presented in the last results and if this assumption is evaluated at all. Specifically, Figure 4 a shows the anatomical neuron reconstructions (from cellular resolution SBEM), d-g show connectome-level analysis from the synaptic resolution data. The only link I can see between the two is that the neural processes in the synapse-resolution data can be mapped to the neurons from the cellular resolution data, thanks to the image alignment. This is certainly important, BUT it is only tangentially related to the projectome vs. connectome claim from the introduction. This claim implies that a tentative connectome is derived from projectome-level data (e.g. by assuming a uniform probability of synapse-formation given surface or distance between projections) that is then validated by the "true" connectome data from synaptic resolution. Instead, what is actually solved - to my understanding - is mapping the local connectome to the projectome. While related, these are different things and the current framing of the paper and the quite brief description of the section on comparative connectomics (also no corresponding Methods section) make this claim inadequately supported.

      (2) Reporting on segmentation and proofreading is purely qualitative. Given that this is claimed as a core contribution of the paper (e.g. statement in line 497 and following), I would expect substantially more reporting and evaluation of this claim:<br /> a) Report the actual time needed for proofreading the segmentations in CAVE. I could not find any numbers on this.<br /> b) Report the initial segmentation quality of the model: How many errors does it make? Note: There is a brief mention of VoI-based quantification in Methods (around line 1060), but the results are not reported.

      What should be done: Report the error rates (with an accurate measure such as skeleton VoI) independently for all 6 volumes. Given that the authors have the proofread versions, this is feasible. Only then can the claims be made here be evaluated. Note that the F1-score of synapse prediction is quantified. This is a good starting point, but could also be extended to further species in order to assess the actual transferability. Furthermore, none of the data from the study seems to be available. The training data of the network has to be made available. If possible, high-resolution data should be proofread too.

      Further points:

      (1) Why isn't reconstruction at the cellular level addressed with ML? This is surely possible and should be easier than the full connectome analysis. Similar to before, the actual times needed for tracing with CATMAID are not reported; the manuscript only states that this can be done in minutes for a neuron, but it's unclear if this is the best or average case. It would help to have quantitative numbers to assess whether automation would bring any benefits.

      (2) Finally, regarding the underlying software. I did not try this myself due to time constraints, but did check the repositories. They seem to be in an ok state with some documentation in a README. However, given the central role of the software contribution, I would expect a centralized doc page that explains how to use the different parts of the software, including a full example with sample data. Without this, application by other labs - a central claim - will be difficult.

    1. Reviewer #1 (Public review):

      Summary:

      The authors aimed to determine the neural networks involved in updating behaviour by training mice on a 'go / no go' odour discrimination task, and measuring their brain activity using functional MRI.

      Strengths:

      The use of the translationally relevant 'go / no go' task is a major strength, as this is a task that can be used as readily in humans as in animals such as mice. The use of fMRI in awake, behaving mice is also a major strength, as this allows the activation of multiple brain regions to be measured while behaviour is ongoing, and also facilitates comparison to human studies. The computational modelling approaches further support these translational aims, again being as readily applied to human data as to animal data.

      Weaknesses:

      The major weakness of the paper - and one that is potentially addressable - is that the key analysis of the paper, showing that the periaqueductal gray (PAG) is recruited for reversal learning, is only partially supported by the data presented in the paper as it stands. The authors have used a sophisticated way of analysing the behavioural data using 'signal detection theory', in which they collected behavioural data showing correct 'go' responses ('hits'), correct 'no go' responses ('correct rejections'), missed 'go' responses ('misses') and go responses when mice should have withheld a response ('false alarms'). The data presented showing a double dissociation in the activation of the nucleus accumbens for 'hits' but not 'correct rejections' and the PAG for 'correct rejections' but not 'hits' is very interesting; however, it is confounded by the fact that the nucleus accumbens may activate when the animal makes a response, and the PAG when the animal withholds a response. If the authors also included the analysis of nucleus accumbens and PAG activation for 'misses' and 'false alarms', this would allow them to determine whether the activation of these regions reflects the behavioural response or the expectation of reinforcement from the response.

      Thus, the paper includes very interesting data and is impressive in its approach to analysing behaviour in a manner that is highly translatable between species. The additional analyses would markedly strengthen the paper and would add depth to the finding that the PAG appears to be involved in behavioural flexibility.

    2. Reviewer #2 (Public review):

      Summary:

      In this manuscript, the authors test the hypothesis that whole-brain functional magnetic resonance imaging in behaving mice, coupled with reinforcement-learning modeling, can dissociate neural substrates of initial cue-reward acquisition versus contingency reversal, and potentially reveal underappreciated contributors to cognitive flexibility. Using a head-fixed go/no-go odor discrimination task with subsequent rule reversal in a subset of mice, they model trial-by-trial state-action values with a model-free Q-learning algorithm (hierarchical Bayesian fit) and use the model-derived decision variable as a parametric regressor in whole-brain analyses. They report that acquisition-related signals prominently involve ventral and dorsal striatal regions, whereas reversal learning additionally recruits the periaqueductal gray (negative correlation with the decision variable) and shows an apparent double dissociation between nucleus accumbens and periaqueductal gray responses for hit versus correct-rejection outcomes during reversal.

      Strengths:

      (1) The reversal manipulation is implemented without explicit punishment, targeting suppression of previously rewarded actions under reward omission - an underexplored regime for midbrain contributions beyond canonical threat/pain framing.

      (2) The manuscript provides a credible MR-compatible olfactory/licking platform with synchronized sniff/lick/valve/reward timing and high-field imaging, supporting feasibility and broader utility for mesoscale systems neuroscience in rodents.

      (3) Trial-by-trial value estimates from a Q-learning variant are fit via hierarchical Bayesian inference and explicitly integrated into subject-level general linear models with a mouse hemodynamic response function, which is appropriate for leveraging within-subject dynamics in small-N rodent fMRI.

      (4) The decision-variable maps during acquisition recover expected basal ganglia involvement (including nucleus accumbens and dorsal striatum), providing face validity; the reversal-stage map yields an interpretable set of cortical/striatal/pallidal regions plus periaqueductal gray/hippocampus.

      (5) The finite impulse response analysis stratified by behavioral outcomes (hit, false alarm, correct rejection, miss) adds interpretability beyond the model regressor alone, and the reported crossover interaction between nucleus accumbens and periaqueductal gray is potentially impactful if robust.

      Weaknesses:

      (1) The core claim regarding selective periaqueductal gray engagement rests on a subset of n = 6 mice for reversal. With permutation-based whole-brain inference and very small cluster sizes, the robustness of the periaqueductal gray effect to reasonable analytic perturbations is not yet convincing. I would suggest providing leave-one-animal-out analyses for the periaqueductal gray cluster/ROI effects and reporting how often the key findings survive.

      (2) The authors note that due to temporal resolution and hemodynamics, they cannot separate stimulus, choice, and feedback and therefore model "whole trials." This limitation creates ambiguity about whether periaqueductal gray signals reflect value updating, action inhibition (no-lick), reward omission, autonomic arousal, or motor preparation/withholding, especially given the strong hit versus correct-rejection opponency. I would suggest adding targeted analyses that disambiguate "withholding" from "reversal-related updating".

      (3) ROIs are defined from the whole-brain decision-variable maps and then interrogated by outcome types; the manuscript acknowledges non-independence. This can inflate apparent dissociations. It would be better if the authors define ROIs independently (anatomical periaqueductal gray/nucleus accumbens masks, or split-half ROI definition with held-out data) and repeat the key ROI conclusions.

      (4) The reversal group is a subset of the acquisition cohort and also experiences a different task phase structure and additional sessions; the paper attempts to address exposure differences descriptively. I would suggest that the authors formally test whether periaqueductal gray effects are explained by session count, time-in-scanner, or learning rate differences (e.g., include these as covariates, or match sessions more strictly).

      (5) The platform records sniffing and licking, but the imaging models described include motion, global, and ventricle regressors and do not clearly include trialwise lick/sniff covariates. Given the periaqueductal gray's known autonomic and defensive coordination roles, physiological state confounding is a major concern. Could the authors incorporate sniff and lick metrics (and their derivatives) as nuisance regressors and show whether the periaqueductal gray effects persist?

    1. Reviewer #1 (Public review):

      Summary:

      In their manuscript, Zhou and colleagues present a detailed look at how the JSP functions differently in the various cells of a breast tumor. The authors have effectively shown that the JSP acts as a double-edged sword, as it helps T cells fight cancer but also allows tumor cells to grow and avoid ferroptosis. These findings are important because they identify a useful biomarker to predict how TNBC patients might respond to PD-1 inhibitors.

      Strengths:

      This work is important because it provides a clear explanation for the conflicting roles of the JSP in the tumor environment. The evidence is solid, as it combines data from thousands of patients with single-cell analysis and lab experiments to confirm the role of STAT4 in cancer progression and immunity.

      Weaknesses:

      However, there are areas for improvement in the scope of the review, the depth of analysis, and the potential for broader clinical implications. The authors are encouraged to address these issues to enhance the scientific and clinical impact of the study.

      Major Issues:

      (1) The authors demonstrate that STAT4 upregulates SLC47A1, but this is currently supported only by expression correlation and western blot data. To confirm a direct link, the authors are encouraged to perform ChIP-qPCR or luciferase reporter assays to show that STAT4 binds directly to the SLC47A1 promoter.

      (2) The conclusion that the MIF-CD74 axis drives immunosuppression is based on computational inference. To support this, the authors could consider mining publicly available breast cancer spatial transcriptomics data to show the co-localization of MIF and CD74. Alternatively, performing simple dual-color immunofluorescence staining on a few clinical sections would effectively demonstrate the physical proximity of these cells.

      (3) TNBC is highly heterogeneous and includes subtypes like mesenchymal and immunomodulatory groups. The authors should analyze whether the JSP score or STAT4 levels vary significantly between these subtypes, as this could further refine the selection of patients for JAK1 inhibitors.

      (4) While the JSP score works well in the current datasets, the authors should consider validating its predictive accuracy in additional independent immunotherapy cohorts, such as the TONIC trial, to ensure the biomarker is robust across different treatment settings.

      Minor Issue:

      The manuscript mentions a U-shaped trajectory of JSP activity during tumor transition. A more detailed biological explanation of why the pathway activity initially drops and then rises would add depth to the discussion.

    2. Reviewer #2 (Public review):

      Summary:

      The JAK-STAT pathway (JSP) exhibits cell-type-specific functional heterogeneity in breast cancer. This study investigates the JSP in breast cancer and its response to anti-PD‑1 immunotherapy. JSP displays distinct cell‑type heterogeneity: it promotes malignant phenotypes and immunosuppression in tumor cells, while enhancing cytotoxicity and reducing exhaustion in T cells. Elevated JSP expression correlates with improved immunotherapy responses, especially in triple‑negative breast cancer. These findings highlight the paradoxical roles of JSP, indicating that broad inhibition may compromise anti‑tumor immunity.

      Strengths:

      The major strengths of this study include the comprehensive characterization of JSP heterogeneity across epithelial, tumor, and T cells in breast cancer. The identification of JSP and STAT4 as predictive biomarkers for immunotherapy response, particularly in triple‑negative breast cancer, provides clinically relevant insights for patient stratification.

      Weaknesses:

      The findings rely heavily on public dataset analyses.

    3. Reviewer #3 (Public review):

      Summary:

      This multi-omics study by Zhou et al elucidates the context-dependent roles of the Janus kinase-signal transducer and activator of transcription (JAK-STAT) pathway (JSP) across different cellular compartments in the breast cancer tumor microenvironment. While bulk JSP activity is associated with a favorable prognosis, single-cell analysis reveals a paradoxical landscape: high JSP in T cells drives anti-tumor cytotoxicity and reduces exhaustion, whereas high activity in tumor epithelial cells promotes malignancy and immunosuppression via the MIF-CD74 signaling axis. The JSP score (immune-related) serves as a robust predictive biomarker for response to anti-PD-1 immunotherapy, particularly in triple-negative breast cancer (TNBC). Furthermore, the study identifies the STAT4/SLC47A1 axis as a critical mechanism through which tumor cells resist ferroptosis, facilitating disease progression. These findings suggest that broad JAK-STAT inhibition may be counterproductive in cancer therapeutics; instead, therapeutic success depends on precise modulation and carefully timed interventions to preserve its T-cell-associated functions. This study may inspire future studies to explore specific factors that selectively modulate JAK-STAT activity in immune cells to achieve favorable therapeutic outcomes.

      Strengths:

      Significant therapeutic implications.

      Weaknesses:

      Limited molecular mechanisms.

    1. Reviewer #1 (Public review):

      Summary:

      Fields et al. investigated the heterogeneity and kinetics of human antibody secreting cell (ASC) differentiation by analyzing ex vivo tonsil samples and using in vitro differentiation modeling. They discovered that a CD30+ intermediate subset emerges in transition from B cell to ASC in both contexts, but not from germinal centers, and they identified cytokines that promote this state. They also identified an isoform of CD44, CD44v9, that is expressed on some ASCs.

      Strengths:

      The strengths are the novelty of the findings and the identification of two new markers that may be useful for tracking ASC heterogeneity.

      Weaknesses:

      However, some of this work seems preliminary and would need to be further validated. Some of the data presented was only representative, with limited controls and biological repeats, limiting the interpretation. For example, the role of Mef2c for CD30 expression was not robustly demonstrated. It was not clear if Figure 1 scRNAseq/ATACseq was from multiple donors or just one. Future studies may extend these novel findings and determine the functional relevance of these factors, CD30, and CD44v9 for ASC differentiation and physiology.

    2. Reviewer #2 (Public review):

      Summary:

      Bhattacharya and colleagues here use cell culture, single-cell RNA and ATACseq sequencing of such in vitro cultures and of ex vivo isolated B-lineage cells to infer an ontogeny for extra-germinal centre B cell differentiation. The manuscript presents a useful potential ontogeny for plasma cells, wherein in vitro cultured naïve human B cells enter a CD30+ intermediate state before moving in subsequent days through a CD44v9+ state before ultimately obtaining a 'mature' antibody-secreting plasma cell phenotype. Ex vivo isolated germinal centre B cells obtain the plasma cell state without expressing CD30 in their development. Phenotype analysis of tonsillar B-lineage cells supports the same phenotype conversion in vivo, although the intermediate cell population was smaller in vivo. The link to CD44v9 expression on developing plasma cells is inferred to be for extra-GC (T-independent) responses, but the data presented leave this equivocal, and the functional importance of developing via a CD30+CD44v9+ intermediate is not investigated.

      Strengths:

      The article presents a solid potential ontogeny for PC development, wherein some differentiating B cells acquire a CD30+ state, transition through a CD44v9+CD30+ state, then downmodulate CD30 before obtaining canonical CD38+ 'PC' status. A strength is the integration of in vitro cultured B cell results with tonsillar B-lineage cell data sets, and careful flow cytometry of the in vitro cultures over several days to infer lineage. The data provide reasonable support for the concept. CD30+ cells are shown to develop readily from naïve B cells in culture, but uncommonly from GC B cell cultures. A nice piece of data is Figure 6B, which shows reasonably strong correlative changes in phenotype through the assumed ontogeny, and this fits with the expected trajectory of maturation.

      Weaknesses:

      The most important weakness throughout is the non-absolute nature of the relationship. An example is seen in that the sorted ex vivo GC B cells also give rise to the 'extra-GC' phenotype of plasma cell, suggesting that while the profile is enriched, it is not absolute. There is a further weakness, as while cultures are run for several days, division-associated shifts in PC phenotype are not mapped; such would greatly strengthen the weight of the argument, and show conditional shifts in phenotype associated with division, an uncontrolled parameter in the mix. For example, for the MEF2C A388 inhibition experiments, it would be strong evidence of the pathway/process contributing if a by-division peak increase in CD30+ population was demonstrated in the early days of culture.

      There are some basic sort experiments performed (e.g. 3C-3F), which show that the CD30+ cells do give rise to PC preferentially, but what is missing is the step-wise phenotype shifts in these sorted populations, which should support the trajectory shown in Figure 3B and (the in vitro equivalent of) 6B. It would emphatically support the trajectory to show the cellular phenotypes on the PC with sorting based on CD30, CD44v9, CD27, and CD20 expression, and following outcome phenotypes 24-48 hours later, if the inferred maturation trajectory is true.

      There are also specific weaknesses with the bioinformatics, in that, while the analyses are likely appropriate, unpresented data is necessarily used to shape the argument. For example, Figure 1C shows bubble plots for two plasma cell sets, yet, of archetypal PC-expressed genes, only IRF4 is demonstrated to confirm they are true PC, and the gene is not universally expressed in cells in the clusters. For this figure, it would help to expand the bubble plot to show J-CHAIN, XBP-1, CIITA and PRDM1 or other appropriate PC demarcating molecules. Similarly, in Fig 2B, more evidence of a bifurcation in state is needed than that CD44v9 distinguishes PC1 from PC2 clusters-this is the stated conclusion, but 2A depicts that 50% of PC1 relatively weakly express CD44, while <25% of PC2 express it. Demonstrating additional molecules or genes distinguishing the clusters would improve veracity. Figure 2F shows clonal lineages, but it would be helpful to see somatic hypermutation burdens and learn if they differ between the demarcated subsets. I also find the pseudotime analyses of limited value, as some of the branches follow trajectories that are unrealistic biologically, so less weight should be placed on the pathways to which they do or do not point (i.e., the notion that GC B cells do or do not give rise to particular PC subsets).

      Statistically, some of the experiments are single wells from single donors, so there is a low level of confidence and no reproducibility demonstrated for some aspects of the study, which is a weakness.

      Paradoxical to the argument that it is the TI response process being modelled, it is presented that CpG stimulation, plus proxy T cell help (CD40L), drives the CD30+ phenotype best with the addition of the GC-associated cytokine IL-21. This should be carefully considered and discussed.

      Overall, in addition to presenting more contextual information from the bioinformatics, the best way to solidify the data set, in my vie,w would be to revisit the hypothesis with two additional experimental approaches: (1) to incorporate division tracing into the ontogeny studies and (2) to perform lineage tracing on sort-purified populations at different stages of the maturation process.

    1. Reviewer #1 (Public review):

      Summary:

      Spinal projection neurons in the anterolateral tract transmit diverse somatosensory signals to the brain, including touch, temperature, itch, and pain. This group of spinal projection neurons is heterogeneous in their molecular identities, projection targets in the brain, and response properties. While most anterolateral tract projection neurons are multimodal (responding to more than one somatosensory modality), it has been shown that cold-selective projection neurons exist in lamina I of the spinal cord dorsal horn. Using a combination of anatomical and physiological approaches, the authors discovered that the cold-selective lamina I projection neurons are heavily innervated by Trpm8+ sensory neuron axons, with calb1+ spinal projection neurons primarily capturing these cold-selective lamina I projection neurons. These neurons project to specific brain targets, including the PBNrel and cPAG. This study adds to the ongoing effort in the field to identify and characterize spinal projection neuron subtypes, their physiology, and functions.

      Strengths:

      (1) The combination of anatomical and physiological analyses is powerful and offers a comprehensive understanding of the cold-selective lamina I projection neurons in the spinal cord dorsal horn. For example, the authors used detailed anatomical methods, including EM imaging of Trpm8+ axon terminals contacting the Phox2a+ lamina I projection neurons. Additionally, they recorded stimulus-evoked activity in Trpm8-recipient neurons, carefully selected by visual confirmation of tdTomato and GFP juxtaposition, which is technically challenging.

      (2) This study identifies, for the first time, a molecular marker (calb1) that labels cold-selective lamina I projection neurons. Although calb1+ projection neurons are not entirely specific to cold-selective neurons, using an intersectional strategy combined with other genes enriched in this ALS group or cold-induced FosTRAP may further enhance specificity in the future.

      (3) This study shows that cold-selective lamina I projection neurons specifically innervate certain brain targets of the anterolateral tract, including the NTS, PBNrel, and cPAG. This connectivity provides insights into the role of these neurons in cold sensation, which will be an exciting area for future research.

      Weaknesses:

      (1) The sample size for the ex vivo electrophysiology conducted on the calb1+ lamina I projection neurons (Figure 5) is limited to a total of six recorded neurons. Given the difficulty and complexity of the preparation, this is understandable. Notably, since approximately 87% of lamina I projection neurons heavily innervated by Trpm8+ terminals are calb1+, these six recordings of such neurons in Figure 4E could also be calb1+.

    2. Reviewer #2 (Public review):

      Summary:

      In this study, the authors took advantage of a semi-intact ex vivo somatosensory preparation that includes hindlimb skin to characterize the response of projection neurons in the dorsal horn of the spinal cord to peripheral stimulation, including cold thermal stimuli. The main aim was to characterize the connectivity between peripheral afferents expressing the cold sensing receptor TRPM8 and a set of genetically tagged neurons of the anterolateral system (ALS). These ALS neurons expressed high levels of the calcium binding protein calbindin 1.

      In addition, combining different viral tracing methods, the authors could identify the anatomical targets of this specific subset of projection neurons within the brainstem and diencephalon.

      Strengths:

      The use of a relatively new (seldom used previously) transgenic line to label TRPM8-expressing afferents, combined with the genetic characterization of a previously identified subset of projections neurons add specificity to the characterization. The transgenic line appears to capture well the subpopulation of Trpm8-expressing neurons.

      In addition, the use of electron microscopy techniques makes the interpretation of the structural contacts more compelling

      The writing is clear and the presentation of findings follows a logical flow.

      Overall, this study provides solid, novel information about the brain circuits involved in cold thermosensation.

      Weaknesses:

      In the characterization of recorded neurons in close contact or in the absence of this contact with TRPM8 afferents, the number of recordedd neurons is relatively low. In addition, the strength of thermal stimuli is not very well controlled, preventing a more precise characterization of the connectivity.

      The authors acknowledge that, technically, this is a very difficult preparation with very low yield as far as obtaining successful recordings. Moreover, the tissue needs to be maintained at room temperature which is obviously not ideal when characterizing cold thermoreceptors due to the unavoidable effects of low temperature on cold-activated receptors.

    3. Reviewer #3 (Public review):

      Summary:

      Razlan and colleagues provide a detailed anatomical characterization of lamina I projection neurons in the mouse spinal cord that are densely innervated by primary afferents activated by cooling of the skin. The authors validate a Trpm8-Flp mouse line, show synaptic contacts between Trpm8⁺ boutons and projection neurons at the ultrastructural level, and demonstrate at the physiological level that these neurons specifically respond to cooling stimuli. Next, by taking advantage of previous transcriptomic analysis of ALS neurons, the authors identify calbindin as a marker for cold activatetd lamina I projection neurons and map their ascending projections to the rostral lateral parabrachial area, caudal periaqueductal gray, and ventral posterolateral thalamus, well-known thermosensory and thermoregulatory centers. Altogether, these findings provide strong anatomical and functional evidence for a direct line of transmission from Trpm8⁺ sensory afferents through Calb1⁺ lamina I neurons to key supraspinal centers controlling perception of cold and thermoregulatory responses.

      Strengths:

      The combination of mouse genetics, electron microscopy, ex-vivo physiology, optogenetics and viral tracing provides convincing evidence for a direct cold pathway. The work validates the Trpm8-Flp line by extensive anatomical and molecular characterization. Integration with previous transcriptomic and anatomical data, neatly links the cold-selective lamina I neurons to a molecularly defined cluster of ALS neurons, strengthening the bridge between molecular identity, anatomy, and physiological function.

      Weaknesses:

      The main limitation remains the relatively small number of neurons that could be recorded electrophysiologically. While understandable given the complexity of the preparation, this necessarily limits generalization.

    1. Reviewer #1 (Public review):

      Summary:

      Spinal projection neurons in the anterolateral tract transmit diverse somatosensory signals to the brain, including touch, temperature, itch, and pain. This group of spinal projection neurons is heterogeneous in their molecular identities, projection targets in the brain, and response properties. While most anterolateral tract projection neurons are multimodal (responding to more than one somatosensory modality), it has been shown that cold-selective projection neurons exist in lamina I of the spinal cord dorsal horn. Using a combination of anatomical and physiological approaches, the authors discovered that the cold-selective lamina I projection neurons are heavily innervated by Trpm8+ sensory neuron axons, with calb1+ spinal projection neurons primarily capturing these cold-selective lamina I projection neurons. These neurons project to specific brain targets, including the PBNrel and cPAG. This study adds to the ongoing effort in the field to identify and characterize spinal projection neuron subtypes, their physiology, and functions.

      Strengths:

      (1) The combination of anatomical and physiological analyses is powerful and offers a comprehensive understanding of the cold-selective lamina I projection neurons in the spinal cord dorsal horn. For example, the authors used detailed anatomical methods, including EM imaging of Trpm8+ axon terminals contacting the Phox2a+ lamina I projection neurons. Additionally, they recorded stimulus-evoked activity in Trpm8-recipient neurons, carefully selected by visual confirmation of tdTomato and GFP juxtaposition, which is technically challenging.

      (2) This study identifies, for the first time, a molecular marker (calb1) that labels cold-selective lamina I projection neurons. Although calb1+ projection neurons are not entirely specific to cold-selective neurons, using an intersectional strategy combined with other genes enriched in this ALS group or cold-induced FosTRAP may further enhance specificity in the future.

      (3) This study shows that cold-selective lamina I projection neurons specifically innervate certain brain targets of the anterolateral tract, including the NTS, PBNrel, and cPAG. This connectivity provides insights into the role of these neurons in cold sensation, which will be an exciting area for future research.

      Weaknesses:

      (1) The sample size for the ex vivo electrophysiology is small. Given the difficulty and complexity of the preparation, this is understandable. However, a larger sample size would have strengthened the authors' conclusions.

      (2) The authors used tdTomato expression to identify brain targets innervated by these cold-selective lamina I projection neurons. Since tdTomato is a soluble fluorescent protein that fills the entire cell, using synaptophysin reporters (e.g., synaptophysin-GFP) would have been more convincing in revealing the synaptic targets of these projection neurons.

      (3) The summary cartoon shown in Figure 7 can be misleading because this study did not determine whether these cold-selective lamina I projection neurons have collateral branches to multiple brain targets or if there are anatomical subtypes that may project exclusively to specific targets. For example, a recent study (Ding et al., Neuron, 2025) demonstrated that there are PBN-projecting spinal neurons that do not project to other rostral brain areas. Furthermore, based on the authors' bulk labeling experiments, the three main brain targets are NTS, PBNrel, and cPAG. The VPL projection is very sparse and almost negligible.

    2. Reviewer #2 (Public review):

      Summary:

      In this study, the authors took advantage of a semi-intact ex vivo somatosensory preparation that includes hindlimb skin to characterize the response of projection neurons in the dorsal horn of the spinal cord to peripheral stimulation, including cold thermal stimuli. The main aim was to characterize the connectivity between peripheral afferents expressing the cold-sensing receptor TRPM8 and a set of genetically tagged neurons of the anterolateral system (ALS). These ALS neurons expressed high levels of the calcium-binding protein calbindin 1.

      In addition, combining different viral tracing methods, the authors could identify the anatomical targets of this specific subset of projection neurons within the brainstem and diencephalon.

      Strengths:

      The use of a relatively new (seldom used previously) transgenic line to label TRPM8-expressing afferents, combined with the genetic characterization of a previously identified subset of projection neurons, adds a specificity to the characterization. The transgenic line appears to capture well the subpopulation of Trpm8-expressing neurons

      In addition, the use of electron microscopy techniques makes the interpretation of the structural contacts more compelling.

      The writing is clear, and the presentation of findings follows a logical flow.

      Overall, this study provides solid, novel information about the brain circuits involved in cold thermosensation.

      Weaknesses:

      In the characterization of recorded neurons in close contact or in the absence of this contact with TRPM8 afferents, the number of recorded neurons is relatively low. In addition, the strength of thermal stimuli is not very well controlled, preventing a more precise characterization of the connectivity.

      The authors could provide some sense of the effort needed to record from the 6 cold-activated neurons described. How many preparations were needed, etc?

    3. Reviewer #3 (Public review):

      Summary:

      Razlan and colleagues provide a detailed anatomical characterization of lamina I projection neurons in the mouse spinal cord that are densely innervated by primary afferents activated by cooling of the skin. The authors, building on their previous anatomical work, validate a Trpm8-Flp mouse line, show synaptic contacts between Trpm8⁺ boutons and projection neurons at the ultrastructural level, and demonstrate at the physiological level that these neurons specifically respond to cooling stimuli. Next, by taking advantage of their previous transcriptomic analysis of ALS neurons, they identify calbindin as a marker for cold-activated lamina I projection neurons and map their ascending projections to the rostral lateral parabrachial area, caudal periaqueductal gray, and ventral posterolateral thalamus, well-known thermosensory and thermoregulatory centers. Altogether, these findings provide strong anatomical and functional evidence for a direct line of transmission from Trpm8⁺ sensory afferents through Calb1⁺ lamina I neurons to key supraspinal centers controlling perception of cold and thermoregulatory responses.

      Strengths:

      The combination of mouse genetics, electron microscopy, ex vivo physiology, and viral tracing provides convincing evidence for a direct cold pathway. The work validates the Trpm8-Flp line by extensive anatomical and molecular characterization. Integration with previous transcriptomic and anatomical data neatly links the cold-selective lamina I neurons to a molecularly defined cluster of ALS neurons, strengthening the bridge between molecular identity, anatomy, and physiological function.

      Weaknesses:

      While anatomical evidence for direct synaptic connectivity between Trpm8+ afferents and lamina I projection neurons is compelling, a physiological demonstration of strict monosynaptic transmission is not shown. The conclusion that these inputs are exclusively monosynaptic should be toned down. Similarly, the statement that "Lamina I ALS neurons that are surrounded by Trpm8 afferents are cold-selective" should also be toned down as only a few neurons have been tested and it cannot be excluded that other neurons with similar characteristics may be polymodal.

    1. Reviewer #1 (Public review):

      Summary:

      CCK is the most abundant neuropeptide in the brain, and many studies have investigated the role of CCK and inhibitory CCK interneurons in modulating neural circuits, especially in the hippocampus. The manuscript presents interesting questions regarding the role of excitatory CCK+ neurons in the hippocampus, which has been much less studied compared to the well-known roles of inhibitory CCK neurons in regulating network function. The authors adopt several methods including transgenic mice and viruses, optogenetics, chemogenetics, RNAi, and behavioral tasks to explore these less-studied roles of excitatory CCK neurons in CA3. They find that the excitatory CCK neurons are involved in hippocampal-dependent tasks such as spatial learning and memory formation, and that CCK-knockdown impairs these tasks.

      However, these questions are very dependent on ensuring that the study is properly targeting excitatory CCK neurons (and thus their specific contributions to behavior).

      There needs to be much more characterization of the CCK transgenic mice and viruses to confirm the targeting. Without this, it is unclear whether the study is looking at excitatory CCK neurons or a more general heterogeneous CCK neuron population.

      Strengths:

      This field has focused mainly on inhibitory CCK+ interneurons and their role in network function and activity, and thus this manuscript raises interesting questions regarding the role of excitatory CCK+ neurons, which have been much less studied.

      Weaknesses:

      (1a) This manuscript is dependent on ensuring that the study is indeed investigating the role of excitatory CCK-expressing neurons themselves and their specific contribution to behavior. There needs to be much more characterization of the CCK-expressing mice (crossed with Ai14 or transduced with various viruses) to confirm the excitatory-cell targeting. Without this, it is unclear whether the study is looking at excitatory CCK neurons or a more general heterogeneous CCK neuron population.

      (2) The methods and figure legends are still extremely sparse, still leading to many questions regarding methodology and accuracy. More details would be useful in evaluating the tools and data, and the lack of proper quantification is still prevalent throughout the paper. In many places, only % values are noted, or only images are presented, and the number of cells counted is almost never reported.

    2. Reviewer #2 (Public review):

      Summary:

      In this study, the authors have demonstrated, through a comprehensive approach combining electrophysiology, chemogenetics, fiber photometry, RNA interference, and multiple behavioral tasks, the necessity of projections from CCK+ CAMKIIergic neurons in the hippocampal CA3 region to the CA1 region for regulating spatial memory in mice. Specifically, authors have shown that CA3-CCK CAMKIIergic neurons are selectively activated by novel locations during a spatial memory task. Furthermore, authors have identified the CA3-CA1 pathway as crucial for this spatial working memory function, thereby suggesting a pivotal role for CA3 excitatory CCK neurons in influencing CA1 LTP. The data presented appear to be well-organized and comprehensive.

      Strengths:

      (1) This work combined various methods to validate the excitatory CCK neurons in the CA3 area; these data are convincing and solid.

      (2) This study demonstrated that the CA3-CCK CAMKIIergic neurons are involved in the spatial memory tasks; these are interesting findings, which suggest that these neurons are important targets for manipulating the memory-related diseases.

      (3) This manuscript also measured the endogenous CCK from the CA3-CCK CAMKIIergic neurons; this means that CCK can be released under certain conditions.

      Weaknesses:

      In summary, this work can be formally accepted after the revision. For the limitations of the revision, the distinct neural effects of cholecystokinin (CCK) receptors (CCK-1R, CCK-2R, and CCK-3R) on hippocampal function have not been fully elucidated. Recent studies indicate that CCK-2R can modulate hippocampal activity at CA3-Schaffer collateral synapses; however, the roles of CCK-1R and CCK-3R in hippocampal function remain poorly characterized, with limited experimental evidence supporting their involvement. Overall, this study provides an interesting and novel perspective on the role of excitatory CCK signaling in hippocampus-dependent navigation learning.

    3. Reviewer #3 (Public review):

      Summary:

      Fengwen Huang et al. used multiple neuroscience techniques (transgenetic mouse, immunochemistry, bulk calcium recording, neural sensor, hippocampal-dependent task, optogenetics, chemogenetics, and interfer RNA technique) to elucidate the role of the excitatory cholecystokinin-positive pyramidal neurons in the hippocampus in regulating the hippocampal functions, including navigation and neuroplasticity.

      Strengths:

      (i) The authors provided the distribution profiles of excitatory cholecystokinin in the dorsal hippocampus via the transgenetic mice (Ai14::CCK Cre mice), immunochemistry, and retrograde AAV.

      (ii) The authors used the neural sensor and light stimulation to monitor the CCK release from the CA3 area, indicating that CCK can be secreted by activation of the excitatory CCK neurons.

      (iii) The authors showed that the activity of the excitatory CCK neurons in CA3 is necessary for navigation learning

      (iv) The authors demonstrated that inhibition of the excitatory CCK neurons and knockdown of the CCK gene expression in CA3 impaired the navigation learning and the neuroplasticity of CA3-CA1 projections.

      Weaknesses:

      (i) The causal relationship between navigation learning and CCK secretion remains nebulous; answering this question will require a more sensitive CCK-BR sensor in future work.

    1. Reviewer #1 (Public review):

      Summary:

      The authors test whether the frog buccal ventilatory rhythm generator behaves as a discrete, anatomically localized oscillator or as a distributed, state-dependent network. They combine reduced preparations (segment/subsegment work), systematic extracellular unit surveys over a defined grid, and local AMPA/GABA microinjections in a hemisected brainstem preparation. Based on these approaches, the authors conclude that mild global excitation (bath AMPA) broadens the distribution of rhythmically active units and renders a previously defined "buccal area" functionally non-identifiable as a unique necessary/sufficient locus.

      The central idea is plausible, and the overall experimental strategy is appropriate for the question being asked. However, in its current form, the manuscript overstates the strength of inference supporting the "expansion" and "loss of necessity/sufficiency" conclusions. This is primarily due to (a) statistical treatment of unit-mapping data that does not respect clustering by preparation/animal, (b) inconsistent statistical reporting across sections, and (c) limited interpretability of focal inhibitory perturbations under a globally excited state.

      Strengths:

      (1) The manuscript addresses a clear mechanistic question with broader relevance: whether rhythm generation is best conceptualized as a localized kernel or as an emergent distributed property that changes with excitatory state.

      (2) The authors use convergent approaches (reduced preparations, mapping, and necessity/sufficiency-style pharmacological perturbations), which is appropriate for circuit-level inference.

      (3) A strong element is the within-unit analysis supporting state-dependent changes in phase coupling for a subset of units ("lung" units adopting a buccal-like pattern). The authors' offline PCA-based spike sorting (with cluster-quality selection via silhouette score) provides some reassurance that the reported pre/post injection changes are not simply driven by unit misidentification.

      Weaknesses:

      (1) Pseudoreplication in unit-survey statistics undermines the main mapping inference. The Methods state that "Units were pooled from multiple preparations" and that chi-squared tests were used to compare proportions across conditions (baseline vs 60 nM AMPA). The Results similarly report proportion changes (e.g., 110 units pooled from three preparations vs 137 units pooled from three additional animals) analyzed with chi-squared tests. Because many units come from the same preparation/animal, independence is unlikely to hold; therefore, inference about state-dependent reorganization at the systems level should be made at the preparation/animal level or via hierarchical models that explicitly account for clustering.

      (2) Statistical methods are inconsistently described and need harmonization. In the segment dose-response "Analysis," values are described as compared to zero using a "One-sample t-test." Yet Table 1 is titled as using a "Wilcoxon One-sample Test." These discrepancies must be resolved throughout (Methods, Results, figure legends, and tables), including clear reporting of the unit of n and exact test statistics.

      (3) Unit classification and operational definitions raise interpretational concerns. The unit classification scheme defines "buccal units" as those firing during buccal bursts as well as lung bursts, and explicitly notes that "no units were found which fired only during buccal bursts." This is a consequential result, and it currently reads more like a limitation of detection/classification (or state-space sampled) than a robust biological conclusion. Without additional evidence, it weakens claims about a distinct buccal rhythmogenic module and complicates the interpretation of "buccal identity" changes under excitation.

      (4) Microinjection mapping: high exclusion rate and alternative explanations for 'loss of necessity' under excitation. The manuscript reports that 15 experiments were conducted, but 9 were excluded because the buccal area was not found or the preparation was "overdriven." This exclusion rate is too high to leave implicit; it raises concerns about selection bias and demands transparent accounting. Moreover, under baseline conditions, GABA (or AMPA-GABA) microinjections reliably reduce/abolish buccal bursts, but under bath 60 nM AMPA, the same injections produce no significant change in instantaneous frequency. This pattern can be interpreted as network redistribution, but it can also reflect state-dependent changes in gain, dynamic range, or local pharmacological impact (e.g., inhibition being comparatively underpowered in the globally excited state). Additional controls/analyses are required to distinguish these explanations.

    2. Reviewer #2 (Public review):

      Summary:

      In this manuscript, the authors investigate the response of the amphibian respiratory rhythm generator under varying excitability conditions. They use pharmacological agents to increase and/ or decrease synaptic excitability and demonstrate the resilience of buccal rhythms under different conditions. They employ these results to formulate their primary thesis, that there is no obligatory locus of the buccal respiratory rhythm in the frog, and that their respiratory rhythmogenic mechanisms should be considered diffuse and anatomically distributed across a larger brainstem region.

      Strengths:

      This manuscript is well written, with a sufficiently large number of experiments, for which the authors should be congratulated.

      Weaknesses:

      The presented results don't support the authors' main conclusions, and the interpretation of the data is heavily biased toward their hypothesis. This impregnates an unsubstantiated narrative in the Abstract, Introduction, and Discussion of this manuscript, which must be reexamined with the following points in consideration:

      (1) The authors seem to confuse degeneracy with redundancy. For instance, at line 54, they state, "These findings support the broader hypothesis that respiratory rhythm-generating circuits can switch to being diffuse and redundant, with discrete oscillators quickly drowning in a sea of excitations."

      Redundancy means having the same component repeated multiple times to buffer the failure of any single component, whereas degeneracy means different functional components that compensate for one another under perturbations (Goaillard and Marder, ARN 2021)

      Since the premotor-lung units get converted to buccal units under high excitability, this suggests a degenerate mechanism for respiratory rhythm generation- rather than a redundant mechanism, where there should be multiple buccal units that get recruited under different excitability conditions.

      (2) Line 83, "but the essential requirement for a discrete, rudimentary buccal oscillator is also lost".

      This statement is not supported by the data presented in this study. How does the expansion of the buccal unit imply that the essential requirement for discreteness is lost? Under increased excitability, does the burst/rhythm initiation zone also expand? Or does it still remain centered around the location of buccal units under physiological conditions? Increased excitability can lead to recruitment of a larger area, without a change in the location of the rhythmogenic kernel.

      (3) Line 86, "... oscillators should be viewed as promiscuous flexible functional entities that expand or contract...".

      Oscillators can be regarded as promiscuous only if, under physiological conditions, they switch positions. Under high excitability, only the flexibility argument holds, which has been established in mammals before (e.g., CA Del Negro, K Kam, JA Hayes, JL Feldman, The Journal of physiology 587 (6), 1217-1231; CA Del Negro, C Morgado-Valle, JL Feldman,Neuron 34 (5), 821-830; NA Baertsch, LJ Severs, TM Anderson, JM Ramirez, Proceedings of the National Academy of Sciences 116 (15), 7493-7502; NA Baertsch, HC Baertsch, JM Ramirez Nature communications 9 (1), 843).

      Results:

      (4) Interpretation of data in Figure 6.

      How does the Buccal activity and L2 Power stroke change with 60nm AMPA (in CN5)? Does the increase in the Buccal neurons and decrease in power stroke neurons also reflect in the CN5 activity? Also see comments on Figure 9 data below.

      (5) Interpretation of data in Figure 7.

      Here, classifying buccal neurons solely by spiking may obscure the fact that the 'silent' neurons under baseline conditions were part of the rhythmic network but could not spike due to subthreshold inputs. 60 nM AMPA increased their firing in response to previously subthreshold synchronous inputs during the buccal burst. Intracellular recordings are required to negate this possibility and establish that the neuronal classification is robust.

      (6) Interpretation of data in Figure 8.

      "Lung units can transform into buccal units under excitation".<br /> CN5 buccal and lung bursts need to be compared before and after AMPA injection. From Figure 8 A-D, it is apparent that the example Unit2's activity increases during the buccal bursts, after AMPA injection. However, they are also present in buccal burst pre-AMPA, albeit with less frequency.

      It is striking that the pre-AMPA epoch (panel A) is less than half of the post-AMPA epoch. This would, in itself, lead to a biased estimate of lung units that are active under the baseline condition during the buccal bursts.

      Figure 8G, meta-analysis of lung units spiking during the baseline buccal bursts is warranted to interpret the main claim of this figure. Similarly, analysis of spiking per lung burst for the post-AMPA condition is essential for comparing the lung unit's contribution under high excitability.

      (7) Interpretation of data in Figure 9

      "Buccal area loses importance under increased excitation."

      This interpretation is not fully supported by the data presented in this manuscript. Under 60 nm AMPA, does the ratio of lung burst to buccal burst change in CN5? This analysis is crucial for determining whether the lung units are indeed converted into buccal bursts at the expense of lung activity or whether their appearance during buccal bursts is incidental due to increased excitability. In the baseline, there are 4-5 buccal bursts per lung burst, whereas under high excitability, there are 2-3 buccal bursts per lung burst (Figure 9 A-B). This seems inconsistent with the conclusion that increased excitability converts lung units into buccal units (Figures 6 &7).

      Could the authors comment on the connectivity between the lung and the buccal units? Results in Figure 9A-B indicate that lung units may receive an efference copy of buccal units, and under high excitability, their spikes may generate negative feedback onto the buccal units, terminating their bursts. This could explain the decrease in the buccal-to-lung burst in high-AMPA conditions. This type of circuit interaction resembles the mammalian breathing CPG, in which the parafacial/RTN (which controls the abdominal muscles) and preBötC (which controls the diaphragm) interact and cross-inhibit each other.

      (8) Line 382.

      "Buccal-like bursting produced from two independent slices".

      The two "independent" slices have portions of the same anatomical kernel, the buccal rhythm generator. This experiment is like the sandwich slice preparation of preBötC (Del Negro Lab), in which two thinner slices exhibit rhythmic activity. Thus, the two slices are not independent; they are anatomically adjacent and functionally overlapping.

    3. Reviewer #3 (Public review):

      Summary:

      This study uses isolated frog brainstem preparations to test whether inspiratory rhythm generation is confined to a narrowly defined neural center or instead reflects the activity of a distributed and adaptable network. Building on prior rodent work, the authors examine structural and functional parallels between the frog Buccal Area and the mammalian preBötzinger complex. By increasing excitatory drive, they assess whether a localized rhythmogenic region can expand into a broader network that participates in buccal rhythm generation, providing insight into how respiratory circuits are dynamically reconfigured across physiological states.

      Strengths:

      The work presents compelling evidence that ventilatory rhythm generation is supported by a flexible, state-dependent network rather than a fixed anatomical locus. The experimental preparation is well-suited to address these questions, and the data are generally of high quality. The demonstration that increased excitation recruits a more distributed network parallels observations in mammalian systems and strengthens the translational relevance of the findings. Overall, the analyses are thoughtful, and the interpretations are largely well supported by the results.

      Weaknesses:

      Some issues limit the strength of the conclusions. First, the study does not address the transition from eupnea to gasping in mammals, which could provide important physiological context for the observed AMPA-induced network reorganization. Second, the reported transformation of lung-active neurons into buccal-active neurons would benefit from additional analyses to clarify whether neurons switch identities or acquire dual activity. Finally, the necessity and sufficiency experiments in Figure 9 require further support, particularly through AMPA dose-response analyses and more comprehensive GABA manipulations, to confirm that network expansion does not obscure the continued functional importance of the core buccal region.

    1. Reviewer #1 (Public review):

      Summary:

      Choucri and Treiber have reassessed their previous study on TE-gene chimeric transcripts in neural genes in response to Azad et al (2024). Azad and colleagues argued that, contrary to Choucri and Treiber's findings, chimeric TE-mRNAs are relatively infrequent, and they cautioned that further optimization of bioinformatics pipelines is needed to detect TE insertions from RNAseq accurately. In this short response, Choucri and Treiber clearly demonstrate that differences in the tools used between their study and that of Azad et al. likely account for the contrasting results, along with RT-PCR failure in designing primers that would match the chimeric transcript, and the use of different Drosophila lines. The authors emphasize the need for uniform, standardized criteria in such analysis, which would ultimately strengthen and advance the field.

      Strengths:

      The addition of a ratio to compute the number of splice reads specific to the chimeric transcript and compare to the exon-exon splice reads is really interesting because it opens the door to finally quantify the contribution of chimeric TEs to the overall gene expression, although this is not the scope of the present article. The clear dissection of chimeric transcripts, along with the results from Azad et al, allows us to understand the differences between the two studies confidently. Finally, the discussion on Drosophila lines is indeed essential, given that the lines and even individuals have high TE polymorphism.

      Weaknesses:

      I think it is necessary to add more detail to this article, for instance, the differences between TEchim and Tidal could be laid out more precisely. Regarding the roo example, one of the caveats of this family, along with others, is the presence of simple repeats. It would be important to show that the simple repeats are not interfering with the read mapping. Regarding the experiments, if we are looking for a standardized protocol, then we should have a detailed material and methods section, with every experiment, replicate, and PCR temperature clearly defined. Finally, and in my opinion, more importantly, the use of RT negative controls on the RT PCRs, along with DNA PCRs to show insertion presence, is mandatory for testing the presence of chimeric genes. Of course, water negative PCR controls are also needed, and unfortunately, absent from Figure 3.

    2. Reviewer #2 (Public review):

      Summary:

      This study by Choucri and Treiber aims to directly address a recent critique regarding the role of transposable elements (TEs) in diversifying the neural transcriptome of Drosophila. The authors seek to demonstrate that TEs are not merely genomic "noise" but are frequently and reliably "exonized" into brain-specific mRNA. By introducing an upgraded computational pipeline, TEChim, and conducting precise experimental validations, the authors set out to show that TE-mediated splicing represents a genuine biological phenomenon that expands the molecular repertoire of the nervous system.

      Strengths:

      The study's primary strength lies in its rigorous technical "forensic" analysis of previous failed replication attempts. The authors convincingly demonstrate that the lack of signal in the opposing study stemmed from a fundamental methodological mismatch: the software used by the critics (TIDAL) is logically incapable of detecting splice sites located within TE sequences. Importantly, the authors complement this computational clarification with definitive experimental evidence through an effective "experimental rescue." By employing correctly designed primers and matching the genetic backgrounds of the fly strains, thereby accounting for genomic polymorphisms, they successfully validated all seven loci that were previously reported as undetectable. This dual-pronged strategy, addressing both algorithmic bias and experimental design, establishes a more robust technical benchmark for the detection and validation of TE-derived exons in neural tissues.

      Weaknesses:

      While the technical rebuttal is highly convincing, the scope of the study remains primarily defensive. As a response to a prior critique, the work focuses on establishing the existence and detectability of chimeric TE-derived transcripts rather than exploring their broader functional consequences. As a result, there is limited new insight into how these TE-modified isoforms influence neural circuit function or organismal behavior. In addition, the detection and validation of these events remain technically demanding, requiring deep sequencing and specialized bioinformatic expertise, which may limit broader adoption by laboratories without dedicated computational resources.

    3. Reviewer #3 (Public review):

      Summary:

      This manuscript by Choucri and Treiber responds to a recent paper by Azad et al., which responds to a paper by Treiber and Wadell (Genome Research, 2020). The controversy relates to the detection of transcripts with transposable elements (TEs) spliced into them in the Drosophila brain.

      Strengths:

      The authors now argue convincingly that these transcripts exist using an improved, updated version of their pipeline. They also validate some of their findings using RT-PCR and explain why Azad et al. failed to detect these transcripts due to methodological errors. Overall, I am convinced that these transcripts exist and that the TE-derived transcripts described by Choucri and Treiber are real.

      Weaknesses:

      The authors should mention that combining PCR-amplified cDNA generation with short-read sequencing is suboptimal for detecting TE-fusion transcripts. Recently, direct long-read ONT RNA sequencing, which does not require amplification and spans the entire transcript, has been used to detect similar transcripts in human stem cells and the human brain (PMID: 40848716 & Garza et al, BioRxiv). Had the authors used this technology to validate their findings, there would be no question about these transcripts. If not doing such experiments, then they should at least discuss the possibility and the advantage of the approach.

    1. Reviewer #1 (Public review):

      [Editors' note: this version has been assessed by the Reviewing Editor without further input from the original reviewers. The authors have addressed the minor comments raised in the previous round of review.]

      Summary:

      In this manuscript, Chengjian Zhao et al. focused on the interactions between vascular, biliary, and neural networks in the liver microenvironment, addressing the critical bottleneck that the lack of high-resolution 3D visualization has hindered understanding of these interactions in liver disease.

      Strengths:

      This study developed a high-resolution multiplex 3D imaging method that integrates multicolor metallic compound nanoparticle (MCNP) perfusion with optimized CUBIC tissue clearing. This method enables the simultaneous 3D visualization of spatial networks of the portal vein, hepatic artery, bile ducts, and central vein in the mouse liver. The authors reported a perivascular structure termed the Periportal Lamellar Complex (PLC), which is identified along the portal vein axis. This study clarifies that the PLC comprises CD34⁺Sca-1⁺ dual-positive endothelial cells with a distinct gene expression profile, and reveals its colocalization with terminal bile duct branches and sympathetic nerve fibers under physiological conditions.

      Comments on revisions:

      The authors very nicely addressed all concerns from this reviewer. There are no further concerns and comments.

    2. Reviewer #3 (Public review):

      Xu, Cao and colleagues aimed to overcome the obstacles of high-resolution imaging of intact liver tissue. They report successful modification of the existing CUBIC protocol into Liver-CUBIC, a high-resolution multiplex 3D imaging method that integrates multicolor metallic compound nanoparticle (MCNP) perfusion with optimized liver tissue clearing, significantly reducing clearing time and enabling simultaneous 3D visualization of the portal vein, hepatic artery, bile ducts, and central vein spatial networks in the mouse liver. Using this novel platform, the researchers describe a previously unrecognized perivascular structure they termed Periportal Lamellar Complex (PLC), regularly distributed along the adult liver portal veins.<br /> Using available scRNAseq data, the authors assessed the CD34⁺Sca-1⁺ cells' expression profile, highlighting mRNA presence of genes linked to neurodevelopment, bile acid transport, and hematopoietic niche potential. Different aspects of this analysis were then addressed by protein staining of selected marker proteins in the mouse liver tissue. Next, the authors addressed how the PLC and biliary system react to CCL4-induced liver fibrosis, implying PLC dynamically extends, acting as a scaffold that guides the migration and expansion of terminal bile ducts and sympathetic nerve fibers into the hepatic parenchyma upon injury.

      The work clearly demonstrates the usefulness of the Liver-CUBIC technique and the improvement of both resolution and complexity of the information, gained by simultaneous visualization of multiple vascular and biliary systems of the liver. The identification of PLC and the interpretation of its function represent an intriguing set of observations that will surely attract the attention of liver biologists as well as hepatologists. The importance of the CD34+/Sca1+ endothelial cell population and claims based on transcriptomic re-analysis require future assessment by functional experimental approaches to decipher the functional molecules involved in PLC formation, maintenance, and the involvement in injury response before establishing their role in biliary, arterial, and neural liver systems.

      Strengths:

      The authors clearly demonstrate an improved technique tailored to the visualization of the liver vasulo-biliary architecture in unprecedented resolution.<br /> This work proposes a new morphological feature of adult liver facilitating interaction between the portal vein, hepatic arteries, biliary tree, and intrahepatic innervation, centered at previously underappreciated protrusions of the portal veins - PLCs.

      Weaknesses:

      The importance of CD34+Sca1+ endothelial cell sub-population for PLC formation and function was not tested and warrants further validation.

    1. Reviewer #1 (Public review):

      This manuscript investigates how dentate gyrus (DG) granule cell subregions, specifically suprapyramidal (SB) and infrapyramidal (IB) blades, are differentially recruited during a high cognitive demand pattern separation task. The authors combine TRAP2 activity labeling, touchscreen-based TUNL behavior, and chemogenetic inhibition of adult-born dentate granule cells (abDGCs) or mature granule cells (mGCs) to dissect circuit contributions.

      This manuscript presents an interesting and well-designed investigation into DG activity patterns under varying cognitive demands and the role of abDGCs in shaping mGC activity. The integration of TRAP2-based activity labeling, chemogenetic manipulation, and behavioral assays provides valuable insight into DG subregional organization and functional recruitment. However, several methodological and quantitative issues limit the interpretability of the findings. Addressing the concerns below will greatly strengthen the rigor and clarity of the study.

      Major points:

      (1) Quantification methods for TRAP+ cells are not applied consistently across panels in Figure 1, making interpretation difficult. Specifically, Figure 1F reports TRAP+ mGCs as density, whereas Figure 1G reports TRAP+ abDGCs as a percentage, hindering direct comparison. Additionally, Figure 1H presents reactivation analysis only for mGCs; a parallel analysis for abDGCs is needed for comparison across cell types.

      (2) The anatomical distribution of TRAP+ cells is different between low- and high-cognitive demand conditions (Figure 2). Are these sections from dorsal or ventral DG? Is this specific to dorsal DG, as itis preferentially involved in cognitive function? What happens in ventral DG?

      (3) The activity manipulation using chemogenetic inhibition of abDGCs in AsclCreER; hM4 mice was performed; however, because tamoxifen chow was administered for 4 or 7 weeks, the labeled abDGC population was not properly birth-dated. Instead, it consisted of a heterogeneous cohort of cells ranging from 0 to 5-7 weeks old. Thus, caution should be taken when interpreting these results, and the limitations of this approach should be acknowledged.

      (4) There is a major issue related to the quantification of the DREADD experiments in Figure 4, Figure 5, Figure 6, and Figure 7. The hM4 mouse line used in this study should be quantified using HA, rather than mCitrine, to reliably identify cells derived from the Ascl lineage. mCitrine expression in this mouse line is not specific to adult-born neurons (off-targets), and its expression does not accurately reflect hM4 expression.

      (5) Key markers needed to assess the maturation state of abDGCs are missing from the quantification. Incorporating DCX and NeuN into the analysis would provide essential information about the developmental stage of these cells.

      Minor points:

      (1) The labeling (Distance from the hilus) in Figure 2B is misleading. Is that the same location as the subgranular zone (SGZ)? If so, it's better to use the term SGZ to avoid confusion.

      (2) Cell number information is missing from Figures 2B and 2C; please include this data.

      (3) Sample DG images should clearly delineate the borders between the dentate gyrus and the hilus. In several images, this boundary is difficult to discern.

      (4) In Figure 6, it is not clear how tamoxifen was administered to selectively inhibit the more mature 6-7-week-old abDGC population, nor how this paradigm differs from the chow-based approach. Please clarify the tamoxifen administration protocol and the rationale for its specificity.

      Comments on revisions:

      I appreciate the authors' careful and thorough revisions. They have addressed all of my previous concerns satisfactorily, and the manuscript is now significantly strengthened. I have no further concerns.

    2. Reviewer #2 (Public review):

      In this study, the authors investigate how increasing cognitive demand shapes activity patterns in the dorsal dentate gyrus (DG). Using a touchscreen-based TUNL task combined with TRAP/c-Fos tagging, birth-dating of adult-born granule cells (abDGCs), and chemogenetic inhibition, they show that higher task demand increases mature granule cell (mGC) recruitment and enhances suprapyramidal (SB) versus infrapyramidal (IB) blade bias. Functionally, mGC inhibition reduces overall activity and impairs performance without disrupting blade bias, whereas inhibition of {less than or equal to}7-week-old abDGCs increases mGC activity, abolishes blade bias, and impairs discrimination under high-demand conditions. These findings suggest that effective pattern separation depends not only on overall DG activity levels but also on the spatial organization of recruited ensembles.

      The integration of touchscreen TUNL with temporally controlled activity tagging and birth-dated cohorts is technically strong. Quantification of SB-IB bias and radial/apical distributions adds anatomical precision beyond bulk activity measures. The comparison between mGC and abDGC inhibition is conceptually compelling and supports dissociable functional roles. Overall, the data convincingly demonstrate that increasing cognitive demand amplifies blade-biased DG recruitment and that mGCs and abDGCs differentially contribute to both behavioral performance and network organization.

      However, how abDGCs are integrated into the mGC network under high cognitive demand remains unresolved. Additional experiments are needed to clarify how abDGCs shape spatial recruitment patterns and whether they directly inhibit or indirectly regulate mGC activity to maintain high performance.

      Furthermore, the authors frame "high cognitive demand" as a multidimensional construct encompassing broad behavioral challenge. It would strengthen the work to delineate how local abDGC-mGC circuit interactions regulate specific task components in real time. This will require higher temporal resolution approaches, as TRAP and c-Fos labeling integrate activity over prolonged windows and primarily reflect sustained engagement rather than moment-to-moment computations.<br /> The central conclusion that dentate function depends on coordinated spatial recruitment rather than total activity magnitude is supported by the data, although mechanistic interpretations should be tempered given methodological limitations.<br /> Overall, this work advances models of adult neurogenesis by emphasizing a critical-period modulatory role of abDGCs in organizing DG network activity during high-demand discrimination. The combined behavioral and circuit-level framework is likely to be influential in the field.

    3. Reviewer #3 (Public review):

      This study examines the role of dentate gyrus neuronal populations, reflecting neurogenesis and anatomical location (suprapyramidal vs infrapyramidal blade), in a mnemonic discrimination task that taxes the pattern separation functions of the dentate. The authors measure dentate gyrus activity resulting from cognitive training and test whether adult neurogenesis is required for both the anatomical patterns of activity and performance in the cognitive task. The authors find that more cognitively challenging variants of the task evoked more dentate activity, but also distinct patterns of activity (more activity in the suprapyramidal blade, less in the infdrapyramidal blade). Using chemogenetic approaches they silence mature vs immature dentate gyrus neurons and find that only mature neurons (either the general population or specifically mature adult-born neurons), and not immature adult-born neurons, are required for the difficult version of the task. Inhibition of mature adult-born neurons furthermore increased overall activity in the dentate and reduced the biased pattern of activity across the blades, consistent with evidence that adult-born neurons broadly regulate dentate gyrus activity.

      Comments on revisions:

      I appreciate the efforts the authors have taken to revise this manuscript. I have only minor concerns with this revised version of the manuscript:

      Methods state that significance is defined as P<0.05 but some results are interpreted as significant when P=0.05. Either the alpha value needs to change or the interpretation needs to change.

      I believe the statistical results for group and blade effects for the ANOVAs, in Figs 2,3 & 4, appear to be switched (blade should be significant, not group).

      I appreciate that sometimes there is not a perfect overlap between immunohistochemical signals, but I continue to believe that the spatially-non-overlapping TRAP and EDU signals in Fig 3 is caused by these 2 markers being in different cells. A Z-stack or orthogonal projection could verify/disprove this concern.

    1. Reviewer #1 (Public review):

      Kong et al.'s work describes a new approach that does exactly what the title states, "Correction of local beam-induced sample motion in cryo-EM images using a 3D spline model." It is, therefore, a more elaborate approach than current methods in the field for the "movie alignment" stage. Additionally, the work uses 2DTM (2D Template Matching)-related measurements to quantify the improvement of the new method compared to other methods in the field. I find both parts very compelling (the new method and the testing approach)

      On a "focused" view, the strengths of the work rest on presenting a better approach for motion correction and on measuring their performance very well at the 2D level in a compelling manner

      On a more "general" view, the authors introduce the important notion that even one of the most worked-out steps in the processing workflow can still be done better in a measurable way, and that this could lead to better results beyond the 2DTM metrics used for testing, reflecting in better results along the processing pipeline (although the manuscript does not explore further this notion)

      On the "usability" side, the method is still CPU-based and is slower than standards in the field. This may pose significant limitations in practical work, although the authors are aware of this issue and are working on it.

    2. Reviewer #2 (Public review):

      Summary:

      The authors present a new method, Unbend, for measuring motion in cryo-EM images, with a particular emphasis on more challenging in situ samples such as lamella and whole cells (that can be more prone to overall motion and/or variability in motion across a field of view). Building on their previous approach of full-frame alignment (Unblur), they now perform full-frame alignment followed by patch alignment, and then use these outputs to generate a 3D model of the motion. This model allows them to estimate a continuous, per-pixel shift field for each movie frame that aims to better describe complex motions and so ultimately generate improved motion-corrected micrographs. Performance of Unbend is evaluated using the 2D template matching (2DTM) method developed previously by the lab, and results are compared to using full-frame correction alone and to the leading local motion correction methods. Several different in situ samples are used for evaluation covering a broad range that will be of interest to the rapidly growing in situ cryo-EM community.

      Strengths:

      The method appears an elegant way of describing complex motions in cryo-EM samples and the authors present sound data that Unbend generally improves SNR of aligned micrographs as well as increases detection of particles matching the 60S ribosome template when compared to using full-frame correction alone and since review to the leading local motion correction methods. The authors also give interesting insights into how different areas of a lamella behave with respect to motion by using Unbend on a montage dataset collected previously by the group. There is growing interest in imaging larger areas of in situ samples at high resolution and these insights contribute valuable knowledge. Additionally, the availability of data collected in this study through the EMPIAR repository will be much appreciated by the field.

      Weaknesses:

      A major weakness was comparing this method to full-frame approaches only but this has since been addressed by the authors during review and Unbend is compared to MotionCor2, 3, CryoSPARC and Warp. The improvements here are smaller, generally it seems to perform on par with the above methods, but there are significant gains for certain samples (e.g. the M. pneumoniae sample). A comment from this reviewer about using an adaptive approach to decide if/when to proceed to the full Unbend pipeline, over full-frame alone, has been addressed by the authors.

    3. Reviewer #3 (Public review):

      Summary

      Kong and coauthors describe and implement a method to correct local deformations due to beam induced motion in cryo-EM movie frames. This is done by fitting a 3D spline model to a stack of micrograph frames using cross-correlation-based local patch alignment to describe the deformations across the micrograph in each frame, and then computing the value of the deformed micrograph at each pixel by interpolating the undeformed micrograph at the displacement positions given by the spline model. A graphical interface in cisTEM allows the user to visualise the deformations in the sample, and the method is proved to be successful by showing improvements in 2D template matching (2DTM) results on the corrected micrographs using five in situ samples.

      Impact

      This method has great potential to further streamline the cryo-EM single particle analysis pipeline by shortening the required processing time as a result of obtaining higher quality particles early in the pipeline, and is applicable to both old and new datasets, therefore being relevant to all cryo-EM users.

      Strengths

      (1) The key idea of the paper is that local beam induced motion affects frames continuously in space (in the image plane) as well as in time (along the frame stack), so one can obtain improvements in the image quality by correcting such deformations in a continuous way (deformations vary continuously from pixel to pixel and from frame to frame) rather than based on local discrete patches only. 3D splines are used to model the deformations: they are initialised using local patch alignments and further refined using cross-correlation between individual patch frames and the average of the other frames in the same patch stack.

      (2) Another strength of the paper is using 2DTM to show that correcting such deformations continuously using the proposed method does indeed lead to improvements, as evidenced by the number of particles found and the quality of the detections (measured using 2DTM SNR). This is shown using five in situ datasets, where local motion is quantified using statistics based on the estimated motions of ribosomes. The same analysis is performed using other deformation correction tools, with Unbend showing superior performance in terms of particle detected or 2DTM SNR of the detections.

    1. Reviewer #2 (Public review):

      Summary:

      In this study, Bansal et al examine and characterize feeding behaviour in Anopheles stephensi mosquitoes. While sharing some similarities to the well-studied Aedes aegypti mosquito, the authors demonstrate that mated-females, but not unmated (virgin) females, exhibit suppression in their blood-feeding behaviour after imbibing an initial bloodmeal. Using brain transcriptomic analysis comparing sugar fed, blood fed and starved mosquitoes, several candidate genes potentially responsible for influencing blood-feeding behaviour were identified, including two neuropeptides (short NPF and RYamide) that are known to modulate feeding behaviour in other mosquito species. Using molecular tools including in situ hybridization, the authors map the distribution of cells producing these neuropeptides in the nervous system and in the gut. Further, by implementing systemic RNA interference (RNAi), the study suggests that both neuropeptides (particularly in the brain, but not in the abdomen since knockdown outside the brain did not affect feeding behaviour) appear to promote blood-feeding while having no impact on sugar feeding. Interestingly, when either of these two neuropeptide gene transcripts were reduced independently by RNAi, the proportion of females acquiring a blood meal was not affected, whereas simultaneous knockdown of both sNPF and RYa led to a reduction in blood feeding behaviour but did not impact sugar feeding.

      Given that the expression of both neuropeptide genes was found in mostly in non-overlapping brain neurons, this suggests that these two neuropeptides may elicit at least partially complementary actions promoting blood feeding in A. stephensi. Indeed, their putative receptors appear to be colocalized within several neurons within the brain, which could explain why knockdown of both sNPF and RYa transcripts was required to affect blood feeding behaviour (although authors could not confirm if either of these neuropeptides act independently as only partial knockdown was achieved in the brain). Finally, while sNPF was mapped to brain neurons and midgut enteroendocrine cells, the authors mapped RYa only in the brain while reporting expression in the abdomen by qPCR, but that was not localized to the midgut EECs (like sNPF). Therefore, the source of RYamide in the abdomen remains unknown in this mosquito species, but could involve the abdominal ganglia where this neuropeptide has been localized in Ae. aegypti.

      Strengths and/or weaknesses:

      Overall, the manuscript was effectively communicated. Previous concerns and requested clarifications have been addressed in the revised manuscript. While advanced cell-specific tools are lacking in this mosquito species, one weakness here is that peptides could have been applied ectopically in attempts to rescue the deficit in blood feeding behaviour following knockdown by RNAi. Further insight in this regard may be provided in future studies by this and other research groups.

      Reviewing editor comment:

      Inclusion of a schematic in Supplementary Figure S9B addresses the point raised by reviewer 1 in the previous round.

    1. Reviewer #1 (Public review):

      Summary:

      The authors aimed to investigate how short-term visual deprivation influences tactile processing in the primary somatosensory cortex (S1) of sighted rats. They justify the study based on previous studies that have shown that long-term blindness can enhance tactile perception, and aim to investigate the change in neural representations underlying rapid, short-term cross-modal effects. The authors recorded local field potentials from S1 as rats encountered different tactile textures (smooth and rough sandpaper) under light and dark conditions. They used deep learning techniques to decode the neural signals and assess how tactile representations changed across the four different conditions. Their goal was to uncover whether the absence of visual cues leads to a rapid reorganization of tactile encoding in the brain.

      Strengths:

      The study effectively integrates high-density local field potential (LFP) recordings with convolutional neural network (CNN) analysis. This combination allows for decoding high-dimensional population-level signals, revealing changes in neural representations that traditional analyses (e.g., amplitude measures) failed to detect. The custom treadmill paradigm permits independent manipulation of visual and tactile inputs under stable locomotion conditions. Gait analysis confirms that motor behavior was consistent across conditions, strengthening the conclusion that neural changes are due to sensory input rather than movement artifacts.

      Weaknesses:

      (1) While the study interprets the emergence of more distinct texture representations in the dark as evidence of rapid cross-modal plasticity, the claim rests on correlational data from a short-term manipulation and decoding analysis. The authors show that CNN-derived feature embeddings cluster more clearly by texture in the dark, but this does not directly demonstrate plasticity in the classical sense (e.g., synaptic or circuit-level reorganization). The authors have noted this as a limitation and have clarified that the observed changes reflect functional reorganization rather than structural plasticity.

      (2) Although gait was controlled, changes in arousal or exploratory behavior in light versus dark conditions might play a role in the observed neural differences. The authors have controlled for various factors in relation to locomotion, but future studies would benefit from more direct behavioural readouts of arousal states (e.g., via pupillometry or cortical state indicators).

      (3) It should be noted that the time course of the observed changes (within 10 minutes) is quite rapid, and while intriguing, the study does not include direct evidence that the underlying circuits were reorganized-only that population-level signals become more discriminable. The authors have adequately discussed this as an avenue for more mechanistic future research.

      (4) The authors have adequately discussed that, while these findings are consistent with somatotopy and context-dependent dynamics, they do not provide strong independent evidence for novel spatial or temporal organization.

      (5) The authors have also discussed that, while the neural data suggest enhanced tactile representations, the study does not assess whether rats' actual tactile perception improved. Future studies including an assessment of a behavioral readout (e.g., discrimination accuracy), would be insightful.

      (6) The authors' discussion about the implications for sensory rehabilitation, including Braille training and haptic feedback enhancement was a bit premature, but they have amended this, and it remains an interesting translational potential to be explored in future studies.

      (7) While the CNN showed good performance, more transparent models (e.g., linear classifiers or dimensionality reduction) appear to not exceed chance level. The implications of this are that there is an underlying complex structure in the LFPs that has yet to be fully uncovered, on the mechanistic level. This would be important to push the findings forward in future studies.

      Therefore, while the authors raise interesting hypotheses around rapid plasticity, somatotopic dynamics, and rehabilitation, the evidence for each is indirect. Stronger claims will require future causal experiments, behavioral readouts, and mechanistic specificity beyond what the current data provides. However, the work represents an interesting starting point to a more mechanistic understanding in the future.

    2. Reviewer #2 (Public review):

      Summary:

      Yamashiro et al. investigated how transient absence of visual input (i.e. darkness) impacts tactile neural encoding in the rat primary somatosensory cortex (S1). They recorded local field potentials (LFPs) using a 32-channel array implanted in forelimb and hindlimb primary somatosensory cortex while rats walked on smooth or rough textures under illuminated and dark conditions. Employing a convolutional neural network (CNN), they successfully decoded both texture and lighting conditions from the LFPs. The authors conclude that the subtle differences in LFP patterns underlie tactile representation surface roughness and become more distinct in darkness, suggesting a rapid cross-modal reorganization of the neural code for this sensory feature.

      Strengths:

      • The manuscript addresses a valuable question regarding how sensory cortices dynamically adapt to changes in sensory context.<br /> • The use of machine learning (CNNs) enables the analysis to go beyond conventional amplitude-based metrics, potentially uncovering subtle but meaningful effects.<br /> • The authors have substantially improved the manuscript with clearer figures, additional statistical analyses (including permutation tests and cross-validation), and greater methodological transparency.

      Weaknesses:

      • The new analyses (grand-average LFPs, correlation maps, wavelet decompositions, attribution-score correlations) improve transparency but do not yet clarify which specific neural features the CNN exploits, leaving the central interpretability question unresolved.<br /> • A plausible alternative explanation for the increased discriminability in darkness remains insufficiently ruled out: visually driven activity in the light condition (e.g., ambient illumination changes or self-motion-induced visual input) could contaminate S1 LFPs and account for the effect without reflecting a true neural representational change.<br /> • Behavioural and order controls have been improved but remain somewhat limited in sample size.

      Overall assessment:

      The revised manuscript is clearer, more transparent, and technically strengthened. However, the true nature of the signal changes underlying the observed differences in discriminability remains unclear, limiting the scientific strength of the conclusions. The possibility that visual interference contributes to the observed effects remains a plausible and untested alternative interpretation. Additional experiments or analyses quantifying visually evoked activity in S1 would be required to confirm the claim of genuine reorganization of neural representation depending on the illumination condition.

    1. Reviewer #2 (Public review):

      Summary:

      An abundant literature documents molecular changes in the rodent hypothalamus that occur during the transition from prepubertal to mature reproductive physiology. Equally well documented is the role of sex steroids and their receptors during this important period of reproductive development, as well as the importance of GABAergic and glutamatergic neurons. The medial preoptic area (MPOA) is known to play a central role in expression of sexually dimorphic reproductive function and previously reported sexually dimorphic patterns of gene expression are consistent with this role. The present manuscript extends this knowledge base and reports the results of a detailed evaluation of transcriptional dynamics in the MPOA during the adolescent transition to maturity with a particular focus on the role of the estrogen receptor gene (Esr1). Both single cell RNA sequencing (scRNseq) and multiplex in situ hybridization methods were employed and the results subjected to detailed computational analyses to demonstrate that the transcriptomic structure of MPOA neurons displays both sex and cell type specific expression profiles. In addition, both hormonal and genetic manipulations of Esr1 signaling during puberty altered the transcriptional profiles of MPOA neurons, and these changes aligned with maturation of hormone-dependent reproductive function. The authors provide this evidence to illustrate Esr1-dependent control of gene regulatory networks required for normal expression of reproductive behaviors expressed during the transition from adolescence to adulthood. The results presented in this manuscript are extensive and represent the most comprehensive evaluation of transcriptomic changes during reproductive maturation to date. The methods appear strong and the results provide a rich data set that will support a good deal of future analysis.

      Strengths:

      (1) The major strength of this manuscript is the extensive set of images and graphs that illustrate molecular changes that occur in MPOA neurons during adolescence, although additional spatial detail as to locations of the source neurons would be welcome in order to place the changes in the proper circuitry context.

      (2) Targeting Esr1 deletion to MPOA GABA neurons is a good choice, given how these cells have been implicated in sexual differentiation of reproductive behavior previously, and the lack of comparable responses in glutamatergic neurons is convincing. The AAV-frtFlex-Cre virus created by the investigators is a most useful tool for such studies. Profiling distinct transcriptomic trajectories in GABA and glutamatergic neurons during reproductive maturation is impressive and leads to some of the best supported conclusions in this paper.

      (3) Cellular and molecular resolution of the transcriptomics data appears excellent, however, because the source tissue for the scRNAseq analysis was obtained by bulk dissection of the MPOA anatomical resolution is limited. This problem is addressed to some extent by careful comparison of scRNAseq results with previously published spatial transcriptomics data. The HM-HCR-FISH analysis clearly documents spatially restricted changes in gene expression, but it is hard to discern where these changes occur based on the images presented or the descriptions included in the Results. The anatomical schematic included in Figure 4 suggests that investigators are not familiar with components of the MPOA (see Allen Mouse Brain Atlas).

      Weaknesses:

      (1) A major conceptual flaw is that the authors do not distinguish between genetically determined sex differences in patterns of gene expression and differences caused by the fact that MPOA neurons are exposed to different endocrine environments in adolescent males and females, which can cause different transcriptional trajectories independent of genetic sex. This issue does not render their results invalid, but their terminology should address the issue in the discussion and "limitations" section. At the very least the endocrine status of "intact females" should be included.

      (2) A major technical flaw is that the MPOA is treated as a functionally distinct brain region (block dissections) with uniform distribution of cell types (FISH data are not illustrated or reported with sufficient spatial detail). Thus, an enormous amount of molecular data is provided that cannot be mapped to distinct neural circuits, thereby limiting the neurobiological impact. This is also a weakness of the FISH data, which is presented with only small regions illustrated without anatomical detail. In fact, some images are compared that appear to illustrate different MPOA structures, although it is impossible to be certain of this due to the lack of morphological landmarks. The analysis of how Esr1 orchestrates regulatory gene networks is impressive and interesting, but the fact that many of the observed transcriptional events occur in neural circuits that do not overlap confounds interpretation.

      (3) The locations of the AAV injections should be characterized because deleting Esr1 in multiple distinct parts of the MPOA will likely confound interpretation. This is especially problematic given the limited number of mice used for parts of the RNAscope analysis.

      (4) Although the focus of these experiments on adolescence is welcome, neither the Introduction nor the Discussion do a good job of placing these studies in the context of what is already known about brain maturation during puberty. It is true that this is very much a results-focused manuscript, but the scholarship can be improved. Simply stating that your results are consistent with previous reports places an undue burden on the reader to go figure out what is new.

      (5) Throughout the manuscript, the authors utilize obscure abbreviations, which often makes reading their text overly cumbersome. This is certainly justified in certain instances where complex names of analytical methods are used repeatedly, but the authors are encouraged to try and simply their use of non-standard abbreviations.

      Comments on revisions:

      The authors have considered issues raised during the initial review. Although there do not appear to be significant changes to analyses, figures or conclusions, the authors have added important revisions listing limitations in study design and methodology that impact interpretation.

    2. Reviewer #3 (Public review):

      The paper identifies effects of gonadal hormones within hormone-responsive GABAergic neurons in the MPOA. Although it is not surprising that hormones have effects on neurons that express hormone receptors, the current paper adds insights with higher cellular and spatial resolution than previous work and focuses on adolescence period. The paper also identifies a major role for Esr1-dependent mechanisms on behavior using an intersectional genetic strategy to ablate Esr1 in GABAergic or glutamatergic neurons in the MPOA.

      The authors have thoughtfully addressed the reviews, in particular by focusing quantitative analyses on Vgat+Esr1+ clusters and adding important technical and conceptual considerations in the limitations section.

      I have one remaining minor concern. I appreciate that the text now defines "transcriptional maturation". However, the term seems inappropriate when describing the "minimal transcriptional changes" in Vgat+hormone RLow clusters, which implies that they are transcriptionally immature. Do the authors mean to imply that transcriptional maturation is observed in Vgat+Esr1+ clusters but not Vgat+hormone RLow clusters? The authors also use the term "hormone-dependent transcriptional dynamics", which I think is more appropriate. For example, hormone-dependent transcriptional dynamics are observed in Vgat+Esr1+ clusters but not Vgat+hormone RLow clusters.

    1. Reviewer #1 (Public review):

      Summary:

      Overall, this is an interesting and well-written manuscript on a fascinating question in a "charismatic" model system.

      Strengths:

      1) The Introduction is concise, though it might be helpful to the non-specialist reader to learn a bit more about what is known about the social control of somatic growth across diverse species (including humans), which would help to make this work more generally interesting.

      (2) The experiment is well-designed.

      (3) The data collected are comprehensive.

      (4) The complementary analysis of both feeding and aggression/submission data with and without known social roles is a neat idea and compelling!

      Weaknesses:

      (1) I was surprised that the HPA/stress axis was not considered here at all. Wouldn't we expect that subordinates have increased stress axis activation, which in turn could inhibit their growth and aggressive behavior?

      (2) To what extent are growth, food intake, agonistic behavior, and/or gene expression patterns coordinated across P1 vs P2 pairs? The lack of such an analysis seems like a missed opportunity.

      (3) What was the rationale for using whole bodies for the transcriptome analysis? Given the hypotheses, the forebrain or hypothalamus and certain other organ systems (e.g., liver, gonads, skin, etc.) would have been obvious candidate tissues here. I realize that cost is always a consideration, but maybe a focus on the fore-/midbrain could have been prioritized.

      (4) Given the preceding point, why was a fold-change threshold used for assessing DEGs (supplementary Figure 3)? There is no biological justification to ever use a fold-change threshold, especially in bulk RNA-seq analysis. This is particularly true here, where whole bodies were used for RNA-seq analysis, which is a bit unusual. Relatively small cell populations (such as hypothalamic neurons that regulate growth or food intake) may show substantial gene expression variation across social types, yet will be masked by the masses of other cells in the whole body sample. However, gene expression may still vary significantly, albeit the fold-difference may be small. I therefore suggest a reanalysis that omits any fold-change threshold.

      (5) Why is the analysis of color (hue, saturation) buried in the supplementary materials? Based on the hypotheses that motivated the study, color seems just as relevant as food intake, growth, and agonistic behavior, so even if the results are negative, they should be presented in the main paper.

      (6) The Discussion is sometimes difficult to follow. The authors may want to consider including a conceptual graphic that integrates the different aspects of growth and satiety regulation, etc., into a work-in-progress model of sorts, which would also facilitate clearer hypotheses for future research.

    2. Reviewer #2 (Public review):

      In this manuscript, the authors test growth, behavior, and gene expression in pairs of clownfish as they establish social dominance hierarchies, examining patterns of gene expression in these pairs after dominance has been established. The authors show solid evidence that emerging dominant clownfish show increased growth, aggression, and food consumption compared to their submissive or solitary counterparts, eventually adopting distinct gene expression profiles.

      Major Comments:

      (1) The Introduction is comprehensive, but it could be condensed. Likewise, the discussion could be condensed. There is considerable redundancy between the methods, the results, and the legend in Figure 1. The authors should consolidate and remove the redundancy.

      (2) For Figure 3, the authors are showing PC2 and PC3; why is PC1 not shown? There is so much overlap between the three groups in PC2 vs PC3; it seems unlikely that researchers could conclusively identify any individual as belonging to a group based on the expression profile. The ovals shown do not capture all the points within each of the groups, and particularly the grey S oval seems misaligned with the datapoints shown.

      (3) The authors indicate that the 15 replicates exhibiting the greatest size difference between P1 and P2 were selected for gene profiling. Does this mean that each of the P1 and P2 were pairs with each other? Have the authors tried examining the gene expression patterns in a paired manner? E.g., for the pairs that showed the greatest size differences, do they also show the greatest differences in gene expression? Do the P1s show the most extreme differences from P2s that also show the most extreme P2 differences? Perhaps lines on Figure 3A connecting datapoints from the P1 and P2 pairs would be informative.

      (4) For the specific target pathways that are up- and downregulated in the different backgrounds, I recommend that the authors include boxplots (or heatmaps) showing the actual expression values for these targets. Figure 6 shows a heatmap for appetite-related genes, and it would be great to see a similar graph for the metabolism and glycolysis genes; it would also be informative to see similar graphs for hormonal and sexual maturation pathways as well.

      (5) Particularly given that there is a relatively small number of genes enriched in the different rank conditions, I did not understand the need to do the WGCNA module analysis. I thought that an analysis of GO terms across the dataset would have been more meaningful than the GO term analysis shown in Figure 4, which considers only genes assigned to the "brown WGCNA module". This should be simplified or clarified.

      (6) The authors say that they have identified coordinated changes in behaviors and the "underlying gene expression, leading to the emergence" of social roles. This is a little bit misleading, since the gene expression analysis occurred well after the behavioral and phenotypic differences emerged. Presumably, the hormonal and genetic shifts that actually caused the behavioral and phenotypic difference occurred during the weeks during which the experiment was underway, and earlier capture of the transcriptome would presumably reveal different patterns, and ones that would be considered more causative. The authors acknowledge this in 434-435, but it could be emphasized further.

      (7) The authors have measured a number of differences between the different dominance classes of fish. All these differences were measured relative to the other classes, but in my view, the Solitary group was the closest to a baseline control. So I'm not sure that it is fair to say that "P2 and S individuals showed consistent downregulation of these genes and pathways" (line 401). I encourage the authors to emphasize the differences in gene expression from the "perspective" of the P1 individuals compared to the baseline of P2 and S individuals. Line 474 says that "P2 fish showed significant upregulation" of a number of pathways. It should be very clear what that is compared to (compared to P1, presumably?)

      (8) Along the same lines, the authors say in line 514 that subordinates and solitaries strategically downregulate their growth. I'm not convinced that this is the case: I would consider this growth trajectory to be the default and the baseline. I would interpret that under certain social conditions, a P1 dominant pattern of growth, behavior, and gene expression is allowed to emerge.

    3. Reviewer #3 (Public review):

      Summary:

      The authors tested the hypothesis that interactions among size- and age-matched rivals will lead to the emergence of social roles, accompanied by divergence in four aspects of individual phenotypes: growth, feeding behavior, fighting behaviors, and gene expression in clownfish.

      Strengths:

      The data on growth, feeding rate, and fighting behaviors support the authors' claims.

      Weaknesses:

      Gene analysis conducted in this study is not sufficient to clarify how the relevant genes actually regulate growth and behavior.

      The information obtained from whole-body gene expression analysis is very limited. Various gene expression is associated with the regulation of fighting behaviors, food intake, growth, and metabolism, and these genes are regulated differently across tissues, even within a single individual. Gene expression analysis should be performed separately for each tissue.

      Clownfish undergo sex change depending on social status and body size, as the authors mention in the manuscript. Numerous gene expressions are affected by sex change. It is unclear how this issue was addressed.

    1. Reviewer #1 (Public review):

      Summary:

      In this study, the authors examine how a developmentally regulated cis-regulatory element controls SOX2 expression during neural differentiation of human stem cells. The results suggest that this highly conserved long-range enhancer mediates neural-specific SOX2 regulation and offer insight into the role of promoter-enhancer contacts in this process. Although the findings are interesting, several limitations need to be addressed.

      Strengths:

      A central question in developmental biology is how genes are regulated in a context-dependent manner. SOX2, a major pluripotency factor, is expressed in diverse tissues during development, and therefore understanding the mechanisms that control its spatiotemporal expression is critical. This study addresses this important question by examining the functional relevance of a neural-specific, developmentally regulated SOX2 enhancer and its associated promoter-enhancer contacts in driving gene expression during human neural development. Using multiple model systems and techniques, the authors test the requirement of this enhancer by analyzing SOX2 expression in mutant lines, providing evidence for its role in this process.

      Weaknesses:

      A key limitation of the study is the absence of data from homozygous SOX2 enhancer deletion, which leaves the analysis incomplete and tempers the conclusions that can be drawn. Furthermore, the suitability of teratomas as a model system is questionable, given their limited capacity to recapitulate the spatial patterning, regional specification, and organized developmental processes characteristic of the human forebrain. Finally, the manuscript remains largely descriptive with little mechanistic insight.

    2. Reviewer #2 (Public review):

      Summary:

      The authors use a combination of genomics, genome conformation assays, and CRISPR-mediated deletion to study the transcriptional regulation of the SOX2 gene in human neural stem cells (hNSCs).

      Strengths:

      The authors show that two distal elements, located ~550kb downstream of the SOX2 gene, are important for SOX2 transcription in hNSC. They investigate both the deletion of these elements in established hNSCs and in hNSCs generated by differentiation of human pluripotent stem cells, suggesting these elements are important in both the establishment and maintenance of SOX2 expression in hNSCs.

      Weaknesses:

      Homologous elements have been studied in the mouse genome and have conserved function in mouse NSCs, yet these findings are not mentioned. Inclusion of biological replicates for the scRNA-seq and replicate CRISPR-deleted clones would strengthen the study.

    1. Reviewer #1 (Public review):

      Summary:

      In this work, the authors investigate the mechanisms of low-frequency synaptic depression at cerebellar parallel fiber to interneuron synapses using unitary recordings that allow direct quantification of synaptic vesicle release. They show that sparse stimulation can induce robust synaptic depression even in the absence of substantial vesicle consumption, and that this depressed state is rapidly reversed when stimulation frequency is increased. To account for these observations, the authors propose a model in which low-frequency depression reflects a redistribution of vesicles within the readily releasable pool, in particular, a reduction in docking site occupancy due to vesicle undocking.

      Strengths:

      I found the experimental work to be of high quality throughout. The use of simple synapse recordings to count individual vesicle release events is particularly powerful in this context and allows questions to be addressed that are difficult to approach with more conventional approaches. The demonstration that low-frequency depression can occur independently of prior vesicle release, together with the rapid recovery observed during high-frequency stimulation, places strong constraints on possible underlying mechanisms and represents a clear strength of the study.

      The modeling framework is clearly laid out and helps organize a broad set of observations across stimulation frequencies. Several of the experimental tests appear well-motivated by the model, including the recovery train experiments, the analysis of failures, and the use of doublet stimulation. Taken together, the data provide a coherent phenomenological description of low-frequency depression and its relationship to vesicle availability within the readily releasable pool.

      Weaknesses:

      While the experimental results are strong, the manuscript would benefit from rebalancing the strength of the mechanistic conclusions drawn from the modeling in light of its limitations. The framework is clearly useful and provides a coherent interpretation of the data, but it is not uniquely constrained by the experimental observations, and alternative models or interpretations could plausibly account for the findings. The use of different model regimes concatenated across time, with substantially different parameter values, highlights the abstract nature of the approach. For these reasons, the model seems best presented as one plausible explanatory framework rather than a definitive biological mechanism. Clarifying the distinction between data-driven observations and model-based inferences would help readers assess which conclusions are strongly supported and which remain more speculative.

      The interpretation of the Ca2+-related experiments would benefit from more cautious wording. The absence of detectable changes in presynaptic Ca2+ signals does not exclude more localized or subtle Ca2+-dependent mechanisms, and conclusions regarding Ca2+ independence should therefore be framed accordingly. In addition, while low-frequency depression is still observed at reduced extracellular Ca2+, these experiments appear less diagnostic of the specific model-derived mechanism emphasized elsewhere in the manuscript - namely, a selective reduction in docking-site occupancy - and should be discussed with appropriate qualification in the text.

      Major points:

      (1) Clarify and qualify mechanistic claims derived from the model.

      Throughout the manuscript, changes in model parameters are at times described as if they directly reflected underlying physiological mechanisms. As a result, the conceptual distinction between experimentally observed phenomena, model-derived variables, and biological interpretation is not always clear. Several conclusions in the Results and Discussion are phrased as mechanistic statements, although they rest on assumptions intrinsic to the modeling framework. The authors should systematically review the text and explicitly distinguish between (i) experimentally observed changes in synaptic responses and (ii) inferences about vesicle docking states or transitions within the model.

      In particular, statements implying that vesicle undocking is the mechanism underlying low-frequency depression should be rephrased to reflect that this is an interpretation within the proposed framework rather than a uniquely demonstrated biological process. For example, statements such as "Low-frequency depression is caused by synaptic vesicle undocking" should be replaced with formulations such as "Within the framework of our model, low-frequency depression is accounted for by a redistribution of synaptic vesicles away from docking sites" or "Our results are consistent with a model in which changes in vesicle docking-state occupancy contribute to low-frequency depression."

      A particularly problematic example is the statement that "these experiments further confirm that LFD only involves a decrease in δ, without accompanying changes in ρ or IP size." Here, an experimentally defined phenomenon (LFD) is directly equated with changes in model-derived variables. Such statements should be revised to make clear that δ, ρ, and IP size are inferred quantities within the model, and that the experimental data are interpreted through this framework rather than directly confirming changes in these parameters. Similarly, over-generalizing statements such as "Undocking therefore represents the key mechanism controlling short-term depression across stimulation frequencies" should be softened to reflect that this conclusion emerges from the model rather than from direct experimental evidence.

      (2) Address the biological interpretation of time-dependent model regimes.

      The model relies on distinct parameter regimes applied at different time points, with some transitions effectively suppressed in certain regimes. While this approach captures the data well, its biological interpretation remains unclear. The authors should either (i) expand the discussion to outline plausible biological processes that could give rise to such regime changes (for example, calcium-dependent modulation of transition rates or activity-dependent changes in vesicle state stability), or (ii) more explicitly frame this aspect of the model as a descriptive abstraction rather than a mechanistic proposal. This further underscores the need to clearly separate the descriptive role of the model from claims about underlying biological mechanisms.

      (3) Reframe conclusions drawn from calcium-related experiments.

      The calcium imaging data demonstrate no detectable changes in the measured presynaptic calcium signals under the tested conditions, but they do not rule out that calcium signals contribute in ways undetectable by the assay. Conclusions should therefore be revised to reflect this limitation, avoiding statements that exclude a role for calcium-dependent mechanisms. Wording such as "we did not detect evidence for..." would be more appropriate than conclusions implying the absence of an effect.

      Similarly, while low-frequency depression is still observed at reduced extracellular calcium (1.5 mM Ca²⁺), the specific mechanistic signature emphasized elsewhere in the manuscript - namely a selectively reduced first response during a high-frequency recovery train - is no longer apparent. These experiments should therefore be discussed as consistent with the proposed framework, but not as providing independent support for a selective reduction in docking-site occupancy. Explicitly acknowledging this limitation would improve clarity and avoid over-interpreting these data.

      (4) Soften interpretations based on non-significant comparisons.

      In several places, comparisons that do not reach statistical significance are used to argue for equivalence between conditions (for example, comparisons involving failure versus non-failure trials or different LFD conditions). These conclusions should be revised to emphasize the limits of statistical power and framed as a lack of evidence for a difference rather than evidence of independence.

    2. Reviewer #2 (Public review):

      Summary:

      Silva and co-workers exploit their previously established methods of analyzing release events at single parallel fiber to molecular layer interneuron synapses. They observed synaptic depression at low transmission frequencies (< 5 Hz), which rapidly recovers during high-frequency transmission. Analysis of the time course of low-frequency depression revealed an initial rapid and a slow linearly increasing time course. Strikingly, the initial depression occurred even in the absence of preceding release, arguing against vesicle depletion as the underlying mechanism.

      Strengths:

      The main strength of the study is the careful demonstration of an interesting synaptic phenomenon challenging the classical vesicle-centered interpretation of synaptic depression.

      Weaknesses:

      No major weaknesses were identified by this reviewer.

      The finding of release-independent synaptic depression is important and would have widespread implications. Therefore, some more analyses to increase the confidence in these findings could be performed.

      My concern is whether rundown could explain the findings. If the rate of failures in s1 increases and at the same time the amplitude decreases during the experiments, an apparent depression in s2 could arise. The Supplementary Figure 5A addresses run-down, but the figure is not easy to understand, and, as far as I understood, it does not address the question of whether the release-independent depression could be caused by a rundown. To address this, the analysis of Figure 5 could be repeated by investigating the failure rate and amplitude separately or by analyzing the 1st and 2nd half of the recordings separately.

    3. Reviewer #3 (Public review):

      Summary:

      The manuscript builds on the observation that, at some synapses, low-frequency stimulation causes synaptic depression, which can be reversed by subsequent high-frequency stimulation. Such low-frequency depression (LFD) cannot be easily explained by the depletion of a single vesicle pool. Here, Silva and colleagues propose a model of activity-dependent vesicle trafficking to explain LFD at synapses between cerebellar granule cells and molecular layer interneurons.

      Strengths:

      Overall, LFD is interesting and worthy of examination, and the authors provide new experimental results that are of the high quality expected from this group.

      Weaknesses:

      The study proposes a novel model of vesicle trafficking that is not explained by known biological mechanisms, and the manuscript does not adequately compare or discuss alternative models.

      I have several concerns about how the authors interpret the data. First, the manuscript's primary conceptual advance is the idea that LFD involves vesicle undocking, rather than depletion. However, most experiments were performed under conditions that promote vesicle depletion (3 mM extracellular Ca2+). When experiments were repeated in physiological Ca2+, there appeared to be little or no LFD (stats are not provided). Second, the RS/DS/DU/undocking model, though not outside the realm of possibility, is not readily explained by known mechanisms and is only loosely supported by experimental findings. Third, when simulating LFD, the authors do not compare alternative models and use inappropriate language to imply that a model fit represents the truth (e.g., "the finding of identical experimental and simulated values confirms that the undocking mechanism accounts for LFD"). Finally, the model is presented in an overly complicated manner. The sheer amount of terms and nomenclature makes the manuscript confusing and difficult to read. Overall, the manuscript would benefit from added experiments and more statistics, a better justification and evaluation of the model, and more nuanced language.

      Major concerns:

      (1) Most experiments were performed under conditions that exacerbate depletion

      In order to attribute LFD to vesicle undocking rather than depletion, it is important to show LFD under conditions where depletion is minimal. As mentioned above, the authors only report significant LFD in elevated extracellular Ca2+. In a small number of experiments performed in more physiological Ca2+ (1.5 mM), there is no depression after a single stimulus, and it is not clear that there was statistically significant depression during a low-frequency train. Several studies cited in support of LFD share this problem:

      • Abrahamsson et al., (2007) recorded from Schaffer collaterals in 4 mM Ca, 3-4X physiological Ca2+.

      • Doussau et al., (2010) recorded from aplysia synapses in 3X Ca compared to seawater.

      • Rudolph et al., (2011) is cited as an example of LFD. However, this study performed experiments at high release probability cerebellar climbing fibers, and reported depression that increased monotonically with

      stimulation frequency, so it does not resemble the phenomenon studied in this paper. Lin et al., (2022) also largely describe monotonic depression at the calyx.

      The authors note that their results differ from those of Atluri and Regehr, but do not mention that a possible reason for the difference is the increased release probability in their experiments.

      The authors should provide statistics for the data obtained in 1.5 mM Ca, and discuss why LFD is increased in conditions that also elevate vesicle release probability.

      (2) Lack of biological mechanisms supporting the model

      The model is presented without compelling biological support. The evidence in support of vesicle undocking comes from experiments by the Watanabe lab, which showed fewer-than-expected docked vesicles under EM when cultured synapses were stimulated immediately prior to high-pressure freezing. Kusick et al were careful to note that these vesicles may have been lost to fusion.

      The putative undocking Kusick describes is immediate (< 5 ms after stimulation), and was not shown to be Ca2+ sensitive. This manuscript describes "calcium-dependent undocking" that proceeds from 10 ms - 200 ms. Multiple studies from the Watanabe lab show that a single stimulus lowers the number of docked vesicles, and subsequently, there is a transient redocking of vesicles that can be blocked by EGTA or Syt7 knockout.

      I also question the rationale for the authors' model that 2 vesicles are coupled in series to a single release site. Previous papers from this lab cited EM studies from frog and neuromuscular that showed filamentous connections between vesicles (do these synapses show LFD?). Here, the authors primarily cite their previous models to support their arguments. I encourage them to continue searching for ultrastructural evidence for 2-vesicle-docking-units and to cite such studies.

      (3) Comparison to other vesicle models

      The authors use overly assertive language to suggest that the model proves a mechanism. "Altogether, these results indicate that the slow phase of LFD ... reflects a δ decrease without significant changes in pr, in ρ or in IP size". Simulating data does not conclusively "indicate" the underlying mechanism, but the authors could state their data can be "explained by a model where..".

      However, LFD does not require activity-dependent undocking. Instead, the phenomenon has been explained by high-release probability, paired with an activity-dependent increase in either docking or release probability (Chiu and Carter, 2024; Doussau et al., 2017). Does the new model do a better job of replicating some facet of the data? If multiple models can explain the same data, how can we determine which model is correct? The "Alternative Presynaptic Depression Mechanisms" should be expanded to discuss these issues.

    1. Reviewer #1 (Public review):

      Sensory hair cells of the inner ear convert mechanical sound vibrations into electrical signals through mechano-electrical transduction (MET), a process critically dependent on the specialized organization and lipid composition of their plasma membrane. Although the protein components of the MET complex are relatively well characterized, the role of the lipid environment remains poorly understood and often overlooked. Recent discoveries that core MET proteins TMC1 and TMC2 function as lipid scramblases, disrupting membrane lipid asymmetry, expose a significant gap in our understanding of how lipid homeostasis is regulated in hair cells and how membrane dynamics influence MET function.

      In this study, the authors address this gap by identifying the P4-ATPase ATP8B1 and its chaperone TMEM30B as essential regulators of membrane lipid asymmetry in outer hair cells. They also generated HA-tagged knock-in mice to precisely localize the P4-ATPase ATP8B1 and its chaperone TMEM30B within outer hair cells, demonstrating their enrichment in stereocilia, and convincingly demonstrate that loss of these proteins causes phosphatidylserine externalization, hair cell degeneration, and hearing loss in mouse models, phenocopying defects observed in TMC1 mutant mice with constitutive scrambling activity. While these findings establish lipid flippase pathways as critical for hair cell survival and auditory function, they also raise important questions about the precise mechanisms linking lipid asymmetry disruption to MET dysfunction and hair cell pathology.

      Overall, the data convincingly support the conclusion that ATP8B1-TMEM30B flippase activity is required to maintain stereocilia lipid asymmetry and auditory function. The study substantially advances understanding of how lipid homeostasis intersects with MET. However, several points require clarification to ensure that localization claims and mechanistic interpretations are fully supported by the presented data.

      Revisions considered essential by this reviewer are:

      (1) Figure 1D.<br /> The authors should clarify how the qPCR data were normalized and specify the reference (housekeeping) genes used. This information is necessary to evaluate the robustness and comparability of the gene expression data.

      (2) Figure 1F.<br /> The lack of F-actin staining at the hair cell base raises the possibility that the permeabilization conditions may have limited antibody access to certain membrane regions. This is especially important given that the authors used a gentle permeabilization agent such as saponin to preserve membrane integrity. Because the authors conclude that ATP8B1 and TMEM30B are localized "almost exclusively to OHC bundles and the apical membrane, with minimal staining in the remaining plasma membrane," (line 128). Including co-labeling with a plasma membrane marker or more comprehensive F-actin visualization of lateral and basal regions would help ensure that the restricted localization is biological rather than technical. In the absence of such controls, the localization claim may be somewhat overstated and should be tempered accordingly.

      (3) Figure 7B.<br /> Although quantification of ATP8B1-HA intensity at the bundle appears similar between WT and Cib2 KO samples, the representative image suggests that some bundles lack detectable labeling. To better capture phenotype variability, it would be helpful to include an additional quantification showing the fraction or number of bundles with detectable ATP8B1-HA signal in Cib2 KO mice.

      (4) Lines 346-349.<br /> The manuscript suggests that IHCs lack stereocilia-enriched P4-ATPases. However, this conclusion is not directly supported by the presented data. The authors should either provide supporting localization or expression data for other P4-ATPases or soften the statement to indicate that no stereocilia-enriched P4-ATPases were detected under the conditions examined.

      Recommendations:

      (5) The authors convincingly demonstrate that TMEM30B loss results in ATP8B1 mislocalization. While not essential to the central conclusions, examining TMEM30B localization in ATP8B1 KO hair cells would clarify whether this interdependence is reciprocal, as described for other P4-ATPase-CDC50 complexes.

      (6) Lines 359-374.<br /> The discussion of Annexin V labeling is careful and balanced. This paragraph would benefit from referencing other studies that showed minimal Annexin V labeling in healthy P6 organ of Corti, reinforcing that robust PS externalization in the present study is pathological rather than developmental.

      (7) Lines 392-399.<br /> The proposed feedback model linking MET activity and ATP8B1-TMEM30B localization is compelling. The discussion could be strengthened by noting that in TMC1/2 double knockout hair cells, PS externalization is not observed, consistent with the idea that flippase activity becomes critical specifically when scrambling occurs. The mislocalization observed in Cib2 KO hair cells further supports the coupling between TMC-mediated scrambling and flippase-mediated membrane restoration.

    2. Reviewer #2 (Public review):

      Summary:

      Prior work identified TMEM30B (knockout mice) as well as ATP8B1 (human genetics and mouse model), ATP8A2 (knockout mice), and ATP811A (human genetics) as relevant for hearing. The authors also reasoned that, given the recent discovery of TMC1 and TMC2's dual function as mechanotransduction channels of the inner ear and as lipid scramblases, a counterpart flippase should be in the sensory hair-cell stereocilia bundle where mechanotransduction happens. They use CRISPR/CAS to modify the endogenous mouse genes and add an HA tag at the N-terminus of the ATP8B1, ATP8A1, ATP8A2, and ATP11A proteins. Their experiments with these mice unambiguously localized ATP8B1 at the base of outer hair cell stereocilia bundles. Knockout of ATP8B1 results in loss of outer hair cells, deficient auditory function (ABR), and degeneration of outer hair cell stereocilia bundles. Similarly, hair cells from genetically modified mice with endogenous HA-tagged TMEM30B proteins show localization of this protein to outer hair cell stereocilia bundles. TMEM30B knock-out mice phenocopy the ATP8B1 knock-out model. Interestingly, the authors show that annexing V staining precedes hair cell loss in ATP8B1 and TMEM30B knockout mice and that proper localization of these proteins is lost in mice that lack CIB2, a protein essential for hair cell mechanotransduction.

      Strengths:

      (1) Use of knock-in HA-tagged proteins, rather than antibody staining, to unambiguously localize ATP8B1 and TMEM30B.

      (2) Systematic characterization of auditory function (ABR), hair cell loss, and hair-cell stereocilia bundle morphology.

      (3) Advances our understanding of the role played by lipid homeostasis in auditory function.

      (4) Reports on mouse models that will be helpful to further understand the mechanistic role played by ATP8B1 and TMEM30B in normal hearing and hereditary deafness.

      Weaknesses:

      (1) Are the HA tags causing any functional issues? Function and localization of tagged proteins can sometimes be compromised. It would be good to know, for each knock-in model (TMEM30B, ATP8B1, ATP8A1, ATP8A2, and ATP11A ), whether the HA-tagged protein is causing any issues with the mice and particularly with hearing (ABRs). Are these mice normal? Can they hear? These data are missing.

      (2) Following on the point above, is it possible that ATP8B1-HA is well localized, but localization for the other three flippases (ATP8A1-HA, ATP8A2-HA, and ATP11A-HA) is compromised by the tag? Is this potential mislocalization causing any functional phenotypes? (ABRs of point 1). I find it surprising that there are flippases only in outer hair cells, and only formed by ATP8B1. A possible explanation is that the tag is interfering with trafficking. If so, there should be a phenotype (ABRs), although this might be masked by redundancy among these flippases or caused by systemic issues (admittedly difficult to sort out). Given that this manuscript will likely become foundational, and that there is evidence that at least two of the other flippases are involved in hearing loss, it would be good to provide more information about the mice and HA-tagged proteins in the other knock-ins (ATP8A1-HA, ATP8A2-HA, and ATP11A-HA). Depending on the data available for the knock-ins, the authors may want to discuss these scenarios and soften the statement indicating that inner-hair cells may lack flippase activity altogether.

      (3) Expression of ATP8B1 at P0 (Figure 1D), when there should not be protein in outer hair cells yet, seems high. Does this mean that other cells in the cochlea also express ATP8B1? Is this a concern?

      (4) Fluorescence scales in Figure 6 B and D and Figure 7 B and D are very different. So are the values for WT. One would expect that the WT would be similar in all cases (at least within the same compartments), given that the methods section indicates that "All images were collected using identical acquisition parameters, including zoom and laser power, across genotypes". If WT shows such variability, how can we compare?

    1. Reviewer #1 (Public review):

      Summary:

      This paper examines plasticity in early cortical (V1-V3) areas in an impressively large number of rod monochromats (individuals with achromatopia). The paper examines three things:

      (1) Cortical thickness. It is now well established that early complete blindness leads to increases in cortical thickness. This paper shows increased thickness confined to the foveal projection zone within achromats. This paper replicates work by Molz (2022) and Lowndes (2021), but the detailed mapping of cortical thickness as a function of eccentricity and the inclusion of higher retinotopic areas is particularly elegant.

      (2) Failure to show largescale reorganization of early visual areas using retinotopic mapping. This is a replication of a very recent study of Molz et al. but I believe, given anatomical variability, the larger n in this study, and how susceptible pRF findings are to small changes in procedure, this replication is also of interest.

      (3) Connective field modelling, examining the connections between V3-V1. The paper finds changes in the pattern of connections, and smaller connective fields in individuals with achromatopsia than normally sighted controls, and suggests that these reflect compensatory plasticity, with V3 compensating for the lower resolution V1 signal in individuals with achromatopsia.

      This is a carefully done study (both in terms of data collection and analysis) that is an impressive amount of work.

      *Effects of eye-movements

      The authors have carried out the eye-movement analyses I asked of them. Unfortunately, in 4 individuals they couldn't calibrate the eyetracker (it's impressive they managed in 10). I think this means that 4 of 13 (since a different participant was excluded from head motion) individuals weren't included in correlation analyses. Limiting the correlation analysis to individuals with better fixation has obvious issues. I'd recommend redoing (or additionally including) stats using non-parametric measures while classifying these 4 as having fixation instability of 3 (i.e. greater instability than the participant with the worst fixation who was successfully calibrated).

      *Interpreting pRFs

      The paper would be strengthened by a little more explicit clarity about what pRFs represent and how that affects their interpretation of their findings as plasticity vs. non-plasticity (I know the authors are aware of this, but I think it would be helpful for readers who are less experienced in pRFs). In the introduction it would be helpful to point out that pRFs represent the collective response of a large population of neurons, and as a result pRF estimates can vary depending on which population of neurons that stimulus drives.

      For example, imagine for the sake of argument that rods only project to V1 neurons with larger receptive fields. If one measured pRFs in a control observer under phototopic vs. scotopic conditions one would see smaller pRFs in the photopic conditions. This wouldn't represent 'plasticity' - it would represent the fact that the firing neurons contributing to the pRF signal are a slightly different population because of a change in the stimulus content. This is of course exactly what you see in 2C. And indeed, the authors make this identical point ". In the non-selective condition, the smaller pRFs in controls are in line with the higher spatial resolution of the<br /> cone system, which is not active in the achromat group." But this point would be clearer if more of the conceptual underpinnings were made explicit in the introduction (or at this point in the paper).

      Shifts in which population of neurons drive your pRFs can explain main of the more puzzling results in the paper without detracting from your main conclusions. For example, in 2D, I don't think it's differences in S/N driving your results (pRFs are at least theoretically meant to be robust to S/N). If smaller RFs 'drop out' under low luminance and these smaller RFs also tend to be more central, then one would expect the control results of 1D. And I think a similar argument might even be made for the smaller difference in the rod monochromats.

      It would be possible to make the point of Figure 4B more simply if Figure 4B was replaced by additional Panels in Figure 2 simply showing V3 pRF sizes/eccentricity distributions. That would make the point that you don't see the same expansion in pRF sizes in V3 in a way that is just as clear, and is closer to the data.

      *Interpreting cRFs

      Similarly, I think the paper would be improved with more clarity about the underlying signal in CF modeling. Once again, I appreciate that the authors are familiar with this, but it will help the reader in interpretation. (And I do believe thinking carefully about this may alter your interpretations). CF receptive fields 'find' the region in V1 that best predict the V3 signal in a given voxel. In resting state this likely represents a combination of:

      (1) visually driven signal - correlations that may or may not reflect connectivity but represent the fact that regions that represent the same region of visual space will be active at the same time.

      (2) global bilaterally symmetrical signal consisting of enhanced correlations between iso-eccentric regions (Raemaekers et al., 2014), which may arise from vasculature that symmetrically stems from the posterior cerebral artery (Tong et al., 2013; Tong and Frederick, 2014).

      (3) intrinsic neural fluctuations that are more strongly correlated between connected neurons. These are likely quite weak compared to the other contributions.

      I think if you ignore 2, (which is not likely to differ between rod mono and controls) and model 1 and 3, you might well see shifts in CFs towards the boundary of the scotoma - essentially the CF's location will be biased towards the region of V1 that has stronger correlations - which = the region which has a visual signal.

      I do find convincing the argument that you don't see the same shift in controls in the rod-selective condition. So I think the results of 4A are fine. But a little more clarity about 'what's under the hood' in CF modeling would be nice.

      *Interpreting the relationship between pRFs and cRFs

      So there's something here that confuses me. We are all agreed that V3 pRF sizes are similar across RM and control. V1 pRFs are larger in RM. It feels intuitive that smaller CFs would compensate but I can't make it make sense to myself when I think it through. Each pRF represents a combination of receptive field location scatter and bandwidth. You want to argue that eccentricity mapping looks pretty normal, so there's no reason to think increased rf scatter, and I can believe that (though I do think this assumption should be discussed explictly).

      So far I think we agree.

      But let's think about what drives a CF during visual stimulation ... Specifically lets think about 'the pRF of the CF' (the region of visual space represented by the cluster of voxels in the CF). If pRFs for individual voxels in V1 are big, then the pRF for the CF is also going to be large. But we know that pRFs for V3 are normal size. So, the V3 CF will 'find' a smaller number of voxels in V1, in order to try to find the 'correct sized' CF pRF. Note that this explanation is very similar to yours. But doesn't require ANY 'intrinsic' connectivity. It's really just assuming the whole thing is driven by the visual signal and the CF size is determined by the ratio of the pRF sizes in V3 vs. V1.

      One possible solution would be to regress out the visual stimulus and redo this analysis based on the residuals.

    2. Reviewer #3 (Public review):

      Summary:

      This study addresses a long-standing question in visual neuroscience concerning how the human visual system balances stability and plasticity when sensory input is altered from early in life. Using achromatopsia as a model of lifelong cone deprivation, the authors examine whether early visual cortex undergoes retinotopic reorganization to compensate for the absence of foveal cone input, or whether canonical retinotopic organization is largely preserved. By combining fMRI-based population receptive field (pRF) mapping with connective field (CF) modelling, the authors characterize changes across multiple hierarchical stages of visual processing.

      The main findings indicate that primary visual cortex (V1) shows no systematic remapping of the foveal projection zone, whereas extrastriate cortex, particularly V3, exhibits altered patterns of sampling from V1. The authors interpret these results as evidence for hierarchical adaptation, whereby downstream readout mechanisms adjust to make more efficient use of degraded rod-mediated input while preserving early-stage retinotopic organization.

      Strengths:

      A major strength of this work is the use of silent substitution to generate rod-selective stimuli. This approach enables a principled comparison between achromats and typically sighted controls by isolating rod-driven responses in both groups. In doing so, the study overcomes a key limitation of prior work, where differences in cortical organization could often be confounded by differences in photoreceptor class rather than reflecting neural reorganization per se. The inclusion of a rod-driven baseline in controls provides an important reference for distinguishing long-term adaptation from transient or stimulus-driven effects.

      Another notable strength is the integration of CF modelling alongside conventional pRF mapping. While pRF analyses alone suggest enlarged receptive fields in V1, consistent with reduced spatial resolution, the CF analysis offers a more mechanistic account by revealing changes in how V3 samples information from the V1 surface. This multi-level modelling approach moves beyond descriptive accounts of cortical map structure and provides a framework for interpreting how downstream areas may adjust their integration strategies under conditions of altered input.

      Weaknesses:

      Although the study is methodologically strong, the central claims regarding stability and compensatory plasticity require clearer conceptual framing and stronger empirical support. Stability is primarily defined as the absence of large-scale retinotopic remapping in V1, yet the presence of significantly enlarged V1 pRFs indicates substantial tuning-level plasticity at the input stage; distinguishing topographic stability from functional reorganization would therefore strengthen the interpretation. Moreover, the proposed compensatory mechanism raises a signal-processing concern, as reduced downstream sampling (smaller CFs in V3) cannot restore spatial information lost due to coarse upstream representations, and may instead limit integration. The mechanistic link between altered CF properties and normalization of extrastriate pRFs is not directly tested, as group differences are not shown to covary across individuals or visual field locations. Finally, the interpretation of these changes as compensatory implies functional benefit, yet no behavioral or performance measures are provided to establish that the observed reorganization preserves or enhances visual function, leaving open whether these effects reflect adaptive optimization or passive downstream consequences of altered input.

    1. Reviewer #1 (Public review):

      Summary:

      The paper presents a three-layered hierarchical model for simulating Drosophila larva locomotion, navigation, and learning. The model consists of a basic locomotory layer that generates crawling and turning using a coupled-oscillator framework, incorporating intermittency in movement through alternating runs and pauses. The intermediate layer enables navigation by allowing larvae to actively sense and respond to odor gradients, facilitating chemotaxis. The adaptive learning layer integrates a spiking neural network model of the Mushroom Body, simulating associative learning where larvae modify their behavior based on past experiences. The model is validated through simulations of free exploration, chemotaxis, and odor preference learning, demonstrating close agreement with empirical behavioral data. This modular framework provides a valuable advance for modeling of larva behavior.

      Strengths:

      Every modeling paper requires certain assumptions and abstractions. The main strength of this paper lies in its modular and hierarchical approach to modeling behavior, making connections to influential theories of motor control in the brain. The authors also provide a convincing discussion of the experimental evidence supporting their layered behavioral architecture. This abstraction is valuable, offering researchers a useful conceptual framework and marking a significant step forward in the field. Connections to empirical larval movement are another major strength.

      Weaknesses:

      While the model represents a conceptual advance in the field, some of its assumptions and choices fall behind state-of-the-art approaches. One limitation is the paper's simplified representation of larval neuromechanics, in which the body is reduced to a two-segment structure with basic neural control. Another limitation is the absence of an explicit neuromuscular control system, which would better capture the role of segmental central pattern generators (CPGs) and neuronal circuits in regulating peristalsis and turning in Drosophila larvae. Many detailed neuromechanical models, as cited by the authors, have already been published. These abstractions overlook valuable experimental studies that detail segmental dynamics during crawling and the larval connectome.

      The strength of the model could also be its weakness. The model follows a subsumption architecture, where low-level behaviors operate autonomously while higher layers modulate them. However, this approach may underestimate the complexity of real neural circuits, which likely exhibit more intricate feedback mechanisms between sensory input and motor execution.

    2. Reviewer #2 (Public review):

      The paper proposes a hierarchically layer approach to larval locomotion, chemotaxis and learning. The model consists of a basic locomotor layer with two coupled oscillators, one for crawls and one for turns. The intermediate layer modulates the frequency and amplitude of tunings to enables chemotaxis. The higher layer, integrates a spiking neural network model of the Mushroom Body to modify the door valence in response to experience as during learning.

      The model is compared to experimental data with a good degree of agreement. This modular framework provides a valuable advance for modeling larva behavior.

      Strengths:

      A novel multilayer level model that reflects current thinking of the neuronal organisation of motor control. The model is very useful to investigate the neuronal architecture of central pattern generators<br /> and higher order motor control circuits that could be linked to larval connectome data.

      Weaknesses:

      All the limitations of the model are discussed and therefore the paper perfectly fits its purpose.

    1. Reviewer #1 (Public review):

      Summary:

      The authors describe the results of a single study designed to investigate the extent to which horizontal orientation energy plays a key role in supporting view-invariant face recognition. The authors collected behavioral data from adult observers who were asked to complete an old/new face matching task by learning broad-spectrum faces (not orientation filtered) during a familiarization phase and subsequently trying to label filtered faces as previously seen or novel at test. This data revealed a clear bias favoring the use of horizontal orientation energy across viewpoint changes in the target images. The authors then compared different ideal observer models (cross-correlations between target and probe stimuli) to examine how this profile might be reflected in the image-level appearance of their filtered images. This revealed that a model looking for the best matching face within a viewpoint differed substantially from human data, exhibiting a vertical orientation bias for extreme profiles. However, a model forced to match targets to probes at different viewing angles exhibited a consistent horizontal bias in much the same manner as human observers.

      Strengths:

      I think the question is an important one: The horizontal orientation bias is a great example of a low-level image property being linked to high-level recognition outcomes and understanding the nature of that connection is important. I found the old/new task to be a straightforward task that was implemented ably and that has the benefit of being simple for participants to carry out and simple to analyze. I particularly appreciated that the authors chose to describe human data via a lower-dimensional model (their Gaussian fits to individual data) for further analysis. This was a nice way to express the nature of the tuning function favoring horizontal orientation bias in a way that makes key parameters explicit. Broadly speaking, I also thought that the model comparison they include between the view-selective and view-tolerant models was a great next step. This analysis has the potential to reveal some good insights into how this bias emerges and ask fine-grained questions about the parameters in their model fits to the behavioral data.

      Weaknesses:

      I'll start with what I think is the biggest difficulty I had with the paper. Much as I liked the model comparison analysis, I also don't quite know what to make of the view-tolerant model. As I understand the authors' description, the key feature of this model is that it does not get to compare target and probe at the same yaw angle, but must instead pick a best match from candidates that are at different yaws. While it is interesting to see that this leads to a very different orientation profile, it also isn't obvious to me why such a comparison would be reflective of what the visual system is probably doing. I can see that the view-specific model is more or less assuming something like an exemplar representation of each face: You have the opportunity to compare a new image to a whole library of viewpoints and presumably it isn't hard to start with some kind of first pass that identifies the best matching view first before trying to identify/match the individual in question. What I don't get about the view-tolerant model is that it seems almost like an anti-exemplar model: You specifically lack the best viewpoint in the library but have to make do with the other options. I sort of understand the reasoning that this enforces tolerance of viewpoint variability, but I'm not clear on whether or not this is a version of face familiarity and recognition that the authors think has an analog in human visual processing.

      I do think that this model is interesting in terms of the differential tuning it exhibits, but don't find it easy to align with any theoretical perspective on face recognition. Specifically, do the authors think there is a stage of face processing in which tolerance as they've operationalized it in the model is extant? What I'm looking for is a concrete description of the circumstances that the authors are saying lead to this kind of model potentially being a meaningful analog of face recognition. For example, is the idea that one may become familiar with a face in some very limited set of viewpoints and then be presented with that face in other views?

      Alternatively, if the authors prefer to say that they simply thought this was a nice exercise in terms of identifying a different model and that it may not be a meaningful proxy for face recognition. I think that's fine, to be clear! I just still don't see anything in the text that convinces me of the ecological validity of this version of view-tolerance.

    2. Reviewer #2 (Public review):

      This study investigates the visual information that is used for the recognition of faces. This is an important question in vision research and is critical for social interactions more generally. The authors ask whether our ability to recognise faces, across different viewpoints, varies as a function of the orientation information available in the image. Consistent with previous findings from this group and others, they find that horizontally filtered faces were recognised better than vertically filtered faces. Next, they probe the mechanism underlying this pattern of data by designing two model observers. The first was optimised for faces at a specific viewpoint (view-selective). The second was generalised across viewpoints (view-tolerant). In contrast to the human data, the view-specific model shows that the information that is useful for identity judgements varies according to viewpoint. For example, frontal face identities are again optimally discriminated with horizontal orientation information, but profiles are optimally discriminated with more vertical orientation information. These findings show human face recognition is biased toward horizontal orientation information, even though this may be suboptimal for the recognition of profile views of the face.

      One issue in the design of this study was the lowering of the signal-to-noise ratio in the view-selective observer. This decision was taken to avoid ceiling effects. However, it is not clear how this affects the similarity with the human observers.

      Another issue is the decision to normalise image energy across orientations and viewpoints. I can see the logic in wanting to control for these effects, but this does reflect natural variation in image properties. So, again, I wonder what the results would look like without this step.

      Despite the bias toward horizontal orientations in human observers, there were some differences in the orientation preference at each viewpoint. For example, frontal faces were biased to horizontal (90 deg) but other viewpoints had biases that were slightly off horizontal (e.g. right profile: 80 deg, left profile: 100 deg). This does seem to show that differences in statistical information at different viewpoints (more horizontal information for frontal and more vertical information for profile) do influence human perception. It would be good to reflect on this nuance in the data.

      Comments on revisions:

      I am happy with the response and changes to the comments in my review. The key findings from this study are: (1) that there is bias toward the use of horizontal information across all viewpoints for face recognition in humans using an old-new recognition task. (2) In contrast, the optimal information for matching faces varies as a function of viewpoint. The view-selective model shows horizontal information is dominant for frontal views and vertical information is dominant for profile views.

      The data from the view-tolerant model is less easy to interpret as it doesn't fit with any theoretically plausible model of face recognition. It might be a useful model for a face matching task in which participants had to match unfamiliar faces across viewpoints. This might be a possible extension of the current work.

      Nonetheless, I still think this is an interesting contribution to the literature.

    1. Reviewer #1 (Public review):

      Summary:

      This brief piece by Swartz and colleagues outlines the complexities surrounding the choice of clinical specialty for physician-scientists. It is, in general, clear and well-written, and it will be useful to research-oriented medical students choosing a path and to the mentors who are guiding them.

      Strengths:

      The writing is clear. The points made are not profound, but they are important and will be of use to the intended audience.

      Weaknesses:

      I have only minor suggestions for improvement. There are some areas of redundancy where the article could be tightened up by consolidating.

    2. Reviewer #2 (Public review):

      Summary:

      This article is a useful compendium of advice for MD/PhD students (and research-focused MD students) to consider when it is time to decide on a clinical field for residency training. The authors are a distinguished group of physician-scientists and program directors who are drawing on published data and their own experience as mentors to provide advice and resources to students about to make what can be a career-defining choice. It makes an effective argument for considering important differences between clinical fields in their ability to sustain research integration, provide mentorship, meet lifestyle expectations, and foster a long-term career as a research-focused physician-scientist.

      Strengths:

      (1) A lot has been written about physician-scientists as an endangered species. Given the important role that physician-scientists can play if they engage in research that is informed by experience in patient care, not nearly enough has been written about the choices that students make during training that can keep them on track or throw them off.

      (2) The article provides not only general advice, but specific information in the 2 tables that can help trainees to weigh their priorities and consider their options.

      (3) Among the best advice is to weigh clinical demands, maintenance of procedural skills, recognition of the impact of research time on salary, and the impact of high salaries on the tension between research effort and clinical effort in clinical departments, which is where most physician-scientists in academia are employed.

      Areas for potential improvement:

      (1) Some of the most useful pieces of advice are scattered through the text when they might be more impactful if focused. For example, what are the 4 or 5 most essential factors that someone in an MD/PhD or an MD program should weigh when they are deciding between clinical disciplines? There are also published data on the experience of past graduates in achieving a research-focused career in each clinical discipline. How should that data be applied by trainees? What are the factors that should be weighed in deciding where to work as a research-focused physician once training has been completed?

      (2) Some clinical fields at academic institutions have proved to be much more hospitable to careers as research-focused physicians than others. Published data highlight the challenges. I believe the authors have tried very hard to present a balanced perspective, but in the process, they have, I believe, missed an opportunity to guide trainees and make them aware of what they should look for to avoid making a decision that may prove incompatible with their long-term goals.

      (3) An issue that hasn't been raised: Where will be the jobs for physician-scientists who have an MD {plus minus} PhD and want to do research and discovery? How many openings will there be for physician-scientists in academia 5-10 years from now? In industry? How are recent events in Washington affecting the continuation of those jobs? Unfortunately, I am not aware of labor statistics for physician-scientists, but perhaps the authors can find them.

      (4) Additional questions that can be raised and addressed in the article: Should one of the "smart choices" in the article's title be where you do the residency, and not just which residency you do? How important is it to be at a successful, research-intensive medical center/university, both during and after residency and fellowship training? If being in an institution where there are numerous very successful physician-scientists and scientists improves the likelihood of being able to sustain a physician-scientist career, how should graduating students improve their chances of being at one of those institutions?

      (5) In every clinical discipline, there are departments that value physician-scientists more than other departments and invest accordingly. What advice would the authors give to help graduating students identify those departments?

    1. Reviewer #1 (Public review):

      This paper investigates how different learning curricula influence the way that humans piece together directly experienced transitions into a broader cognitive map. When adjacent learning trials were grouped within rows or columns of the map, subsequent navigation through the map was weaker than when adjacent learning trials came from disjoint spaces in the map. The authors speculate that the grouped curriculum resulted in mental fragmentation that made navigation across space more difficult later on.

      This is an interesting paradigm that contributes useful new findings in the domain of map learning to the growing literature on curriculum learning. The evidence for a difference between conditions is highly compelling, but, as the authors are very transparent in acknowledging in the Discussion, the evidence for their proposed mechanism - mental fragmentation under grouped learning - is somewhat weak. The study thus presents an intriguing empirical result but not an ironclad mechanistic account.

      An alternative - by their account, "less interesting" - explanation is that grouped learning was easier because trials in close succession had overlapping elements, and so participants were not trying as hard or as engaged. There is a literature on spaced (as opposed to massed) learning being better for subsequent memory because it increases retrieval effort. It seems very plausible that this could be going on here, and the control experiment reported in the supplement would not help to rule this out. This literature deserves some discussion.

      The Introduction focuses entirely on literature showing advantages in grouped over intermixed learning, setting that up as the most well-motivated expectation from the literature. Upon finding the opposite, the Discussion then mentions that interleaving has been found to be useful in "applied domains", but then returns to how surprising this is in light of recent findings in the category learning literature. But there is a substantial earlier literature on interleaved vs blocked curricula in category learning, very often finding advantages for interleaving. See, e.g., Carvalho & Goldstone, 2015, for a review. There is also a paper showing interleaving advantages in associative inference, Zhou et al., 2023, JEP:G, which is very relevant to several of the discussion section paragraphs. Thus, the treatment of the prior curriculum learning literature is currently sparse.

    2. Reviewer #2 (Public review):

      I think this paper is an excellent and timely contribution. It clearly shows that learning overlapping relationships in a disjoint training schedule (where the overlaps are not encountered close together in time) appears to aid the formation of an integrated associative memory structure (a cognitive map) and supports generalisation. I believe the methods are sound and the results are clear. I only have a couple of methodological questions that may not warrant any changes to the paper (or only very minor changes/additions):

      (1) The mixed effects models did not include random slopes for the within-subject factors ("spatial manipulation" and "block"), and so the corresponding fixed effect inferences may be unsafe. Having said that, it is likely that including these slopes may not be warranted given their contribution to the model's fit. I recommend that the authors check this.

      (2) The mixed effects models for accuracy appear to model average performance across trials rather than using a generalised linear model with a (e.g.) logit link function and the binomial distribution to characterise performance. I think this is a little sub-optimal, as the latter is often more sensitive. Nonetheless, it is not in any way wrong; the results are clear enough as is, and there may be a good reason to avoid a non-linear link function, which can alter the interpretation of effects close to the ceiling and floor.

      I think the introduction and/or discussion would benefit from contrasting their results with Berens & Bird (2022, PLOS Comp Bio). In this paper, it is shown that blocking the training of discriminations in a linear hierarchy (what we call progressive training) substantially benefited transitive inference performance. This seems at odds with the author's finding that "participants struggle to integrate information across rows and columns, i.e. across groups of transitions that were trained separately in time".

      I would really like to know what the authors think about this discrepancy (or, indeed, whether they think there is one at all). Is it possibly because "progressive" learning is some combination of "grouping", "blocking" and "chaining" (where there is a structured overlap between adjacently trained relationships)? Or is it something else, e.g., that there is a fundamental difference between learning associations and discriminations (personally, I lean on this explanation)?

      Relevant to this, the authors note that their "findings do contradict recent reports from the category learning literature, where blocking seems to help learning and generalisation (Dekker et al., 2022; Flesch et al., 2018; Noh et al., 2016). It may be that where the goal is not to learn a complex knowledge structure - like a map - but simply to compress exemplars by mapping them onto a smaller number of labels - the benefits of blocking emerge." However, the benefit of progressive (blocked) training in my own work was observed in a task that required learning a complex/relational structure in the form of a transitive hierarchy, which theoretical accounts suggest depends on learning map-like representations (Whittington et al., 2020).

    3. Reviewer #3 (Public review):

      Summary:

      This study examines how training regimes influence the formation of cognitive maps. Participants learned two relational maps over three days through pairwise transitions: one map was trained with grouped sequences that followed rows or columns, while the other was trained with disjoint transitions sampled randomly across the map. In addition, the study manipulated the temporal spacing of training blocks (blocked vs. semi-blocked) and tested whether the results generalized across two map geometries (a 5×5 grid and a 4×4 torus).

      Furthermore, they run a follow-up experiment (or condition) testing rows and columns shuffled in the grouped condition.

      While grouped training produced better performance during learning, the authors report that disjoint training led to superior performance at test on tasks probing the global map knowledge.

      Summarising experimental design:

      (1) Map geometry (between-subjects): 5×5 grid vs 4×4 torus

      (2) Training block schedule (between-subjects): Blocked vs Semi-blocked

      (3) Training regime/transition sampling (within-subject): Grouped or Disjoint (Day 1 and Day 2)

      Strengths:

      The study addresses a clear and timely theoretical question about how the training regime affects the formation of cognitive maps. A further strength is the well-controlled experimental design, allowing the authors to test their hypotheses in a systematic and informative way.

      Weaknesses:

      (1) If I understood correctly, participants learned one map on the first day and the other on the second day, with the training regime (grouped vs. disjoint) counterbalanced across maps. This raises the possibility that experience with one training regime on day one could influence performance on the second day. For example, it would be interesting to examine whether participants who experienced the disjoint regime first showed any differences when learning the grouped regime on the following day. While it may be difficult to fully disentangle such transfer effects from the main training regime effects, it would be informative to test whether performance on the second day depends on the regime experienced on the first day (e.g., whether prior exposure to the disjoint regime predicts performance on the subsequent grouped training, but not vice versa).

      (2) The author mentions a control experiment. Did the participants in the control experiment complete only the training phase or also the testing tasks used in the main experiment? If testing was included, it would be informative to report whether performance at test was comparable to that observed in the main experiment. Given that this condition appears to involve blocked transitions while moving across both rows and columns, I would expect performance to fall somewhere between the grouped and disjoint conditions.

      (3) Participants' performance did not differ between conditions in the map reconstruction task, suggesting that participants in both the grouped and disjoint regimes were ultimately able to form a cognitive map. Was this task always administered last during the testing session? I wonder whether the explicit request of the reconstruction task could have influenced participants' awareness of the map structure.

      (4) The manuscript describes the study as consisting of four experiments (two groups per map shape, differing in the blocked versus semi-blocked schedule). However, based on the design described in the Methods, this appears more accurately characterized as a single experiment with two between factors: map geometry (grid vs. torus) and blocking schedule (blocked vs. semi-blocked) manipulated between participants, and training regime (grouped vs. disjoint) manipulated within participants.

      (5) It is not entirely clear to me from the Results section whether performance at test differed between the two map geometries (grid and torus), or whether the reported effects of training regime were consistent across them.

    1. Reviewer #1 (Public review):

      Summary:

      This work aims to elucidate the molecular mechanisms affected in hypoxic conditions causing reduced cortical interneuron migration. They use human assembloids as a migratory assay of subpallial interneurons into cortical organoids and show substantially reduced migration upon 24 hours hypoxia. Bulk and scRNA-seq shows adrenomedullin (ADM) up-regulation, as well as its receptor RAMP2 confirmed at protein level. Adding ADM to the culture medium after hypoxic conditions rescues the migration deficits, even though the subtype of interneurons affected is not examined. However, the authors demonstrate very clearly that ineffective ADM does not rescue the phenotype and blocking RAMP2 also interferes with the rescue. The authors are also applauded for using 4 different cell lines and using human fetal cortex slices as an independent method to explore the DLXi1/2GFP-labelled iPSC-derived interneuron migration in this substrate with and without ADM addition (after confirming that also in this system ADM is up-regulated). Finally, the authors demonstrate PKA - CREB signalling mediating the effect of ADM addition, and also lead to up-regulation of GABAreceptors. Taken together this is a very carefully done study on an important subject - how hypoxia affects cortical interneuron migration. In my view it would be of great interest for the readers of Elife.

      Strengths:

      Its strengths are the novelty and the thorough work using several culture methods and 4 independent lines.

      Weaknesses:

      The main weakness is that we dont know which interneuron subtypes are most affected by hypoxia and which may be rescued in their migration by ADM.

      A further weakness is that the few genes confirmed to be regulated after hypoxia do not help determining which statistical cut-off can be considered reliable, given that they didn't compare strongly regulated versus weakly regulated genes.

      Comments on revisions:

      Unfortunately, the authors did not address my suggestions. While they show example stainings of interneuron subtypes, they do not show if Calretinin, calbinin or somatostatin+ interneurons are differentially affected by hypoxia or the rescue with ADM. I still consider this an important piece of information to add.

    2. Reviewer #2 (Public review):

      Summary:

      The manuscript by Puno and colleagues investigates the impact of hypoxia on cortical interneuron migration and downstream signaling pathways. They establish two models to test hypoxia, cortical forebrain assembloids and primary human fetal brain tissue. Both of these models provide a robust assay for interneuron migration. In addition, they find that ADM signaling mediates the migration deficits and rescue using exogenous ADM. The findings are novel and very interesting to the neurodevelopmental field, revealing new insights into how cortical interneurons migrate and as well, establishing exciting models for future studies.The authors use sufficient iPSC lines including both XX and XY, so analysis is robust. In addition, the RNAseq data with re-oxygenation is a nice control to see what genes are changed specifically due to hypoxia. Further, the overall level of valiation of the sequencing data and involvement of ADM signaling is convincing, including the validation of ADM at the protein level. Overall this is a very nice manuscript. I have a few comments and suggestions for the authors.

      Strengths/Weaknesses:

      (1) Can they comment on the possibility of inflammatory response pathways being activated by hypoxia - has this been shown before? While not the focus of the manuscript, it would be discussed in the Discussion as an interesting finding and potential involvement of other cells in the Hypoxic response.

      (2) Can they comment on the mechanism at play here with respect to ADM and binding to RAMP2 receptors - is this a potential autocrine loop, or is the source of ADM from other cell types besides inhibitory neurons? Given the scRNA-seq data, what cell-to-cell mechanisms can be at play? Since different cells express ADM, there could be different mechanisms at place in ventral vs dorsal areas.

      (3) For data from Figure 6 - while the ELISA assays are informative to determine which pathways (PKA, AKT, ERK) are active, there is no positive control to indicate these assays are "working" - therefore, if possible, western blot analysis from assembloid tissue could be used (perhaps using the same lysates from Fig 3) as an alternative to validate changes at the protein level (however, this might prove difficult); further to this, is P-CREB activated at the protein level using WB?

      (4) Can the authors comment further on the mechanism and what biological pathways and potential events are downstream of ADM binding to RAMP2 in inhibitory neurons? What functional impact would this have linked to the CREB pathway proposed? While the link to GABA receptors is proposed, CREB has many targets beyond this.

      (5) Does hypoxia cause any changes to inhibitory neurogenesis (earlier stages than migration?) - this might always be known but was not discussed.

      (6) In the Discussion section - it might be worth detailing to the readers what the functional impact of delayed/reduced migration of inhibitory neurons into the cortex might results in, in terms of functional consequences for neural circuit development

      Comments on revisions:

      The authors have addressed my comments thoroughly. I have no further comments or suggestions

    3. Reviewer #3 (Public review):

      Summary:

      The authors aimed to test whether hypoxia disrupts the migration of human cortical interneurons, a process long suspected to underlie brain injury in preterm infants but previously inaccessible for direct study. Using human forebrain assembloids and ex vivo developing brain tissue, they visualized and quantified interneuron migration under hypoxic conditions, identified molecular components of the response, and explored the effect of pharmacological intervention (specifically ADM) on restoring the migration deficits.

      Strengths:

      The major strength of this study lies in its use of human forebrain assembloids and ex vivo prenatal brain tissue, which provide a direct system to study interneuron migration under hypoxic conditions. The authors combine multiple approaches: long-term live imaging to directly visualize interneuron migration, bulk and single-cell transcriptomics to identify hypoxia-induced molecular responses, pharmacological rescue experiments with ADM to establish therapeutic potential, and mechanistic assays implicating the cAMP/PKA/pCREB pathway and GABA receptor expression in mediating the effect. Together, this rigorous and multifaceted strategy convincingly demonstrates that hypoxia disrupts interneuron migration and that ADM can restore this defect through defined molecular mechanisms.

      Overall, the authors achieve their stated aims, and the results strongly support their conclusions. The work has significant impact by providing the first direct evidence of hypoxia-induced interneuron migration deficits in the human context, while also nominating a candidate therapeutic avenue. Beyond the specific findings, the methodological platform-particularly the combination of assembloids and live imaging-will be broadly useful to the community for probing neurodevelopmental processes in health and disease.

      Comments on revisions:

      The authors have fully addressed my concerns by incorporating the relevant discussion into the manuscript, especially regarding how well the migration observed in hSO-hCO assembloids reflects in vivo condition. I have no further comments.

    1. Reviewer #2 (Public review):

      [Editors' note: This version was assessed by the editors. The authors have addressed a point raised by Reviewer #2, who thought the authors compared cells grown in low-serum and high serum conditions. This has been clarified in the latest version.]

      In the manuscript Ruhling et al propose a rapid uptake pathway that is dependent on lysosomal exocytosis, lysosomal Ca2+ and acid sphingomyelinase, and further suggest that the intracellular trafficking and fate of the pathogen is dictated by the mode of entry. Overall, this is manuscript argues for an important mechanism of a 'rapid' cellular entry pathway of S.aureus that is dependent on lysosomal exocytosis and acid sphingomyelinase and links the intracellular fate of bacterium including phagosomal dynamics, cytosolic replication and host cell death to different modes of uptake.

      A key strength is the nature of the idea proposed, while continued reliance on inhibitor treatment combined with lack of phenotype / conditional phenotype for genetic knock out is a major weakness.

      In the previous version, the authors perform experiments with ASM KO cells to provide genetic evidence of the role for ASM in S. aureus entry through lysosomal modulation.

    1. Reviewer #1 (Public review):

      [Editors' note: The article has been improved and several points raised by the reviewers have now been addressed. The authors should ideally further improve the clarity of the figures and the description of the experimental methods. This is particularly important for an article discussing potential confounding factors.]

      Summary:

      This important article reveals that the Nora virus can colonize the intestinal cells of Drosophila melanogaster, where it persists with minimal immediate impact on its host. However, upon aging, infection, or exposure to toxicants, stem cell activation induces Nora virus proliferation, enabling it to colonize enterocytes. This colonization disrupts enterocyte function, leading to increased gut permeability and a significant reduction in lifespan. Results are convincing and hold significant import for the Drosophila community.

      Strengths:

      (1) Building on previous studies by Habayeb et al. (2009) and Hanson et al. (2023), this study highlights cryptic Nora virus infection as a crucial factor in aging and gut homeostasis in Drosophila melanogaster.

      (2) Consistent with the oral route of Nora virus transmission, the study demonstrates that the virus resides in intestinal stem cells, with its replication directly linked to stem cell proliferation. This process facilitates the colonization of enterocytes, ultimately disrupting intestinal function.

      (3) The study establishes a clear connection between stem cell proliferation and virus replication, suggesting that various factors - such as microbiota, aging, diet, and injury - can influence Nora virus dynamics and associated pathology.

      (4) The experimental design is robust, comparing infected flies with virus-cured controls to validate findings.

    2. Reviewer #2 (Public review):

      Summary:

      In this manuscript, the authors report that Nora virus, a natural Drosophila pathogen that also persistently infects many laboratory fly stocks, infects intestinal stem cells (ISCs), leading to a shorter life span and increased sensitivity to intestinal infection with the Pseudomonas bacterium. Nora virus infection was associated with an increased proliferation of ISC and disrupted gut barrier function. Genetically, the authors show that increased ISC division in Nora virus and Pseudomonas coinfected flies is driven by signaling through the JAK-STAT pathway and apoptosis.

      Accordingly, blocking apoptosis and JAK-STAT signaling reduces viral load, suggesting that in this context the JAK-STAT pathway is proviral in contrast to other previous observations in systemically infected flies. This work adds to the findings of another recent paper showing that another persistent fruit fly virus, Drosophila A virus, also increases ISC proliferation and decreases gut barrier function. Intestinal viruses should therefore be considered confounders in studies of fly intestinal physiology.

      Strengths:

      Overall, the data are convincing and robust, starting with two wildtype fly stocks (Ore-R strain) that differ in their Nora virus infection status, followed by experiments in which cleared stocks are reinfected with a purified Nora virus stock preparation. The conclusions of the paper will be of interest to scientists working on insect physiology, virology, and immunology, but should also serve as a warning for scientists that use the fly as a model to study gut physiology.

    3. Reviewer #3 (Public review):

      Summary:

      Franchet et al. sought to characterize the impact of Nora virus on host lifespan and sensitivity to a variety of infectious or stressful treatments. Through careful and rigorous analyses, they provide evidence that the Nora virus greatly impacts fly survival to infection, overall lifespan, and intestinal integrity. The authors have been thorough and rigorous, and the experimental evidence including proper isolation of the virus and Koch's Postulate reinoculation of the organism is excellent. The additional work is valuable and to the gold standard of the field, characterizing the pathology of the gut, including data showing gut leakage, the presence of the virus in the intestinal stem cells, and the importance of stem cell proliferation for virus replication and spread using elegant genetic tools to block stem cell proliferation or enterocyte death.

      Strengths:

      The authors have been rigorous and careful. The initial finding is presented through the lens of two related strains differing in virus infection. From there, the authors characterized the virus and isolated a purified culture, which they used to reinoculate a cleared strain to demonstrate proper Koch's Postulate satisfaction. The authors have also probed various parameters in terms of dietary importance in relevant conditions for many experiments. The additional work to characterize the pathology of the gut is compelling, using genetic tools to block or allow intestinal stem cell proliferation and enterocyte death through JAK-STAT and JNK signalling alongside the tracing of virus presence using a Nora virus antibody. JAK-STAT and JNK are previously described as regulators of these processes, making these tools appropriate and convincing. It is also interesting to see good evidence that the virus itself is damaging, rather than simply permitting coinfection by gut microbes (which does happen).

    1. Reviewer #2 (Public review):

      This paper describes an analysis of a commercially available panel for a spatial transcriptomic approach and introduces a computational tool to predict potential off-target binding sites for the type of probe used in the aforementioned panel. The performance of the prediction tool was validated by examining a dataset that profiled the same cancer tissue with multiple modalities. Finally, a detailed analysis of the potential pitfalls in a published study communicated by the company that commercialized the spatial transcriptomic platform in question is provided, along with best practice guidelines for future studies to follow.

      Strengths:

      - The manuscript is clearly written and easy to follow.<br /> - The authors provide clean, organized, and well-documented code in the associated GitHub repository.

      Comments on revision:

      My impressions from the first round of review haven't really changed. I don't think the software tool is well developed, and failing to incorporate thermodynamics or consider the impact of alignment settings is a major weakness.

      I do think the topical area is relevant. The inclusion of the Xenium /Hubmap data modestly strengthens the manuscript relative to the original submission.

    2. Reviewer #3 (Public review):

      Summary:

      The authors present a new computational method (OPT) for predicting off-target probe binding in the commercial 10X Xenium spatial transcriptomics platform. They identified 28 genes in the 10x xenium human breast cancer gene panel (280 genes) that are not accurately detected at the single-molecule level. They validated the predicted off-target binding using reference data from single-cell RNA-seq and 3'-sequencing-based Visium RNA-seq. This work provides a practical resource and will serve as a valuable reference for future data interpretation.

      Strengths:

      (1) Provides a toolbox for the community to identify off-target probes.

      (2) Validates the predictions using single-cell RNA-seq and sequencing-based Visium RNA-seq datasets.

      Comments on revision:

      The authors state that OPT is a new software tool and have posted example code on GitHub. However, the Jupyter notebook does not display any figures or workflows that would allow the process to be replicated. Please provide documentation and code that can reproduce the results/figures presented in the paper.

    1. Reviewer #1 (Public review):

      Summary

      Alpha oscillations have been previously proposed to shape the temporal resolution of visual perception, with a higher alpha frequency providing a finer resolution. This study goes beyond by investigating three additional processes that could influence joint visual temporal perception: the aperiodic neural signal, the integration of recent perceptual experience (serial dependence), and subjective confidence. To address their question, they developed a novel task where two Gabor patches oriented in opposite directions are presented in a continuous stream. This allows for testing for robust perceptual integration while avoiding bias from suboptimal perception. Behavioral analyses revealed an association between confidence and individual temporal integration thresholds, and demonstrated that serial dependence biases visual temporal integration as well as its associated confidence. EEG analyses first replicated the previous findings showing that faster IAF provides higher temporal resolution. Interestingly, the aperiodic neural signal was associated with both perceptual and temporal precision. Finally, the authors show that serial dependence is reduced in individuals with faster IAF and enhanced in participants exhibiting a stronger aperiodic component. Together, these findings highlighted that visual temporal integration arises from an interplay between alpha oscillations, the aperiodic signal, serial dependance and subjective confidence.

      Strengths:

      (1) The novel task proposed in the study represents a substantial improvement over the two-flash fusion task previously used to investigate the role of alpha oscillations in visual temporal perception.

      (2) Serial dependence has attracted increasing interest in vision research in recent years. Testing whether recent visual inputs also influence temporal resolution is, therefore, a valuable and timely approach. In this regard, the authors provide evidence for a serial dependence effect.

      (3) Although the functional role of brain oscillations has been extensively studied over the past decade, the role of the aperiodic neural signal has long been overlooked. This study revealed that the aperiodic component plays a role in perceptual precision and temporal resolution, thus providing evidence for an important role of the aperiodic neural signal.

      (4) The mediation analysis demonstrates that the aperiodic and oscillatory neural components act independently, providing important insights for future studies aimed at understanding their respective role.

      Weaknesses

      It would have been valuable to record EEG continuously during the experiment to investigate how spontaneous alpha oscillations and aperiodic signal dynamically influence the temporal integration, serial dependance and confidence on a trial-by-trial basis.

      Appraisal

      The authors employed a novel and thoughtfully designed task, combined with appropriate analyses, to address their research question. Their results are convincing and provide strong support for their conclusions.

      Impact

      This study provides valuable insights into the role of the aperiodic neural signal in visual temporal integration. This is important because its contribution has likely been underestimated, and future research will likely uncover increasing evidence of its impact across multiple cognitive functions.

      It was also very interesting to observe how alpha oscillations are associated with serial dependence and confidence, extending beyond their well-known role in visual temporal resolution. This opens intriguing avenues for future research on the functional role of alpha oscillations.

    2. Reviewer #2 (Public review):

      Summary:

      This paper examines resting-state electroencephalography (EEG), the electrophysiological underpinnings of the temporal integration window in perception, and its modulation by priors (serial dependence) as measured through the perceptual fusion point of two continuous alternating stimuli. The study also includes a measure of perceptual confidence. Separating periodic from aperiodic EEG activity, the results show that the faster the individual alpha-frequency at rest and the steeper the aperiodic slope (previously linked to higher sampling/ lower noise), the lower the perceptual fusion point (corresponding to narrower integration windows), with independent contributions of the period and aperiodic activity to the integration window. The data also reveal that the point of fusion depends on prior history, and that the strength of this effect depends on individual alpha frequency and aperiodic slope: the lower the individual alpha frequency and the aperiodic slope, the stronger the serial dependence, with the two contributions being again independent. Higher alpha frequency also led to higher confidence. The results are interpreted to suggest that speed of alpha oscillations and aperiodic slope of the power spectrum (presumably reflecting rate/fidelity of visual sampling and the level of background noise) jointly shape the perceptual measure under study: high rate/ fidelity and low noise promote temporal precision in integration, while lower rate/fidelity and higher noise lead to a higher reliance on prior history. It is concluded that it is the interaction between two EEG features that shapes temporal integration and hence perceptual fusion.

      Strengths:

      The strength lies in the use of a continuous visual stream of two alternating stimuli whose timing shapes fusion or separation of the two stimulus precepts, avoiding some of the pitfalls of previous fusion probes through discrete (not continuous) stimulus pairs (missed detection of one stimulus of the pair may be misinterpreted as fusion). The results seem robust (based on n=83 participants), the results are interesting, and the interpretations are sound.

      Weaknesses:

      The main weakness lies in the reliance on resting state EEG for correlation with the behavioural measures. This captures trait-based relationships, but does miss out on the brain activity dynamics within/across trials, which could be used for a direct readout of evidence accumulation to a decision, for capturing spontaneous fluctuations of the processes under study, etc. Also, in terms of resting state EEG, both eyes-closed (EC) and eyes-open (EO) data have been recorded, but their links to perceptual fusion point/ confidence seem somewhat inconsistent across the results. This is a bit confusing. Are the EO and EC signals in any way related/ correlated, and if not, what are they supposed to represent? Would an analysis of these EEG measures during task performance (e.g., in a pre-stimulus = baseline time window) provide more consistent results? These points could be resolved by additional analyses and/or more elaborate discussions.

    3. Reviewer #3 (Public review):

      Summary:

      In this study, the authors seek to explain what influences the temporal resolution of visual perception and its associated metacognitive monitoring, interindividual differences in such processes, and the neural mechanisms associated with these interindividual differences. More specifically, they investigated the factors influencing the perception of a rapid alternating stream of visual patterns as a single fused percept versus two segregated stimuli, and how these factors relate to stable features of ongoing brain activity. They introduce a novel sustained-stream temporal integration paradigm designed to address limitations of traditional two-flash tasks, and combine this with resting-state electroencephalography (EEG) to examine how individual alpha peak frequency and the aperiodic component of the power spectrum relate to temporal integration thresholds, perceptual history effects, and subjective confidence. Their overarching aim is to move beyond a purely oscillatory account of temporal sampling and to test whether periodic (alpha) and non-periodic (aperiodic) neural dynamics jointly shape perceptual decisions.

      Strengths:

      The study has several notable strengths. First, the experimental paradigm represents a thoughtful and innovative refinement of earlier approaches. By presenting alternating gratings within a continuous stream and varying the duration of each element rather than introducing discrete blank intervals, the authors mitigate well-known confounds of classical two-flash paradigms, particularly the possibility that "fusion" reports reflect missed detections rather than genuine temporal integration. The psychometric functions are well characterized, and the sample size is large for an individual-differences EEG study, with an a priori power analysis supporting the adequacy of the sample. Second, the use of spectral parameterization to separate oscillatory alpha peak frequency from the aperiodic component of the spectrum is methodologically rigorous and timely, as this distinction is increasingly recognized as important to avoid confounds in oscillatory activity estimation and the measurement of neural noise/excitatory-inhibitory balance (i.e., the aperiodic component of the power spectrum). The present work contributes to this emerging direction by relating both to behavioral indices within the same dataset. Third, the integration of perceptual thresholds, serial dependence, and subjective confidence within a unified framework provides a richer account of temporal perception than studies focusing on a single measure. In particular, the demonstration that resting alpha frequency predicts integration thresholds and that the aperiodic exponent relates to variability of the psychometric function is broadly consistent with the authors' central claims.

      Weaknesses:

      (1) At the same time, several aspects of the interpretation require caution. One conceptual issue concerns the interpretation of the psychometric slope parameter as an index of "temporal precision." The manuscript consistently equates steeper slopes with higher perceptual precision or lower internal noise. However, the slope of a binary psychometric function does not uniquely index sensory temporal resolution. It reflects the steepness of the transition between response categories and can arise from multiple sources, including variability in sensory encoding, instability of decision criteria, lapse rates, or other decisional processes. Even in the literature cited by the authors, slope is often described more generally as reflecting perceptual variability or sensory and/or decision noise rather than a pure measure of perceptual precision. An abrupt transition from "fused" to "segregated" responses, therefore, does not necessarily imply finer temporal resolution at the sensory level; it may instead reflect more consistent categorization or reduced decisional variability. The present data convincingly demonstrate relationships between spectral measures and the steepness of behavioral transitions, but they do not by themselves establish that this steepness reflects perceptual temporal precision rather than broader sources of behavioral variability.

      (2) A related concern involves the causal language used to describe the relationship between neural measures and behavior. The EEG metrics are derived from resting-state recordings and therefore reflect stable, trait-like individual differences. Nonetheless, the Discussion sometimes adopts mechanistic phrasing suggesting that slower alpha rhythms or flatter spectra lead the brain to compensate by weighting prior information more heavily, or that neural noise is being "regulated." Such formulations imply within-task adaptive processes that are not directly measured. The results demonstrate robust between-participant associations, but further research is needed to establish whether individuals regulate neural noise or adjust prior weighting dynamically.

      (3) Another point that merits clarification concerns the control analyses. The authors appropriately use spectral parameterization to dissociate oscillatory alpha peak frequency from the aperiodic component in the main analyses; however, their subsequent control analyses examining other frequency bands appear to rely on conventional band-power measures. Because band power can be influenced by the aperiodic background, null effects in other bands are difficult to interpret without similarly accounting for aperiodic structure.

      (4) In addition, the temporal structure of the stimulus stream introduces an interpretational nuance. Varying the duration of each Gabor in a continuous alternation produces quasi-periodic stimulation rates, and several of these ISIs fall within the alpha frequency range. Rhythmic visual stimulation at alpha-range frequencies is known to produce strong stimulus-locked responses and can interact with intrinsic alpha rhythms in a frequency-dependent manner (Keitel et al., 2019; Gulbinaite et al., 2017). Although the present study does not record EEG during task performance and therefore cannot directly assess stimulus-driven steady-state responses, this aspect of the design complicates a purely intrinsic sampling interpretation. The observed relationship between resting alpha frequency and integration thresholds may reflect intrinsic sampling speed, but it could also be influenced by how closely an individual's alpha rhythm aligns with alpha-range temporal structure in the stimulus.

      Conclusion:

      Despite these limitations, the study achieves many of its primary aims. The sustained-stream paradigm reliably elicits graded temporal integration behavior and robust serial dependence effects. Individual alpha frequency is convincingly associated with integration thresholds, and the aperiodic exponent relates to behavioral variability measures. These findings support the broader conclusion that temporal perception reflects an interaction between rhythmic neural dynamics and the background spectral structure of ongoing activity. The work is likely to have a meaningful impact for researchers studying perceptual timing, perceptual history, individual differences in brain rhythms, and the functional role of aperiodic neural activity.

      References:

      Keitel, C., Keitel, A., Benwell, C. S., Daube, C., Thut, G., & Gross, J. (2019). Stimulus-driven brain rhythms within the alpha band: The attentional-modulation conundrum. Journal of Neuroscience, 39(16), 3119-3129.

      Gulbinaite, R., Van Viegen, T., Wieling, M., Cohen, M. X., & VanRullen, R. (2017). Individual alpha peak frequency predicts 10 Hz flicker effects on selective attention. Journal of Neuroscience, 37(42), 10173-10184.

    1. Reviewer #2 (Public review):

      Summary:

      The work presented by Zhang and coauthors in this manuscript presents the study of the neuropeptide corazonin in modulating the post-mating response of the brown planthopper, with further validation in Drosophila melanogaster. To obtain their results, the authors used several different techniques that orthogonally demonstrate the involvement of corazonin signalling in regulating the female post-mating response in these species.

      They first injected synthetic corazonin peptide into female brown planthoppers, showing altered mating receptivity in virgin females and a higher number of laid eggs after mating. The role of corazonin in controlling these post-mating traits has been further validated by knocking down the expression of the corazonin gene by RNA interference and through CRISPR-Cas9 mutagenesis of the gene. Further proof of the importance of corazonin signaling in regulating the female post-mating response has been achieved by knocking down the expression or mutagenizing the gene coding for the corazonin receptor.

      Similar results have been obtained in the fruit fly Drosophila melanogaster, suggesting that corazonin signaling is involved in controlling the female post-mating response in multiple insect species.

      The study of the signalling pathways controlling the female post-mating response in insects other than Drosophila is scarce, and this limits the ability of biologists to draw conclusions about the evolution of the post-mating response in female insects. This is particularly relevant in the context of understanding how sexual conflict might work at the molecular and genetic levels, and how, ultimately, speciation might occur at this level. Furthermore, the study of the post-mating response could have practical implications, as it can lead to the development of control techniques, such as sterilization agents.

      The study, therefore, expands the knowledge of one of the signalling pathways that control the female post-mating response, the corazonin neuropeptide. This pathway is involved in controlling the post-mating response in both Nilaparvata lugens (the brown planthopper) and Drosophila melanogaster, suggesting its involvement in multiple insect species.

      The study uses multiple molecular approaches to convincingly demonstrate that corazonin controls the female post-mating response. The data supporting the main claim of the manuscript are solid and convincing.

    1. Reviewer #1 (Public review):

      Summary:

      This paper leverages 7T fMRI data from the Natural Scenes Dataset to investigate whether retinotopic coding, the position-selective organization of visual response structures, spontaneous resting-state interactions between the Default Network (DN) and the Dorsal Attention Network (dATN). Using individualized network parcellations and population receptive field (pRF) modeling, the authors show that DN voxels can be split into two subpopulations based on their response to visual stimulation: those with position-specific positive BOLD responses (+pRFs) and those with position-specific negative BOLD responses (-pRFs). Critically, these subpopulations relate differently to the dATN during rest: -pRFs are anticorrelated with the dATN, +pRFs are positively correlated, and non-retinotopic DN voxels show no coupling. The anticorrelation (and positive correlation) is enhanced when DN and dATN voxels share visual field preferences. An event-triggered analysis suggests that retinotopic coding shapes both "top-down" (DN-initiated) and "bottom-up" (dATN-initiated) spontaneous activity transients, supporting the claim that the retinotopic scaffold is intrinsic to the DN. These findings challenge the prevailing view of global DN-dATN antagonism and suggest retinotopic coding as an organizing principle for cross-network communication.

      Strengths:

      The central finding that what looks like network-level independence between DN and dATN decomposes into structured, bivalent interactions organized by voxel-level visual field preferences is a compelling demonstration that macro-scale network descriptions can hide meaningful substructure. The logic of the analysis is clean: pRF properties are estimated from retinotopic mapping data and then used to predict resting-state coupling in completely independent scanning sessions. This cross-session, cross-modality design rules out many circularity concerns.

      The use of individualized multi-session hierarchical Bayesian parcellation (Kong et al.) to define DN and dATN boundaries within each subject is the right methodological choice for this question. Network boundaries in posterior cortex, where DN and dATN interdigitate most closely, vary considerably across individuals, and group-average approaches would introduce exactly the kind of misassignment that would most confound the result.

      The matched-vs-random pRF analysis is well-controlled. The authors demonstrate that cortical distance between matched and randomly-matched dATN pRFs does not differ, effectively ruling out spatial proximity on the cortical surface as a confound. tSNR controls further show that signal quality differences do not drive the effect.

      The event-triggered analysis (Figure 3) is creative and adds genuine value. Showing that retinotopically-specific coupling persists during DN-initiated activity transients, not only dATN-initiated ones, is the key piece of evidence for the claim that the code is intrinsic to the DN rather than passively inherited through bottom-up visual drive.

      The result is observed consistently across all individual participants, which provides strong evidence for the robustness of the qualitative pattern despite the small sample size inherent to densely-sampled designs.

      Weaknesses

      (1) The nature of negative pRFs requires more scrutiny

      The entire interpretive framework depends on treating negative pRFs in the DN as genuine position-selective neural responses (suppression). However, negative BOLD signals are well known to arise from non-neural sources, specifically, vascular stealing (where activation in nearby tissue diverts blood from adjacent voxels) and macrovascular draining vein effects that produce spatially displaced signal inversions. These concerns are amplified at 7T, where T2*-weighted GE-EPI carries substantial macrovascular weighting. The DN and dATN interdigitate extensively in the posterior cortex, often within millimeters. A negative pRF in a DN voxel adjacent to a positive dATN voxel could, in principle, reflect the hemodynamic shadow of its neighbor rather than an independent neural response.

      The spatial dispersion control (matched vs. random pRFs have similar cortical distribution) is valuable but addresses long-range confounds, not *local* hemodynamic crosstalk. The reliability of sign and center position across runs is reassuring but does not exclude a vascular origin, as vascular architecture is itself stable across sessions. I would encourage the authors to test whether the matched-vs-random effect survives exclusion of voxels near large pial vessels (identifiable from T2* contrast or the venograms available in the NSD). These analyses would not be dispositive, but they would meaningfully strengthen the neural interpretation.

      (2) Amount of retinotopic mapping data and choice of pRF pipeline

      The NSD includes 6 runs of retinotopic mapping (~5 minutes each; 3 bar-aperture, 3 wedge/ring). The authors use only the 3 bar-aperture runs (~15 minutes total per subject) and fit their own pRFs using AFNI's 3dNLfim procedure, rather than using the pRF estimates provided as part of the NSD release (which were fitted using the analyzePRF toolbox with all 6 runs).

      Fifteen minutes of bar data is quite limited for reliable voxel-wise pRF estimation, especially in regions far from the early visual cortex, where signal-to-noise is inherently lower. Standard recommendations for robust pRF mapping in higher-order regions generally suggest substantially more data. The variance-explained threshold is close to the noise floor by design, meaning that a non-trivial number of the "retinotopic" DN voxels may be poorly estimated. Given that the core analyses depend on both the sign and the center position of these pRFs, the limited data is a significant concern.

      The authors do not explain why they chose to re-fit pRFs rather than use the NSD-provided estimates. If the motivation was methodological (e.g., the NSD pRF pipeline does not readily yield signed amplitude, or the bar-only fits were judged more appropriate for detecting negative responses), this should be made explicit. If the NSD-provided pRFs can reproduce the key findings, this would substantially increase confidence in the results. If they cannot, that divergence itself would be important to understand. I would ask the authors to address this choice and, if feasible, to report whether the core results replicate using the NSD-provided pRF estimates and/or whether using all 6 runs of retinotopy data changes the findings.

      (3) pRF model adequacy for the Default Network

      The isotropic Gaussian pRF model was developed for and validated in early and mid-level visual cortex, where it captures the dominant spatial selectivity of neuronal populations. In DN voxels where the model explains comparatively little variance, it is less clear that the model is capturing the right quantity. Specifically, the negative pRFs could conceivably be described by a model with a dominant suppressive surround (e.g., a difference-of-Gaussians model), in which what appears as a "negative pRF" in the standard model is actually the surround component of a center-surround mechanism whose center is poorly resolved. This distinction matters: a genuine inverted code (negative center response) implies a qualitatively different computation than inherited surround suppression from nearby visual cortex.

      The authors should consider discussing why the standard model is sufficient for the questions asked, or ideally, testing whether the sign distinction survives under alternative pRF model specifications.

      (4) Interpreting resting-state transients as top-down vs. bottom-up

      The event-triggered analysis labels high-amplitude DN pRF activations as "top-down events" and dATN activations as "bottom-up events." This is a reasonable inference given experience-sampling studies showing that rest involves alternation between internal and external attention, but it remains an inference. Without concurrent experience sampling, eye-tracking, or physiological monitoring, we cannot establish that a spontaneous DN transient reflects memory retrieval or internally-directed thought rather than a global arousal fluctuation. Similarly, dATN transients during rest could reflect covert shifts of spatial attention to remembered or imagined locations rather than bottom-up processing per se. I would ask the authors to soften this framing or to discuss what additional data would be needed to validate the top-down/bottom-up attribution.

      (5) The "retinotopic code" vs. "visual field bias" distinction

      The paper uses the language of a "retinotopic code" throughout and correctly distinguishes this from a "retinotopic map," noting that DN voxels do not form a continuous topographic representation on the cortical surface. This distinction deserves greater emphasis. In vision science, retinotopic maps carry computational significance through their topographic continuity and relationship to cortical wiring. A distributed collection of voxels with coarse visual field preferences but no cortical topography is a fundamentally different organizational feature. Recent reviews have drawn an explicit distinction between *retinotopic maps* and *visual field biases* (Groen, Dekker, Knapen & Silson, TiCS 2022), and the present findings may be more accurately characterized as the latter. Perhaps the authors think that the distinction is merely a signal-to-noise distinction, in which case I would invite them to clearly speak to this interpretation. In any case, this is not a criticism of the findings themselves, but clarity on this point would prevent conflation of two different organizational principles and would help position the work for both the vision and network neuroscience communities.

    2. Reviewer #2 (Public review):

      Summary:

      Using a public dataset of retinotopic mapping and resting-state data, the authors find that the default mode network has voxels that respond (positively or negatively) to visual stimulation at specific retinotopic positions, and that resting-state activity in these voxels is correlated with activity in more traditional sensory voxels with the same visual-location preference. The retinotopic specificity is bidirectional, such that high activity in default mode voxels drives activity only in voxels with matching receptive fields in sensory cortex, and vice versa. These findings are at odds with traditional views of the default mode network as having abstract (non-retinotopic) representations and competing (rather than cooperating) with external sensory representations.

      Strengths:

      This study continues an intriguing line of research about how default mode regions interact with the sensory cortex. Demonstrating that there are structured interactions between these regions at rest, and that these interactions are in fact organized according to retinotopic location (as opposed to traditional views of representational format in the default mode network), provides a new framework for thinking about large-scale internal and external brain networks. The authors make use of a well-powered public dataset that allows for precise estimates of pRFs and individual-specific resting-state networks, and develop a number of interesting analyses that characterize the relationships between DN and dATN voxels. The findings are exciting and could have a major impact on future studies in cognitive neuroimaging.

      The authors mention that these findings could shed light on internal/external interactions such as "anticipatory saccades or memory-guided attention," which is true, though I would argue that constructing DN representations of external stimuli is in fact even more fundamental than these specific cases (e.g., see Barnett and Bellana, 2025, "Situation models and the default mode network"). The "highways" identified in this study could play a vital role in real-world perceptual processes that are constantly translating external input into internal mental models.

      Weaknesses:

      (1) The criterion used for defining voxels as retinotopic seems very liberal. The authors show that only 5% of voxels have R^2>0.14 in a null analysis, and therefore define voxels with R^2>0.14 as retinotopic. Although all the networks in 1C show voxel distributions that differ from the null, the number of false positives above R^2>0.14 seems problematic, especially for the DN positive pRFs (red distribution) and to a lesser extent the DN negative pRFs (blue distribution). From visual inspection of the plot, the false discovery rate (fraction of voxels labeled as retinotopic that are false positives) looks like it would be greater than 50% for the DN-positive pRFs. The authors do show that the positive pRF voxels have above-chance consistency across runs, again providing evidence that there are true positive voxels in this set, but perhaps a stricter criterion (such as having consistent negative fits across runs) would provide more targeted identification of the DN voxels with true retinotopic sensitivity.

      (2) The claim that "opponency at rest between the DN and dATN appears to be driven by the subset of DN voxels with negative retinotopic tuning" is not well supported. The fraction of DN voxels with negative pRFs is small: 9.42% of DN voxels have pRFs, and 58.77% are negative, so about 6% of DN voxels have negative pRFs. The fact that any DN voxels have negative pRFs is notable, but the authors do not provide evidence that these 6% are driving the overall behavior of the DN. They do show (e.g., in Figure 2B) that negative and positive pRFs have opposing influences, but the overall correlation with dATN does not look similar to the negative pRF connectivity. I'm also unsure whether "opponency" is a reasonable description for two networks that are "independent (i.e., not correlated)" in this analysis.

      (3) The event-triggered analysis is effective at testing the bidirectional relationship between DN and dATN, with high activity in either network triggering a response in the other network. However, it would be helpful to show more validation that these "events" are meaningful windows of time to study. First, is 13 TRs a typical length of time that activity is elevated during one of these events? Second, the top-down and bottom-up terminology is perhaps too loaded and not well-justified; if the negative pRFs in the DN reflect a meaningful coding system, then couldn't low (rather than high) activity indicate a top-down event?

      (4) The framing of this paper relative to the authors' past week, such as Steel et al. 2024 ("A retinotopic code structures the interaction between perception and memory systems"), could be improved. The existence of negative pRFs in the DN and a functional relationship between these pRFs and the sensory pRFs have already been described in prior work. My understanding of the primary novelty here is that this paper examines resting-state data, showing that there are widespread spontaneous interactions between broad internal and external networks, but this distinction is not made explicit in the Introduction.

      (5) The definition of the default mode (DN) in this study aligns with past research, but the definition of the dorsal attention network (dATN) seems at odds with standard terminology. For example, the authors cite Fox et al. 2006, which depicts the dATN as including regions such as IPS, FEF, SMA, and MT+. Here, however, the "dATN" seems to be primarily lateral and ventral visual cortex (e.g., Figure S5). The exact location of these sensory pRFs is not critical to the authors' claims, but this labeling seems incorrect, and the motivation for defining/selecting the sensory network in this way is not described.

    3. Reviewer #3 (Public review):

      Summary:

      This paper addresses an important question (the relationship between DN and dATN, and the role of retinotopic coding) and uses a set of novel analyses.

      Strengths:

      Important question, novel analytical approaches (pRF-informed functional connectivity analysis).

      Weaknesses:

      Some of the key claims are not fully supported by the data presented. There is also a concern about over-interpretation of the results. Key issues:

      (1) The authors claim that retinotopic coding scaffolds the interaction between DMN and dATN. However, retinotopically tuned voxels account for a mere 9% of DMN voxels. So this appears to be a major overstatement. For instance, the statement that "these findings would position retinotopy as a unifying framework for brain-wide information processing" is not justified given the presented data.

      (2) Given that positive pRF voxels in DMN positively correlate with dATN voxels and negative pRF voxels in DMN negatively correlate with dATN voxels, there is a concern that these results could be contributed to by imprecise brain network parcellations. E.g., could some of the positive pRF voxels in DMN be erroneously assigned to DMN and actually belong to one of the other task-positive networks? There is insufficient validation of network parcellation to put this worry to rest, especially since it depends on ICA, which has a degree of arbitrariness built in.

      (3) The claim that retinotopic coding is intrinsic to the DN network is not supported by rigorous analysis and results. The analysis here has many arbitrary factors, including: the threshold of the 99th percentile of resting-state distribution; the designation of DN as "top-down" and dATN as "bottom-up"; the definition of "anti-matched" voxels instead of using randomly selected voxels; and the statistics being paired between matched and anti-matched voxels instead of using comparisons to baseline. Overall, I do not think that the result supports the conclusion that retinotopic coding in DN is intrinsic instead of being bottom-up-driven, given the very high threshold (99%) used and the fact that many other networks could also send bottom-up input to DN. Furthermore, the idea that bottom-up inputs only occur when the dATN (or any other RSN)'s spontaneous BOLD activity is above a certain threshold is a huge and unvalidated assumption.

    1. Reviewer #1 (Public review):

      Summary:

      Garcia-Alcala, Kratz and Cluzel investigate to what extent our understanding of bacterial physiology in bulk experiments can be applied to single-cell observations. They find that intrinsic noise may be powerful enough to even inverse the trends found in the bulk. The authors hypothesize that the asymmetric distribution of ribosomes to daughter cells during cell division plays the dominant role in the intrinsic noise and is able to generate the observed phenomenon. They do not show it directly, but the data and its agreement with the model are sufficient to support this claim.

      Strengths:

      The experimental part is convincing: the positive correlation between the elongation rate and promoter activity of unnecessary protein is clear, as well as the negative correlation between the mean values while changing the promoter strength. This was demonstrated in both rich and poor media. The causality between the growth rate and the promoter activity was shown using the negative lag time of the cross-correlation function. A simple, reasonable model accounts well for the data. This paper demonstrates an interesting phenomenon and provides a plausible theory for it, advancing our understanding of bacterial physiology on the single-cell level.

      Weaknesses:

      (1) Mean-reversion timescales were assumed to be longer than the simulation time and much longer than the cell cycle time. It is not clear whether the results are robust in case mean-reversion timescales become of the order of the cell-cycle or smaller. If not, is there an argument for such practically infinite reversion timescales?

      (2) It is not easy to understand the simulation part unless one reads Ref. [14]. k(t) is assumed Equation (1) from Reference [14]? Is it crucial that the ribosome noise appears only at the division? The ribosome noise strength \sigma_R=0.06 - is it lower or higher than the naively expected binomial division? Also, a more intuitive explanation of the Simpson paradox would help the reader.

      (3) It would be useful for the reader to see the raw data and not only the filtered one to appreciate the measurement noise level.

      (4) Negative lag time of the cross-correlation function is visible, but consider adding a statistical test for it.

      (5) Can you make similar cross-correlation plots using the model? Can you infer by using it, whether the data agrees better with the assumption that ribosomal noise appears only at division or continuous fluctuations during the cell cycle?

    2. Reviewer #2 (Public review):

      Summary:

      The manuscript by Garcia-Alcala et al. reports an interesting paradox: the cost of gene expression slows the population-average growth rate, whereas at the single-cell level, expression levels from these genes positively correlate with the growth rate. The effect is observed in the expression of flagellar genes and a gene under a synthetic promoter in E. coli. The findings are explained by the inheritance of growth factors, including ribosomes, during asymmetric division.

      Strengths:

      (1) The manuscript adds strength to an emerging body of literature showing that the population-level bacterial growth laws do not match correlations based on single-cell data. The evidence presented here is more striking than in previous works (such as Pavlou et al., Nat. Commun. 2025), as the trends in population-level data and single-cell data are reversed.

      (2) A relatively simple model correctly explains the trends in the data.

      Weaknesses:

      (1) It is not clear whether flagellar proteins are expressed proportionally to the reporter signal. Furthermore, it is questionable if E. coli bacteria in the mother machine channels are flagellated. If they are, they could potentially swim out of the channels, which is not the case when they do not carry the MotA E98K mutation. The authors should provide some evidence that E. coli expresses the actual filament proteins in the channels.

      (2) It is unclear what fraction of the total proteome mVenus represents in different measurements. Some quantification is needed (for example, using the Coomassie staining). Using f_U as high as 14.4% in simulations is questionable.

      (3) The data from the MC4100 strain does not directly match the trends of MG1655. The justification for filtering out the low-frequency components of MC4100 is not particularly convincing. It appears unlikely that ribosomes or other growth factors partition significantly differently in the MC4100 strain than in the MG1655 strain. Further discussion and a plot similar to Figure 1 (Left) for this strain are warranted.

      (4) The model needs to be described in more detail. A closed set of equations that has been simulated must be presented, along with all values of the model parameters and their sources. The authors should consider depositing their code on GitHub or another publicly accessible repository.

    1. Reviewer #1 (Public review):

      Summary:

      Effective decision-making in dynamic environments requires the brain to flexibly adjust how sensory evidence is accumulated over time, a process often modeled as an adaptive "leak." McGaughey and Gold propose that this flexibility is not solely a property of downstream integrators but is also supported by stimulus-specific sensory adaptation in the middle temporal area (MT). By recording single-unit activity in rhesus macaques during a motion direction-discrimination task, the authors found that more rapidly changing environments lead to reduced sensory encoding and discriminability in MT, which they argue accounts partially for a "leakier" integration. Furthermore, the study identifies pupil-linked arousal as a parallel, independent mechanism contributing to this adaptive process.

      Strengths:

      The study addresses an important question in cognitive neuroscience by exploring the neural substrates of perceptual flexibility. A major strength is the novel focus on how sensory adaptation, rather than just downstream integration, contributes to behavioral changes in dynamic environments. By shifting the perspective toward the encoding stage, the authors provide a more comprehensive account of how the brain manages evidence accumulation. This conceptual advance is supported by a rigorous experimental approach that combines human-like psychophysics with large-scale single-unit recordings in the middle temporal area (MT) and pupillometry.

      Weaknesses:

      (1) Alternative mechanisms for performance differences

      The authors assume that the difference in performance between the low-switch (LS) and high-switch (HS) frequency conditions is explained by a change in the "leakiness" of integration. However, several other mechanisms could potentially explain this effect:

      (i) Temporal Uncertainty: Integration might start later in the HS condition, leading to lower performance.

      (ii) Reduced Efficiency: Integration could be less efficient in the HS condition (i.e., lower signal-to-noise ratio) without a change in the leak parameter itself.

      (iii)Evidence Contamination: Motion information from the adapting stimulus in the HS condition may be integrated rather than ignored, which might be the case since the transition from the adapting to the test stimulus is not externally cued.

      To distinguish between these alternatives, I suggest two possible analyses. First, a formal model comparison could be performed, though I acknowledge this may be inconclusive in the absence of response-time data. Second, an analysis of motion energy kernels could be revealing; the leak hypothesis makes the specific prediction that for long test stimuli, early samples should contribute more to the choice in the LS condition than in the HS condition, relative to late samples.

      (2) Independence of neural and pupil-linked signals


      The authors take the lack of session-wise correlation between context-dependent contributions from neural and pupil terms as evidence that these two signals provide independent contributions to the behavioral effect. However, could this lack of correlation simply be a result of high variability or noise in these estimates? The data shown in Figure 7B suggests that measurements are very noisy, which might obscure a potential relationship.

    2. Reviewer #2 (Public review):

      McGaughey & Gold trained rhesus macaque monkeys to perform a motion-direction discrimination task in which a behaviorally irrelevant adapting stimulus with either fast or slow direction alternations preceded a variable-duration test stimulus, while simultaneously recording single-unit activity in area MT and pupil diameter. They report that adaptation to the more rapidly changing stimulus was associated with reduced behavioral sensitivity, attenuated test-evoked MT responses, and larger pupil-linked arousal signals. The authors interpret these behavioral changes as evidence for a more "leaky" evidence-accumulation process, and argue that this apparent leak is implemented in part through context-dependent sensory adaptation in MT and in part through arousal-related mechanisms. More broadly, they conclude that flexible evidence accumulation in dynamic environments arises from distributed adjustments across sensory encoding and neuromodulatory systems rather than solely from changes within a downstream accumulator. If correct, this interpretation has significant implications not only for our understanding of the neural mechanisms of perceptual decision-making but also for broader theories concerning the functional role of sensory adaptation.

      The conclusions of the paper are mostly well supported by the data. Evidence for robust adaptation-induced changes in sensory encoding, behavior, and pupil dynamics is convincing, but further clarification and refinement are needed to establish a clear mechanistic link between these effects and decision-making processes.

      Aspects of the behavioral analysis would benefit from a tighter connection between theoretical claims about evidence accumulation and the empirical features of the psychometric functions. For example, the rightward shifts observed across adapting conditions are interpreted as consistent with a reset of accumulation on switch trials, but similar patterns could also arise from failures to detect the test stimulus on a subset of trials, leading responses to default to the final adaptor direction. Likewise, changes in psychometric slope and asymptote are attributed to differences in evidence accumulation without explicit modelling or consideration of alternative explanations. Clarifying how specific features of the psychometric functions map onto distinct components of the decision process will strengthen the link between the theoretical framework and the behavioral data.

      A slight concern is the lack of a consistent analytical approach for relating behavioral changes to neural and pupil-linked measures. Different sections of the manuscript rely on different behavioral metrics-such as differences in accuracy within a selected stimulus-duration range (e.g., Figure 5C) or psychometric slope differences (Figure 6C) - without clear justification for these choices. The analytical approach likewise varies between simple correlational analyses (Figure 5C, Figure 6C), pseudo-experimental group comparisons (Figures 5D, E), and the inclusion of neural or pupil terms in the behavioral psychometric regression model (Figure 7B). While each metric and approach may be defensible in isolation, adopting a more consistent framework will help convince readers that the reported effects are robust and not contingent on the selective choice of metric or analysis.

    3. Reviewer #3 (Public review):

      Summary:

      Environments change over time; therefore, optimal decision-making ought to discount older observations of the environment in favor of newer ones in a manner consistent with the amount of temporal instability. Computational models of perceptual decision-making model this temporal discounting with a 'leak' parameter that determines the rate at which older information is discarded. In this study, McGaughey and Gold examine the neurophysiological mechanisms that could underlie adaptation to different degrees of temporal instability. They developed a novel variant of the well-established perceptual decision-making random-dot-motion paradigm, in which the stimulus being evaluated was preceded by an 'adapting' stimulus with either high or low temporal stability. When the test stimulus was preceded by the adapting stimulus with lower temporal stability, NHPs showed reduced psychometric slopes, indicative of increased temporal discounting ('leak'). While the NHPs performed this task, single-unit neural activity was recorded in area MT, along with pupillometric data. The authors use these neural and pupil datasets to investigate two potential sources of adaptive discounting under varying amounts of temporal instability: sensory adaptation (changes in instantaneous evidence encoding), and arousal-related changes in evidence accumulation. MT neurons respond differently to the test stimulus under conditions of high vs low temporal stability of the adapting stimulus - when the adapting stimulus is more stable, MT neurons have larger and more selective responses to the test stimulus. In addition, evoked pupil responses to the test stimulus were modulated by the adapting stimulus. Both the strength of the difference in MT responses across contexts and the difference in pupil diameter across contexts were correlated with context-dependent modulation of the monkeys' behavior over sessions. The paper concludes that both sources appear to independently contribute to adaptive evidence accumulation, likely operating at different processing stages in the brain.

      Strengths:

      (1) While computational models of perceptual decision-making have been very useful for explaining behavior and neural responses in decision-making areas, we are still in search of some of the neural mechanisms that could implement such models. Studies such as this one, which aim to identify neural correlates of simplified model parameters, are quite crucial.

      (2) Analysis is generally careful and well-executed.

      (3) Prompts some interesting follow-up questions that could be answered with simultaneous recordings and causal manipulations, as the authors state in the Discussion - e.g., which areas are affected by arousal-related neuromodulation correlated with evoked pupil size and how.

      Weaknesses:

      (1) The task design may not be optimal. While the amount of time the monkey is exposed to each motion direction during the adapting stimulus is matched, it's hard to know if the reduced MT responses to the test stimulus are truly due to the greater frequency of switches during the HSF adapting stimulus or because the monkeys have been exposed to more repetitions of the stimulus. It's increased sensory adaptation in either case, but it makes it problematic to interpret this as temporal context-dependent adaptation specifically. I think this could potentially be partially addressed by an analysis that is in the paper, but could potentially be emphasized/fleshed out more, specifically the results shown in Figure 4D that seem to show that most of the reduction in neural response for adapting units occurs between the first and second stimuli.

      (2) The pupillometric analysis seems to be an indirect way of assessing whether the accumulator itself might be modulated by temporal context, but the link could be made clearer. The authors show that context-dependent behavior is related to pupil size, which is related to arousal/neuromodulation, but it would be helpful to have some idea of what neural mechanisms underlying adaptive decision-making are actually impacted by this neuromodulation. Lacking neural data to address this question (e.g., from a brain region proposed to be involved in the accumulation process), at least more discussion of this would be helpful. Essentially, I'm unsure of how to interpret the pupil results: the argument that temporal context affects instantaneous evidence encoding in MT that then drives the accumulator is very clear, but I am a bit confused about what, mechanistically, I should think about the effect of neuromodulation doing.

    1. Reviewer #1 (Public review):

      Summary:

      This manuscript aims to differentiate between foveal and peripheral attentional mechanisms in visual and frontal brain regions in monkeys engaged in a free-gaze visual search task.

      Strengths:

      The manuscript is clearly written, the question is important, and the behavioral task is interesting.

      Weaknesses:

      I have two major concerns.

      (1) The authors interpret divergence in neural responses to target vs nontarget as attention. But it is not. The subject has to attend to both target and nontarget stimuli to determine the stimulus category and thereby decide on the next action. Thus, divergence between target and nontarget responses could reflect categorical discrimination, but I am not sure this can be interpreted as attentional modulation. While it may be tempting to suggest that finding a stimulus of a specific category is "feature attention", analogous to, e.g., attending to the red stimulus, I don't believe this is correct. For the former, the animals have to attend to a stimulus, and examine the stimulus to determine the stimulus category, unlike a simpler discrimination, which may pop out. Given this, I am unconvinced that the interpretations in this manuscript are valid.

      (2) Regarding the RF classification of foveal and peripheral RFs for IT and PFC, prior work suggests that neurons in IT cortex (especially AIT) and PFC have RFs that largely include the foveal visual field. So, it would be important to include figures that show the RFs of neurons classified as foveal versus peripheral for all three areas.

    2. Reviewer #2 (Public review):

      Summary:

      In natural visual behavior, such as when one is looking for a face in the crowd, the eyes are moved from site to site, seeking possible matching targets. This involves attention both to the current view at the center of vision (the foveal location) as well as to upcoming views via attention to targets in the periphery. While it has been established that attention generally enhances neuronal response (compared to simple visual activation) at the attended spatial location, this study provides solid evidence that attention during active visual search leads to neuronal response enhancement only when the eye moves towards targets that exhibit the desired feature and category. This study thus moves the field towards understanding the neural encoding of active vision.

      This study examines the neuronal basis of feature-selective attention during active, freely behaving visual search. Traditional electrophysiological studies on visual attention in monkeys commonly used an eye fixation with a covert attention paradigm, but have not sufficiently addressed the roles of both foveal and peripheral attention in play during natural looking behavior. Here, the authors present a novel paradigm in which, during eye-movement mediated search, neuronal receptive fields are recorded in multiple cortical areas (sensory V4, temporal, and prefrontal areas). In this manner, as the eye foveates, items in the array fall into foveal or non-foveal recorded sites. Thus, the experimental paradigm is elegant, offering the opportunity to make multiple types of comparisons: target/distractor, towards/away from fovea, and areal. Specifically, following a category cue (face, house, hand, flower), freely initiated saccades are made to locate a categorically matching 'target' in an array of distractors. Feature attention is assessed by comparing eye saccades made to targets vs to distractors. Spatial attention is assessed by comparing saccades made 'towards' vs 'away' from targets. Statistics are rigorous and nicely designed. The detailed association of simultaneously obtained eye movement sequences and neural parameters is well done. These are valuable data that will contribute to our understanding of attentional modulation in visual search.

      Strengths:

      The significance of these findings is fundamental. Decades of attention research in vision have been based on the paradigm of visual fixation and covert peripheral attention. However, increasingly, the field has moved towards understanding how the visual system works during active vision. Here, the authors use an active visual search paradigm and record from multiple areas (V4, IT, PFC). They find enhancement of attention both in the foveal and peripheral locations, and, furthermore, a high degree of feature and categorical specificity. This provides valuable data for the concept of a foveal-peripheral attentional window in natural vision. The controls (comparisons of neuronal response during looks to targets vs distractors, and looks towards and away from the target) and statistical rigor make these findings quite compelling.

      Weaknesses:

      While the study is generally quite strong, there are a few weaknesses to be addressed.

      (1) Little rationale is provided for recording in the selected areas, V4, IT, and PFC. Given the respective roles in sensory, object recognition, and goal-directed behavior, some rationale for this design should be offered, and commonalities/distinctions between these areas should be discussed.

      (2) Given the reliance of all analyses on saccadic behavior (towards target/distractor, towards/away from target), additional description and summaries of eye movement behavior during single trials and across trials should be provided.

      (3) The dependency of findings on top-down (categorical & feature-specific) task design should be discussed.

    3. Reviewer #3 (Public review):

      In this manuscript, the authors investigate the role of attention in foveal processing during a naturalistic task. They record neural activity from extrastriate visual areas V4 and inferotemporal cortex, as well as from the lateral prefrontal cortex, in macaques performing a free-gaze visual search task. In this task, animals searched for a face or house target among multiple complex stimuli, with no constraints on eye movements. Unlike classic studies of visual attention, which often rely on controlled fixation, this work examines neural activity in both foveal and peripheral receptive fields during naturalistic eye movements.

      The main question addressed by the authors is how feature-based attention is distributed and coordinated across foveal and peripheral visual fields during active search, and how this attentional processing influences saccade behavior. The authors show that foveal units in visual areas exhibit feature-based attentional enhancement, with stronger responses when a fixated stimulus is a target compared to when the same stimulus serves as a distractor. Peripheral units in visual and prefrontal areas show both feature-based and spatial attentional modulation, consistent with prior work. Finally, the authors show that attentional modulation depends primarily on stimulus category rather than response magnitude, with neurons showing similar enhancement for all images within the target category regardless of how strongly individual images drive the cell.

      There are several notable strengths of this paper, including:

      (1) Disentangling feature-based and spatial attention during naturalistic vision remains a central challenge. This paper tackles both simultaneously, parsing neural populations by object selectivity (face-selective, house-selective, non-selective) and RF position (foveal vs. peripheral).

      (2) The unconstrained search task (Figure 1A) moves beyond the dominant fixed-gaze, cued-attention designs (Zhou & Desimone, 2011) to study attention as it operates during natural behavior, with sequential fixations and voluntary saccades.

      (3) The scale of the multi-area recordings is a major strength and is well aligned with current trends in primate and human neuroscience toward large-scale, multi-area recordings. Simultaneous recordings from visual and prefrontal areas, comprising over 4,900 foveal units and more than 1,500 peripheral units, enable meaningful cross-area latency comparisons and area-specific analyses of attentional modulation. This study builds on the authors' previous analyses of this dataset by expanding the scope to show that feature-based attention generalizes across neuronal classes and operates on categorical identity rather than response magnitude.

      (4) The combination of simultaneous multi-area recordings and a rich behavioral paradigm provides a dataset that is well-suited for population decoding, cross-area interaction analyses, and trial-by-trial prediction of saccade choices, which could substantially deepen mechanistic understanding beyond the largely univariate comparisons presented here.

      While the data broadly support the paper's main conclusions, several issues limit the strength of the mechanistic interpretation and should be taken into consideration:

      (1) Receptive field size is not explicitly quantified and may confound foveal-peripheral comparisons. Units are classified as foveal or peripheral based on responsiveness to the cue versus the search array (Methods, p. 17), but the manuscript lacks essential information about receptive field sizes, eccentricities, and the number of search stimuli falling within each receptive field and related proper controls. This is critical because receptive fields in visual area V4 at foveal eccentricities are relatively small (Gattass et al., 1988; Desimone & Schein, 1987), whereas receptive fields in inferotemporal cortex can span several degrees to tens of degrees and often include the fovea (Op de Beeck & Vogels, 2000; DiCarlo & Maunsell, 2003; Zoccolan et al., 2007). Given the 2{degree sign} × 2{degree sign} stimulus size, multiple search items could potentially fall simultaneously within peripheral receptive fields. This introduces a potential confound, as attentional modulation is known to be strongest when multiple stimuli appear within a single receptive field (Reynolds et al., 1999). Although the authors acknowledge this issue for visual area V4 (p. 17), it is neither quantified nor controlled for. Without explicit receptive field mapping relative to the search array, comparisons between foveal and peripheral units, as well as between visual areas, are difficult to interpret cleanly.

      (2) Attentional modulation is difficult to dissociate from saccade planning and decision-related signals. The free-gaze paradigm enhances ecological validity but introduces a temporal confound: mean distractor fixation durations are approximately 156 ms (p. 9), while attentional effects emerge between 137 and 170 ms after fixation onset (Figure 2). As a result, the reported attentional modulation coincides with the preparation of the subsequent saccade. Neural activity measured in the primary analysis window (150-225 ms; p. 19), therefore, likely reflects a mixture of visual, attentional, motor planning, target recognition, and behavioral relevance signals, all of which are known to modulate responses in visual areas at similar latencies (e.g., Chelazzi et al., 1998). Moreover, target fixations (~257 ms) and distractor fixations (~156 ms) occur on fundamentally different behavioral timescales, which may inflate apparent foveal attentional effects. While the authors suggest that these timing differences support the idea that foveal feature-based attention facilitates prolonged fixation on target stimuli, this interpretation is not fully supported by the current analyses. That said, the saccade-aligned analyses of peripheral units (Figure S3) partially mitigate this concern by demonstrating that feature-based modulation persists through saccade execution.

      (3) The "attention-out" condition for spatial attention lacks directional control. In the spatial attention analyses (Figures 4D-F), the "attention-out" condition appears to include all fixations followed by saccades directed away from the receptive field, regardless of saccade direction. This differs from classic spatial attention designs, which typically use controlled anti-saccades or saccades to fixed locations opposite the receptive field (e.g., Moore & Armstrong, 2003; Gregoriou et al., 2009). Saccades directed toward locations adjacent to, but outside, the receptive field may still partially engage spatial attention mechanisms near the receptive field via broad attentional fields or motor preparation gradients (Bisley & Goldberg, 2010). In addition, the "attention-out" condition likely contains a heterogeneous mixture of trials in which the stimulus in the receptive field is either a target or a distractor, since feature-based attention effects are derived from this same pool of trials. As a result, spatial and feature attention effects are not fully orthogonal, and variance related to feature attention may already be embedded in the spatial attention baseline.

    1. Reviewer #1 (Public review):

      Summary:

      This work presents a flexible spike-sorting framework that allows users to run, swap, and benchmark individual modules commonly used in spike sorting. The paper argues and demonstrates that "opening the black box" is essential for understanding which components drive performance differences and for making progress toward more accurate and transparent spike sorting.<br /> Using this modular benchmarking pipeline, the work identifies electrode drift as a primary bottleneck for accurate sorting and introduces an end-to-end sorter ("Lupin") that combines the best-performing modules and is reported to outperform existing spike-sorting packages on their benchmark.

      Overall, this is a strong tool/resource contribution with clear potential to accelerate spike-sorting development and enable more rigorous comparisons. However, several claims, particularly around Lupin's or individual modules' superiority, are not yet supported robustly enough for the strength of the conclusions stated.

      Strengths:

      This work has high community value and practical utility. The effort to make benchmarking and spike sorting modules accessible and standardized is substantial and likely to be broadly useful.<br /> Treating spike sorting as a set of interchangeable modules is a useful approach to some extent, and it enables targeted improvements rather than 'new sorters' popping up, which are difficult to fully understand.

      Implementing this resource within SpikeInterface, an already widely used tool, will facilitate uptake and community contributions.

      Overall, I am positive about this manuscript as a resource paper. The core framework is compelling and timely.

      Weaknesses:

      (1) The main concern is the limited support for the claim that 'Lupin' and individual modules' outperform existing spike sorters.

      (2) Evidence is primarily from a single benchmark based on an intentionally simplified simulation. While the authors discuss the trade-offs between simulated and real data, the current evaluation does not provide enough diversity to justify claims of superiority.

      (3) While improving individual modules that run in a serial fashion could aid overall spike sorting performance, acknowledging that some end-to-end sorters work in an iterative fashion across multiple of these modules would be fair. Perhaps the optimal spike sorter is not a serial set of modules.

      (4) There is also a risk of benchmark overfitting. A modular approach makes it easy to select components that excel on specific benchmarks (or a specific project's data characteristics) without generalizing.

      Concrete ways to strengthen this work:

      (1) Evaluate on multiple simulation regimes, consider adding at least one biophysically detailed simulation, benchmark on multiple probe-geometries with neurons also clustered in different depth profiles (as this will affect drift solutions), and provide real-data validation. Even without full ground truth, real-data can be evaluated with expert curation, functional validation (e.g., refractory violations, quality metrics, unit waveform consistency), agreement across sorters, and consistency across time.

      (2) Related to real-data applicability, it is also important to acknowledge that modulatory approaches can enable overfitting to the needs of individual projects. Without real-data benchmarking (or benchmark diversity), it is unclear how the framework will guide users towards generalizable 'best practices' rather than optimized configurations that work for their specific conditions.

    2. Reviewer #2 (Public review):

      Summary:

      Spike sorting, that is, assigning events detected in extracellular electrophysiology data to the firing of individual neurons, is an inherently difficult computational problem involving multiple steps. The difficulty arises from low signal-to-noise, instability in signal due to the relative motion of the tissue and recording sites, and large volumes of data. Experimental ground truth data - where the correct assignment of spikes is known - is not available in large enough quantities to test algorithms. This paper describes a tool for creating fully synthetic ground truth data and benchmarking the individual steps of spike sorting to dissect the impact of signal-to-noise, firing rate, and motion correction on each step. This information is used to construct an optimized algorithm for sorting the ground truth data. One result of particular interest is the dominant role of motion correction in degrading accuracy. Another important technical result is that motion correction via interpolation of the voltage traces yields similar accuracy to interpolation of the spike templates.

      Strengths:

      The paper clearly shows the benefits of analyzing the complex process of spike sorting step by step. While this analysis has also been done in papers presenting spike sorters (for example, reference [32]), the tools presented here allow users and developers to do similar studies for their own work. This toolset will be very useful to many labs, especially those working in less studied brain areas or model systems, cases where the tuning of standard spike sorting tools is not a good match to the data.

      Weaknesses:

      The model ground truth data used in the paper does not need to be a perfect match to experimental data to provide useful benchmarking. However, as with all measurements of spike sorting accuracy, extrapolation to experimental data can be complicated. Users of these tools will need to assess how well the simulated data matches their recordings.

    3. Reviewer #3 (Public review):

      Overview:

      In this manuscript, the authors describe two additions to an existing toolbox (SpikeInterface, Buccino et al., 2020, eLife). The first addition is an empirical simulator for extracellular recordings, in which spikes from predefined templates are added up with Gaussian noise. The second addition involves granting user-level access to intermediate processing steps along spike sorting algorithms. The authors demonstrate the toolbox by evaluating functions (e.g., event detection) or sets of functions (e.g., feature extraction + clustering) on their simulated data, and suggest that a specific combination of function implementations provides performance improvement relative to kilosort4 (Pachitariu et al., 2024, Nature Methods).

      If the authors are interested in making this manuscript a suitable scientific contribution, the entire work has to be revised extensively. In particular, the simulator has to be extended and improved; the implementation of existing spike sorters has to be improved; the feedforward architecture of the modules has to be extended; the reporting of results has to follow standard reporting standards; new algorithms have to be explained in sufficient detail; and the manuscript has to undergo extensive proofreading.

      Notably, even assuming perfect implementation and descriptions, it is unclear to me whether the scope of the present work warrants a publication in a scientific journal, or is more suitable for an internal technical report or an e.g., a GitHub version release. To go beyond a scientifically-sound technical report, the authors may choose to demonstrate the utility of their new proposed sorter ("Lupin") and compare it to existing tools on multiple datasets.

      General comments:

      (1) The simulator itself has to be improved and extended. Right now, it simply generates, for every unit, a mother waveform from a sum of exponentials, scales that over channels, and then adds up multiple instantiations of every unit on every channel, along with noise. This is not a biophysical simulator: it is an ad hoc procedure, and the sentence "we firmly believe that.." (lines 482-483) does not make the procedure convincing. To make the simulator credible, the authors should: (1) use a set of biophysical equations, with multi-compartmental modeling of currents and return currents; (2) use noised data from extracellular recordings; or (3) some combination thereof.

      (2) The simulated dataset has to be extended in time. Maybe I missed something, but 500 units over 10 minutes, with some units having firing rates as low as 0.1 spikes/s, corresponds to some of the units firing an expected 60 spikes. This is clearly too short, and does not replicate the standard situation in extracellular experiments.

      (3) The simulated dataset has to be extended in space. The choice of using NeuroPixels 1.0 geometry is a poor one. Many labs use other monolithic electrode arrays (MEAs, silicon probes, other rigid arrays); tetrodes remain a major tool, and flexible probes (polyimide, mesh) are evolving. Assessing algorithms over a single spatial architecture is likely to lead to local maxima in performance and potentially erroneous conclusions.

      (4) The existing spike sorters evaluated are not completely described. Some sorters (e.g., SpyKING Circus and KS4) were described in previous publications, but it is unclear whether the implementation that was used for the present tests is exactly the same as those previously published. More importantly, some of the sorters evaluated (e.g., TDC, TDC2, SpyKING Circus 2) were never described in a peer-reviewed paper. This does not mean that they cannot be evaluated - but if they are, they must be described in full. Relying on the fact that the code is open source cannot replace a complete and accurate scientific description.

      (5) Related to the above, all relevant code should be made available online in permanent repositories, not only in author-controlled ones.

      (6) It is unclear why SpyKING Circus 2 and TDC2 are evaluated - these could potentially be described as straw men. I recommend reorganizing the manuscript so that after every module is evaluated separately based on a limited ground truth dataset, a single "best" sorter would be constructed, and then tested extensively (and compared to the de facto state of the art). Such reorganization would both demonstrate the utility of a modular approach and clarify the general usefulness of the outcome.

      (7) The new algorithms developed, for example, clustering and template matching, have to be described in more detail, and demonstrated graphically on simple datasets. This can be done in supplementary material if the authors prefer not to extend the manuscript too much.

      (8) This reviewer finds the description and interpretation of the results to be inadequate. As an example, focusing on Figure 5: The results in Figure 5A have to be supplemented and summarized as a scalar point estimate (e.g., median accuracy), an estimate of dispersion (e.g., using MAD, IQR, or SD), evaluated over multiple runs, and compared using statistical tests between tools and conditions (e.g., using a multi-dimensional analysis of variance, a mixed effect model, etc.). The results in Figure 5D must have an indication of dispersion. Any conclusions based on the numerical experiments must be based on these metrics and statistical evaluations.

      (9) The entire MS would benefit from expert proofreading; there are many language errors, mostly in indefinite articles and grammatical numbers.

    1. Reviewer #1 (Public review):

      Summary:

      The manuscript by Yang, Wang, and Cléry presents a lightweight pipeline for real-time identification of common marmosets in a laboratory setting. Models were trained and evaluated on data derived from a family of three closely related adults and a set of juvenile twins. Freely moving animals entered an enclosed space fixed to the housing cage door, which permitted the entry of individual animals for data acquisition. Utilizing YOLOv8-nano, identification was improved through the introduction of uniquely colored collar beads. Analyses of facial similarity showed close morphological relatedness amongst individuals and highlighted the need for highly discriminative classification. Overall, the authors offer a framework for identity tracking that prioritizes real-time inference. The authors demonstrate that combining facial detection with visual markers enables adequate identity assignment under controlled laboratory conditions with minimal cross-individual misclassification.

      Strengths:

      (1) The proposed pipeline offers a solution for real-time identity tracking in common marmosets. Its lightweight design enables deployment across a wide range of hardware configurations. Furthermore, if similar strategies are employed, this methodology is likely adaptable for other species with minimal modification.

      (2) Evaluation of closely related individuals provides a necessary stress test for the discrimination of facial identity tracking.

      Weaknesses:

      (1) The pipeline's reliance on controlled animal isolation and small visual markers raises questions about the approach's generalizability to unconstrained multi-animal environments. The provided confusion matrices (Figures 6-8) indicate that the most common misclassifications are background-related, possibly suggesting that detection specificity is the primary source of error. All things considered, these findings raise concerns about performance in its use in socially dynamic and visually complex environments.

      (2) The manuscript claims performance comparable to that of human experimenters but provides no explicit evidence to support these claims. While it is plausible that human experimenters may be less accurate in facial recognition tasks involving closely related marmosets, the authors don't provide evidence. Moreover, while that might be the case, the color-coded beads provide a salient identity cue for the model, which complicates the interpretation of this comparison grounded in facial recognition.

    2. Reviewer #2 (Public review):

      Summary:

      In this study, Yang et al. develop a real-time system for automatic face detection and identification of multiple unrestrained common marmosets in a home cage setting.

      Strengths:

      The study aims to address an unmet need in behavioral neuroscience: the ability to non-invasively identify animals is crucial to the automated and rigorous study of neural behaviors; this is especially true for common marmosets, which are rapidly becoming a model system of choice for the study of complex social cognition. By using a YOLOv8 backbone, the study achieve human level performance, both in terms of precision and recall of the trained models.

      Weaknesses:

      The robustness of the system is not clear from the limited datasets presented. The use of color-coded beads undercuts the study's premise that the system achieves truly non-invasive tracking. Although the system achieves good performance in face detection, it does not perform as well for classification using faces alone (especially when the faces are similar, as in twin animals). Here, too, the color-coded beads play a key role in identity discrimination. The stated goals of the study and the actual results presented are therefore at odds.

    3. Reviewer #3 (Public review):

      Summary:

      In this manuscript, Yang et al introduce a new method for automatically identifying marmosets in their home cage using a supervised deep learning method that recognizes the face and colored beads on marmoset collars. The authors show a high precision rate of identifying marmosets to levels comparable to a human experimenter. The method overall seems robust at identifying marmosets at different life stages and different settings; however, given the current form, I'm struggling to see the generalizability and experimental utility of this method.

      Strengths:

      (1) The authors provide a near-perfect automatic identification of marmosets in their home cage.

      (2) This method is robust across lightning, camera angles, etc., making it potentially useful for marmoset (and other NHP) identification outside the housing cage as well

      Weaknesses:

      (1) Despite the almost perfect precision, in its current form, I'm failing to see how this method can be useful to other labs.

      (2) This is a nice methods manuscript, but the authors do not present results to show how their method can be used outside of identifying marmosets inside their home cages in a small field of view.

      (3) Reading the manuscript is strenuous, given its repetitive nature. Consolidating and shortening the results, as well as adding some definitions to the results section, would be helpful.

    1. Reviewer #1 (Public review):

      In this manuscript, the authors aim to determine the ligand on Plasmodium falciparum-infected erythrocytes for the NK cell integrin, LFA-1, following up on previous evidence that LFA-1 is important for immune cell-mediated recognition of iRBCs.

      They start by incubating LFA-1 with iRBCs and show by flow analysis that a substantial population of these iRBCs binds to the LFA-1 (Figure 1C). They do conduct the control with uninfected RBCs, but put this in the supplementary material. As this is a critical control, I think that it should be moved to Figure 1C as it is essential to allow interpretation of the iRBC data. The authors also do not state which strain of P. falciparum they used (line 144). This is critical information as different strains have different variant surface antigens and should be included. With these changes, this data seems convincing.

      They next incubated LFA-1 with the iRBCs, cross-linked and conducted a pulldown, identifying GP130 as a binding partner. Using cross-linkers is a dangerous strategy as it risks non-specific cross-linking. Did they try without cross-linking and find an interaction?

      They raised antibodies to PfGBP and showed IFA, which reveals that these antibodies stain iRBCs (Figure 2Ciii). This experiment lacks a critical control of uninfected RBCs, which needs to be included to show that the staining is specific. Without this, it is not possible to conclude that there is iRBC-specific staining with PfGBP.

      They then conduct a pulldown using LFA-Fc, which does show GP130 only in the presence of the LFA-Fc, but not when empty beads are used. This is convincing. BLI measurements are also used to study this interaction (Figure 2Ci). The BLI data is presented in such a way that any association phase is obscured by the y-axis, which makes it impossible to know whether there is binding here. I think that the data needs to be shown with some baseline before the addition of the ligand so that the association can be seen. The data is also a bit messy with a downward drift and the curves showing different shapes, for example, with the 1.0uM curve seeming to have a different association rate. Also, is this n=1? I think that this data needs to be repeated and replicated. As this is the only data which shows a direct interaction between LFA1 and GBP, as pulldowns are done with lysates, which might mean bridging components. I think that it is important to repeat the BLI or use additional biophysical methods to assess binding, to obtain more convincing data.

      The authors next do some modelling of the putative complex. This is done by homology modelling and docking, which is not the most up-to-date method and is overinterpreted. Personally, I would remove this data as I did not find it convincing, and it is not important for the story. If the authors wish to include it, then I think that they should validate the modelling by mutagenesis to show that the residues which the models indicate might bind are involved in the interaction.

      They next made GP130 and tested the binding of this to THP-1 cells, which are often used as a model for macrophages. They observe greater binding of PfGBP-Fc to these cells when compared with hIgG and show that LFA-1 siRNA reduces this binding. I was a little confused about how the flow plots related to the graph in the bottom right corner of Figure 3Bii. In the flow plots, hIgG control shows 12.8% of cells in the gated region, while the unstained cells has 5.63%, but the MFI data shows a decrease in binding for hIgG vs unstained cells. How is this consistent? Also, the siRNA reduces the number of cells in the gated region from 66.6% to 25.9%, which is still substantially more that 5.63% in the unstained control. This also doesn't seem quite consistent with the MFI data. Could the authors explain this? Also, perhaps an additional experiment would be to add soluble LFA-1 into this assay as an additional control to determine whether this blocks PfGBP binding to the THP-1 cells? It could be that there are additional mechanisms of binding which indicate why the siRNA has a partial effect. The same is true for the NK cell experiments in Figure 3Ci, in which the siRNA has a partial effect. The authors also test binding to HEK, HepG2 and 'stem' cells and claim 'only background levels of binding', but in each case, there is more binding to these cells by PfGBP-Fc than by hIgG, albeit less than in THP-1 and NK cells. Why have the authors decided that these increases are not significant? All in all, these experiments do indicate a role for the GBP-LFA1 interaction in the binding of immune cells to iRBCs, but perhaps not as absolutely as is suggested.

      The authors next produce CHO cells with PfGBP on the surface. These cells bind to LFA-1 specifically. When these cells were incubated with primary NK cells, they did see increases in activation markers, which were reduced by the addition of anti-CD11a, suggesting these to be specific. They also conduct the same experiment with anti-GBP with iRBCs, but this is in a different figure. It would be easier for the reader if Figure 5B were in the same figure as Figure 4B, as it is related data using the same method. I found this data convincing, showing that the LFA1:GBP interaction does contribute to immune cell recognition and activation.

      The authors next conduct an experiment in which they assess parasite growth in the presence of NK cells and in the presence of anti-GBP. They use Heochst staining as a measure of parasite growth and claim that NK cells reduce the number of parasites, but that anti-GBP abolishes this effect (Figure 5A). I found this experiment very unconvincing as there are small effects and no demonstration of significance. More commonly used approaches to study parasite growth are lactate dehydrogenase GIA assays or calcein-AM labelling. I did not find this experiment convincing and would either remove or supplement with additional data using a more robust assay, with repeats and tests of statistical significance.

      In summary, the authors present a set of data which comes together to indicate an interaction between LFA1 and PfGBP on the Plasmodium-infected erythrocyte surface. Pulldown studies show convincingly that these two proteins co-precipitate, and BLI data suggest that this is direct. Also convincing is that NK cell activation can be reduced using antibodies against either LFA1 or PfGBP, indicating that this interaction does play a role in immune cell recognition of iRBCs.

    2. Reviewer #2 (Public review):

      Summary:

      The authors used an LFA-1 αI-Fc fusion protein to pull down potential ligands and LC-MS/MS, leading to the selection of PfGBP-130 as a potential membrane protein on the surface of infected cells. PfGBP-130 antibodies were raised and used to support the surface localization. This putative ligand interacted strongly with LFA-1 (Kd = 15 nM). A presumed PfGBP-130 ectodomain interacts with monocytes and NK cells but not cells that lack LFA-1. PfGBP-130 antibodies also interfered with NK cell-mediated infected cell killing; the effect, although statistically significant, is modest. The authors propose that NK cells recognize infected cells via LFA-1 interaction with PfGBP-130 exposed on the host cell and that this interaction is critical to initiation of NK cell activation and killing of infected cells.

      Major points:

      (1) PfGBP-130 is proposed to be a membrane protein based on a single predicted transmembrane domain. Figures 2b and 3a show ribbon schematics with this TM domain at residues 51-68, in agreement with TM prediction algorithms such as TMHMM 2.0 and Phobius. However, this predicted TM is upstream of the PEXEL motif (residues 84-88, sequence RILAE), a conserved sequence for parasite protein export to host cytosol that is proteolytically processed at its 4th residue. Thus, residues 1-87 are removed from PfGBP-130 prior to export, yielding a mature protein without predicted TMs. Prior studies have determined that the mature PfGBP-130 lacks TMs and is retained as a soluble protein in host cell cytosol (PMID: 19055692, 35420481). Thus, the authors' model of PfGBP-130 as a surface-exposed membrane protein conflicts with both computational analysis of the mature protein and these prior reporter studies. An important simple experiment would be to evaluate PfGBP-130 membrane association in immunoblots using the authors' PfGBP-130 antibody after hypotonic lysis (PMID: 19055692) and after alkaline extraction (e.g. 100 mM NaCO3, pH 11 as frequently used, PMID: 33393463). If the prior studies and computational analyses are correct, the protein will be predominantly in the soluble and/or alkaline supernatant fractions.

      (2) Many findings rely on the specificity of antibodies generated against PfGPB-130 or NK cell receptors. Although the authors have included key controls (use of isotype control antibodies, lack of anti-PfGBP-130 binding to uninfected cells), cross-reactivity between P. falciparum antigens is well-recognized and could significantly undermine the interpretation of experiments (PMID: 2654292 and 1730474 provide key examples of antigens recognized by antibodies raised against other proteins). For example, the surface localization in IFA experiments (Figure 2B(iii)) could reflect anti-PfGBP-130 binding to an unrelated parasite surface antigen, a possibility not addressed by any of the authors' controls. As another example, the iRBC lysate immunoblot using this antibody in Fig. 2B(iv) suggests a MW of 95 kDa, which corresponds to the unprocessed pre-protein before export; cleavage in the PEXEL motif yields a processed mature protein of 85 kDa, which should be readily resolved from the pre-protein in immunoblots (PMID: 19055692). A better immunoblot using immature infected cell stages might show both the pre-protein and the mature protein as a doublet band.

      (3) PfGBP-130 is not essential for in vitro cultivation (PMID: 18614010 and MIS of 1.0 in the piggyBac mutagenesis screen as tabulated on plasmodb.org, indicating a highly dispensable gene). The authors should use the knockout line as a control in their IFA localization experiments to address antibody specificity. More fundamentally, their model predicts that NK cells should not recognize or kill infected cells from the knockout line when compared to their untransfected parent. Such results with the knockout line would compellingly support the authors' model without reliance on antibodies that may cross-react with other parasite antigens. PMID: 18614010 reported that the PfGBP-130 knockout exhibited increased membrane rigidity, suggesting an intracellular scaffolding protein rather than a surface localization and use as a ligand for LFA-1 interaction and NK cell-mediated killing.

      (4) PfGBP-130 non-essentiality raises the question of why the gene would be retained if it triggers NK cell-mediated killing of infected cells in vivo. Presumably, this killing would pose strong selective pressure against retention of PfGBP-130. Some speculation is warranted to support the model.

    3. Reviewer #3 (Public review):

      Summary:

      Malhotra and colleagues present evidence that the integrin LFA-1 on NK cells is a ligand for the Plasmodium falciparum protein GBP130 on the infected erythrocyte surface and that this interaction plays a role in the clearance of infected erythrocytes by NK cells.

      The authors first select a subdomain contained within the CD11a subunit of LFA-1 as a probe to discover possible binding proteins on the infected erythrocyte surface. Parasite-infected erythrocytes stained positively with this probe; the level of staining increased as the parasites progressed through the life cycle. Using the LFA-1-based probe in cross-linking pull-down experiments, GBP130 was identified by mass spectrometry as a co-purifying parasite protein. The N-terminal portion of GBP130 was recombinantly expressed and shown to interact with LFA-1 alpha-I by biolayer interferometry experiments. The full-length extracellular domain of GBP130 was then recombinantly expressed and used to stain primary human NK cells and THP-1 cells. Knocking down LFA-1 by siRNA reduced staining by GBP130. To assess the contribution of GBP130 to the activation of NK cells, CHO cells exogenously expressing GBP130 were incubated with primary NK cells. Transfecting CHO cells with GBP130 led to increased activation of co-incubated NK cells compared to mock-transfected and compared to GBP130 transfected cells, with the inclusion of anti-CD11a to block NK cell adhesion. Finally, CHO cells expressing GBP130 led to increased activation of NK cells compared to mock-transfected CHO cells.

      Overall, although the authors present data from NK cell killing assays that include appropriate controls, the data suggesting a direct interaction between PfGBP-130 and LFA-1 does not include the same necessary controls, for example, the use of blocking antibodies. Most critically, the biolayer interferometry experiments use a recombinant fragment of PfGBP-130, which does not include the residues predicted to be important for mediating specific interaction with LFA1. The biolayer interferometry data instead suggest non-specific interactions between PfGBP-130 and LFA1, as binding does not reach saturation.

  2. Mar 2026
    1. Reviewer #1 (Public review):

      Summary:

      This study provided key experimental evidence for the "Solstice-as-Phenology-Switch Hypothesis" through two temperature manipulation experiments.

      Strengths:

      The research is data-rich, particularly in exploring the effects of pre- and post-solstice cooling, as well as daytime versus nighttime cooling, on bud set timing, showcasing significant innovation. The article is well-written, logically clear, and is likely to attract a wide readership.

      Comments on revisions:

      This is the second round of review, and I am generally very satisfied with the authors' revisions. However, a few detailed issues still require attention:

      The authors identified the summer solstice (June 21) as a phenological "switch point", but the flexibility of this switch point remains poorly understood. A more precise explanation of what "flexibility" means in this context is needed, along with a description of the specific experimental results that would demonstrate this flexibility.

      The experiment did not directly measure the specific date of the phenological switch point. Instead, it was inferred by comparing temperature effects before and after the solstice. The manuscript should clearly state that this switch point remains an inferred conceptual node rather than a directly measured variable.

      In Experiment 1, the effect of bud type (terminal vs. lateral) was inconsistent across the overall model and the different leafing groups. The authors should provide a more thorough discussion of potential reasons for this inconsistency. In addition, the statistical model for Experiment 1 indicates that the measured variables (summer cooling and leaf emergence date) explain only 23.4% of the variation in bud formation timing. This leaves over 76% of the variation unexplained, suggesting that other important factors are involved. The discussion should address this limitation in greater depth, moving beyond a focus on the measured variables.

    2. Reviewer #2 (Public review):

      In 'Developmental constraints mediate the summer solstice reversal of climate effects on European beech bud set [their original title]' Rebindaine and co-authors report on two experiments on Fagus sylvatica where they manipulated temperatures of saplings between day and night and at different times of year. I think the experiments are interesting, but I found the exact methods of them somewhat extreme compared to how the authors present them. Further, given that much of the experiment happened outside, I am not sure how much we can generalize from one year for each experiment, especially when conducted on one population of one species. I was also very concerned by the revisions.

      I expand briefly on these concerns and a few others for readers of the paper (see `The below comments relate to my original review'). Subsequent edits to the paper addressed some of these by providing a new figure and moving around the methods. Further, I am at a loss about their hypothesis, when they write in their letter: "Importantly, the Solstice-as-Phenology-Switch hypothesis does not assume that the reversal is fixed to June 21." Why on earth reference the solstice if the authors do not mean to exactly reference the solstice?

      The comments below relate to my original review with many of them still applying.

      Methods: As I read the Results I was surprised the authors did not give more info on the methods here. For example, they refer to the 'effect of July cooling' but never say what the cooling was. Once I read the methods I feared they were burying this as the methods feel quite extreme given the framing of the paper. The paper is framed as explaining observational results of natural systems, but the treatments are not natural for any system in Europe of which I have worked in. For example a low of 2 deg C at night and 7 deg C during the day through end of May and then 7/13 deg C in July is extreme. I think these methods need to be clearly laid out for the reader so they can judge what to make of the experiment before they see the results.

      I also think the control is confounded with growth chamber experience in Experiment 1. That is, the control plants never experience any time in a chamber, but all the treatments include significant time in a chamber. The authors mention how detrimental chamber time can be to saplings (indeed, they mention an aphid problem in experiment 2) so I think they need to be more upfront about this. The study is still very valuable, but -- again -- we may need to be more cautious in how much we infer from the results.

      Also, I suggest the authors add a figure to explain their experiments as they are very hard to follow. Perhaps this could be added to Figure 1?

      Finally, given how much the authors extrapolate to carbon and forests, I would have liked to see some metrics related to carbon assimilation, versus just information on timing.

      Fagus sylvatica: Fagus sylvatica is an extremely important tree to European forests, but it also has outlier responses to photoperiod and other cues (and leafs out very late) so using just this species to then state 'our results likely are generalisable across temperate tree species' seems questionable at best.

      Measuring end of season (EOS): It's well known that different parts of plants shut down at different times and each metric of end of season -- budset, end of radial expansion, leaf coloring etc. -- relate to different things. Thus I was surprised that the authors ignore all this complexity and seem to equate leaf coloring with budset (which can happen MONTHS before leaf coloring often) and with other metrics. The paper needs a much better connection to the physiology of end of season and a better explanation for the focus on budset. Relatedly, I was surprised the authors cite almost none of the literature on budset, which generally suggests is it is heavily controlled by photoperiod and population-level differences in photoperiod cues, meaning results may different with a different population of plants.

      Somewhat minor comments:<br /> (1) How can a bud type -- which is apical or lateral -- be a random effect? The model needs to try to estimate a variance for each random effect so doing this for n=2 is quite odd to me. I think the authors should also report the results with bud type as fixed, or report the bud types separately.<br /> (2) I didn't fully see how the authors results support the Solstice as Switch hypothesis, since what timing mattered seemed to depend on the timing of treatment and was not clearly related to solstice. Could it be that these results suggest the Solstice as Switch hypothesis is actually not well supported (e.g., line 135) and instead suggest that the pattern of climate in the summer months affects end of season timing?

    1. Reviewer #1 (Public review):

      Naim et al. use genetically engineered mouse models and tissue culture cell lines to investigate the role of the SLAP adaptor protein in colonic epithelium and colon tumour formation. The SLAP adaptor protein is known to be a negative regulator of tyrosine kinase signaling in hematopoietic cells, but its role outside the immune system is less well defined. Here, the authors use genetically engineered SLAP-deficient mice, tissue-specific SLAP KO, and colonic organoids to demonstrate that SLAP is expressed in cells of the colonic epithelium, where it acts as a cell-autonomous regulator of proliferation and differentiation. In addition, they provide biochemical evidence that loss of SLAP expression in cultured colonic organoids results in increased Src family kinase activity and global tyrosine phosphorylation, consistent with its known role as a suppressor of tyrosine kinase activity in immune cells. Consistently, treatment with an SRC kinase inhibitor inhibited the growth of SLAP-deficient organoids. These data provide solid evidence of a cell-autonomous role of SLAP in the colonic epithelium.

      This work would be improved by further description and interpretation of the SLAP expression pattern shown in the constitutive and tissue-specific KO to further support the conclusions made. In Supplementary Figure 1, magnification of the colon epithelium areas with SLAP expression shown by b-gal and anti-SLAP staining, highlighting regions of interest, would better support the conclusions regarding SLAP expression in specific regions of the colon epithelium. In Supplementary Figure 1B, the authors should indicate that the SLAP staining referred to is epithelial and in resident immune cells, as is mentioned in the text. Also, magnification of the boxed area of LRG5 staining in Figure 1 would improve this figure.

      Using a chemically induced model of colitis-associated cancer, the authors demonstrate that inactivation of SLAP shows a trend toward increased tumor formation (though this did not reach significance) as well as increased Src family kinase activity within tumors. Tumor spheres from SLAP-deficient animals showed enhanced growth that was suppressed by treatment with a Src family kinase inhibitor. Of note, the latter effect was specific to SLAP-deficient tumor spheres. These observations are convincing and support the authors' conclusion that SLAP has a tumor suppressor role in CRC through inhibition of SFK signaling.

      Mechanistically, elevated expression of the RTK, EphB2, was detected in immunoblots of SLAP KO colonic crypts, while overexpression of SLAP in CRC cell lines downregulated EphB2 protein levels. Using an EPHB2 inhibitor, the role of EPHB2 in the growth of SLAP-deficient colonic organoids was demonstrated. While these data generally support the authors' conclusion that SLAP limits colonic organoid growth by downregulating RTKS such as EphB2 and downstream Src family kinase activity, they do not show which cell types/regions in the colonic epithelium have increased EPHB2 protein and how this relates to SLAP and phospho-SRC expression, as shown in Figure 1 and Figure S1 immunocytochemistry. The expression of EphB2 and its role in colonic tumorsphere growth were not investigated.

      Overall, this work provides evidence of SLAP adaptor function in restricting tyrosine kinase signaling in the colonic epithelium, and suggests that loss of SLAP expression could promote tumorigenesis in this context.

    2. Reviewer #2 (Public review):

      Summary:

      Protein tyrosine kinases are subject to diverse regulatory mechanisms controlling their activity in normal situations. The authors previously identified SLAP (Src-like adaptor protein), a negative regulator of receptor tyrosine kinase (RTK) signaling, as a key suppressor of the cytoplasmic tyrosine kinase SRC in the normal colon and demonstrated that SLAP is downregulated in a majority of colorectal cancers (CRCs).

      In this study, the authors further explored SLAP functions in mouse models using constitutive and inducible epithelial-specific Slap deletion (villin-CreERT2 model). They found that loss of SLAP augments colonic epithelial cell proliferation and that induction of tumorigenesis by the AOM/DSS protocol mimicking CRC leads to more aggressive tumors in the absence of SLAP. This effect is apparently cell-autonomous as growth of normal and tumoral colonic organoids is SLAP-dependent in in vitro settings. Finally, the authors define that, in colon, SLAP represses EphB2, an RTK lying upstream of SRC, and show that inhibitors of EphB2 can partially limit tumorigenic development in vitro.

      Strengths:

      The manuscript is clearly and concisely written, making it easy to follow. The data obtained in the mouse models are very convincing.

      Weaknesses:

      Direct evidence that EphB2 is activated/phosphorylated in the absence of SLAP is lacking, as conclusions are only based on results obtained with inhibitors. Some other issues have to be addressed before acceptance, in particular, the relevance of the findings in CRC patients.

    1. Reviewer #1 (Public Review):

      [Editors' note: This version has been assessed by the Reviewing Editor without further input from the original reviewers. Given the time elapsed since the original data collection, the authors have addressed the previous concerns by providing a more nuanced discussion of their results and acknowledging the limitations of the study to ensure the conclusions are supported by the existing data.]

      Throughout the paper, the authors do a fantastic job of highlighting caveats in their approach, from image acquisition to analysis. Despite this, some conclusions and viewpoints portrayed in this study do not appear well-supported by the provided data. Furthermore, there are a few technical points regarding the analysis that should be addressed.

      (1) Analysis of signaling traces

      - Relevance of "modeled signaling level": It is not clear whether this added complexity and potential for error (below) provides benefits over a more simple analysis such as taking the derivative (shown in Figure 3C). Could the authors provide evidence for the benefits? For example, does the "maximal response" given a simpler metric correlate less well with cell fate than that calculated from the fitted response?

      - Assumptions for "modeled signaling level": According to equation (1) Kaede levels are monotonically increasing. This is assumed given the stability of the fluorescent protein. However, this only holds for the "totally produced Kaede/fluorescence". Other metrics such as mean fluorescence can very well decrease over time due to growth and division. Does "intensity" mean total fluorescence? Visual inspection of the traces shown in Figure 2 suggests that "fluorescence intensity" can decrease. What does this mean for the inferred traces?

      - Estimation of Kaede reporter half-live: It is not clear how the mRNA stability of Kaede is estimated. It sounds like it was just assessed visually, which seems not entirely appropriate given the quantitative aspects of the rest of the study. Also, given that Shh signaling was inhibited on the level of Smoothened, it is not obvious how the dynamics of signaling shutdown affect the estimate. Most results in Figure 7 seem to be quite robust to the estimate of the half-live. That they are, might suggest that the whole analysis is unnecessary in the first place. However, not all are. Thus, it would be important to make this estimate more quantitative.

      (2) Assignment of fates and correlations

      - Error estimate for cell-type assignment: Trying to correlate signaling traces to cell fate decisions requires accurate cell fate assignment post-tracking. The provided protocol suggests a rather manual, expert-directed process of making those decisions. Can the authors provide any error-bound on those decisions, for example comparing the results obtained by two experts or something comparable? I am particularly concerned about the results regarding the higher degree of variability in the correlation between signaling dynamics and cell fate in the posterior neural tube. Here, the expression of Olig2 does not seem to segregate between different assigned fates, while it does so nicely in the anterior neural tube. This would suggest to me that cells in the posterior neural tube might not yet be fully committed to a fate or that there could be a relatively high error rate in assigning fates. Thus, the results could emerge from technical errors or differences in pure timing. Could the authors please comment on these possibilities?

      - Clustering and fates: One approach the authors use to analyze the correlation between signaling and fate is clustering of cell traces and comparison of the fate distributions in those clusters. There is a large number of clusters with only single traces, suggesting that the data (number of traces) might not be sufficient for this analysis. Furthermore, I am skeptical about clustering cells of different anterior-posterior identities together, given potential differences in the timing of signal reception and signaling. I am not convinced that this analysis reveals enough about how signaling maps to fate given the heterogeneity in traces in large clusters and the prevalence of extremely small clusters.

      - Signaling vector and hand-picked metrics: As an alternative approach, that might be better suited for their data, the authors then pick three metrics (based on their model-predicted signaling dynamics) and show that the maximal response is a very good predictor of fate for different anterior-posterior identities. Previous information-theoretic analysis of signaling dynamics has found that a whole time-vector of signaling can carry much more information than individual metrics (Selimkhanov et al, 2014, PMID: 25504722). Have the authors tried to use approaches that make use of the whole trace (such as simple classifiers (Granados et al, 2018, PMID: 29784812), or can comment on why this is not feasible for their data? The authors should at least make clear that their results present a lower bound to how accurately cells can make cell-fate decisions based on signaling dynamics.

      (3) Consequences of signaling heterogeneity

      The authors focus heavily on portraying that signaling dynamics are highly variable, which seems visually true at first glance. However, there is no metric used or a description given of what this actually means. Mainly, the variability seems to relate to the correlation between signaling and fate. However, given the data and analysis, I would argue that the decoding of signaling dynamics into fate is surprisingly accurate. So signaling dynamics that seem quite noisy and variable by visual inspection can actually be very well discriminated by cells, which to me appears very exciting.

      Indeed, simple features of signaling traces can predict cell fate as well as position (for anterior progenitors). Given that signaling should be a function of position, it naively seems as if signaling read-out could be almost perfect. It might be interesting to plot dorsal-ventral position vs the signaling metrics, to also investigate how Shh concentration/position maps to signaling dynamics, this would give an even more comprehensive view of signal transmission.

      There remains the discrepancy between signaling traces and fate in the posterior neural tube. The authors point towards differences in tissue architecture and difficulties in interpreting a "small" Shh gradient. However, the data seems consistent with differences in timing of cell-fate decisions between anterior and posterior cells. The authors show that fate does initially not correlate well with position in the posterior neural tube. So, signaling dynamics should likely also not, as they should rather be a function of position, given they are downstream of the Shh gradient. As mentioned above, not even Olig2 expression does segregate the assigned fates well. All this points towards a difference in the time of fate assignment between the anterior and posterior. Given likely delays in reporter protein production and maturation, it can thus not be expected that signaling dynamics correlate better with cell fate than the reporter "83%". Can the authors please discuss this possibility in the paper?

      Thus, while this paper represents an example of what the community needs to do to gain a better understanding of robust patterning under variability, the provided data is not always sufficient to make clear conclusions regarding the functional consequences of signaling dynamics.

    2. Reviewer #2 (Public Review):

      Summary:

      In this work, Xiong and colleagues examine the relationship between the profile of the morphogen Shh and the resulting cell fate decisions in the zebrafish neural tube. For this, the authors combine high-resolution live imaging of an established Shh reporter with reporter lines for the different progenitor types arising in the forming neural tube. One of the key observations in this manuscript is that, while, on average, cells respond to differences in Shh activity to adopt distinct progenitor fates, at the single cell level there is strong heterogeneity between Shh response and fate choices. Further, the authors showed that this heterogeneity was particularly prominent for the pMN fate, with similar Shh response dynamics to those observed in neighboring LFP progenitors.

      Strengths:

      It is important to directly correlate Shh activity with the downstream TFs marking distinct progenitor types in vivo and with single cell resolution. This additional analysis is in line with previous observations from these authors, namely in Xiong, 2013. Further, the authors show that cells in different anterior-posterior positions within the neural tube show distinct levels of heterogeneity in their response to Shh, which is a very interesting observation and merits further investigation.

      Weaknesses:

      This is a convincing work, however, adding a few more analyses and clarifications would, in my view, strengthen the key finding of heterogeneity between Shh response and the resulting cell fate choices.

    1. Reviewer #2 (Public review):

      Summary:

      Almansour et al., investigate whether the proximity of TAD boundaries is directly linked to gene activity. The authors use high-throughput imaging to simultaneously measure the gene activity and physical distances between boundary regions in an allele-specific manner. Using transcriptional inhibitors, expression induction, and acute depletion of CTCF and cohesin, they test whether proximity of boundaries affects, or is affected by, gene activity.

      Strengths:

      The combined use of DNA and RNA imaging enabled simultaneous measurement of boundary proximity and transcriptional status at individual alleles. This allows single-allele correlation between boundary proximity and gene activity at multiple loci across thousands of alleles.

      The use of both transcription inhibitors and transcription stimulation provides compelling and consistent evidence that boundary proximity can be disconnected from a gene's activity. The data convincingly support the conclusion that stable proximity between boundary regions is not required for ongoing transcription at the loci and timescales examined.

      This work strengthens the emerging view that genome organization at the level of domain boundaries does not impose a deterministic control over transcription.

      Strong disruption of boundary distances is only observed upon depletion of cohesin. Notably, this corresponds with the largest changes in gene activity. In contrast, depletion of CTCF actually had minimal impact on boundary distances and also had minimal impact on gene activity. This makes sense in light of previous work, where live cell imaging demonstrated that cohesin is more important for domain-structure, whereas CTCF is only important for blocking cohesin from continuing on, such that the fully formed loop occurs in a very small percentage of cells. Therefore, the fact that disruption of cohesin (more important for internal domain structure) affects gene activity while disruption of CTCF does not is exceptionally interesting.

      Weaknesses:

      In untreated cells, the distribution of distance measurements between boundary probes is exceptionally narrow. While depletion of RAD21 clearly demonstrates an ability to detect changes in this distribution, this tight baseline distribution may limit sensitivity to more subtle changes (like those one might expect from transcriptional influences).

      This approach primarily tests the role of boundary interactions rather than domain organization as a whole.

    2. Reviewer #3 (Public review):

      Summary:

      This study addresses a central question in genome organization: whether the positions of chromosomal domain boundaries are functionally coupled to gene activity. The authors use high-throughput imaging to simultaneously measure distances between boundary markers and nascent RNA production in thousands of individual cells, enabling direct comparison of boundary positions and transcriptional status at single chromosomal copies. This approach is applied across multiple loci, genes, and cell types, and is combined with acute transcriptional perturbations and depletion of architectural proteins to test the relationship between chromosome structure and gene activity in both directions.<br /> This work makes a meaningful contribution by providing direct, single-cell evidence that domain boundary positions and gene activity are largely uncoupled in this system.

      Strengths:

      A major strength of the work is its single-cell, single-allele resolution, which overcomes the averaging inherent to population-based assays. The authors consistently find that boundary proximity is largely independent of transcriptional status: active and inactive alleles have similar boundary distances, transcriptional perturbations do not shift boundary distributions, and depletion of the boundary factor CTCF does not alter gene expression, whereas cohesin depletion affects both boundary organization and transcription. These conclusions are supported by large numbers of alleles, multiple loci and cell types, and internal controls that distinguish boundary-specific effects from broader chromatin influences. The study offers a robust, scalable imaging pipeline that will be valuable for future studies linking genome organization and transcription at single-cell resolution.

      Weaknesses:

      The study has important limitations that are acknowledged by the authors. Measurements are restricted to distances between flanking boundaries and do not capture internal domain architecture, sub-domain structure, or finer-scale regulatory contacts. Resolution is limited by probe size and imaging, potentially masking subtle positional changes, and only a small set of loci is examined, leaving open how broadly the uncoupling generalizes. Some perturbation effects, particularly for RAD21, may involve mechanisms beyond boundary disruption.

    1. Reviewer #1 (Public review):

      Summary:

      In this manuscript, Rosero and Bai examined how the well-known thermosensory neuron in C. elegans, AFD, regulates context-dependent locomotory behavior based on the tactile experience. Here they show that AFD uses discrete cGMP signalling molecules and independent of its dendritic sensory endings regulates this locomotory behavior. The authors also show here that AFD's connection to one of the hub interneurons, AIB, through gap junction/electrical synapses, is necessary and sufficient for the regulation of this context-dependent locomotion modulation.

      Strengths:

      This is an interesting paper showcasing how a sensory neuron in C. elegans can employ a distinct set of molecular strategies and different physical parts to regulate a completely distinct set of behaviors, which were not been shown to be regulated by AFD before. The experiments were well performed and the results are clear. However, there are some questions about the mechanism of this regulation. This reviewer thinks that the authors should address these concerns before the final published version of this manuscript.

      Comments on revisions:

      In this revised manuscript, Rosero and Bai satisfactorily addressed all the concerns raised by this reviewer regarding their original manuscript. This reviewer appreciates the authors' effort. This revised and improved manuscript demonstrates that a sensory neuron in C. elegans can utilize distinct molecular strategies and circuit engagements to regulate distinct sets of behaviors. This reviewer believes that the manuscript is suitable for final acceptance in eLife.

    2. Reviewer #2 (Public review):

      The goal of the study was to uncover the mechanisms mediating tactile-context-dependent locomotion modulation in C. elegans, which represents an interesting model of behavioral plasticity. Starting from a candidate genetic screen focusing on guanylate cyclase (GCY) mutants, the authors identified the AFD-specific gcy-18 gene as essential for tactile-context-dependent locomotion modulation. AFD has been primarily characterized as a thermosensory neuron. However, key thermosensory transduction genes and the sensory ending structure of AFD were shown here to be dispensable for tactile-context locomotion modulation. AFD actuates tactile-context locomotion modulation via the cell-autonomous actions of GCY-18 and the CNG-3 cyclic nucleotide-gated channel, and via AFD's connection with AIB interneurons through electrical synapses. At the circuit level, AIB also receive inputs from the mechanosensory neuron FLP, which was also shown to be relevant for tactile-context-dependent locomotion modulation.

      For this study, the authors combined a very clever microfluidic-based behavioral assay with a large set of genetic manipulations to dissect the molecular and cellular pathways involved. Rescue experiments with single-copy transgenes are particularly convincing. The study is very clearly written, and the figures are nicely illustrated with diagrams that effectively convey the authors' interpretation. Overall, the convergence of behavioral assays, genetics, and circuit analysis provides convincing support for the proposed role of the AFD-AIB connection, potentially downstream of FLP via synapic and of other mechanosensory neurons via extra-synaptic communication.

      The facts that AFD mediates tactile-context locomotion modulation, that this role relies on GCY-18, and on electrical synapses linking AFD to AIB are new, somewhat unexpected, and interesting. The study raises intriguing and addressable questions about the role of innexin-based cellular communication in a multimodal sensory-behavior microcircuit, including the direction and nature of the signal(s) transmitted through these electrical synapses. These questions remain difficult to address in most experimental systems. The compact and genetically tractable nervous system of C. elegans provides a powerful entry point for addressing them in the context of an intact in vivo circuit.

    3. Reviewer #3 (Public review):

      Summary:

      Rosero and Bai report an unconventional role of AFD neurons in mediating tactile-dependent locomotion modulation, independent of their well-established thermosensory function. They partially elucidate the signaling mechanisms underlying this AFD-dependent behavioral modulation. The regulation does not require the sensory dendritic endings of AFD but rather the AFD neurons themselves. This process involves a distinct set of cGMP signaling proteins and CNG channel subunits separate from those involved in thermosensation or thermotaxis. Furthermore, the authors demonstrate that AIB interneurons connect AFD to mechanosensory circuits through electrical synapses. They conclude that, beyond its primary function in thermosensation, AFD contributes to context-dependent neuroplasticity and behavioral modulation via broader circuit connectivity.

      While the discovery of multifunctionality in AFD is not entirely unexpected, given the limited number of neurons in C. elegans (302 in total), the molecular and cellular mechanisms underlying this AFD-dependent behavioral modulation, as revealed in this study, provide valuable insights into the field.

      Strengths:

      (1) The authors uncover a novel role of AFD neurons in mediating tactile-dependent locomotion modulation, distinct from their well-established thermosensory function, providing an important conceptual contribution to our understanding of how individual neurons can support multiple, mechanistically separable behavioral functions.

      (2) They provide meaningful mechanistic insight into how AFD, GCY-18-dependent cGMP signaling, and AFD-AIB electrical coupling contribute to this AFD-dependent behavioral modulation.

      (3) The neural behavior assays utilizing two types of microfluidic chambers (uniform and binary chambers) are innovative and well-designed. In the revised manuscript the authors introduce a removable-barrier assay that physically separates exploration and assay phases. This independent behavioral approach addresses prior concerns about ongoing sensory input and confirms that tactile experience alone is sufficient to modulate locomotion.

      (4) By comparing AFD's role in locomotion modulation to its thermosensory function throughout the study, the authors present strong evidence supporting these as two independent functions of AFD.

      (5) The finding that AFD contributes to context-dependent behavioral modulation is significant, further reinforcing the growing evidence that individual neurons can serve multiple functions through broader circuit connectivity.

      Weaknesses:

      While the requirement for AFD, GCY-18, and AFD-AIB electrical coupling is well supported, the directionality of information flow and the precise mode of interaction between mechanosensory neurons, AIB, and AFD remain unclear and an area of future studies.

      Overall, the authors successfully achieve their primary aim of identifying and characterizing a novel role for AFD in tactile experience-dependent locomotion modulation. This work contributes meaningfully to the growing body of literature demonstrating multifunctionality and context-dependent reconfiguration of individual neurons within compact nervous systems.

    1. Reviewer #1 (Public review):

      Summary:

      In this manuscript, Lei and co-workers aim to uncover the genetic underpinnings of thermal adaptation across three strains of the diamondback moth (Plutella xylostella) through experimental evolution over three years under three different thermal regimes. They identify systematic differences in trait responses (e.g., survival, fecundity), metabolic profiles, gene expression, and in the amino acid sequence of the PxSODC gene, among others. These results suggest that the diamondback moth has a strong potential for rapid physiological adaptation to different thermal regimes. Overall, this is a comprehensive and generally well-executed study that addresses an important question in the face of ongoing climate change.

      Strengths:

      The authors employ multiple approaches to identify signatures of thermal adaptation across the three strains, such as trait performance comparisons, metabolomics, transcriptomics, and amino acid sequence comparisons. All these different angles form a convincing picture of the underlying factors that underpin thermal adaptation in this experimental system. The manuscript is also generally well written and easy to understand.

      Weaknesses:

      I am unable to judge the validity of all aspects of this work; I will focus only on areas within my core expertise.

      (1) The authors identify pathways that are enriched in different strain comparisons (Figure 3E), but do not provide a detailed interpretation of these results. It would be great if the authors could explain in more detail how the physiological processes of a cold-adapted strain of this species may differ from those of a warmer-adapted strain.

      (2) The authors reconstruct a phylogenetic tree of the PxSODC gene using the neighbor-joining algorithm. The limitations of this algorithm have been known for many years now, especially for sequences separated by long evolutionary distances. According to Wang et al. (2016), the last common ancestor of the species shown in Figure S4C occurred 392-350 million years ago. Given this, I would strongly recommend that the authors infer a phylogenetic tree using model-based methods, such as those implemented in RAxML-NG or IQ-TREE. Also, in the absence of a valid outgroup sequence, I would show the gene tree as unrooted or rooted based on the corresponding species tree.

      (3) There is a key piece of the puzzle that is currently missing: the structural mechanism behind the mutational effects described in this study (e.g., Figure 5). The authors could leverage AlphaFold to generate structural models of different mutants and conduct molecular dynamics simulations to examine their conformational dynamics.

      References:

      Wang, Yh., Engel, M., Rafael, J. et al. Fossil record of stem groups employed in evaluating the chronogram of insects (Arthropoda: Hexapoda). Sci Rep 6, 38939 (2016). https://doi.org/10.1038/srep38939

    2. Reviewer #2 (Public review):

      Summary:

      In this paper, the authors set out to better understand the genetic mechanisms underlying thermal adaptation in insects. They experimentally evolved diamondback moth (Plutella xylostella) populations - a pest species with a wide distribution - under both hot (12h:12h 32{degree sign}C/27{degree sign}C) and cold (15{degree sign}C/10{degree sign}C) thermal conditions, and conducted phenotypic assays and metabolic and transcriptomic profiling to analyze how populations changed to deal with this thermal stress compared to the nonevolved ancestral population (constant 26{degree sign}C). Phenotypic assays showed that evolved hot populations had increased survival at high temperatures (42-43{degree sign}C) while evolved cold populations had lower freezing points compared to the ancestral population. When measured at the constant 26{degree sign}C conditions, metabolic and transcriptomic profiles of 3rd instar larvae from the evolved population were distinctive from the ancestral population, with a set of overlapping metabolic and transcriptomic pathways that were significantly differentially expressed in both hot and cold evolved populations compared to the ancestral. The authors narrowed down this set of candidate genes further by focusing on genes with high expression levels overall, whose expression profile was correlated with differentially expressed metabolites, and that contained mutants in both hot and cold strains. From this set, they chose the PxSODC gene for further functional validation, as it has previously been shown to be involved in the response of insects to abiotic stress with its antioxidative role in cellular defense. At the constant 26{degree sign}C, this gene showed lower expression across development in evolved strains compared to the ancestral population, while it showed similar expression patterns under thermal stress. Knockdown of PxSODC resulted in decreased survival rates at high temperatures and higher freezing points compared to the ancestral population. Based on this validation, the authors hypothesize that the non-synonymous mutation in the PxSODC gene that they found in the cold and hot evolved populations might alter the conformation of the PxSODC protein, increasing enzyme capacity. Their experimental evolution experiment furthermore indicates the capacity of the pest species, the diamondback moth, to adapt to a wide range of temperatures, providing insights into its capacity for global dispersal.

      Strengths:

      (1) The authors did a tremendous amount of work to characterize the mechanisms underlying thermal adaptation in the diamondback moth, artificially selecting populations for three years in the lab and characterizing how they evolved as a result at different biological levels: from phenotypes in different life stages, to larval metabolites and gene transcription, to functionally validating how one of the resulting gene candidates influences the capacity to deal with thermal stress.

      (2) The paper identifies and provides further evidence for candidate genetic mechanisms that might be particularly important for thermal adaptation in insects, including lipid metabolism, oxidoreductase activity, and DNA methylation. It is furthermore interesting that the authors found similar mechanisms to be involved in both the adaptation to cold and hot environments. Their functional validation of some of the genes involved in these mechanisms is very useful to understand how these genes might be causally involved in insect thermal adaptation.

      (3) The paper also has applied value: the diamondback moth is a pest species with a wide distribution, so understanding its adaptive capacity to different thermal environments is important for predicting the prevalence and potential further range expansion of this species under future climate change.

      Weaknesses:

      (1) The paper in its current form is hard to digest and would benefit from improved clarification of the storyline, as well as a tighter integration between the phenotypic, omics, and functional validation data. Currently, it is not always clear what the relevance is of all the reported results, nor why certain decisions were made, or how all the different methods the authors used fit together. For example, the authors functionally validated a second gene, PxDnmt1, but it is unclear why this particular gene was chosen, nor how it relates to their selection regimes when looking at the results obtained with the phenotyping and omics data collection. Seeing how much work the authors did, this makes the paper overwhelming and difficult to read.

      (2) The authors at times stretch their results too far, as the ecological relevance of their study design and results is not clear, limiting the generalizability and value of the results for understanding species' adaptive potential under climate change. For example, the selection regimes used present the minimum and maximum known temperatures at which the species can survive and develop, but it is unclear how the temperatures relate to the natural environment of the source population, to what extent wild populations might experience these temperatures, and whether they would experience them at the extended duration used (12h at max/min temperature). Moreover, I wonder whether the comparisons made would identify the genes that matter under natural conditions, as unevolved populations were kept under constant conditions compared to 12h:12h temperature regimes for the evolved populations, and the metabolic and transcriptomic profiling was done under a constant favorable 26{degree sign}C rather than under thermal stress in a, as far as I can tell, randomly chosen life stage (larval stage).

      (3) The paper in its current form does not adequately describe the statistical analyses underlying the results, nor do the authors share their code, making it very hard to judge whether the analyses used are appropriate and the results trustworthy. I have concerns about the inappropriate use of t-tests, the lack of correcting for confounding variables, and the need for multiple testing corrections.

    1. Reviewer #1 (Public review):

      Summary:

      When contemplating the role of any sensory area in the brain, an essential question is: How much of the neural code is inherited from the inputs versus constructed de novo by the local circuitry? This study tackles that important question for the case of the mouse superior colliculus (SC), a visual brain area that receives direct input from the retina. The specific aspects of the neural code are the representation of line orientation and direction of motion in the visual image. Over the past 10 years or so, there have been reports that the preferred directions and orientations of neurons vary systematically across the SC in a global map that is not present in the retina, and therefore computed locally.

      Here, the authors revisit this question by expanding the range of measurements: They record from the axonal boutons of retinal ganglion cells in the input layer of the SC, from the post-synaptic neurons there, and from neurons in deeper layers of the SC. They conclude that at any given location in the SC, the signals in retinal boutons recapitulate the tuning of retinal ganglion cells, and that SC neurons follow that organization, though it is lost in the deeper layers. Notably, they find no evidence for a global map of these response properties other than what is contributed by retinal input.

      Strengths:

      The study combines multiple recording methods - calcium imaging and electrical recording - to capture the activity of retinal inputs to the colliculus, the tuning of neurons in the superficial layers close to the input, as well as neurons in deeper layers. Furthermore, the work connects to the recent literature on gradients of tuning properties among retinal ganglion cells. All these set the stage for testing the correspondence between retinal inputs and collicular outputs.

      Weaknesses:

      The methods used to identify direction-selective and orientation-selective neurons based on visual responses are overly permissive and don't match common usage in this research area. Furthermore, the measurements covered only a small fraction of the visual field, which limits their ability to distinguish between different hypotheses for the global map of visual response properties. Relatedly, the claim that retinal input patterns explain much of the layout in the superior colliculus should be made more quantitative.

    2. Reviewer #2 (Public review):

      In this study, the authors investigate the spatial organization of direction and orientation selectivity in the mouse superior colliculus (SC) and its retinal inputs. By combining two-photon imaging of retinal boutons and SC neurons with Neuropixels recordings, they assess whether tuning preferences form structured maps or are arranged in a salt-and-pepper fashion. They further compare SC tuning organization to previously described retinal geometric models to determine the extent to which collicular responses inherit retinal topography. The authors conclude that SC inherits a cardinally biased topographic scaffold from the retina, which progressively weakens with depth, and that strong global maps are absent.

      A major strength of the study is the impressive combination of methodologies, including imaging of retinal boutons, imaging of SC neurons, and large-scale electrophysiological recordings across SC depth. The direct comparison to retinal geometric models is particularly valuable, as it situates the SC within a broader framework of retinotopic information transfer and advances our understanding of how retinal computations are transformed in downstream targets.

      A limitation of the study, however, is that the imaging experiments sample only a relatively small and spatially homogeneous region of the visual field, whereas the electrophysiological recordings cover a different portion of SC. This separation makes it difficult to form a fully integrated, global picture of the spatial organization of direction and orientation selectivity across the entire collicular map.

    3. Reviewer #3 (Public review):

      Summary:

      The authors studied the organisation of orientation and direction-selective retinal ganglion cells' boutons in the mouse superior colliculus. They confirmed the results already published (Molotkov, 2023; de Malmazet, 2024; Vita, 2024; Laniado, 2025), that retinal ganglion cells' boutons in the superior colliculus conserve the retinal organisation. Thereby, orientation and direction preferences of retinal boutons at each collicular location reflect the tuning of retinal ganglion cells found at the corresponding retinal location, that is covering a matching receptive field location.

      The authors also studied the organization of orientation and direction-selective neurons in the superior colliculus. They report a lack of functional organisation in the superior colliculus for neurons preferring the same stimulus orientation or direction of movement. This goes against several published reports (Ahmadlou and Heimel, 2015; Liang et al., 2023; De Malmazet et al., 2018; Feinberg and Meister, 2014; Kasai and Isa, 2021; Li et al., 2020) but echoes a study from Chen et al. (Chen, 2021). The latter authors contested the strength of the anatomical clustering of tuned alike direction-selective neurons. They found, however, that in about a quarter of their recordings, direction-selective cells with similar preferred directions did cluster anatomically in the superior colliculus.

      Here, the authors of the current manuscript under review report that local clustering of tuning was weak in all neural populations and confined to very small spatial scales (10-20 μm). This is one order of magnitude smaller than previously reported clusters of around 100-300μm wide. Therefore, the authors conclude that orientation and direction tuning in the mouse superior colliculus follows a salt and pepper organisation.

      Strengths & Weaknesses:

      Although the authors performed a solid analysis contesting the functional clustering of direction and orientation selective neurons, there seemed to be some elements in their data in favour of a functional clustering of neurons.

      As an illustration, the authors plotted in Figure 1Q the distribution of preferred orientations from all their recorded orientation-selective cells. The curve shows a clear bias, indicating that neurons preferring horizontal orientations were found two times more often than neurons encoding any other orientations. Moreover, the authors recorded all their neurons from a defined anatomical location of the colliculus, marked by the dotted rectangle in Figure 3A-C. Therefore, this suggests that orientation-selective cells in this particular collicular location are biased towards preferring horizontal orientations. This supports an anatomical clustering of tuned alike orientation-selective cells in the superior colliculus.

      Similarly, Figure 1P shows a bias in the preferred directions of direction-selective neurons in the same recording area. Cells tended to respond more to upward and forward-moving stimuli. The bias is more modest than the one described above for preferred orientations. However, it still seems significant. For example, cells preferring upwards movements appeared to be four times more abundant than cells preferring downward movements. As a consequence, it indicates that preferred directions might not be uniformly distributed and equally represented across the superior colliculus.

      These anatomical biases are also visible in the receptive field analysis of the paper. In Figure 3G, the authors plotted the distribution of preferred orientations for every 10-degree bins within the recorded field of view. Out of 26 bins containing more than one neuron, only 6 seemed to include cells not overwhelmingly preferring a single orientation. These were located towards the top right of the figure. Therefore, over almost 80% of the recorded superior colliculus, the data seem in agreement with the view that orientation-selective cells tend to prefer the same orientation at a given receptive location.

      The patch analysis in Figures 4G and H also seems to show some degree of coherence in the preferred orientation and direction of neighbouring tuned collicular cells. In both Figures 4 G and H, clear patches of similar preferred orientation and direction appeared to emerge. For example, in Figure 4H, there is a predominance of horizontally tuned patches. This was expected given the recording bias consisting of a majority of horizontally tuned cells. In addition, vertical and 45-degree patches are also visible, in blue and red, respectively. These patches overlap with the corresponding retinotopic locations in Figure 3G, where the histograms show that cells tend to prefer the same orientations, horizontal, vertical or 45 degrees.

      It is important to note that in the previous studies on functional clustering of orientation and direction, variability in the tuning of cells within clusters was always reported (Ahmadlou and Heimel, 2015; Chen et al., 2021; De Malmazet et al., 2018; Feinberg and Meister, 2014; Kasai and Isa, 2021; Li et al., 2020). This was more marked for direction-selective cells than for orientation-selective cells. In general, cells preferring all four cardinal directions were often recorded at any given collicular location. Similarly, orientation-selective cells could be found to prefer deviant orientations compared to adjacent cells. Therefore, it is not surprising to see locally mixed tuning in collicular neurons. However, what appeared significant in these studies was the overall proportion of cells with similar tuning in patches of the superior colliculus. As described above, this also seems to be the case in the data of this manuscript.

      To conclude, it seems that authors tend to overlook the sources of agreement between their data and previous reports showing functional clustering of cells in the superior colliculus. Instead, the authors tend to emphasise the dissimilarities and variability to put forward a contentious view on the organisation of orientation and direction selectivity in neurons of the superior colliculus. This, I fear, is detrimental to the field because it creates a sort of manufactured chaos that produces unnecessary confusion for readers who do not attentively read the manuscript. It would be valuable for the authors to consider rewriting the manuscript, acknowledging where their data, in fact, support some level of functional clustering.

    1. Reviewer #1 (Public review):

      Summary:

      This work builds a theory to implement planning trajectories towards a goal in a known environment, inspired by analyses of prefrontal neural recordings. Unlike standard neural architectures for this task, such as value-based learning and successor representations, their proposed theory is able to adapt to novel goal locations within a trial. The key to the theory is that future times are represented by orthogonal groups of neurons. The recurrent connectivity between groups of neurons selective to specific future times and locations reflects the learned knowledge of the task. Finally, the authors show that standard networks trained on the task approximate their proposed theory.

      Strengths:

      The structure of the work is clear, and the presentation of the results is very well written, which is particularly noticeable given the consequential amount of results presented. The authors are able to link their theory with experimental findings in neural recordings. The reverse-engineering of trained recurrent neural networks is very thorough, by analyzing both dynamics and connectivity. The assumptions and predictions of their model are clearly stated.

      Weaknesses:

      It is unclear whether their proposed theory, "space-time attractors", actually is an attractor network. The authors used recurrent neural networks with very few timesteps, and long single neuron time constants with respect to the task time scales. Attractor networks, as the ones the authors cite, refer to networks that generate nontrivial patterns of activity through recurrent interactions, after long periods of time.

      The authors gloss over how the reward inputs are calculated. Computing these reward inputs should be part of the planning process, and the authors are implicitly leaving this problem aside. How does the reward input, which includes future time and location, depend on the actions that have not yet been taken by the agent? It feels like most of the planning computation is already provided by these reward inputs at the beginning of the trial. It could be that the network is only learning to process the planned sequence of actions present in the inputs.

    2. Reviewer #2 (Public review):

      This well-written manuscript proposes to use attractors in space and time (STA) as a mechanistic explanation for planning in the prefrontal cortex. The main conceptual hypothesis is that planning is implemented as attractor dynamics in a representation that encodes states at each time step jointly. Depending on inputs, the network relaxes to a trajectory that already contains future states that will be visited at each time step, rather than computing a scalar value at each point in time and space like other classical approaches from RL. The authors compare this approach to implementations such as TD learning and successor representation, and further show that trained recurrent neural networks on specific tasks involving planning develop structured subspaces resembling the ones postulated in STA.

      The idea of treating attracting trajectories unfolding in time as the computational substrate for planning is very interesting and potentially important. The explicit construction of a state x time representational space and its implementation via recurrent dynamics are appealing and convincing in the idealized tasks considered. I found the manuscript to be refreshingly explicit regarding several of the assumptions and limitations of the models, for example, the fact that certain advantages can be viewed as properties of the state space itself and not necessarily of a fundamentally new planning mechanism.

      Overall, the manuscript presents a cool attractor model that extends in time and explores its performance in a subset of illustrative tasks involving planning. My doubts concern mostly the interpretation and scope of the claims made in the manuscript. Here are a few comments where I detail my questions/concerns:

      (1) The authors nicely discuss that much of the difference between STA and classical TD or SR agents is "in some sense a property of the state space rather than the decision making algorithm," and that TD and SR could in principle be implemented in a comparable space x time representation. This is fair, but it also suggests that the central contribution of the manuscript lies primarily in the representational factorization (state x time tiling) and its dynamical implementation via attractors, rather than in a fundamentally new planning algorithm or theory, mechanistic or not. I think theory should be distinguished from mechanism, and it would therefore help the reader to describe the conceptual advancement more as a novel mechanism or implementation than a novel (mechanistic) theory for decision/planning.

      (2) Related to my previous point, I think it would be helpful to position STA more explicitly relative to computational/theoretical literature in which attractor networks encode temporally ordered patterns (so effectively including future times). For example, classical extensions of Hopfield networks with asymmetric connectivity implement retrieval of sequences and ordered transitions between patterns (Sompolinsky & Kanter, 1986). More recently, sequential attractors and limit-cycle dynamics have been constructed in structured recurrent networks by the Morrison group (Parmelee et al., 2021). These works do not implement an explicit discretized state x future-time tiling as in STA and do not specifically discuss the usage for planning. However, they do provide concrete precedents for attractor dynamics over temporally structured trajectories in terms of mechanism. It would be useful to discuss this literature and clarify a little what's new mechanistically in the view of the authors.

      (3) A central claim of the manuscript is that space-time trajectories are attractors of the STA dynamics. The manuscript does provide empirical evidence consistent with attractor-like behavior. However, it is not explicitly shown whether trajectory representations persist in the absence of sustained external inputs. So it's not clear to me whether the trajectories should be interpreted as intrinsic attractors of the recurrent system, which can be selected by delivering transient inputs, or whether they must be stabilized by a specific continuous external drive. It would be useful if the author could clarify/discuss this point.

      As far as I understand it, reward information is provided as input to specific populations encoding future time steps, and that's essential for rapid adaptation without rewiring connectivity. How such future-time-specific reward inputs would be generated and routed to distinct neural populations isn't entirely clear to me. Since this seems to be an essential component of the model, I think it would be important to discuss more deeply the source and plausibility of these reward signals related to different timesteps.

      (4) The authors note that vanilla STA scales linearly with planning horizon, and discuss potentially hierarchical extensions for longer horizons. They acknowledge that learning abstractions remains an open challenge, yet the examples of planning in the manuscript are restricted to very short temporal horizons and limited branching complexity. It is not obvious to me in what cases the current implementation and interpretation of STA remains viable (for example, in terms of relaxation iterations) as the horizon and branching factor increase. Relatively simple planning can be managed by simpler, less costly models/algorithms, whereas complex planning is a lot harder to deal with, and it's something that a mechanistic "theory" should address. In the context of the claims of the paper in its present form, I think this is possibly the most important conceptual and practical limitation in the manuscript.

      (5) The RNN analyses show that trained networks develop structured subspaces aligned with future time indices and exhibit perturbation behavior consistent with attractor-like dynamics. The manuscript also explicitly notes differences between the trained RNN and the handcrafted STA (e.g., long-range couplings between subspaces and differences in behavior of lower-value trajectories under perturbation), which I much appreciated. My doubt is on the specificity of this result, as trained RNNs on fixed-horizon tasks can develop latent dimensions correlated with temporal progress within a trial or time-to-goal. I think it would help the reader to clarify whether the results demonstrate that STA-like computations emerge in RNNs trained on planning tasks, or that RNNs generally develop some kind of structured spacetime representations when tasks involve future timesteps and some degree of flexibility in the decisions.

      A few more minor points, mainly concerning clarity:

      (1) The main dynamical equation combines a log-domain recurrent term, a floor operation, and a log-sum-exp normalization step, followed by exponentiation. The intuition/logic behind this specific formulation could be clarified for the reader. For example tt would be helpful to explain why the recurrent input appears inside a log, and also whether/how these operations relate to any multiplicative constraint.

      (2) While the computational cost of successor representation in an expanded NT x NT representation is discussed, the corresponding scaling of STA in terms of number of units and connections (as a function, for example, of the planning horizon) isn't clear to me. Perhaps the authors could compare costs more explicitly.

      (3) In the RNN analyses, structured subspaces aligned with future time indices are shown. I couldn't find a quantification of how much variance is captured by the subspaces, relative to other latent dimensions. Adding it would help get a feeling for the strength of the alignment.

    1. Reviewer #1 (Public review):

      Summary:

      This work aims to identify the transcription factor responsible for targeting constitutively active genes for repression during heat stress. While the mechanisms underlying heat-stress-induced gene activation are well characterized - primarily involving Heat Shock Factor (HSF), the GA-binding factor GAF, and RNA Polymerase II pausing regulators - far less is known about how repression of constitutive genes is directed. Because known activation factors such as HSF and GAF do not account for repression, the authors sought a DNA-binding factor that could selectively target these genes. They focused on CLAMP (Chromatin-linked adaptor for MSL complex proteins) for several reasons. First, CLAMP recognizes GA-rich DNA motifs similar to those bound by GAF, suggesting it could compete with GAF at regulatory elements and shift transcriptional outcomes. Second, CLAMP has been shown to antagonize GAF binding in certain genomic contexts, suggesting it could counteract activation mechanisms. Third, CLAMP interacts with Negative Elongation Factor (NELF), a factor known to regulate transcriptional repression during heat stress. Finally, CLAMP promotes long-range chromatin interactions, indicating it may influence local chromatin architecture during the heat-stress response. Together, these properties led the authors to hypothesize that CLAMP helps mediate heat-stress-induced transcriptional repression of constitutively active genes.

      To test this hypothesis, the authors use immunofluorescence along with three techniques: (1) nascent RNA-sequencing (SLAM-seq) to define the function of CLAMP in heat stress-induced transcriptional activation and repression; (2) Micro-C to identify CLAMP-dependent and independent genome-wide, high-resolution local changes in chromatin organization after heat stress, and (3) HiChIP to identify CLAMP-bound 3D chromatin loop anchors associated with heat-stress-dependent transcriptional regulation.

      Analysis of heat-shocked salivary glands or KC cells showed results that aligned across all experiments, indicating that CLAMP is the primary repressor of gene activation upon heat shock. CLAMP also inhibits chromatin loop formation.

      Strengths:

      The techniques used here are comprehensive, and impressively, the data is unambiguous.

      Weaknesses:

      These techniques do not reveal the molecular mechanisms, but the authors provide a strong rationale and molecular hypotheses for future studies to dissect.

    2. Reviewer #2 (Public review):

      In this manuscript, Aguilera et al. investigate the mechanisms underlying transcriptional repression of constitutively expressed genes during heat stress. While the activation of heat-shock genes has been extensively studied, the mechanisms responsible for widespread transcriptional repression remain poorly understood. The authors propose that the GA-binding transcription factor CLAMP acts as a major regulator of heat-stress-induced transcriptional repression in Drosophila. Using nascent RNA-sequencing approaches, they report that CLAMP contributes to the repression of a large fraction of genes whose transcription decreases upon heat stress. In addition, the authors generate high-resolution Micro-C datasets to analyze changes in chromatin architecture during heat stress and report widespread alterations in chromatin looping associated with transcriptional changes. Based on these results, the study proposes that CLAMP regulates repression through both direct transcriptional mechanisms and modulation of local 3D genome architecture.

      The study addresses an important question in gene regulation: how transcription is rapidly repressed during environmental stress. The work is timely because most previous studies have focused on transcriptional activation of heat-shock genes, whereas repression mechanisms remain comparatively less understood. The integration of transcriptional profiling with high-resolution chromatin conformation data is a major strength of the manuscript and provides a valuable resource for the community studying genome organization and stress responses.

      The nascent RNA-sequencing experiments appear carefully designed and allow the authors to capture rapid transcriptional responses to heat stress. These data provide convincing evidence that heat stress leads to widespread transcriptional repression of constitutive genes and that CLAMP contributes substantially to this process. The genomic analyses linking CLAMP binding to repressed genes are also informative and support the idea that CLAMP plays a direct regulatory role at many loci.

      Another strength of the study is the generation of Micro-C datasets under heat stress conditions. These datasets provide a high-resolution view of chromatin architecture and reveal dynamic changes in local chromatin looping associated with transcriptional responses. The authors' analysis suggests that heat stress induces widespread reorganization of chromatin contacts, and that CLAMP may contribute to these structural changes. This dataset is likely to be useful for future studies exploring how environmental cues influence genome organization.

      Despite these strengths, several aspects of the study would benefit from further clarification. First, the mechanism by which CLAMP mediates transcriptional repression remains insufficiently defined. While the data support a role for CLAMP in the repression of a subset of genes during heat stress, the molecular basis of this effect is not fully explored. Second, although the Micro-C dataset represents a valuable resource for studying chromatin architecture during heat stress, the functional interpretation of the observed structural changes could be further developed. In particular, it would be helpful to better establish the relationship between the identified chromatin loops and gene regulation, and to clarify whether these structural changes play a causal role in transcriptional repression or instead reflect broader chromatin reorganization associated with the stress response.

    3. Reviewer #3 (Public review):

      Summary:

      Exposure to heat shock results in major changes to gene expression programs within the cell, and over the past decades, there has been extensive characterization of the mechanisms through which heat shock activates transcription. However, heat shock also leads to widespread repression of many genes, and the transcriptional mechanisms that mediate this repression have not been well understood. Here, the authors show that the transcription factor CLAMP mediates this heat shock-dependent repression via changes in local 3D chromatin looping. Intriguingly, CLAMP is already bound to chromatin prior to heat shock, but is necessary for the loss of local chromatin loops at its bound sites and repression of genes located within the loops. This study is significant because it defines chromatin looping, depending on a key transcription factor CLAMP, as the major mechanism through which genome-wide changes in gene repression occur in response to an inducible stimulus, heat shock.

      Strengths:

      The use of the SLAM-seq and Micro-C techniques to measure the necessity of CLAMP for heat shock-dependent transcription repression and local chromatin looping is excellent, and these approaches provide valuable insight into the role of CLAMP in heat shock-dependent repression that was not apparent with older approaches. The HiChIP approach provides an excellent method to test whether CLAMP is bound at sites where there are changes in looping upon heat shock, providing good support for their conclusions that CLAMP induces heat shock repression by decreasing loops. Appropriate controls are present, and there is robust statistical analysis of the bioinformatics data.

      Weaknesses:

      The study does not provide insight into how CLAMP mechanistically affects loops upon heat shock, although the discussion raises the possibility that this could result from biophysical changes since CLAMP is an intrinsically disordered protein.

    1. Reviewer #1 (Public review):

      The paper from Hudait and Voth details a number of coarse-grained simulations as well as some experiments focused on the stability of HIV capsids in the presence of the drug lenacapavir. The authors find that LEN hyperstabilizes the capsid, making it fragile and prone to breaking inside the nuclear pore complex.

      I found the paper interesting. I have a few suggestions for clarification and/or improvement.

      (1) How directly comparable are the NPC-capsid and capsid-only simulations? A major result rests on the conclusion that the kinetics of rupture are faster inside the NPC, but are the numbers of LENs bound identical? Is the time really comparable, given that the simulations have different starting points? I'm not really doubting the result, but I think it could be made more rigorous/quantitative.

      (2) Related to the above, it is stated on page 12 that, based on the estimated free-energy barrier, pentamer dissociation should occur in ~10 us of CG time. But certainly, the simulations cover at least this length of time?

      (3) At first, I was surprised that even in a CG simulation, LEN would spontaneously bind to the correct site. But if I read the SI correctly, LEN was parameterized specifically to bind to hexamers and not pentamers. This is fine, but I think it's worth describing in the main text.

      Comments on revisions:

      I found that the authors addressed my concerns satisfactorily. The other reviewer raised a number of important points regarding the nuances of the model and the interpretation of the simulations, which the authors rebutted. I think the paper in its current form now is a worthwhile addition to the literature.

    2. Reviewer #3 (Public review):

      I have carefully reviewed the manuscript, the two referee reports, and the authors' detailed responses. I appreciate the substantial effort the authors have invested in addressing the reviewers' comments, and I also recognize the strength and ambition of the work. This is a technically sophisticated study that integrates coarse-grained modeling with live-cell imaging to address an important and timely question regarding HIV-1 capsid inhibition by lenacapavir.

      Embedded within Reviewer #2's report are several substantive points that warrant careful consideration, particularly with respect to framing, terminology, and engagement with the broader literature. I view my role here is to distinguish those issues from claims that I do not find to be supported.

      First, I do not agree with Reviewer #2's central assertion that the manuscript lacks novelty or fails to present meaningful new findings. While individual elements of the system studied here-capsid docking at the NPC, lenacapavir-induced capsid hyperstabilization, capsid rupture, and competition with FG- nucleoporins-have been observed previously, this work provides a coherent, mechanistic account of how these elements are coupled. In particular, the proposed sequence linking LEN-induced lattice hyperstabilization, preferential pentamer loss at the narrow end, NPC-induced mechanical stress, and failure of nuclear import represents a nontrivial integration that goes beyond prior phenomenological observations. I therefore do not view this work as redundant with existing literature.

      That said, Reviewer #2 is correct to note that the manuscript would benefit from broader and more explicit engagement with recent independent studies, including computational and hybrid modeling efforts that address capsid mechanics, nuclear entry, and LEN effects using different frameworks. While the authors' bottom-up coarse-grained approach is clearly distinct and, in many respects, more systematically derived, eLife readers would benefit from a clearer discussion of how the present results relate to, complement, or differ from these other approaches. I strongly encourage the authors to add a short discussion paragraph situating their work within this broader context, without disparaging alternative models.

      Second, I find that some mechanistic claims in the manuscript would benefit from more careful language distinguishing model-conditioned interpretation from de novo prediction. This applies in particular to discussions of LEN binding heterogeneity and stoichiometry, as well as to conclusions drawn from biased enhanced-sampling simulations. While I agree with the authors that parameterization does not invalidate mechanistic insight, it is important to be precise about what aspects of the behavior emerge from the simulations versus what is constrained by prior experimental knowledge. Modest tightening/revising of language (e.g., "suggests," "is consistent with," "within the model") would address this concern without weakening the scientific conclusions.

      Third, Reviewer #2 raises a legitimate semantic issue regarding the use of the term "elasticity." The manuscript infers changes in capsid mechanical response using heterogeneous elastic network models, which quantify effective stiffness and deformability rather than elasticity in the macroscopic materials sense. I recommend that the authors clarify this definition explicitly in the text to avoid confusion and unnecessary debate.

      Finally, I note that several of Reviewer #2's objections-particularly those asserting circular reasoning, misuse of enhanced sampling methods, or invalidity of coarse-grained predictions-reflect a misunderstanding of contemporary bottom-up coarse-grained modeling rather than genuine methodological flaws. I do not believe these points require further rebuttal or revision beyond what the authors have already provided.

      In summary, in my view, the manuscript represents a solid contribution to the field, provided that the authors undertake a limited set of targeted revisions aimed at improving framing, clarity, and engagement with the broader literature. Addressing these points will strengthen the manuscript and ensure that its contributions are clearly and fairly communicated to the community.

    1. Reviewer #2 (Public review):

      This article focuses on the study of two E. coli tripartite efflux pumps, both using TolC as a partner in the outer membrane, namely MacAB-TolC and AcrABZ-TolC.

      By preparing MacAB-TolC in Peptidiscs rather than in detergent for cryo-EM structure determination, they visualized an extra protein localized around TolC. The resolution was sufficient to build part of the structure, and using the AlphaFold2 database and DALI topology recognition program, they identified it as the lipoprotein YbjP. This protein has an anchorage in the outer membrane, and it was suggested that it could act as a support for TolC, which is the only OMF that does not have an N-terminal extension anchored in the outer membrane, which is very puzzling for the community working in this field of research.

      Authors used a large number of different approaches to evaluate the importance of YbjP (structure, genomic evolution, microbiology, photocrosslink in vivo, proteomic profile), but did not succeed in finding it a clear role so far, even if it could be important depending on environmental stress. Nevertheless, their results, obtained with extreme rigour, are of main interest for the comprehension of the complexity of such systems and deserve publication.

      Comments on revisions:

      Thank you for clarifying the points that puzzled me concerning the crosslink experiments. This version does not need further modifications.

    1. Reviewer #1 (Public review):

      This study established a C921Y OGT-ID mouse model, systematically demonstrating in mammals the pathological link between O-GlcNAc metabolic imbalance and neurodevelopmental disorders (cortical malformation, microcephaly) as well as behavioral abnormalities (hyperactivity, impulsivity, learning/memory deficits). Researchers comprehensively assessed the model phenotype through integrated multi-level analysis methods, including long-term behavioral monitoring, high-resolution brain structural imaging (micro-CT and MRI), histopathology, and quantitative proteomics.

      The core strength of this study lies in its multimodal experimental design. The evidence chain spanning in vivo behavior, brain structure, and molecular characteristics demonstrates high consistency and correlation. Of particular note is the combination of non-invasive behavioral tracking with quantitative neuroimaging techniques, providing objective validation for the observed phenotypes. The findings support the authors' core conclusion: O-GlcNAc homeostasis imbalance correlates with neurodevelopmental deficits, including structural abnormalities in specific brain regions and altered cognitive behaviors. Furthermore, this model reproduces certain clinical features observed in human patients.

      Nevertheless, several avenues remain open for further exploration. For instance, sample sizes in certain omics analyses remain relatively small, and investigations into downstream molecular mechanisms are still confined to the level of correlation-direct causal validation through genetic or pharmacological interventions is still required. Furthermore, as this model focuses on a single recurrent mutation, the generalizability of its findings to other OGT-ID variants remains to be verified.

      It provides the first actionable vertebrate model for neurodevelopmental disorders with unclear mechanisms, filling a critical gap in this field. The multidimensional research methods established in the paper-such as the digital behavioral phenotyping workflow-also offer valuable references for related disease studies.

    2. Reviewer #2 (Public review):

      Summary:

      The authors are trying to understand why certain mutants of O-GlcNAc transferase (OGT) appear to cause developmental disorders in humans. As an important step towards that goal, the authors generated a mouse model with one of these mutations that disrupts OGT activity. They then go on to test these mice for behavioral differences, finding that the mutant mice exhibit some signs of hyperactivity and differences in learning and memory. They then examine alterations to the structure of the brain and skull, and again find changes in the mutant mice that have been associated with developmental disorders. Finally, they identify proteins that are up or down regulated between the two mice as potential mechanisms to explain the observations.

      Strengths:

      The major strength of this manuscript is the creation of this mouse model, as a key step in beginning to understand how OGT mutants cause developmental disorders. This line will prove important for not only the authors but other investigators as well, enabling the testing of various hypotheses and potentially treatments. The experiments are also rigorously performed and the conclusions are well supported by the data.

      Weaknesses:

      The only weakness is a lack of mechanistic insight. However, this certainly may come in the future through more targeted experimentation using this mouse model. I do not recommend that these experiments need to be performed in this manuscript.

      Comments on revisions:

      The authors have addressed all of my suggestions proactively.

    1. Reviewer #1 (Public review):

      Summary:

      In this study, Besson et al. investigate how environmental nutrient signals regulate chromosome biology through the TORC1 signaling pathway in Schizosaccharomyces pombe. Specifically, the authors explore the impact of TORC1 on cohesin function-a protein complex essential for chromosome segregation and transcriptional regulation. Through a combination of genetic screens, biochemical analysis, phospho-proteomics, and transcriptional profiling, they uncover a functional and physical interaction between TORC1 and cohesin. The data suggest that reduced TORC1 activity enhances cohesin binding to chromosomes and improves chromosome segregation, with implications for stress-responsive gene expression, especially in subtelomeric regions.

      Strengths:

      This work presents a compelling link between nutrient sensing and chromosome regulation. The major strength of the study lies in its comprehensive and multi-disciplinary approach. The authors integrate genetic suppression screens, live-cell imaging, chromatin immunoprecipitation, co-immunoprecipitation, and mass spectrometry to uncover the functional connection between TORC1 signaling and cohesin. The use of phospho-mutant alleles of cohesin subunits and their loader provides mechanistic insight into the regulatory role of phosphorylation. The addition of transcriptomic analysis further strengthens the biological relevance of the findings and places them in a broader physiological context. Altogether, the dataset convincingly supports the authors' main conclusions and opens up new avenues of investigation.

      Points that remain open but are appropriately discussed by the authors:

      (1) The authors propose that nutrient status influences cohesin regulation. While this is not directly tested under defined nutrient conditions (e.g., by systematically examining cohesin dynamics or phosphorylation across nutrient states), the rationale is well explained in the text, and the study provides a strong foundation for addressing this question in future work.

      (2) The upstream signaling cascade downstream of TORC1 remains to be fully elucidated. In particular, the identity of the relevant kinases (e.g., whether Sck1/Sck2 or other effectors are involved) and whether TORC1 directly phosphorylates Mis4 or Psm1 are not resolved. The authors acknowledge these mechanistic gaps, which represent logical next steps rather than shortcomings of the current study.

    2. Reviewer #2 (Public review):

      Summary:

      In this study the authors follow up on a previous suppressor screen of a temperature-sensitive allele of mis4 (mis4-G1487D), the cohesin loading factor in S. pombe, and identify additional suppressor alleles tied to the S. pombe TORC1 complex. Their analysis suggests that these suppressor mutations attenuate TORC1 activity while enhanced TORC1 activity is deleterious in this context. Suppression of TORC1 activity also ameliorates chromosome segregation and spindle defects observed in the mis4-G1487D strain, although some more subtle effects are not reconstituted. The authors provide evidence that this genetic suppression is also tied to the reconstitution of cohesin loading. Moreover, disrupting TORC1 also enhances Mis4/cohesin association with chromatin (likely reflecting enhanced loading) in WT cells while rapamycin treatment can enhance the robustness of chromosome transmission. These effects likely arise directly through TORC1 or its downstream effector kinases as TORC1 co-purifies with Mis4 and Rad21; these factors are also phosphorylated in a TORC1-dependent fashion. Disrupting Sck2, a kinase downstream of TORC1, also suppresses the mis4-G1487D allele while simultaneous disruption of Sck1 and Sck2 enhances cohesin association with chromatin, albeit with differing effects on phosphorylation of Mis4 and Psm1/Scm1. Phosphomutants of Mis4 and Psm1 that mimic observed phosphorylation states identified by mass spectrometry that are TORC1-dependent also suppressed phenotypes observed in the mis4-G1487D background. Lastly, the authors provide evidence that the mis4-G1487D background and TORC1 mutant backgrounds display an overlap in the dysregulation of genes that respond to environmental conditions.

      Overall, the authors provide compelling evidence from genetics, biochemistry and cell biology to support a previously unknown mechanism by which nutrient sensing regulates cohesin loading with implications for the stress response. The technical approaches are generally sound, well-controlled, and comprehensive.

      The specific points that I raised in the first review have been addressed by changes/additions to the manuscript or have been determined to be beyond the scope of the study by the authors.

      One major question that remains open is the relationship between local changes in cohesin loading and gene expression through this TORC1 regulatory signaling pathway and the details of the underlying mechanisms.

    1. Reviewer #1 (Public review):

      Summary:

      This manuscript presents high resolution cryoEM structures of VPS34-complex II bound to Rab5A at 3.2A resolution. The Williams group previously reported the structure of VPS34 complex II bound to Rab5A on liposomes using tomography, and therefore the previous structure, although very informative, was at lower resolution.

      The first new structure they present is of the 'REIE>AAAA' mutant complex bound to RAB5A. The structure resembles the previously determined one except an additional molecule of RAB5A was observed bound to the complex in a new position, interacting with the solenoid of VPS15.

      Although this second binding site exhibited reduced occupancy of RAB5A in the structure, the authors determined an additional structure in which the primary binding site was mutated to prevent RAB5A binding ('REIE>ERIR'). In this structure, there is no RAB5A bound to the primary binding site on VPS34, but the RAB5A bound to VPS15 now has strong density. The authors note that the way in which RAB5A interacts with each site is distinct, though both interfaces involve the switch regions. The authors confirm the location of this additional binding site using HDX-MS.

      The authors then determine multiple structures of the wild-type complex bound to RAB5A from a single sample, as they use 3D classifications to separate out versions of the complex bound to 0, 1, or 2 copies of RAB5A. Overall the structure of VPS34-Complex II does not change between the different states, and the data indicate that both RAB5A binding sites can be occupied at the same time.

      The authors then design a new mutant form of the complex (SHMIT>DDMIE) that is expected to disrupt the interaction at the secondary site between VPS15 and RAB5A. This mutation had a minor impact on the Kd for RAB5A binding, but when combined with the REIE>ERIR mutation of the primary binding site, RAB5A binding to the complex was abolished.

      Comparison of sequences across species indicated that the RAB5A binding site on VPS15 was conserved in yeast while the RAB5A binding site on VPS34 is not.

      The authors tested the impact of a correspond yeast Vps15 mutation (SHLITY>DDLIEY) predicted to disrupt interaction with yeast Rab5/Vps21, and found this mutant Vps15 protein was mislocalized and caused defective CPY processing.

      The authors then compare these structures of the RAB5A-class II complex to recently published structures from the Hurley group of the RAB1A-class I complex, and find that in both complexes the Rab protein is bound to the VPS34 binding site in a somewhat similar manner. However, a key difference is the position of VPS34 is slightly different in the two complexes because of the unique ATL14L and UVRAG subunits in the class I and class II complexes, respectively. This difference creates a different RAB binding pocket that explains the difference in RAB specificity between the two complexes.

      Finally, the higher resolution structures enable the authors to now model portions of BECLIN1 and UVRAG that were not previously modeled in the cryoET structure.

      Strengths:

      Overall I found this to be an interesting and comprehensive study of the structural basis for interaction of RAB5A with VPS34-complex II. The authors have performed experiments to validate their structural interpretations, and they present a clear and thorough comparative analysis of the Rab binding sites in the two different VPS34 complexes. The result is a much better understanding of how two different Rab GTPases specifically recruit two different, but highly similar complexes to the membrane surface.

      Weaknesses:

      No significant weaknesses noted.

    2. Reviewer #2 (Public review):

      The work by Spokaite et al describes the discovery of a novel Rab5 binding site present in complex II of class III PI3K using a combination of HDX and Cryo EM. Extensive mutational and sequence analysis define this as the primordial Rab5 interface. The data presented are convincing that this is indeed a biologically relevant interface, and is important in defining mechanistically how vps34 complexes are regulated.

      This paper is a very nice expansion of their previous cryo-ET work from 2021, and is an excellent companion piece on high resolution cryo-EM of the complex I class III complex bound to Rab1 from the Hurley lab in 2025. Overall, this work is of excellent technical quality, and answers important unexplained observations on some unexpected mutational analysis from the previous work.

      They used their increased affinity vps34 mutant to determine the 3.2 ang structure of Rab5 bound to vps34-CII. Clear density was seen for the original Rab5 interface, but an additional site was observed. Based on this structure they mutated out the vps34 interface, allowing for a high resolution structure of the Rab5 bound at the Vps15 interface.

      They extensively validated the vps15 interface in the yeast variant of vps34, showing that the Vp215-Rab5 (Vps21) interface identified is critical in controlling complex II vps34 recruitment.

      The major strengths of this paper are that the experiments appear to be done carefully and rigorously and I have very few experimental suggestions.

      Here is what I recommend based on some very minor weaknesses I observed

      (1) My main concern has to do a little bit with presentation. My main issue is how the authors use mutant description. They clearly indicate the mutant sequence in the human isoform (for example see Fig 2A, Vps15 described as 579-SHMIT-583>DDMIE), however, when they shift to the yeast version they shift to saying vps15 mutant, but don't define the mutant, Fig 2G). I would recommend they just include the same sequence numbering and WT to mutant replacement every time a new mutant (or species) is described. It is always easier to interpret what is being shown when the authors are jumping between species when the exact mutant is included. This is particularly important in this paper, where we are jumping between both different subunits and different species, so clear description in figure/figure legends makes it much easier to read for non-specialists.

      (2) The HDX data very clearly shows that Rab5 is likely able to bind at both sites, which back ups the cryo EM data nicely. I am slightly confused by some of the HDX statements described in the methods.

      (3) The authors state "Only statistically significant peptides showing a difference greater than 0.25 Da and greater than 5% for at least two timepoints were kept." This seems to be confusing why they required multiple timepoints, and before they also describe that they required a p value of less than 0.05. It might be clearer to state that significant differences required a 0.25 Da, 5%, and p value of <0.05 (n=3). Also what do they mean by kept? Does this mean that they only fully processed the peptides with differences.

      (4) They show peptide traces for a selection in the supplement, but it would be ideal to include the full set of HDX data as an excel file, including peptides with no differences as there is a lot of additional information (deuteration levels for everything) that would be useful to share, as recommended from the Masson et al 2019 recommendations paper. This may be attached but this reviewer could not see an example of it in the shared data dropbox folder.

      Comments on revisions:

      The authors have addressed all of my issues.

    3. Reviewer #3 (Public review):

      Summary:

      The manuscript of Spokaite et al. focuses on the Vps34 complex involved in PI3P production. This complex exists in two variants, one (class I) specific for autophagy, and a second one (class II) specific for the endocytic system. Both differ only in one subunit. The authors previously showed that the Vps34 complexes interact with Rab GTPases, Rab1 or Rab5 (for class II), and the identified site was found at Vps34. Now, the authors identify a conserved and overlooked Rab5 binding site in Vps15, which is required for the function of the Class II complex. In support of this, they show cryo-EM data with a second Rab5 bound to Vps15, identify the corresponding residues, and show by mutant analysis that impaired Rab5 binding also results in defects using yeast as a model system.

      Overall, this is a most complete study with little to criticize. The paper shows convincingly that the two Rab5 binding sites are required for Vps34 complex II function, with the Vps15 binding site being critical for endosomal localization. The structural data is very much complete. What I am missing are a few controls that show that the mutations in Vps15 do not affect autophagy. I also found the last paragraph of the results section a bit out of place, even though this is a nice observation that the N-terminal part of BECLIN has these domains. However, what does it add to the story?

      Comments on revisions:

      The authors answered all my questions. I have no further requests.

    1. Reviewer #1 (Public review):

      Summary:

      The manuscript from Jones and colleagues investigates a previously described phenomenon in which P. falciparum malaria parasites display increased trafficking of proteins displayed on the surface of infected RBCs as well as increased cytoadherence in response to febrile temperatures. While this parasite response was previously described, it was not uniformly accepted, and conflicting reports can be found in the literature. This variability likely arises due to differences in the methods employed and the degree of temperature increase that the parasites were exposed to. Here the authors are very careful to employ a temperature shift that likely reflects what is happening in infected humans and that they demonstrate is not detrimental to parasite viability or replication. In addition, they go on to investigate what steps in protein trafficking are affected by exposure to increased temperature and show that the effect is not specific to PfEMP1 but rather likely affects all transmembrane domain containing proteins that are trafficked to the RBC. They also detect increased rates of phosphorylation of trafficked proteins, consistent with overall increased protein export.

      Strengths:

      The authors used a relatively mild increase in temperature (39 degrees) that they demonstrate is not detrimental to parasite viability or replication. This enabled them to avoid potential complications of more severe heat shock that might have affected previously published studies. They employed a clever method of fractionation of RBCs infected with a var2csa-nanoluc fusion protein expressing parasite line to determine which step in the export pathway was likely accelerating in response to increased temperature. This enabled them to determine that export across the PVM is being affected. They also explored changes in phosphorylation of exported proteins and demonstrated that the effect is not limited to PfEMP1 but appears to affect numerous (or potentially all) exported transmembrane domain containing proteins.

      Impact and conclusions:

      The study shows that protein export, including PfEMP1 and PSAC, are accelerated in response to mild heat shock. This has implications for disease severity as well as our understanding of protein trafficking in these unique organisms. There is increasing interest in asymptomatic infections, which have been proposed to be a major reservoir for transmission and generally are not associated with fever. It will be interesting to consider whether reduced (or slower) trafficking of these proteins has a selective advantage for parasites in asymptomatic infections.

    2. Reviewer #2 (Public review):

      This manuscript describes experiments characterising how malaria parasites respond to physiologically relevant heat-shock conditions. The authors show, quite convincingly, that moderate heat-shock appears to increase cytoadherance, likely by increasing trafficking of surface proteins involved in this process.

      While generally of a high quality and including a lot of data, I have a few small questions and comments, mainly regarding data interpretation.

      (1) The authors use sorbitol lysis as a proxy for trafficking of PSAC components. This is a very roundabout way of doing things and does not, I think, really show what they claim. There could be a myriad of other reasons for this increased activity (indeed, the authors note potential PSAC activation under these conditions). One further reason could be a difference in the membrane stability following heat shock, which may affect sorbitol uptake, or the fragility of the erythrocytes to hypotonic shock. I really suggest that the authors stick to what they show (increased PSAC) without trying to use this as evidence for increased trafficking of a number of non-specified proteins that they cannot follow directly.

      (2) Supplementary Figure 6C/D: The KAHRP signal does not look like it should. In fact, it doesn't look like anything specific. The HSP70-X signal is also blurry and overexposed. These pictures cannot be used to justify the authors' statements about a lack of colocalisation in any way.

      (3) Figure 6: This experiment confuses me. The authors purport to fractionate proteins using differential lysis, but the proteins they detect are supposed to be transmembrane proteins and thus should always be found associated with the pellet, whether lysis is done using equinatoxin or saponin. Have they discovered a currently unknown trafficking pathway to tell us about? Whilst there is a lot of discussion about the trafficking pathways for TM proteins through the host cell, a number of studies have shown that these proteins are generally found in a membrane-bound state. The authors should elaborate, or choose an experiment that is capable of showing compartment-specific localisation of membrane-bound proteins (protease protection, for example).

      (4) The red blood cell contains, in addition to HSP70-X, a number of human HSPs (HSP70 and HSP90 are significant in this current case). As the name suggests, these proteins non-specifically shield exposed hydrophobic domains revealed upon partial protein unfolding following thermal insult. I would thus have expected to find significantly more enrichment following heat shock, but this is not the case. Is it possible that the physiological heat shock conditions used in this current study are not high enough to cause a real heat shock?

      Comments on Revision:

      Although in any study there are going to be residual weaknesses, this reviewer is happy to see that the authors have gone to lengths to address many of my main concerns, and also those of other reviewers.

    3. Reviewer #3 (Public review):

      Summary:

      In this paper it is established that high fever-like 39oC temperatures cause parasite infected red blood cells become stickier. It is thought that high temperatures might help the spleen to destroy parasite infected cells, so they become stickier to remain trapping in blood vessels, so they stop passing through the spleen.

      Strengths:

      The strength of this research is that it shows that fever-like temperatures can cause parasite infected red blood cells to stick to surfaces designed to mimic the walls of small blood vessels. In a natural infection this would cause parasite infected red blood cells to stop circulating through the spleen where the parasites would be destroyed by the immune system. It is thought that fevers could lead to infected red blood cells becoming stiffer and therefore more easily destroyed in the spleen. Parasites respond to fevers by making their red blood cells stickier, so they stop flowing around the body and into the spleen. The experiments here prove fever temperatures increase the export of Velcro-like sticky proteins onto the surface of the infected red blood cells and are very thorough and convincing.

      Weaknesses:

      Minor weaknesses in the original version have now been satisfactorily addressed with additional work which is very convincing.

    1. Reviewer #1 (Public review):

      Summary:

      It is well known that neurons in the medial prefrontal cortex (mPFC) are involved in higher cognitive functions such as executive planning, motivational processing and internal state mediated decision-making. These internal states often correlate with the emotional states of the brain. While several studies point to the role of mPFC in regulating behavior based on such emotional states, the diversity of information processing in its sub-populations remains a less explored territory. In this study, the authors try to address this gap by identifying and characterizing some of these sub-populations in mice using a combination of projection-specific imaging, function-based tagging of neurons, multiple behavioral assays and ex-vivo patch clamp recordings.

      Strengths:

      The authors targeted mPFC projections to the nucleus accumbens (NAc) and basolateral amygdala (BLA). Using the open field task (OFT), the authors identified four relevant behavioral states as well as neurons active while the animal was in the center region ("center-ON neurons"). By characterizing single unit activity and using dimensionality reduction, the authors show differentiated coding of behavioral events at both the projection and functional levels. They further substantiate this effect by showing higher sensitivity of mPFC-BLA center-ON neurons during time spent in the open arms of the elevated plus maze (EPM). The authors then pivoted to the three-chamber social interaction (SI) assay to show the different subsets of neurons encode preference of social stimulus over non-social. This reveals an interesting diversity in the function of these sub-populations on multiple levels. Lastly, the authors used the tube test as a manipulation of the anxiety state of mice and compared behavioral differences before/after in the OFT and social interaction tasks. This experiment revealed that "losers" of the tube test spend less time in the center of the open field while "winners" show a stronger preference for the familiar mouse over the object. Using patch-clamp experiments, the authors also found that "winners" exhibit stronger synaptic transmission in the mPFC-NAc projection while "losers" exhibit stronger synaptic transmission in the mPFC-BLA projection. Given the popularity of the tube test assay in rank determination, this provides useful insights into possible effects on anxiety levels and synaptic plasticity. Overall, the many experiments performed by the authors reveal interesting differences in mPFC neurons relative to their involvement in high or low anxiety behaviors, social preference and social rank.

      Weaknesses:

      The authors focused primarily on female mice limiting generalizability and leaving the readers with questions about the impact of sex differences on their results. The tube test is used as a manipulation of the "emotional state" in several of the experiments. While the authors show the changes to corticosterone levels as a consequence of win/loss in the tube test, stronger claims might be made with comparisons to other gold standard stressors such as forced social defeat or social isolation.

    2. Reviewer #2 (Public review):

      Summary:

      The goal of this proposal was to understand how two separate projection neurons from the medial prefrontal cortex, those innervating the basolateral amygdala (BLA) and nucleus accumbens (NAc), contribute to the encoding of emotional behaviors. The authors record the activity of these different neuron classes across three different behavioral environments. They propose that, although both populations are involved in emotional behavior, the two populations have diverging activity patterns in certain contexts. A subset of projections to the NAc appear particularly important for social behavior. They then attempt to link these changes to the emotional state of the animal and changes in synaptic connectivity.

      Strengths:

      The behavioral data builds on previous studies of these projection neurons supporting distinct roles in behavior and extend upon previous work by looking at the heterogeneity within different projection neurons across contexts, this is important to understand the "neural code" within the PFC that contributes to such behaviours and how it is relayed to other brain structures.

      Weaknesses:

      The diversity of neurons mediating these projections and their targeting within the BLA and NAc is not explored. These are not homogeneous structures and so one possibility is that some of the diversity within their findings may relate to targeting of different sub-structures within BLA or NAc or the diversity of projection neuron subtypes that mediate these pathways. This is an important future direction for this work but does not detract from the main finding as reported. The electrophysiological data in Figure 7 have significant experimental confounds that makes their interpretation challenging.

    1. Reviewer #1 (Public review):

      A well-designed and preregistered simulation study investigating whether replication-success metrics can be applied to assess animal-to-human translation. The study is comprehensive, uses realistic parameter settings, and provides valuable insights into how different metrics behave under varied conditions.

      Strengths:

      (1) Methodologically rigorous and transparently preregistered.

      (2) Comprehensive simulation design covering a wide range of plausible scenarios.

      (3) Clear description of metrics and decision rules.

      (4) Valuable contribution to understanding the limitations of applying replication metrics to translation questions.

      Weaknesses:

      (1) The conceptual distinction between replication and translation could be more clearly emphasized.

      (2) Interpretation of results is dense and can be challenging to follow without a clear and summarized.

      (3) Some simulation parameters (effect sizes, heterogeneity, and number of animal studies) require more substantial justification.

      (4) Practical recommendations could be more explicit to guide applied researchers.

    2. Reviewer #2 (Public review):

      Summary:

      The authors attempt to address the issue of high rates of translation failure from animal studies to humans in the literature, where promising results in animal studies fail when conducting human clinical trials. Using parameters from a previous meta-analysis on prenatal amino acid supplementation and the effects it has on maternal blood pressure, the authors assessed the performance of the metrics used and whether they can quantify translation success. Performing a simulation study, the authors compared nine translation success metrics and found that no one method was uniformly optimal. The authors list several limitations of the study, such as comparability of effect sizes between animal and human studies, different goals of animal studies versus human studies, and the focus of the study on one aspect (statistics of translation) is part of a broader, more complex decision-making process before proceeding to human trials. The authors recommend using multiple metrics in combination while taking into consideration their strengths and weaknesses to assess the translation of animal studies to human outcomes. The paper achieves the aim of providing a model with several metrics to evaluate translation success from animal studies to humans.

      Strengths:

      (1) Utilizing 9 different translation success metrics in combination provides strong flexibility in evaluating whether results in animal studies can translate to humans. This would allow researchers to evaluate translation success using multiple different metrics according to the context of the study.

      (2) The authors accommodated for the limited sample size in animal studies, which are typically underpowered, and also caution that special attention should be given to heterogeneity when interpreting translation results.

      (3) Overall, this approach has the potential to be applied to other biomedical studies, provided the limitations for each of the metrics are considered. It would provide a useful tool in assessing translation from animals to humans, in addition to other factors such as safety, pharmacokinetics, etc.

      Weaknesses:

      While the study has several strengths, there are some limitations.

      (1) Preclinical animal study sizes tend to be much smaller than human studies, which results in underpowered results. The authors adjusted for this by pooling animal study data. However, high heterogeneity in the animal studies can affect translation results.

      (2) The study focuses only on evaluating the statistical component of translation, which is only one aspect of the decision-making process to move on to human trials. The study does not take into account safety and toxicological profiles, pharmacokinetics, or genetics, which are important considerations that influence the overall effect in humans.

    3. Reviewer #3 (Public review):

      Summary:

      This paper focused on how to navigate the complex decision-making process of whether to go into human trials. This is a critical topic considering the well-documented challenges in replicating and translating findings. While these are two distinct topics (i.e., replication and translation), they are related, and the authors simulated many conditions to assess the utility of replication assessment metrics.

      Strengths:

      A major strength of the study is the detailed approach to identifying relevant conditions and metrics, and to providing rich results that outline the strengths and weaknesses of each metric. Any simulation study is challenged by trying to identify the most relevant variables of interest, and this study provided sound justification for its chosen variables of interest. While this study does not make a strong recommendation (which I see as a strength), it does provide a comprehensive overview of the various metrics and conditions that were investigated.

      Weaknesses:

      The weaknesses of the study are the limited focus on specific metrics, the assumptions, particularly in the limited number of human study variables, and the less-than-ideal approachable summary of findings for a non-technical audience.

      Conclusion:

      This paper provides a much-needed investigation and discussion of how decisions are made when assessing whether to go into human trials. This is an important topic that productively challenges the status quo, considering documented challenges in replication and translation in biomedical research.

    1. Reviewer #1 (Public review):

      Summary:

      T cells that recognize lipids - CD1c - are frequent in circulation; however, their role in infection is unclear. This study aims to understand how Mtb infection can shape the responses of CD1c-specific T cells. CD1c is expressed in MTB granuloma, but in lower amounts than in nearby inflamed tissue. Mtb infection downregulates the expression of CD1c on monocyte-derived DCs. Single-cell RNA sequencing revealed the cytotoxic program inherent to the lipid-CD1c-specific T cells. Using an in vitro APC system where CD1c expression remains intact upon Mtb infection, the authors suggest that these T cells react better to Mtb-infected than uninfected Cd1c-expressing APC and reduce Mtb burden in infected cells. Therefore, Cd1c downregulation could be an immune evasion strategy used by Mtb.

      Strengths:

      This study asks an important question. The single-cell transcription analysis suggests the inherent cytotoxic program of lipid-CD1c cells and provides insights into their phenotypic and potential functional profiles. Function experiments suggest that these autoreactive T cells can react to Mtb infection, adding to the paradigm of infection control by these non-conventional T cell populations.

      Weaknesses:

      The study lacks sufficient rigor; conclusions may be strengthened with the incorporation of more controls, and some deeper characterization of the THP1 system and the CD1c-specific T cells isolated from blood. Crucial conclusions are drawn from the cell mixing experiments involving the engineered THP-1 system and CD1c-lipid-specific T cells from blood. These cells need more in-depth characterization. The expression of MHC-I/II is clearly reduced in THP1-CD1c cells. However, it is important to ensure that it is completely abolished, since a residual expression can skew the result with activation of conventional T cells in the blood or low levels of conventional T cells that may be present in the CD1c-tetra/multimer sorted T cells. CD1c-tetra/multimer sorting should include more markers than used in this study.

      Figure 2: The immunohistochemistry appears to be shown only for one biopsy; it may be worth quantifying the immunohistochemistry of all five. The expression of CD1 molecules goes up during the differentiation of MoDC. And Mtb infection prevents or dampens the upregulation. Does Mtb infection downregulate the CD1 expression of mature DCs? Can the effect of Mtb on the expression of CD1a,b,c molecules be investigated using CD1c-expressing DCs from blood? What could be the reason THP-1 cells do not downregulate CD1 molecules upon Mtb infection, and how about the expression of CD1a and b?

      Figure 3: (F) What does the X-axis read for the no infection group? The value for MOI = 0 should be incorporated for the infected T cell group.

      Figure 4: In the lysis assay, THP1-CD1c cells (uninfected and infected) incubated alone should be incorporated.

      Figure 5: A quantitative brief on the single cell TCR sequencing - including how many T cells were sequenced and the frequency of different clone including EM1 and EM2 - should be shown.

    2. Reviewer #2 (Public review):

      Summary:

      The study by Milton et al titled "Human CD1c-autoreactive T cells recognise Mycobacterium tuberculosis-infected antigen-presenting cells and display cytotoxic effector programmes" characterises CD1c-restricted autoreactive T cells and their potential role in controlling Mtb infection. The authors develop a well-controlled system to assay for the functioning/activation of autoreactive T cells. They report the presence of CD1c-restricted autoreactive T cells in the circulating blood of healthy donors. They show that these T cells respond to CD1c and get activated even in the absence of any exogenous antigen. They next show that CD1c, along with CD1a and b, are typically downregulated on APCs during Mtb infection. These autoreactive T cells are cytotoxic, indicating they respond to Mtb treatment and/or to changes in the T cell ratio. The autoreactive T cells could effectively lyse Mtb-infected or PAMP-stimulated CD1c+APCs. Next, using TCR sequencing, they show that T cell responses were mediated by specific TCR clones with common sequence features. They show that these autoreactive T cells could curtail Mtb growth as measured by luminescence. Finally, using scRNAseq, they selectively identify the CD1c-reactive T cell pool and detect enrichment of typical effector memory CD4 and CD8 cells expressing cytolytic markers such as Granzyme, granulolysin, etc. The lung biopsy staining, along with the other data presented here, suggests that while CD1c-restricted T cells could have potential anti-bacterial roles, Mtb downregulation effectively shuts down this mechanism for TB control.

      Strengths:

      The study is designed well and has developed many exciting tools to generate specific information.

      Weaknesses:

      The study has weaknesses in two important parameters - novelty and relevance in controlling TB. Further, the results could be better presented and discussed to allow easy understanding of the experimental design. For example, at several places, UV-killed or live Mtb were used. What is the rationale behind that? Why use irradiated THP1-CD1c cells for activating T cells?

      While functional assays identified only CD4+ cells as CD1c-restricted, scRNAseq shows that both CD4+ and CD8+ cells exhibit this phenotype. Identifying the specific lipid antigen presented by CD1c could add greater value to the study.

      Since autoreactivity was independent of exogenous antigen, the cytotoxic activity should also be independent of exogeneous antigens? What additional signal a THP1-CD1c cells treated with UV-killed Mtb express that is absent from the untreated cells?

      The relative Mtb growth assay is confusing. CD1c cells with Mtb infection triggers massive lytic response, as shown in Figure 4. Under similar conditions, in Figure 6, the authors report a significant decline in Mtb growth in these cells. The problem is that with the kind of lytic response observed, a lot more Mtb could be present extracellularly and would evade killing. How do we reconcile the two observations?

    3. Reviewer #3 (Public review):

      Summary:

      Despite the rising global prevalence of TB, the role of non-classical T-cell pathways in host immunity remains unclear. The present study by Milton et al. is a novel contribution to the field of unconventional T-cell immunity in Mtb infection. The study addresses the role of CD1c-autoreactive T-cells and demonstrates that upon Mtb infection, these cells are significantly activated, resulting in increased cytokine production and cytotoxicity, and a reduction in the bacterial burden, specifically against Mtb-infected CD1c+ APCs (antigen-presenting cells). This defines their role as a plausible candidate for lipid-directed immunity in TB, complementary to MHC-restricted responses.

      Strengths:

      The manuscript is well written, and the novelty, impact, and limitations of this study are precisely highlighted by the authors.

      Weaknesses:

      The authors mention that they did not identify any specific lipids presented by CD1c on Mtb-infected APCs, making it unclear whether they are of host or bacterial origin. This leaves a gap in understanding why the response is enhanced in Mtb-infected cells, whether it is through altered self-presentation of lipids arising from Mtb-induced changes, infection-induced stress signals, or Mtb lipids, or through CD1c-dependent co-stimulation/infection signals. Direct lipid identification via lipidomics/MS of CD1c-bound lipids from Mtb-infected APCs would clarify whether the enhancement arises from altered self-lipids or subtle Mtb lipids.

    1. Reviewer #1 (Public review):

      The study by Lotonin et al. investigates correlates of protection against African swine fever virus (ASFV) infection. The study is based on a comprehensive work, including the measurement of immune parameters using complementary methodologies. An important aspect of the work is the temporal analysis of the immune events, allowing to capture the dynamics of the immune responses induced after infection. Also, the work compares responses induced in farm and SPF pigs, showing the later an enhanced capacity to induce a protective immunity. Overall, the results obtained are interesting and relevant for the field. The findings described in the study further validate work form previous studies (critical role of virus-specific T cell responses), and provide new evidence on the importance of a balanced innate immune response during the immunization process. This information increases our knowledge on basic ASF immunology, one of the important gaps in ASF research that needs to be addressed for a more rational design of effective vaccines. As discussed in the manuscript, the results provide targets which can be further validated in other models, such as immunization using live attenuated vaccines.

      Overall the conclusions of the work are well supported by the results, and most of the issues mentioned during the review have been properly addressed during the revision, improving the quality of the final manuscript. While some limitations remain, I consider that they do not invalidate the results obtained and are well discussed by the authors.

      The study is highly relevant for the field, representing a step forward in our understanding of ASF protective immunity, providing immune targets to be further explored in other models and during vaccine development.

    2. Reviewer #2 (Public review):

      Summary:

      In the current study the authors attempt to identify correlates of protection for improved outcomes following re-challenge with ASFV. An advantage is the study design which compares the responses to a vaccine like mild challenge and during a virulent challenge months later. It is a fairly thorough description of the immune status of animals in terms of T cell responses, antibody responses, cytokines and transcriptional responses and the methods appear largely standard. The comparison between SPF and farm animals is interesting and probably useful for the field in that it suggests that SPF conditions might not fully recapitulate immune protection in the real world. I thought some of the conclusions were over-stated and there are several locations where the data could be presented more clearly.

      Strengths:

      The study is fairly comprehensive in the depth of immune read-outs interrogated. The potential pathways are systematically explored. Comparison of farm animals and SPF animals gives insights into how baseline immune function can differ based on hygiene, which would also likely inform interpretation of vaccination studies going forward.

      Weaknesses:

      There are limited numbers of animals assessed.

      Comments on revisions:

      The authors mostly addressed my comments to the previous version. However, in the discussion they added comments relating to and an interpretation based on their own unpublished data and I think that those statements should be removed because the data are not included in this publication and cannot be cited.

    1. Reviewer #1 (Public review):

      Summary:

      This manuscript investigates how herbivorous insects, specifically whiteflies and planthoppers, utilize salivary effectors to overcome plant immunity by targeting the RLP4 receptor.

      Strengths:

      The authors present a strong case for the independent evolution of these effectors and provide compelling evidence for their functional roles.

      Comments on revisions:

      The authors have addressed all my concerns.

    2. Reviewer #2 (Public review):

      Summary:

      The authors tested an interesting hypothesis that white flies and planthoppers independently evolved salivary proteins to dampen plant immunity by targeting a receptor like protein. Unlike previously reported receptor like proteins with large ligand-binding domains, the NtRLP4 here has a malectin LRR domain. Interestingly, it also associates with the adaptor SOBIR1. While the function of this protein remains to be further explored, the authors provide strong evidence showing it's the target of salivary proteins as the insects' survival strategy.

      The authors have nicely addressed the questions I raised.

      I noticed two small points the authors may modify:<br /> - Line 16: delete "on"<br /> - Line 185: Replace "is resistant to B. tabaci infestation" with "confers resistance against B. tabaci".

    3. Reviewer #3 (Public review):

      Summary:

      In this study, Wang et al., investigates how herbivorous insects overcome plant receptor-mediated immunity by targeting plant receptor-like proteins. The authors identify two independently evolved salivary effectors, BtRDP in whiteflies and NlSP694 in brown planthoppers, that promote the degradation of plant RLP4 through the ubiquitin-dependent proteasome pathway.

      Strengths:

      This work highlights a convergent evolutionary strategy in distinct insect lineages and advances our understanding of insect-plant coevolution at the molecular level.

      Comments on revisions:

      The authors have satisfactorily addressed all the issues I raised.

    1. Reviewer #1 (Public review):

      Summary:

      Zeng et al. characterized the dynamic brain states that emerged during episodic encoding and the reactivation of these states during the offline rest period in children aged 8-13. In the study, participants encoded scene images during fMRI and later performed a memory recognition test. The authors adopted the BSDS approach and identified four states during encoding, including an "active-encoding" state. The occupancy rate of, and the state transition rates towards, this active-encoding state positively predicted memory accuracy across participants. The authors then decoded the brain states during pre- and post-encoding rests with the model trained on the encoding data to examine state reactivation. They found that the state temporal profile and transition structure shifted from encoding to post-encoding rest. They also showed that the mean lifetime and stability (measured with self-transition probability) of the "default-mode" state during post-encoding rest predict memory performance.

      Strengths:

      How brain dynamics during encoding and offline rest support long-term memory remains understudied, particularly in children. Thus, this study addresses an important question in the field. The authors implemented an advanced computational framework to identify latent brain states during encoding and carefully characterized their spatiotemporal features. The study also showed evidence for the behavioral relevance of these states, providing valuable insights into the link between state dynamics and successful encoding and consolidation.

      Weaknesses:

      (1) If applicable, please provide information on the decoding performance of states during pre- and post-encoding rests. The Methods noted that the authors applied a threshold of 0.1 z-scored likelihood, and based on Figure S2, it seems like most TRs were assigned a reinstated state during post-encoding rest. It would be useful to know, for the decodable TRs, how strong the evidence was in favor of one state over others. Further, was decoding performance better during post- vs. pre- encoding rest? This is critical for establishing that these states were indeed "reinstated" during rest. The authors showed individual-specific correlations between encoding and post-encoding state distribution, which is an important validation of the method, but this result alone is not sufficient to suggest that the states during encoding were the ones that occurred during rest. The authors found that the state dynamics vary substantially between encoding and rest, and it would be helpful to clarify whether these differences might be related to decoding performance. I am also curious whether, if the authors apply the BSDS approach to independently identify brain states during rest periods (instead of using the trained model from encoding), they find similar states during rest as those that emerged during encoding?

      (2) During post-encoding rest, the intermediate activation state (S1) became the dominant state. Overall, the paper did not focus too much on this state. For example, when examining the relationship between state transitions and memory performance, the authors also did not include this state as a part of the analyses presented in the paper (lines 203-211). Could the author report more information about this state and/or discuss how this state might be relevant to memory formation and consolidation?

      (3) Two outcome measures from the BSDS model were the occupancy rate and the mean lifetime. The authors found a significant association with behavior and occupancy rate in some analyses, and mean lifetime in others. The paper would benefit from a stronger theoretical framing explaining how and why these two different measures provide distinct information about the brain dynamics, which will help clarify the interpretation of results when association with behavior was specific to one measure.

      (4) For performance on a memory recognition test, d' is a more common metric in the literature as it isolates the memory signal for the old items from response bias. According to Methods (line 451), the authors have computed a different metric as their primary behavioral measure (hits + correction rejections - misses - false alarms). Please provide a rationale for choosing this measure instead. Have the authors considered computing d' as well and examining brain-behavior relationships using d'?

      (5) While this study examined brain state dynamics in children, there was no adult sample to compare with. Therefore, it is hard to conclude whether the findings are specific to children (or developing brains). It would be helpful to discuss this point in the paper.

    2. Reviewer #2 (Public review):

      This paper investigates the latent dynamic brain states that emerge during memory encoding and predict later memory performance in children (N = 24, ages: 8 -13 years). A novel computational approach (Bayesian Switching Dynamic Systems, BSDS) discovers latent brain states from fMRI data in an unsupervised and parameter-free manner that is agnostic to external stimuli, resulting in 4 states: an active-encoding state, a default-mode state, an inactive state, and an intermediate state. The key finding is that the percentage of time occupied in the active-encoding state (characterized by greater activity in hippocampal, visual, and frontoparietal regions), as well as greater transitions to this state, predicts memory accuracy. Memory accuracy was also predicted by the mean lifetime and transitions to the default-mode state (characterized by greater activity in medial prefrontal cortex and posterior cingulate cortex) during post-encoding rest. Together, the results provide insights into dynamic interactions between brain regions that may be optimal for encoding novel information and consolidating memories for long-term retention.

      The approach is interesting and important for our understanding of neural mechanisms of memory during development, as we know less about dynamic interactions between memory systems in development.

      Moreover, the novel methodology may be broadly useful beyond the questions addressed in this study. The manuscript is well-written and concise. Nonetheless, there are several areas for improvement:

      (1) The study focuses on middle childhood, but there is a lack of engagement in the Introduction or Discussion about what is known about memory development and the brain during this period. Many of the brain regions examined in this study, particularly frontoparietal regions, undergo developmental changes that could influence their involvement in memory encoding and consolidation. The paper would be strengthened by more directly linking the findings to what is already known about episodic memory development and the brain.

      (2) A more thorough overview of the BSDS algorithm is needed, since this is likely a novel method for most readers. Although many of the nitty-gritty details can be referenced in prior work, it was unclear from the main text if the BSDS algorithm discovered latent states based on activation patterns, functional connectivity, or both. Figure 1F is not very informative (and is missing labels).

      (3) A further confusion about the BSDS algorithm was whether it necessarily had to work on the rest data. Figure 4A suggests that each TR was assigned one of the four states based on the maximum win from the log-likelihood estimation. Without more details about how this algorithm was applied to the rest data, it is difficult to evaluate the claim on page 14 about the spontaneous emergence of the states at rest.

      (4) Although the BSDS algorithm was validated in prior simulations and task-based fMRI using sustained block designs in adults, it is unclear whether it is appropriate for the kind of event-related design used in the current study. Figure 1G shows very rapid state changes, which is quantified in the low mean lifetime of the states (between 1-3 TRs on average) in Figure 4C. On the one hand, it is a strength of the algorithm that it is not necessarily tied to external stimuli. On the other hand, it would be helpful to see simulations validating that rapid transitions between states in fMRI data are meaningful and not due to noise.

      (5) The Methods section mentions that participants actively imagined themselves within the encoded scenes and were instructed to memorize the images for a later test during the post-encoding rest scan. This detail needs to be included in the main text and incorporated into the interpretation of the findings, as there are likely mechanistic differences between spontaneous memory replay/reinstatement vs. active rehearsal.

      (6) Information about the general linear model used to discover the 16 ROIs that showed a subsequent memory effect are missing, such as: covariates in the model (motion, etc.), group analysis approach (parametric or nonparametric), whether and how multiple-comparisons correction was performed, if clusters were overlapping at all or distinct, if the total number of clusters was 16 or if this was only a subset of regions that showed the effect.

    3. Reviewer #3 (Public review):

      Summary:

      This paper uses a novel method to look at how stable brain states and the transitions between them promote memory formation during encoding and post-encoding rest in children. I think the paper has some weaknesses (detailed below) that mean that the authors fall short of achieving their aims. Although the paper has an interesting methodological approach, the authors need better logic, and are potentially "double dipping" in their results - meaning their logic is circular. I think the method that they are using could be useful to the broader neuroimaging community, although they need to make this argument clearer in the paper.

      Strengths:

      The paper is interesting in that they use a novel method to look at brain state dynamics and how they might support memory.

      Weaknesses:

      The paper has several weaknesses:

      (1) The authors use children as their study subjects but fail to reconcile why children are used, if the same phenomena are expected to be seen in adults (or only children), and if and how their findings change with age across an age range that ranges from middle childhood into early adolescence. They need to include more consideration for the development of their subject population. The authors should make it clear why and how memory was tested in children and not adults. Are adults and children expected to encode and consolidate in a similar manner to children? Do the findings here also apply to adults? Do the findings here also apply to adults? How was the age range of 8-13-year-old children selected? Why didn't the authors look at change with age? Does memory performance change with age? Do the BSDS dynamics change with age in the authors' sample?

      (2) The authors look for brain state dynamics within a preselected set of ROIs that are selected because they display a subsequent memory effect. This is problematic because the state that is most associated with subsequent memory (S3, or State 3) is also the one that shows most activity in these regions (that have already been a priori selected due to displaying a subsequent memory effect). This logic is circular. It would be helpful if they could look at brain state dynamics in a more ROI agnostic whole brain approach so that we can learn something beyond what a subsequent memory analysis tells us. I think the authors are "double dipping" in that they selected regions for further analysis based on a subsequent memory association (remembered > forgotten contrast) and then found states within those regions showing a subsequent memory effect to further analyze for being associated with subsequent memory. Would it be possible instead to do a whole-brain analysis (something a bit more agnostic to findings) using the BSDS framework, and then, from a whole-brain perspective, look for particular brain states associated with subsequent memory? As it stands, it looks like S3 (state 3) has greater overall activation in all brain regions associated with subsequent memory, so it makes sense that this brain state is also most associated with subsequent memory. The BSDS analysis is therefore not adding anything new beyond what the authors find with the simple subsequent memory contrast that they show in Figure 1C. This particularly effects the following findings: (a) active-encoding state occupancy rate correlated positively with memory accuracy, (b) transitions to the active-encoding state were beneficial / Conversely, transitions toward the inactive state (S4) were detrimental, with incoming transitions showing negative correlations with memory accuracy / The active-encoding state serves as a "hub" configuration that facilitates memory formation, while pathways leading to this state enhance performance and transitions away from it impair encoding.

      (3) The task used to test memory in children seems strange. Why should children remember arbitrary scenes? How this was chosen for encoding needs to be made clear. There needs to be more description of the memory task and why it was chosen. Why was scene encoding chosen? What does scene encoding have to do with the stated goal of (a) "Understanding how children's brains form lasting memories", (b) "optimizing education" and (c) "identifying learning disabilities"? What was the design of the recognition memory test? How many novel scenes were included in the test, and how were they chosen? How close were the "new" images to previously seen "old" images? Was this varied parametrically (i.e., was the similarity between new and old images assessed and quantified?)

      (4) They ultimately found four brain states during encoding. It would be helpful if they could make the logic and foundation for arriving at this number clear.

      (5) There is already extant work on whether brain states during post-encoding rest predict memory outcomes. This work needs to be cited and referred to. The present manuscript needs to be better situated within prior work. The authors should look at the work by Alexa Tompary and Lila Davachi. They have already addressed many of the questions that the authors seek to answer. The authors should read their papers (and the papers they cite and that cite them) and then situate their work within the prior literature.

      More minor weaknesses:

      (1) The authors should back up the claim that "successful episodic memory formation critically depends on the temporal coordination between these systems. Brain regions must coordinate their activity through dynamic functional interactions, rapidly reconfiguring their activity and connectivity patterns in response to changing cognitive demands and stimulus characteristics." Do they have any specific evidence supporting this claim?

      (2) These claims seem overstated: "this work has broad implications for understanding memory function in children, for developing educational interventions that enhance memory formation, and enabling early identification of children at risk for learning disabilities." Can the authors add citations that would support these claims, or if not, remove them?

    1. Reviewer #2 (Public review):

      Summary:

      The molecular mechanisms underlying ciliogenesis are not well understood. Previously, work from the same group (Wu et al., 2018) identified myosin-Va as an important protein in transporting preciliary vesicles to the mother vesicles, allowing for initiation of ciliogenesis. The exocyst complex has previously been implicated in ciliogenesis and protein trafficking to cilia. Here, Lin et al. investigate the role of exocyst complex protein EXOC6A in cilia formation. The authors find that EXOC6A localizes to preciliary vesicles, ciliary vesicles, and the ciliary sheath. EXOC6A colocalizes with Myo-Va in the ciliary vesicle and the ciliary sheath, and both proteins are removed from fully assembled cilia. EXOC6A is not required for Myo-Va localization, but Myo-VA and EHD1 are required for EXOC6A to localize in ciliary vesicles. The authors propose that EXOC6A vesicles continually remodel the cilium: FRAP analysis demonstrates that EXOC6A is a dynamic protein, and live imaging shows that EXOC6A fuses with and buds off from the ciliary membrane. Loss of EXOC6A reduces, but does not eliminate, the number of cilia formed in cells. Any cilia that are still present are structurally abnormal, with either bent morphologies or transition zone defects. Overall, the analyses and imaging are well done, and the conclusions are well supported by the data. The work will be of interest to cell biologists, especially those interested in centrosomes and cilia.

      Strengths:

      The TEM micrographs are of excellent quality. The quality of the imaging overall is very good, especially considering that these are dynamic processes occurring in a small region of the cell. The data analysis is well done and the quantifications are very helpful. The manuscript is well-written and the final figure is especially helpful in understanding the model.

      The manuscript has greatly improved after revision. In particular, testing GPR161 and BBS9 localization is helpful evidence to demonstrate that transition zone function is disrupted when EXOC6A is lost. The generation of a second knockout clone and tests of antibody specificity are also great additions.

      Weaknesses:

      None

    2. Reviewer #3 (Public review):

      Summary:

      Lin et al report on the dynamic localization of EXOC6A and Myo-Va at pre-ciliary vesicles, ciliary vesicles, and ciliary sheath membrane during ciliogenesis using three-dimensional structured illumination microscopy and ultrastructure expansion microscopy. The authors further confirm the interaction of EXOC6A and Myo-Va by co-immunoprecipitation experiments and demonstrated the requirement of EHD1 for the EXOC6A-labeled ciliary vesicles formation. Additional experiments using gene-silencing by siRNA and pharmacological tools identified the involvement of dynein-, microtubule-, and actin in the transport mechanism of EXOC6A-labeled vesicles to the centriole, as they have previously reported for Myo-Va. Notably, loss of EXOC6A severely disrupts ciliogenesis, with the majority of cells becoming arrested at the ciliary vesicle (CV) stage, highlighting the involvement of EXOC6A at later stages of ciliogenesis. As the authors observe dynamic EXOC6A-positive vesicle release and fusion with the ciliary sheath, this suggests a role in membrane and potentially membrane protein delivery to the growing cilium past the ciliary vesicle stage. While CEP290 localization at the forming cilium appears normal the recruitment of other transition zone components, exemplified by several MKS and NPHP module components, was also impaired in EXOC6A-deficient cells.

      Strengths:

      - By applying different microscopy approaches, the study provides deeper insight into the spatial and temporal localization of EXOC6A and Myo-Va during ciliogenesis.

      - The combination of complementary siRNA and pharmacological tools targeting different components strengthens the conclusions.

      - This study reveals a new function of EXOC6A in delivering membrane and membrane proteins during ciliogenesis, both to the ciliary vesicle as well as to the ciliary sheath.

      - The overall data quality is high. The investigation of EXOC6A at different time points during ciliogenesis is well schematized and explained.

      - The authors confirmed central antibody reagents used in this study and validated key experiments by using two independent knockout clones (for which sequencing information was provided).

      Weaknesses:

      - The precise molecular function of EXOC6A remains open, as the presented data suggests no involvement of other exocyst components.

      Taken together, the authors achieved their goal to elucidate the role of EXOC6A in ciliogenesis, demonstrating its involvement in vesicle trafficking and membrane remodeling in both early and late stages of ciliogenesis. Their findings are supported by experimental evidence. This work is likely to have an impact on the field by expanding our understanding of the molecular machinery underlying cilia biogenesis, particularly the coordination between exocyst components and cytoskeletal transport systems. The methods and data presented offer valuable tools for dissecting vesicle dynamics and cilium formation, providing a foundation for future research into ciliary dysfunction and related diseases. By connecting vesicle trafficking to structural maturation of an organelle, the study adds important context to the broader description of cellular architecture and organelle biogenesis.

      Comments on revisions:

      We very much appreciate the extra work you put into improving your manuscript and want to congratulate you on your important discovery. We encourage you to keep up the good work!

    1. Reviewer #1 (Public review):

      Lipid transfer proteins (LTPs) play a crucial role in the intramembrane lipid exchange within cells. However, the molecular mechanisms that govern this activity remain largely unclear. Specifically, the way in which LTPs surmount the energy barrier to extract a single lipid molecule from a lipid bilayer is not yet fully understood. This manuscript investigates the influence of membrane properties on the binding of Ups1 to the membrane and the transfer of phosphatidic acid (PA) by the LTP. The findings reveal that Ups1 shows a preference for binding to membranes with positive curvature. Moreover, coarse-grained molecular dynamics simulations indicate that positive curvature decreases the energy barrier associated with PA extraction from the membrane. Additionally, lipid transfer assays conducted with purified proteins and liposomes in vitro demonstrate that the size of the donor membrane significantly impacts lipid transfer efficiency by Ups1-Mdm35 complexes, with smaller liposomes (characterized by high positive curvature) promoting rapid lipid transfer.

      This study offers significant new insights into the reaction cycle of phosphatidic acid (PA) transfer by Ups1 in mitochondria. The experiments are technically robust and carefully interpreted by the authors. They provide compelling evidence that a positive membrane curvature and the presence of negatively charged phospholipids govern the transfer of PA by the mitochondrial lipid transfer protein Ups1-Mdm35.

    2. Reviewer #2 (Public review):

      Summary:

      Lipid transfer between membranes is essential for lipid biosynthesis across different organelle membranes. Ups1-Mdm35 is one of the best-characterized lipid transfer proteins, responsible for transferring phosphatidic acid (PA) between the mitochondrial outer membrane (OM) and inner membrane (IM), a process critical for cardiolipin (CL) synthesis in the IM. Upon dissociation from Mdm35, Ups1 binds to the intermembrane space (IMS) surface of the OM, extracts a PA molecule, re-associates with Mdm35, and moves through the aqueous IMS to deliver PA to the IM. Here, the authors analyzed the early steps of this PA transfer - membrane binding and PA extraction - using a combination of in vitro biochemical assays with lipid liposomes and purified Ups1-Mdm35 to measure liposome binding, lipid transfer between liposomes, and lipid extraction from liposomes. The authors found that membrane curvature, a previously overlooked property of the membrane, significantly affects PA extraction but not PA insertion into liposomes. These findings were further supported by MD simulations.

      Strengths:

      The experiments are well-designed, and the data are logically interpreted. The present study provides an important basis for understanding the mechanism of lipid transfer between membranes. 

      Weaknesses:

      The physiological relevance of membrane curvature in lipid extraction and transfer still remains open.

      Comments on revisions:

      The authors have addressed most of my previous concerns, and the manuscript now looks much stronger.

    3. Reviewer #3 (Public review):

      The manuscript by Sadeqi et al. studies the interactions between the mitochondrial protein Ups1 and reconstituted membranes. The authors apply synthetic liposomal vesicles to investigate the role of pH, curvature, and charge on the binding of Ups1 to membranes and its ability to extract PA from them. The manuscript is well written and structured. The authors provide all relevant information and reference the appropriate literature in their introduction. The underlying question of how the energy barrier for lipid extraction from membranes is overcome by Ups1 is interesting, and the data presented by the authors offer a valuable new perspective on this process. It is also certainly a challenging in vitro reconstitution experiment, as the authors aim to disentangle individual membrane properties (e.g., curvature, charge, and packing density) to study protein adsorption and lipid transfer.

    1. Reviewer #3 (Public review):

      Summary:

      This manuscript by Kim and Parsons presents an overview of the nitroreductase/metronidazole (NTR/MTZ) cell ablation system.

      Strengths:

      This manuscript nicely places the NTR/MTZ system in the context of other cell ablation methods, with a discussion of their respective advantages and disadvantages. This review is particularly useful for highlighting the many ways the NTR/MTZ system has been applied to study the regeneration of multiple cell types and to model different degenerative human diseases. The review concludes with a discussion on recent improvements made to the system and practical considerations and "best practices" for NTR-based experiments. This review could be a helpful resource, especially for researchers new to regeneration or cell ablation studies.

      Weaknesses:

      Although the NTR/MTZ system has been used in other model organisms, this review is primarily focused on its uses in zebrafish. While this is understandable given the wide adoption of NTR/MTZ in the zebrafish field, discussion of the unique considerations and/or challenges for non-zebrafish systems would be an interesting addition and could broaden the potential audience for this review. Additional minor revisions, as suggested below, could also improve readability.

    2. Reviewer #2 (Public review):

      Summary:

      Kim and Parsons reviewed the nitroreductase (NTR)/prodrug system: when engineered cells expressing the enzyme NTR are treated with prodrug (e.g. metronidazole), NTR converts the prodrug into a cytotoxic compound that kills these cells. The review covers how the system has been developed, spatiotemporal control of targeted cell ablation, and its broad utility to study regenerative mechanisms, model human diseases, and screen chemicals to discover pro-regenerative and protective compounds. They further discussed the newer version of NTR, a more potent prodrug, and experimental design, which not only expands the possible utility of the NTR/prodrug system, but also allows the research community to develop a precise, reproducible and versatile platform.

      Strengths:

      The review summarized landmark work application of the NTR/prodrug system, and recent studies, with focus on the model organism zebrafish. The review provides a good gateway to understanding the system and considering regenerative studies.

      Weaknesses:

      No weaknesses were identified by this reviewer.

    3. Reviewer #1 (Public review):

      Summary:

      Kim and Parsons present a timely overview of the NTR/prodrug system and its applications in regenerative biology research, with particular emphasis on tissue-specific cell ablation. The system has substantially advanced the field by enabling non-invasive, conditional cell elimination, and has proven especially powerful in zebrafish, though applications in other classical model organisms are also noted. The review covers the historical origins of the NTR system, its use in regeneration studies, small-molecule screening, and genetic and CRISPR-based screening, as well as future directions, including the development of the highly efficient NTR2 enzyme variant.

      Strengths:

      This is a useful and well-structured contribution. The manuscript is a valuable resource for the regeneration biology community.

      Weaknesses:

      The impact and scientific value of this paper could be meaningfully enhanced by addressing several points outlined below. The concerns centre on completeness, conceptual precision, and the depth of mechanistic discussion.

      (1) Title: Species specificity.

      Given that the review's primary focus is the zebrafish model, it would be appropriate to include the species name in the title. This would improve discoverability and accurately set the scope of the article for prospective readers.

      (2) Subchapter: Physical injury.

      The subchapter enumerates different types of physical injury models but would benefit from a more substantive comparative discussion. In particular, the authors are encouraged to address the following:

      (2.1) Outcome comparison: Surgical and other invasive approaches cause damage to entire tissue structures comprising multiple cell types, whereas tissue-specific genetic ablation eliminates a defined cell population while leaving the surrounding architecture largely intact. This fundamental distinction has direct implications for the interpretation of regenerative outcomes and should be clearly articulated.

      (2.2) Inflammatory response: Invasive injuries typically trigger a robust inflammatory response, which itself can be a potent driver of regeneration. By contrast, genetic cell ablation may elicit a qualitatively different inflammatory reaction. A comparative discussion of this distinction would help readers appreciate a critical limitation of genetic ablation systems relative to models of natural, accidental tissue damage.

      (3) Subchapter: Cell-specific toxins.

      This subchapter would benefit from several targeted expansions:

      (3.1) Off-target effects: The authors should include evidence that the exemplified drugs have known off-target activities, with a discussion of how these confounded the interpretation of experimental data. At least a few concrete published examples should be cited.

      (3.2) Completeness of the toxin list: The current list appears illustrative rather than comprehensive. A more complete enumeration would be valuable, particularly for neurotoxins and drugs targeting sensory cells, as these are highly relevant to the zebrafish regeneration field.

      (3.3) Interspecies differences: It would be informative to specify whether drug specificity differs across species, as this is a practical consideration for researchers working in organisms other than zebrafish.

      (4) Subchapter: Optogenetic cell ablation.

      The authors note that optogenetic cell ablation has not yet been applied in conventional regeneration studies. It would strengthen this section to include a discussion of the underlying reasons for this gap, whether technical or biological, so that readers can appreciate the barriers and potential for future adoption.

      (5) Terminology: "Suicide gene".

      The use of the term "suicide gene" to nitroreductase is conceptually imprecise and merits reconsideration. Strictly speaking, a suicide gene is one whose expression alone is sufficient to kill the cell, as in the case of genes encoding direct triggers of apoptosis or the catalytic A subunit of diphtheria toxin (DTA). NTR does not meet this criterion: it requires the exogenous administration of a prodrug (e.g., metronidazole) to produce a cytotoxic metabolite, and is therefore only conditionally lethal.

      It is worth noting that nitroreductases evolved in bacteria and fungi as enzymes involved in chemoprotection and detoxification, converting potentially toxic and mutagenic nitroaromatic compounds into less harmful metabolites (PMID: 18355273). This biological context further underscores that NTR is not inherently a lethal protein. The authors are encouraged to replace or qualify the term "suicide gene" and instead adopt terminology that more accurately reflects the conditional, prodrug-dependent nature of the system.

      (6) NTR/MTZ in regenerative studies: Mechanistic depth.

      While the review catalogues several studies employing the NTR/MTZ system, it lacks mechanistic depth regarding the cellular basis of ablation. The following questions should be addressed, where evidence exists in the literature:

      (6.1) Temporal dynamics of cell death: What is known about the kinetics of NTR/MTZ-induced lethality across different tissue types in larval and adult zebrafish, as well as other organisms? Are there age- and tissue-specific differences in the speed or completeness of ablation?

      (6.2) Mechanism of cell death: What is the cellular basis of NTR/MTZ-induced cytotoxicity in zebrafish? In particular, do the toxic metabolites preferentially cause mitochondrial damage or nuclear DNA damage, and what downstream death pathways are engaged?

      (6.3) Proliferative versus post-mitotic cells: Are proliferating and non-proliferating cells equally sensitive to the NTR/MTZ system, or does the proliferative status of a cell influence susceptibility? This is a practically important question for researchers designing ablation experiments in tissues with mixed cell populations.

      (6.4) Ablation of progenitor cells: Are there published examples demonstrating that co-ablation of differentiated functional cells and organ-specific progenitor cells abolishes regenerative capacity? Such examples would be highly informative in illustrating the system's power to dissect the cellular requirements for regeneration.

      Addressing the points above, particularly the comparative discussion of injury models and inflammatory responses, the clarification of terminology, and the mechanistic discussion of NTR/MTZ-induced cell death would substantially strengthen the review's scientific contribution and utility.

    1. Reviewer #1 (Public review):

      Summary:

      Chen et al. describe metabolic phenotypes in Dp16 Down Syndrome mice, specifically the Dp(16)1Yey/+ mice - segmental duplication model carrying a majority of the triplicated Hsa21 gene orthologs. The group has performed metabolic phenotyping data in chow and high-fat diets, as well as undertaking a transcriptomic and metabolomic approach in tissues such as white and brown adipose tissues, liver, skeletal muscle, and hypothalamus to reveal both shared and sex-specific differences. The group describes sexual dimorphism in body weight, body temperature, food intake, and physical activity. Core shared features are insulin resistance, glucose intolerance, impaired lipid clearance, and dyslipidaemia in the Dp16 mice. They report tissue signatures of immune activation and a pro-inflammatory state, ER and oxidative stress, fibrosis, impaired glucose and fatty acid catabolism, altered lipid and bile acid profiles, and reduced mitochondrial respiration in Dp16 mice.

      Strengths:

      Overall, this is a good study with detailed, comprehensive data from an excellent group who have previously published on metabolic phenotyping of 2 other Down Syndrome mouse models. Although somewhat descriptive, it does certainly add to the current field and understanding of strengths and weaknesses of Down Syndrome mouse models, as well as identifying new features whilst strengthening previously suggested mechanisms.

      Weaknesses:

      Many aspects of this study have been described in other Down syndrome mouse models, though there are certainly aspects that are new. It would be useful if the authors could do a direct critique and comparison with previous publications in the area, utilising the same Down Syndrome mouse model. There are also a few limitations in the number of animals used and the interpretation of the data that should be acknowledged.

    2. Reviewer #2 (Public review):

      Summary:

      Human DS is associated with metabolic dysfunction in humans, but the precise details of this have not been studied in detail. Here, the authors use a mouse model of DS to study systemic metabolic and transcriptional responses in key metabolic tissues to provide a deep understanding of the metabolic changes associated with DS. As part of his work, the authors also aimed to help inform the selection of a mouse model that best reflects the metabolic profile of DS, through comparison with other DS model metabolic data.

      The data presented in this model will be of interest to those in the field of metabolism. The immediate impact is unclear, but the breadth of data presented makes this a very useful resource.

      Strengths:

      (1) This work builds on other comprehensive analyses that the authors have performed in other DS mouse models.

      (2) The authors note common metabolic disturbances between male and female mice (e.g., insulin resistance) alongside clearly sexually dimorphic phenotypes (e.g., body weight). Studying both sexes in this context is important.

      (3) The authors have written the paper in a way that integrates a large number of observations well. There is complex data, and a high degree of sexual dimorphism. The study has generated a valuable and wide-ranging dataset comprising molecular, biochemical, and physiological data that will be useful for further, more mechanistic studies of metabolism in DS.

      (4) For specific observations, like the findings of altered body temperature in male and female mice, the authors undertake follow-up hypothesis-driven analyses of BAT mitochondria and specific hormones. Although these analyses do not explain the change in temperature, they ensure the study is not purely descriptive in nature.

      Weaknesses:

      (1) Assessing metabolism using dynamic testing is a strength. ITT, GTT and LTTs are included.

      (2) The dosing for GTTs, ITTs and LTTs was performed per body weight. But the mice under chow and HFD had different body weights. This may compromise the interpretation of the data. Further, ITTs are presented as percentage change, and this can be heavily influenced by baseline glucose measures. The changes appear quite dramatic, so can the authors plot the raw data instead?

      (3) In addition, throughout the manuscript, it is not clear which tissues are the most dominant in disrupting metabolism. The ITT and GTT are composite measures across tissues. Tissue-specific analyses using a clamp technique or isolated tissues may provide more clarity here.

      (4) One of the aims of the study was "to help inform the selection of mouse model that best reflects the metabolic profile of DS". The discussion does not contain a comparison between the previous work on different strains and relative to known human data.

      (5) Data availability. Raw metabolomic data should be made available.

    3. Reviewer #3 (Public review):

      Summary:

      The article by Chen et al. describes the comprehensive metabolic profiling of DP16 mice, a Down syndrome model that carries a duplicated segment of the mouse chromosome syntenic to human chromosome 21. The authors note that this model is superior to previously used models, based on genetics, as ~65% of the chromosome 21 orthologues. The metabolic phenotypes also appear to be more consistent with those observed in humans with Down Syndrome. The study lays the groundwork for a more detailed genetic dissection of dosage-sensitive genes that contribute to the metabolic deficits observed in Down Syndrome.

      Strengths:

      There is an enormous amount of data in this manuscript, and the methods are described with adequate attention to detail. A strength of the manuscript is that both male and female mice were analyzed, so that concordant and discordant phenotypes were identified. Both males and females had evidence of insulin resistance. Transcriptomic and metabolomic data revealed impaired pathways for lipid metabolism, a pro-inflammatory state, reduced mitochondrial health and oxidative stress. Although the effects of a high-fat diet on weight gain were divergent, this diet caused worsened insulin resistance in both males and females.

      The discussion is excellent. Limitations of the study are well described. This reviewer does not identify any critical missing data.

      Weaknesses:

      It might have been helpful to have included blood pressure measurements, given the differences in 19-Nor-deoxycorticosterone. The discussion references several articles that describe sex-dependent differences in metabolic phenotypes in humans with Down syndrome, and it might have been helpful to state more explicitly whether these differences correlate with those observed here in mice.

    1. Reviewer #1 (Public review):

      Summary:

      Noell et al have presented a careful study of the dissociation kinetics of Kinesin (1,2,3) classes of motors moving in vitro on a microtubule. These motors move against the opposing force from a ~1 micron DNA strand (DNA tensiometer) that is tethered to the microtubule and also bound to the motor via specific linkages (Figure 1A). The authors compare the time for which motors remain attached to the microtubule when they are tethered to the DNA, versus when they are not. If the former is longer, the interpretation is that the force on the motor from the stretched DNA (presumed to be working solely along the length of the microtubule) causes the motor's detachment rate from the microtubule to be reduced. Thus, the specific motor exhibits "catch-bond" like behaviour.

      Strengths:

      The motivation is good - to understand how kinesin competes against dynein through the possible activation of a catch bond. Experiments are well done, and there is an effort to model the results theoretically.

      Weaknesses:

      The motivation of these studies is to understand how kinesin (1/2/3) motors would behave when they are pitted in a tug of war against dynein motors as they transport cargo in a bidirectional manner on microtubules. Earlier work on dynein and kinesin motors using optical tweezers has suggested that dynein shows a catch bond phenomenon, whereas such signatures were not seen for kinesin. Based on their data with the DNA tensiometer, the authors would like to claim that (i) Kinesin1 and Kinesin2 also show catch-bonding and (ii) the earlier results using optical traps suffer from vertical forces, which complicates the catch-bond interpretation.

      While the motivation of this work is reasonable, and the experiments are careful, I find significant issues that the authors have not addressed:

      (1) Figure 1B shows the PREDICTED force-extension curve for DNA based on a worm-like chain model. Where is the experimental evidence for this curve? This issue is crucial because the F-E curve will decide how and when a catch-bond is induced (if at all it is) as the motor moves against the tensiometer. Unless this is actually measured by some other means, I find it hard to accept all the results based on Figure 1B.

      (2) The authors can correct me on this, but I believe that all the catch-bond studies using optical traps have exerted a load force that exceeds the actual force generated by the motor. For example, see Figure 2 in reference 42 (Kunwar et al). It is in this regime (load force > force from motor) that the dissociation rate is reduced (catch-bond is activated). Such a regime is never reached in the DNA tensiometer study because of the very construction of the experiment. I am very surprised that this point is overlooked in this manuscript. I am therefore not even sure that the present experiments even induce a catch-bond (in the sense reported for earlier papers).

      (3) I appreciate the concerns about the Vertical force from the optical trap. But that leads to the following questions that have not at all been addressed in this paper:

      (i) Why is the Vertical force only a problem for Kinesins, and not a problem for the dynein studies?

      (ii) The authors state that "With this geometry, a kinesin motor pulls against the elastic force of a stretched DNA solely in a direction parallel to the microtubule". Is this really true? What matters is not just how the kinesin pulls the DNA, but also how the DNA pulls on the kinesin. In Figure 1A, what is the guarantee that the DNA is oriented only in the plane of the paper? In fact, the DNA could even be bending transiently in a manner that it pulls the kinesin motor UPWARDS (Vertical force). How are the authors sure that the reaction force between DNA and kinesin is oriented SOLELY along the microtubule?

      (4) For this study to be really impactful and for some of the above concerns to be addressed, the data should also have included DNA tensiometer experiments with Dynein. I wonder why this was not done?

      While I do like several aspects of the paper, I do not believe that the conclusions are supported by the data presented in this paper for the reasons stated above.

    2. Reviewer #2 (Public review):

      Summary:

      To investigate the detachment and reattachment kinetics of kinesin-1, 2, and 3 motors against loads oriented parallel to the microtubule, the authors used a DNA tensiometer approach comprising a DNA entropic spring attached to the microtubule on one end and a motor on the other. They found that for kinesin-1 and kinesin-2, the dissociation rates at stall were smaller than the detachment rates during unloaded runs. With regard to the complex reattachment kinetics found in the experiments, the authors argue that these findings were consistent with a weakly-bound 'slip' state preceding motor dissociation from the microtubule. The behavior of kinesin-3 was different and (by the definition of the authors) only showed prolonged "detachment" rates when disregarding some of the slip events. The authors performed stochastic simulations that recapitulate the load-dependent detachment and reattachment kinetics for all three motors. They argue that the presented results provide insight into how kinesin-1, -2, and -3 families transport cargo in complex cellular geometries and compete against dynein during bidirectional transport.

      Strengths:

      The present study is timely, as significant concerns have been raised previously about studying motor kinetics in optical (single-bead) traps where significant vertical forces are present. Moreover, the obtained data are of high quality, and the experimental procedures are clearly described.

      Weaknesses:

      However, in the present version of the manuscript, the conclusions drawn from the experiments, the overall interpretation of the results, and the novelty over previous reports appear less clear.

      Major comments:

      (1) The use of the term "catch bond" is misleading, as the authors do not really mean consistently a catch bond in the classical sense (i.e., a protein-protein interaction having a dissociation rate that decreases with load). Instead, what they mean is that after motor detachment (i.e., after a motor protein dissociating from a tubulin protein), there is a slip state during which the reattachment rate is higher as compared to a motor diffusing in solution. While this may indeed influence the dynamics of bidirectional cargo transport (e.g., during tug-of-war events), the used terms (detachment (with or without slip?), dissociation, rescue, ...) need to be better defined and the results discussed in the context of these definitions. It is very unsatisfactory at the moment, for example, that kinesin-3 is at first not classified as a catch bond, but later on (after tweaking the definitions) it is. In essence, the typical slip/catch bond nomenclature used for protein-protein interaction is not readily applicable for motors with slippage.

      (2) The authors define the stall duration as the time at full load, terminated by >60 nm slips/detachments. Isn't that a problem? Smaller slips are not detected/considered... but are also indicative of a motor dissociation event, i.e., the end of a stall. What is the distribution of the slip distances? If the slip distances follow an exponential decay, a large number of short slips are expected, and the presented data (neglecting those short slips) would be highly distorted.

      (3) Along the same line: Why do the authors compare the stall duration (without including the time it took the motor to reach stall) to the unloaded single motor run durations? Shouldn't the times of the runs be included?

      (4) At many places, it appears too simple that for the biologically relevant processes, mainly/only the load-dependent off-rates of the motors matter. The stall forces and the kind of motor-cargo linkage (e.g., rigid vs. diffusive) do likely also matter. For example: "In the context of pulling a large cargo through the viscous cytoplasm or competing against dynein in a tug-of-war, these slip events enable the motor to maintain force generation and, hence, are distinct from true detachment events." I disagree. The kinesin force at reattachment (after slippage) is much smaller than at stall. What helps, however, is that due to the geometry of being held close to the microtubule (either by the DNA in the present case or by the cargo in vivo) the attachment rate is much higher. Note also that upon DNA relaxation ,the motor is likely kept close to the microtubule surface, while, for example, when bound to a vesicle, the motor may diffuse away from the microtubule quickly (e.g., reference 20).

      (5) Why were all motors linked to the neck-coil domain of kinesin-1? Couldn't it be that for normal function, the different coils matter? Autoinhibition can also be circumvented by consistently shortening the constructs.

      (6) I am worried about the neutravidin on the microtubules, which may act as roadblocks (e.g. DOI: 10.1039/b803585g), slip termination sites (maybe without the neutravidin, the rescue rate would be much lower?), and potentially also DNA-interaction sites? At 8 nM neutravidin and the given level of biotinylation, what density of neutravidin do the authors expect on their microtubules? Can the authors rule out that the observed stall events are predominantly the result of a kinesin motor being stopped after a short slippage event at a neutravidin molecule?

      (7) Also, the unloaded runs should be performed on the same microtubules as in the DNA experiments, i.e., with neutravidin. Otherwise, I do not see how the values can be compared.

      (8) If, as stated, "a portion of kinesin-3 unloaded run durations were limited by the length of the microtubules, meaning the unloaded duration is a lower limit." corrections (such as Kaplan-Meier) should be applied, DOI: 10.1016/j.bpj.2017.09.024.

      (9) Shouldn't Kaplan-Meier also be applied to the ramp durations ... as a ramp may also artificially end upon stall? Also, doesn't the comparison between ramp and stall duration have a problem, as each stall is preceded by a ramp ...and the (maximum) ramp times will depend on the speed of the motor? Kinesin-3 is the fastest motor and will reach stall much faster than kinesin-1. Isn't it obvious that the stall durations are longer than the ramp duration (as seen for all three motors in Figure 3)?

      (10) It is not clear what is seen in Figure S6A: It looks like only single motors (green, w/o a DNA molecule) are walking ... Note: the influence of the attached DNA onto the stepping duration of a motor may depend on the DNA conformation (stretched and near to the microtubule (with neutravidin!) in the tethered case and spherically coiled in the untethered case).

      (11) Along this line: While the run time of kinesin-1 with DNA (1.4 s) is significantly shorter than the stall time (3.0 s), it is still larger than the unloaded run time (1.0 s). What do the authors think is the origin of this increase?

      (12) "The simplest prediction is that against the low loads experienced during ramps, the detachment rate should match the unloaded detachment rate." I disagree. I would already expect a slight increase.

      (13) Isn't the model over-defined by fitting the values for the load-dependence of the strong-to-weak transition and fitting the load dependence into the transition to the slip state?

      (14) "When kinesin-1 was tethered to a glass coverslip via a DNA linker and hydrodynamic forces were imposed on an associated microtubule, kinesin-1 dissociation rates were relatively insensitive to loads up to ~3 pN, inconsistent with slip-bond characteristics (37)." This statement appears not to be true. In reference 37, very similar to the geometry reported here, the microtubules were fixed on the surface, and the stepping of single kinesin motors attached to large beads (to which defined forces were applied by hydrodynamics) via long DNA linkers was studied. In fact, quite a number of statements made in the present manuscript have been made already in ref. 37 (see in particular sections 2.6 and 2.7), and the authors may consider putting their results better into this context in the Introduction and Discussion. It is also noteworthy to discuss that the (admittedly limited) data in ref. 37 does not indicate a "catch-bond" behavior but rather an insensitivity to force over a defined range of forces.

    3. Reviewer #3 (Public review):

      Summary:

      Several recent findings indicate that forces perpendicular to the microtubule accelerate kinesin unbinding, where perpendicular and axial forces were analyzed using the geometry in a single-bead optical trapping assay (Khataee and Howard, 2019), comparison between single-bead and dumbbell assay measurements (Pyrpassopoulos et al., 2020), and comparison of single-bead optical trap measurements with and without a DNA tether (Hensley and Yildiz, 2025).

      Here, the authors devise an assay to exert forces along the microtubule axis by tethering kinesin to the microtubule via a dsDNA tether. They compared the behavior of kinesin-1, -2, and -3 when pulling against the DNA tether. In line with previous optical trapping measurements, kinesin unbinding is less sensitive to forces when the forces are aligned with the microtubule axis. Surprisingly, the authors find that both kinesin-1 and -2 detach from the microtubule more slowly when stalled against the DNA tether than in unloaded conditions, indicating that these motors act as catch bonds in response to axial loads. Axial loads accelerate kinesin-3 detachment. However, kinesin-3 reattaches quickly to maintain forces. For all three kinesins, the authors observe weakly attached states where the motor briefly slips along the microtubule before continuing a processive run.

      Strengths:

      These observations suggest that the conventional view that kinesins act as slip bonds under load, as concluded from single-bead optical trapping measurements where perpendicular loads are present due to the force being exerted on the centroid of a large (relative to the kinesin) bead, needs to be reconsidered. Understanding the effect of force on the association kinetics of kinesin has important implications for intracellular transport, where the force-dependent detachment governs how kinesins interact with other kinesins and opposing dynein motors (Muller et al., 2008; Kunwar et al., 2011; Ohashi et al., 2018; Gicking et al., 2022) on vesicular cargoes.

      Weaknesses:

      The authors attribute the differences in the behaviour of kinesins when pulling against a DNA tether compared to an optical trap to the differences in the perpendicular forces. However, the compliance is also much different in these two experiments. The optical trap acts like a ~ linear spring with stiffness ~ 0.05 pN/nm. The dsDNA tether is an entropic spring, with negligible stiffness at low extensions and very high compliance once the tether is extended to its contour length (Fig. 1B). The effect of the compliance on the results should be addressed in the manuscript.

      Compared to an optical trapping assay, the motors are also tethered closer to the microtubule in this geometry. In an optical trap assay, the bead could rotate when the kinesin is not bound. The authors should discuss how this tethering is expected to affect the kinesin reattachment and slipping. While likely outside the scope of this study, it would be interesting to compare the static tether used here with a dynamic tether like MAP7 or the CAP-GLY domain of p150glued.

      In the single-molecule extension traces (Figure 1F-H; S3), the kinesin-2 traces often show jumps in position at the beginning of runs (e.g., the four runs from ~4-13 s in Fig. 1G). These jumps are not apparent in the kinesin-1 and -3 traces. What is the explanation? Is kinesin-2 binding accelerated by resisting loads more strongly than kinesin-1 and -3?

      When comparing the durations of unloaded and stall events (Fig. 2), there is a potential for bias in the measurement, where very long unloaded runs cannot be observed due to the limited length of the microtubule (Thompson, Hoeprich, and Berger, 2013), while the duration of tethered runs is only limited by photobleaching. Was the possible censoring of the results addressed in the analysis?

      The mathematical model is helpful in interpreting the data. To assess how the "slip" state contributes to the association kinetics, it would be helpful to compare the proposed model with a similar model with no slip state. Could the slips be explained by fast reattachments from the detached state?

    1. Reviewer #1 (Public review):

      Summary:

      Eroglu and Hobert demonstrate that injecting CRISPR guides and repair constructs to target three genes at a time, tagging each with a different fluorescent protein, and selecting which gene to tag with which fluorophore based on genes' expression levels, can improve the efficiency of gene tagging.

      Strengths:

      This manuscript demonstrates that three genes can be targeted efficiently with three different fluorophores. It also presents some practical considerations, like using the fluorophore least complicated by agar/worm autofluorescence for genes with low expression levels, and cost calculations if the same methods were used on all genes.

      Weaknesses:

      Eroglu has demonstrated in a previous publication that single-stranded DNA injection can increase the efficiency of CRISPR in C. elegans while inserting two fluorescent proteins and a co-CRISPR marker into three loci. The current work is, therefore, an incremental advance. In general, I applaud the authors' willingness to think ahead to how whole proteome tagging might be accomplished, but I predict that the advance here will be one of many small advances that will get the field to that goal. The title vastly oversells the advance in my view, and the first sentence of the Discussion seems a more apt summary of the key advance here.

      Some injections target genes on the same chromosome together, which will create unnecessary issues when doing necessary backcrossing, especially if the mutation rate is increased by CRISPR. Also, the need for backcrossing and perhaps sequencing made me wonder if injecting 3 together really is helpful vs targeting each gene separately, since only 5 worms need to be injected.

      The limited utility of current blue fluorescent proteins makes me wonder if it's worth using at all at this stage, before there are better blue (or far red) fluorescent proteins.

      Some literature reviews, particularly in the Introduction and Abstract, rely too much on recent examples from the authors' laboratory instead of presenting the state of the field. I'd like to have known what exactly has been done with simultaneous injection targeting multiple loci more thoroughly, comparing what has been accomplished to date by various laboratories' advances to date.

    2. Reviewer #2 (Public review):

      The manuscript by Eroglu and Hobert presents a set of strains each harboring up to three fluorescently tagged endogenous proteins. While there is technically nothing wrong with the method and the images are beautiful, we struggled to appreciate the advance of this work - who is this paper for?

      As a technical method, the advance is minimal since the first author had already demonstrated that three mutations (fluorophore insertion and co-CRISPR marker) could be introduced simultaneously.

      As a pilot for creating genome-scale resources, it is not clear whether three different fluorophores in one animal, while elegantly designed and implemented, will be desired by the broader community.

      Finally, the interpretation of the patterns observed in the created lines is somewhat lacking. A Table with all the observations must be included. This can replace the descriptions of the observations with the different lines, which could be somewhat laborious for the reader, and are often wrong. There are numerous mistaken expectations of protein expression here, but two examples include:

      (1) The expectation that ACDH-10 is enriched in the intestine and epidermal tissues (hypodermis)<br /> There are multiple paralogs of this protein (see WormPaths or WormFlux) that may share functions in different tissues. There is also no reason to assume that fatty acid metabolism does not occur in other tissues (including the germline). Finally, there are no published studies about this enzyme, so we really don't know for sure what it's doing.

      (2) The expectation that HXK-1 is ubiquitously expressed<br /> Three paralogous enzymes are all associated with the same reaction, and we have shown that these three function redundantly in vivo, perhaps in different tissues (PMID: 40011787). Moreover, single-cell RNA-seq data (PMID: 38816550) also show enrichment of hxk-1 in gonadal sheath cells.

      The table should have at least the following information: gene/protein name - Wormbase ID - TPM levels of single cell data assigned to tissues for L2, L4, and adult (all published) - tissues in which expression is observed in the lines presented by the authors.

    3. Reviewer #3 (Public review):

      Summary:

      The authors argue that establishing the expression pattern and subcellular localisation of an animal's proteome will highlight many hypotheses for further study. To make this point and show feasibility, they developed a pipeline to knock in DNA encoding fluorescent tags into C. elegans genes.

      Strengths:

      The authors effectively make the points above. For example, they provide evidence of two populations of mitochondria in the C. elegans germline that differ qualitatively in the proteins they express. They also provide convincing evidence that labelling the whole proteome is an achievable goal with relatively limited resources and time.

      Weaknesses:

      Cell biology in C. elegans is challenging because of the small size of many of its cells, notably neurons. This can make establishing the sub-cellular localisation of a fluorescently tagged protein, or co-localizing it with another protein, tricky. The authors point out in their introduction that advances in light microscopy, such as diSPIM, STED, and ISM (a close relative of SIM), have increased the resolution of light microscopy. They also point out that recent advances in expansion microscopy can similarly help overcome the resolution limit.

      (1) Have the authors investigated if the three fluorescent tags they use are appropriate for super-resolution microscopy of C. elegans, e.g., STED or SIM? Would Elektra be better than mTAGBFP2? How does mScarlet3-S2 compare to mScarlet 3?

      (2) Have the authors investigated what tags could be used in expansion microscopy - that is, which retain antigenicity or even fluorescence after the protocol is applied? It may be useful to add different epitope tags to the knock-in cassettes for this purpose.

      The paper is fine as it stands. The experiments above could add value to it and future-proof it, but are not essential. If the experiments are not attempted, the authors could refer to the points above in the discussion.

    1. Reviewer #1 (Public review):

      This manuscript describes the pattern of relaxed selection observed at spermatogenesis genes in gorillas, presumably due to the low sperm competition associated with single-male polygyny. The analyses to detect patterns of selection are very thorough, as are the follow-up analyses to characterize the function of these genes. Furthermore, the authors take the extra steps of in vivo determination of function with a Drosophila model.

      This is an excellent paper. It addresses the interesting phenomenon of relaxation of selection as a genomic signal of reproductive strategies using multiple computational approaches and follow-up analyses by pulling in data from GO, mouse knockouts, human infertility database, and even Drosophila RNAi experiments. I really appreciate the comprehensive and creative approach to analyze and explore the data. As far as I can tell, the analyses were performed soundly and statistics are appropriate. The Introduction and Discussion sections are thoughtful and well-written. I have no major criticisms of the manuscript, just a few minor thoughts.

      In the "Caveats and Limitations" section of the Discussion, the first paragraph of this section states the obvious that genetic manipulation of gorillas is not feasible. Beyond a reminder to the reader that this was a rationale for the Drosophila work, it isn't really adding much insight.

      I do agree with one of the initial reviewers that a comparative approach would add powerful perspective on the evolution of these genes. At the same time, I agree with the authors that the present work is comprehensive and can stand in its own in providing convincing evidence that many male reproductive genes in gorillas have experienced relaxed selection, without reference to other species with similar mating systems. I do not think that the elephant seal data adds a useful perspective.

    2. Reviewer #3 (Public review):

      In this study the authors tested for alterations in selection intensity across ~13,000 protein coding genes along the gorilla lineage in order to test the hypothesis that the evolution of a polygynous social system resulted in relaxed selective constraint through a reduction in sperm competition. Of these genes, 578 exhibited signatures of relaxed purifying selection that were enriched for functions in male germ cells including meiosis and sperm biology. These genes were also more likely to be expressed in male germ cells and to contain deleterious mutations. Functional analysis of genes not previously implicated in male reproduction identified 41 new genes essential to male fertility in a Drosophila model. Moreover, genes under relaxed selective constraint in the gorilla lineage were more likely to contain loss of function variants in a cohort of infertile men. The authors conclude that their results support the hypothesis that the emergence of a polygynous social system may have reduced the degree of selective pressures exerted through sperm competition.

      (1) The identification of novel genes involved in spermatogenesis using signatures of relaxed selective constraint coupled to in vivo RNAi in Drosophila offers a proof of principal as to the power of evolutionarily-informed functional genomics that has been largely underutilized.

      (2) The analysis is restricted to protein-coding regions of genes that have single, orthologous sequences spanning 261 mammalian species, and as such is a non-random set of 13,310 genes that have higher evolutionary conservation. While this approach is necessary for the analyses being performed, it excludes non-coding regions, recently duplicated genes/gene families, and rapidly evolving genes, which are all likely subject to stronger selection as compared to evolutionarily conserved genes (and gene regions). Thus, the conclusions of relaxed selective constraint as being pervasive could be missing a large number of the most strongly selected genes, many of which may include sex and reproduction related genes.

      (3) The identification of genes showing relaxed selection along the gorilla lineage, which are overrepresented in male reproduction, supports the hypothesis that the emergency of polygyny resulted in relaxed sperm competition and is the driving force behind their observations. To more fully test this hypothesis the authors contrast their findings to observations in elephant seals, however of the 573 genes under relaxed selection in gorillas only 14 show a similar pattern. These genes are not enriched for male reproductive function, and may be under-powered or result from variation in reproductive strategies in gorillas as compared to elephant seals that mate seasonally.

      (4) The comparisons of human males with infertility to a large number of healthy males from a separate cohort can lead to genetic differences related to population structure or differences in study recruitment independent of infertility, and care must be taken to avoid confounding. Population structure is more likely to affect patterns of rare variation (including loss of function mutations), even when controls are ascertained using similar enrollment criteria, geographic regions, racial/ethnic and national identities. In this study, the MERGE cohort is largely recruited from Germany, vs. a geographically more broadly recruited control cohort gnomeAD. The authors performed a sub-cohort analysis among individuals identified as having predominantly European genetic ancestry within MERGE, to that of non-Finnish European individuals from genomeAD, and find similar results, thus strengthening their findings.

    1. Reviewer #1 (Public review):

      Summary:

      This study presents an interesting investigation into the role of trained immunity in inflammatory bowel disease, demonstrating that β-glucan-induced reprogramming of innate immune cells can ameliorate experimental colitis. The findings are novel and clinically relevant, with potential implications for therapeutic strategies in IBD. The combination of functional assays, adoptive transfer experiments, and single-cell RNA sequencing provides comprehensive mechanistic insights. However, some aspects of the study could benefit from further clarification to strengthen the conclusions.

      Strengths:

      (1) This study elegantly connects trained immunity with IBD, demonstrating how β-glucan-induced innate immune reprogramming can mitigate chronic inflammation.

      (2) Adoptive transfer experiments robustly confirm the protective role of monocytes/macrophages in colitis resolution.

      (3) Single-cell RNA sequencing provides mechanistic depth, revealing the expansion of reparative Cx3cr1⁺ macrophages and their contribution to epithelial repair.

      (4) The work highlights the therapeutic potential of trained immunity in restoring gut homeostasis, offering new directions for IBD treatment.

      Weaknesses:

      While β-glucan may exert its training effect on hematopoietic stem cells, performing ATAC-seq on HSCs or monocytes to profile chromatin accessibility at antibacterial defense and mucosal repair-related genes would further validate the trained immunity mechanism. Alternatively, the authors could acknowledge this as a study limitation and future research direction.

      Comments on revisions:

      My concerns have been fully addressed. I have no additional comments.

    2. Reviewer #2 (Public review):

      This study investigates how BG-induced myeloid reprogramming influences inflammatory bowel disease in a mouse model of DSS-induced colitis. The authors use in vivo functional experiments, adoptive transfer, and scRNA-seq to assess whether innate immune reprogramming can confer protection in colitis.

      In the revised versions of the manuscript, the authors clarified the mechanistic scope of the study, softened the conclusions, and acknowledged the lack of direct epigenetic validation of trained immunity in this model. The manuscript now also better emphasizes the context-dependent nature of BG-induced reprogramming.

      While some aspects remain correlative and will require further investigation, the central findings are well supported.

      Overall, this work provides a meaningful contribution to the field, and I support its publication.

      Comments on revisions:

      No further comments.

    3. Reviewer #3 (Public review):

      Summary:

      In the present work, the authors offer evidence for the therapeutic potential of trained immunity in the context of inflammatory bowel disease (IBD). Prior research has demonstrated that innate cells pre-treated (trained) with β-glucan show an enhanced pro-inflammatory response upon a second challenge with the same or different stimulus. While an increased immune response can be beneficial and protect against bacterial infections, there is also the risk that it will worsen symptoms in various inflammatory disorders.

      Remarkably, the authors show that β-glucan training of bone marrow hematopoietic progenitors and peripheral monocytes mitigates the pro-inflammatory effects of colitis, with protection extending to naïve recipients of the trained cells. Additionally, the authors demonstrate that mice preconditioned with β-glucan have enhanced resistance to Staphylococcus aureus and Salmonella typhimurium infections, indicating heightened immune responses.<br /> Using a dextran sulfate sodium (DSS)-induced model of colitis, β-glucan pre-treatment significantly dampens disease severity. Importantly, the use of Rag1^-/- mice, which lack adaptive immune cells, confirms that the protective effects of β-glucan are mediated by innate immune mechanisms. Further, experiments using Ccr2^-/- mice underline the necessity of monocyte recruitment in mediating this protection, highlighting CCR2 as a key factor in the mobilization of β-glucan trained monocytes to inflamed tissues. In addition, β-glucan training highlights a distinct monocyte subpopulation with enhanced activation and phagocytic capacity. These monocytes, marked by increased expression of Cx3cr1, are suggestive of an increased ability to infiltrate inflamed colonic tissue and differentiate into macrophages.

      Transcriptomic profiling reveals that β-glucan training upregulates genes associated with pattern recognition, antimicrobial defense, immunomodulation, and interferon signaling pathways, suggesting broad functional reprogramming of the innate immune compartment. Moreover, among the trained monocyte and macrophage subsets, gene expression signatures are associated with tissue and mucosal repair, suggesting a role in promoting resolution and regeneration following inflammatory insult. Furthermore, this was coupled with analysis of chromatin accessibility in publicly available data.

      Strengths:

      By employing a range of well-characterized murine models, the authors investigate specific mechanisms involved in the effects of β-glucan training. Furthermore, the study provides functional evidence that the protection conferred by the trained cells persists within the hematopoietic progenitors and can be transferred to naïve recipients. The integration of transcriptomic profiling allows the identification of changes in key genes and molecular pathways underlying the trained immune phenotype.

      Weaknesses:

      Further studies would benefit from investigating the cytokine responses of intestinal macrophages, particularly CX3CR1⁺ macrophages, following ex vivo stimulation of previously BCG-trained cells. Moreover, assessing the metabolic state of these macrophages would provide valuable insight into the mechanisms underlying trained immunity in this context.

      Impact:

      Overall, the authors present a mechanistically insightful investigation that advances our understanding of trained immunity in IBD. This is an important study that demonstrates that β-glucan-trained innate cells can confer protection against colitis and promote mucosal repair through trained-immunity related mechanisms. These findings underscore the potential of harnessing innate immune memory as a therapeutic approach for chronic inflammatory diseases.

    1. Reviewer #1 (Public review):

      Summary:

      Noell et al have presented a careful study of the dissociation kinetics of Kinesin (1,2,3) classes of motors moving in-vitro on a microtubule. These motors move against the opposing force from a ~1 micron DNA strand (DNA tensiometer) that is tethered to the microtubule and also bound to the motor via specific linkages (Fig 1A). Authors compare the time for which motors remain attached to the microtubule when they are tethered to the DNA, versus when they are not. If the former is longer, the intepretation is that the force on the motor from the stretched DNA (presumed to be working solely along the length of the microtubule) causes the motor's detachment rate from the microtubule to be reduced. Thus, the specific motor exhibits "catch-bond" like behaviour.

      Strengths:

      The motivation is good - to understand how kinesin competes against dynein through the possible activation of a catch bond. Experiments are well done and there is an effort to model the results theoretically.

      Weaknesses:

      The motivation of these studies is to understand how kinesin (1/2/3) motors would behave when they are pitted in a tug of war against dynein motors as they transport cargo in bidirectional manner on microtubules. Earlier work on dynein and kinesin motors using optical tweezers has suggested that dynein shows catch bond phenomenon, whereas such signatures were not seen for kinesin. Based on their data with DNA tensiometer, the authors would like to claim that (i) Kinesin1 and kinesin2 also show catch-bonding and (ii) The earlier results using optical traps suffer from vertical forces, which complicates the catch-bond interpretation.

      Comments on revised version:

      I am not fully convinced about the responses from authors, so I would like to retain my original assessment of the paper. The same may be made available for public viewing, along with the responses of the authors. Readers can go through both and form their opinion.

    2. Reviewer #2 (Public review):

      Summary:

      To investigate the detachment and reattachment kinetics of kinesin-1, 2 and 3 motors against loads oriented parallel to the microtubule, the authors used a DNA tensiometer approach comprising a DNA entropic spring attached to the microtubule on one end and a motor on the other. They found that for kinesin-1 and kinesin-2 the dissociation rates at stall were smaller than the detachment rates during unloaded runs. With regard to the complex reattachment kinetics found in the experiments, the authors argue that these findings were consistent with a weakly-bound 'slip' state preceding motor dissociation from the microtubule. The behavior of kinesin-3 was different and (by the definition of the authors) only showed prolonged "detachment" rates when disregarding some of the slip events. The authors performed stochastic simulations which recapitulate the load-dependent detachment and reattachment kinetics for all three motors. They argue that the presented results provide insight into how kinesin-1, -2 and -3 families transport cargo in complex cellular geometries and compete against dynein during bidirectional transport.

      Strengths:

      The present study is timely, as significant concerns have been raised previously about studying motor kinetics in optical (single-bead) traps where significant vertical forces are present. Moreover, the obtained data are of high quality and the experimental procedures are clearly described.

      Comments on revision:

      The authors extensively entered into a scientific debate with the reviewers in their Response Letter. This led to a few changes and some (limited) new data in the manuscript. This is great and did improve the manuscript.

      However, in the view of this reviewer, (i) a significant number of responses fall short of actually addressing the concerns of the three reviewers (e.g. wrt using the same kinesin-1 neck-coil domains for all motors) and or (ii) a significant number of arguments now only occur in the response letter but not in the manuscript. The authors may check themselves critically for both. In principle, each longer discussion in the response letter warrants mentioning the appropriate facts and arguments in the main text of the manuscript.

    3. Reviewer #3 (Public review):

      Summary:

      Several recent findings indicate that forces perpendicular to the microtubule accelerate kinesin unbinding, where perpendicular and axial forces were analyzed using the geometry in a single-bead optical trapping assay (Khataee and Howard, 2019), comparison between single-bead and dumbbell assay measurements (Pyrpassopoulos et al., 2020), and comparison of single-bead optical trap measurements with and without a DNA tether (Hensley and Yildiz, 2025).

      Here, the authors devise an assay to exert forces along the microtubule axis by tethering kinesin to the microtubule via a dsDNA tether. They compared the behavior of kinesin-1, -2, and -3 when pulling against the DNA tether. In line with previous optical trapping measurements, kinesin unbinding is less sensitive forces when the forces are aligned with the microtubule axis. Surprisingly, the authors find that both kinesin-1 and -2 detach from the microtubule more slowly when stalled against the DNA tether than in unloaded conditions, indicating that these motors act as catch bonds in response to axial loads. Axial loads accelerate kinesin-3 detachment. However, kinesin-3 reattaches quickly to maintain forces. For all three kinesins, the authors observe weakly-attached states where the motor briefly slips along the microtubule before continuing a processive run.

      Strengths:

      These observations suggest that the conventional view that kinesins act as slip bonds under load, as concluded from single-bead optical trapping measurements where perpendicular loads are present due to the force being exerted on the centroid of a large (relative to the kinesin) bead, need to be reconsidered. Understanding the effect of force on the association kinetics of kinesin has important implications for intracellular transport, where the force-dependent detachment governs how kinesins interact with other kinesins and opposing dynein motors (Muller et al., 2008; Kunwar et al., 2011; Ohashi et al., 2018; Gicking et al., 2022) on vesicular cargoes.

      Weaknesses:

      The authors attribute the differences in the behaviour of kinesins when pulling against a DNA tether compared to an optical trap to the differences in the perpendicular forces. However, the compliance is also much different in these two experiments. The optical trap acts like a ~ linear spring with stiffness ~ 0.05 pN/nm. The dsDNA tether is an entropic spring, with negligible stiffness at low extensions and very high compliance once the tether is extended to its contour length (Fig. 1B). The effect of the compliance on the results is not fully considered in the manuscript.

      Compared to an optical trapping assay, the motors are also tethered closer to the microtubule in this geometry. In an optical trap assay, the bead could rotate when the kinesin is not bound. The authors should discuss how this tethering is expected to affect the kinesin reattachment and slipping. While likely outside the scope of this study, it would be interesting to compare the static tether used here with a dynamic tether like MAP7 or the CAP-GLY domain of p150glued.

      In the single-molecule extension traces (Fig. 1F-H; S3), the kinesin-2 traces often show jumps in position at the beginning of runs (e.g. the four runs from ~4-13 s in Fig. 1G). These jumps are not apparent in the kinesin-1 and -3 traces. What is the explanation? Is kinesin-2 binding accelerated by resisting loads more strongly than kinesin-1 and -3? In their response, the authors provide an explanation of the appearance of jumps due to limited imaging speeds. The authors state that the qualitative difference in the kinesin-2 traces compared to the kinesin-1 an -3 traces may be due to the specific rebinding kinetics of kinesin-2.

      When comparing the durations of unloaded and stall events (Fig. 2), there is a potential for bias in the measurement, where very long unloaded runs cannot be observed due to the limited length of the microtubule (Thompson, Hoeprich, and Berger, 2013), while the duration of tethered runs is only limited by photobleaching. Was the possible censoring of the results addressed in the analysis? The authors addressed this concern by applying a Markov model to estimate the duration parameter.

      The mathematical model is helpful in interpreting the data. To assess how the "slip" state contributes to the association kinetics, it would be helpful to compare the proposed model with a similar model with no slip state. Could the slips be explained by fast reattachments from the detached state? In their response, the authors addressed this question by explaining that a three-state model is required to model the recovery time distributions.

    1. Reviewer #1 (Public review):

      Summary:

      This manuscript by Xiong and colleagues presents a compelling validation of UniDesign, a fully computational protein design framework, by using it to engineer a novel, PAM-relaxed variant of Staphylococcus aureus Cas9 (SaCas9) named KRH. The core achievement is the successful de novo generation of a high-performance nuclease (E782K/N968R/R1015H) solely through in silico modeling, without any subsequent experimental optimization or directed evolution. The authors demonstrate that KRH expands the SaCas9 PAM specificity from NNGRRT to NNNRRT, achieving genome editing and base editing efficiencies across multiple human cell types that are comparable to, and sometimes exceed, the well-known evolution-derived KKH variant. The work positions UniDesign not merely as an analytical tool, but as a powerful engine for the generative design of complex molecular functions, offering a scalable and mechanistically insightful alternative to traditional experimental screening.

      Strengths:

      This is an outstanding manuscript that serves as a powerful proof-of-concept for the next generation of computational protein design. The primary selling point-the raw predictive and generative power of UniDesign-is convincingly demonstrated throughout.

      The manuscript shows that the tool can:<br /> (1) successfully navigate a complex sequence landscape to identify a minimal set of three mutations (KRH) that remodel a critical protein-DNA interface;<br /> (2) accurately model and balance the delicate interplay between specific base contacts and non-specific backbone interactions to achieve relaxed PAM specificity;<br /> (3) deliver a final product whose performance is indistinguishable from, and in some cases superior to, a variant that required extensive wet-lab evolution.

      The experimental validation is rigorous, thorough, and directly supports the computational predictions. This work will stand as a landmark study for the field, illustrating that computational design has matured to the point where it can reliably generate sophisticated tools for genome engineering.

      (1) Demonstration of Generative Power:

      The most significant finding is that UniDesign, without any experimental feedback, generated a variant (KRH) that matches the performance of the evolution-derived KKH. This is a remarkable achievement. The iterative design strategy-first reducing PAM bias (R1015H), then restoring binding through non-specific interactions (e.g., N968R, E782K)-is a textbook example of rational design, but it is executed entirely by the algorithm. This validates UniDesign's energy function and search algorithm as capable of capturing the subtle biophysical principles governing PAM recognition.

      (2) Mechanistic Insight as a Built-in Feature:

      A key advantage of UniDesign highlighted by this work is its inherent ability to provide mechanistic explanations. The computational models not only predicted which mutations would work (e.g., N968R over N968K in the KRH variant) but also why they work. The structural and energetic analyses showing the bidentate salt bridge formed by Arg968 versus the single bond formed by Lys968 (Figure 4A) is a perfect example of how the tool's output can rationalize functional differences, a level of insight that is rarely attainable from directed evolution campaigns alone.

      (3) Scalability and Accessibility for Engineering:

      The authors explicitly contrast UniDesign's efficiency (minutes to hours per design run) with the computational expense of methods like COMET and the experimental overhead of directed evolution. The improvements to UniDesign v1.2, specifically the mutation-count and sequence-uniqueness penalties, directly address a key challenge in computational design (generating diverse, low-energy point-mutant libraries). This positions the tool as a highly accessible and scalable platform for engineering other CRISPR systems, a point that will be of immense interest to the community.

      Weaknesses:

      (1) Title and Abstract Emphasis:

      The title and abstract are effective but could be slightly sharpened to emphasize the primary message. Consider a title like "Fully computational design of a PAM-relaxed SaCas9 variant with UniDesign demonstrates power to match directed evolution." The abstract could more explicitly state upfront that the design was achieved without any experimental iteration.

      (2) Figure 1, Panel M:

      The data points in panel M are currently presented at a font size that makes them difficult to read, particularly the labels for the many triple-mutant variants. This density obscures the clear identification of the top-performing designs, such as the KRH variant selected for experimental validation. I recommend that the authors increase the font size of all text elements within this panel, including axis labels, tick marks, and data point labels, to improve legibility. If necessary, the panel dimensions can be adjusted or the layout reorganized to accommodate the larger text without compromising clarity. Ensuring this figure is readable is important, as it visually communicates the energetic convergence that led to the selection of KRH.

      (3) Generality of the Design Strategy for Other PAM Positions:

      The design strategy focused on relaxing specificity at the highly constrained third position of the PAM (the guanine in NNGRRT). How transferable is this specific strategy (i.e., disrupting a key specific contact and compensating with non-specific backbone binders) to relaxing other positions in the PAM or to other Cas enzymes with different PAM-interaction architectures? A short discussion on this point would help readers understand the broader applicability of the "fine-tuning the balance" principle.

    2. Reviewer #2 (Public review):

      Summary:

      This manuscript describes the fully in silico design of a new variant of Staphylococcus aureus Cas9 (SaCas9) using an improved UniDesign workflow.

      The design strategy consists of three sequential steps:<br /> (1) reducing positional bias at PAM position 3;<br /> (2) restoring DNA binding through nonspecific interactions;<br /> (3) combining individually favorable substitutions.

      The overall pipeline is conceptually elegant and logically structured, and the genome-editing activity of the designed variants is comprehensively characterized. The resulting KRH variant exhibits relaxed PAM specificity, expanding the targeting range of SaCas9 across diverse cell types. Notably, the KRH variant demonstrates performance comparable to that of the evolution-derived KKH variant, underscoring the effectiveness of the proposed computational design framework.

      Strengths:

      The design pipeline is entirely computational and does not rely on experimental data for pretraining or iterative optimization.

      Weaknesses:

      The computationally generated KRH mutant differs from the experimentally evolved KKH variant by only a single residue, which may reflect insufficient exploration of the available sequence space.

    3. Reviewer #3 (Public review):

      Summary:

      This study reports KRH, a SaCas9 variant computationally engineered via UniDesign to recognize an expanded NNNRRT PAM with substantially enhanced editing efficiency at non-canonical sites. KRH achieves genome- and base-editing efficiencies comparable to or exceeding the evolution-derived KKH variant across multiple human cell types, demonstrating that computational design can effectively remodel PAM specificity while preserving nuclease activity.

      Strengths:

      The research follows a clear line of reasoning, and the results appear sound. The computational design strategy presented offers a valuable alternative to directed evolution, with potential applicability beyond Cas9 engineering.

      Weaknesses:

      The benchmarking of the UniDesign method is insufficient. How its performance compares to other protein design algorithms, whether the energy function parameters were systematically optimized, and if the design strategy can be generalized to other Cas9 orthologs or genome engineering tasks.

    1. Reviewer #1 (Public review):

      Summary:

      While the results show some loss in the eyelid meibomian glands, there is significant gland retention in HSD3b6 KO mice, as shown in Figure 2. This is supported by the lack of DEG patterns showing downregulation of Meibum lipid genes (AWAT2, Far2, Soat1, Plin2, SCD, etc.), and no decrease in Pparg expression, known to be critical for meibomian gland lipid gene expression.

      Weaknesses:

      It should be noted that while the authors indicate that CD38 is significantly up-regulated in the HSD3b6 KO mouse, the increase was not sufficient to show a significant adjusted P-value. Bulk RNA sequencing also shows no significant change in meibum lipid gene expression for aged mice that are treated with 78c, an inhibitor of CD38, which the authors indicate increases NAD levels, leading to increased meibomian gland size compared to vehicle-treated mice. Unfortunately, there was no increase in meibum lipid gene expression with 78c, as identified by adjusted P-value. However, it should be noted that the supplemental file covering DEG expression was labeled as a Microarray analysis. This did not include the 78c+NMN treated mice, which the authors contend show a more impactful effect on the meibomian gland.

    2. Reviewer #2 (Public review):

      Summary:

      In this manuscript, the authors demonstrate strong correlations between a pro-inflammatory state, the activity of an intracrine hormone (3 beta-hydroxysteroid dehydrogenase, 3B-HSD), and the NAD co-factor. Specifically, in a 3B-HSD knockout mouse, there was an upregulation in pro-inflammatory cytokines and increased CD38+ cells (CD38 is an enzyme that depletes NAD, a necessary cofactor for 3B-HSD activity). Conversely, induction of inflammation in the eyelids resulted in reductions in 3B-HSD activity. Supplementation with 5 alpha-dihydrotestosterone (DHT) or the NAD precursor NMN, and inhibition of CD38 activity (78c), corrected the pathologies observed in both the 3B-HSD knockout mouse and the pro-inflammatory model (LPS injection into eyelids).

      Strengths:

      The experiments were performed with good rigor, assessing the impact of inflammation and 3B-HSD activity using multiple model systems. The endpoints represented a combination of transcriptional changes, protein quantification, enzymatic activity, and immunofluorescent microscopy. The authors use human tissue from both younger and older individuals to justify their hypotheses that increased CD38 + cells and reduced 3B-HSD quantity exist in older individuals. The data provide the foundation for assessing more global changes to the tear film and ocular surface.

      Weaknesses:

      The main weaknesses of the study include the following:

      (1) An absence of information on meibomian gland health, tear film, and ocular surface.

      (2) Too few human subjects to validate the hypotheses.

      Conclusion:

      Overall, this study demonstrates an important relationship that exists between intracrine signaling, inflammation, and cofactor signaling. It represents a novel approach in therapeutic design for patients with meibomian gland dysfunction.

    3. Reviewer #3 (Public review):

      Summary:

      The authors aimed to investigate whether disruption of intracrine steroid hormone metabolism contributes to meibomian gland dysfunction and proposed a "vicious cycle" of gland dysfunction and inflammation, using a global Had3b6 knockout mouse model. The work addresses an important aspect of MGD, but its impact may be limited unless the intracrine mechanism can be more clearly distinguished from systemic hormonal effects.

      Strengths:

      This study addressed an important question. The hormonal regulation of the meibomian gland has long been recognized. If clarified, the concept of local steroid metabolism influencing gland homeostasis could have implications for understanding disease mechanisms and identifying therapeutic targets.

      Weaknesses:

      The use of a global knockout makes it difficult to separate local intracrine effects from systemic hormonal changes, and key controls and hormone measurements are lacking.<br /> LPS-induced inflammation may not reflect the chronic nature of MGD.

    1. Reviewer #1 (Public review):

      It is widely accepted that the number of muscle stem cells (MuSCs) declines with aging, leading to diminished regenerative capacity. In this study, when MuSCs were labeled with YFP at a young age, the authors found that the YFP-positive MuSC population remained stable with aging. However, VCAM1 and Pax7 expression levels were reduced in the YFP-positive MuSCs. These VCAM1-negative/low cells exhibited limited proliferative potential and reduced regenerative ability upon transplantation into MuSC-depleted mice. Furthermore, Vcam1-/low MuSCs were highly sensitive to senolysis and represented the population in which Vcam1 expression could be restored by DHT. Finally, the authors identified CD200 and CD63 as markers capable of detecting the entire geriatric MuSC population, including Vcam1-/low cells. Although numerous studies have reported an age-related decline in MuSC numbers, this study challenges that consensus. Therefore, the conclusions require further careful validation.

      Major comments:

      (1) As mentioned above, numerous studies have reported that the number of MuSCs declines with aging. The authors' claim is valid, as Pax7 and Vcam1 were widely used for these observations. However, age-related differences have also been reported even when using these markers (Porpiglia et al., Cell Stem Cell 2022; Liu et al., Cell Rep 2013). When comparing geriatric Vcam1⁺ MuSCs with young MuSCs in this study, did the authors observe any of the previously reported differences? Furthermore, would increasing the sample size in Figure 1 reveal a statistically significant difference? The lack of significance appears to result from variation within the young group. In addition, this reviewer requests the presentation of data on MuSC frequency in geriatric control mice using CD200 and CD63 in the final figure.

      (2) Can the authors identify any unique characteristics of Pax7-VCAM-1 GER1-MuSCs using only the data generated in this study, without relying on public databases? For example, reduced expression of Vcam1 and Pax7. The results of such analyses should be presented.

      (3) In the senolysis experiment, the authors state that GER1-MuSCs were depleted. However, no data are provided to support this conclusion. Quantitative cell count data would directly address this concern. In addition, the FACS profile corresponding to Figure 4D should be included.

      (4) Figure S4: It remains unclear whether DHT enhances regenerative ability through restoration of the VCAM1 expression in GER1-MuSCs, as DHT also acts on non-MuSC populations. Analyses of the regenerative ability of Senolysis+DHT mice may help to clarify this issue.

      (5) Why are there so many myonuclear transcripts detected in the single-cell RNA-seq data? Was this dataset actually generated using single-nucleus RNA-seq? This reviewer considers it inappropriate to directly compare scRNA-seq and snRNA-seq results.

      Comments on revisions:

      Related to Comment#3: The percentage is also influenced by the number of other cell types. Therefore, to demonstrate cell removal, it is necessary to present the absolute number of cells. If the cells were removed and were not replenished from Vcam1+ cells, the absolute number of cells should be reduced.

      Related to Comment#4: Without the DHT+Senolysis experiment proposed by this reviewer or related experiments, there is no evidence demonstrating that GERI-MuSCs functionally rejuvenate. The current data only show that VCAM1 expression is restored.

      Related to Comment#8: Individual results from 3-4 biological replicates should be shown in Figure 4. It will help readers to recognize the variation of each sample.

    2. Reviewer #2 (Public review):

      Kim et al. investigate heterogeneity in aged muscle stem cells using a model that enables lifelong lineage tracing. The questions addressed in the paper are highly relevant to the fields of aging and stem cell biology, and the experimental approach overcomes some of the limitations of previous studies.

      The study provides evidence for phenotypic and functional heterogeneity within the lineage-traced aged MuSC pool. However, the data as presented do not yet support the broader conclusions that MuSC abundance is maintained with age or that a previously unrecognized aged MuSC subpopulation has been identified. These claims would require stronger age-matched cohorts, absolute cell counts normalized to tissue mass, and direct comparison to previously described aged muscle stem cell states.

      If the core observations were experimentally reinforced, this study could prompt the field to reassess muscle stem cell loss, heterogeneity, and age-associated changes in canonical marker expression in geriatric mice. However, because several of the central claims depend on analyses that are currently incomplete, the conceptual impact should be treated as provisional. The deposited bulk RNA-seq and scRNA-seq datasets should be useful for mapping these states to existing atlases and for re-analysis by groups interested in quiescent and senescent programs in geriatric muscle stem cells.

    3. Reviewer #3 (Public review):

      Summary:

      The manuscript by Kim et al. describes a MuSC subpopulation that loses VCam expression in geriatric muscle and shows reduced ability to contribute to muscle regeneration. They propose that this population underlies the reported decline of MuSCs in aged mice, suggesting that these cells remain present in geriatric muscle but are overlooked due to low or absent VCam expression. The identification of a subpopulation that changes with aging would be compelling and of interest to the field.

      Strengths:

      The authors employ a wide range of assays, from in vitro to in vivo systems, to characterize Vcam-low/negative cells from geriatric muscle. The loss of Vcam appears strong in geriatric mice. They further identify CD63 and CD200 as potential surface markers that remain stable with age, thereby enabling the isolation of MuSCs across different age groups.

      Weaknesses:

      Some issues remain before establishing whether this population represents a true functional subset or explains the reported decline in MuSC numbers in aged mice. Stronger fate assessment of Vcam-low/negative cells is needed to assess their propensity for cell death and whether this contributes to the conclusions. Comparisons include young, middle-aged, and geriatric mice, but not aged (~24 months) mice, which would help comparisons to previous reports of age-related MuSC decline. The suggestion that the Vcam-low/negative population reflects a senescence-like state remains unclear, as these cells display limited canonical senescence markers, exhibit reversible cell-cycle exit, and yet are reported to be sensitive to senolytic treatment. Validation of CD63 and CD200 as reliable age-independent MuSC markers requires further testing, specifically using the Pax7-YFP tracing model and co-labeling in geriatric mice. Finally, the grouping patterns in some analyses suggest that the Vcam-low/negative fraction may be present in only a subset of geriatric mice, raising the possibility that it reflects health status or pathology rather than a consistent aging-associated phenotype.

    1. Reviewer #1 (Public review):

      Summary:

      The manuscript by Xu et al. reported base-resolution mapping of RNA pseudouridylation in five bacterial species, utilizing recently developed BID-seq. They detected pseudouridine (Ψ) in bacterial rRNA, tRNA, and mRNA, and found growth phase-dependent Ψ changes in tRNA and mRNA. They then focused on mRNA and conducted comparative analysis of Ψ profiles across different bacterial species. Finally, they developed a deep learning model to predict Ψ sites based on RNA sequence and structure.

      Strengths:

      This is the first comprehensive Ψ map across multiple bacterial species, and systematically reveals Ψ profiles in rRNA, tRNA, and mRNA under exponential and stationary growth conditions. It provides a valuable resource for future functional studies of Ψ in bacteria.

      Weaknesses:

      Ψ is highly abundant on non-coding RNA such as rRNA and rRNA, while its level on mRNA is very low. The manuscript focuses primarily on Ψ on mRNA, which is prone to false positives. Many conclusions in the manuscript are speculative, based solely on the sequencing data, but not supported by additional experiments.

    2. Reviewer #2 (Public review):

      Summary:

      In this study, Xu et al. present a transcriptome-wide, single-base resolution map of RNA pseudouridine modifications across evolutionarily diverse bacterial species using an adapted form of BID-Seq. By optimizing the method for bacterial RNA, the authors successfully mapped modifications in rRNA, tRNA, and, importantly, mRNA across both exponential and stationary growth phases. They uncover evolutionarily conserved Ψ motifs, dynamic Ψ regulation tied to bacterial growth state, and propose functional links between pseudouridylation and bacterial transcript stability, translation, and RNA-protein interactions. To extend these findings, they develop a deep learning model that predicts pseudouridine sites from local sequence and structural features.

      Strengths:

      The authors provide a valuable resource: a comprehensive Ψ atlas for bacterial systems, spanning hundreds of mRNAs and multiple species. The work addresses a gap in the field - our limited understanding of bacterial epitranscriptomics, by establishing both the method and datasets for exploring post-transcriptional modifications.

      Weaknesses:

      The main limitation of the study is that most functional claims (i.e. translation efficiency, mRNA stability, and RNA-binding protein interactions) are based on correlative evidence. While suggestive, these inferences would be significantly strengthened by targeted perturbation of specific Ψ synthases or direct biochemical validation of proposed RNA-protein interactions (e.g., with Hfq). Additionally, the GNN prediction model is a notable advance.

    3. Reviewer #3 (Public review):

      Summary:

      This study aimed to investigate pseudouridylation across various RNA species in multiple bacterial strains using an optimized BID-seq approach. It examined both conserved and divergent modification patterns, the potential functional roles of pseudouridylation, and its dynamic regulation across different growth conditions.

      Strengths:

      The authors optimized the BID-seq method and applied this important technique to bacterial systems, identifying multiple pseudouridylation sites across different species. They investigated the distribution of these modifications, associated sequence motifs, their dynamics across growth phases, and potential functional roles. These data are of great interest to researchers focused on understanding the significance of RNA modifications, particularly mRNA modifications, in bacteria.

    1. Reviewer #1 (Public review):

      Summary:

      This important study functionally profiled ligands targeting the LXR nuclear receptors using biochemical assays in order to classify ligands according to pharmacological functions. Overall, the evidence is solid, but nuances in the reconstituted biochemical assays and cellular studies and terminology of ligand pharmacology limit the potential impact of the study. This work will be of interest to scientists interested in nuclear receptor pharmacology.

      Strengths:

      (1) The authors rigorously tested their ligand set in CRTs for several nuclear receptors that could display ligand-dependent cross-talk with LXR cellular signaling and found that all compounds display LXR selectivity when used at ~1 µM.

      (2) The authors tested the ligand set for selectivity against two LXR isoforms (alpha and beta). Most compounds were found to be LXRbeta-specific.

      (3) The authors performed extensive LXR CRTs, performed correlation analysis to cellular transcription and gene expression, and classification profiling using heatmap analysis-seeking to use relatively easy-to-collect biochemical assays with purified ligand-binding domain (LBD) protein to explain the complex activity of full-length LXR-mediated transcription.

      Comments on revisions:

      The authors have addressed the comments from the prior round of review with care. I find the revised manuscript significantly strengthened.

    2. Reviewer #2 (Public review):

      Summary:

      In this manuscript by Laham and co-workers, the authors profiled structurally diverse LXR ligands via a coregulator TR-FRET (CRT) assay for their ability to recruit coactivators and kick off corepressors, while identifying coregulator preference and LXR isoform selectivity.

      The relative ligand potencies measured via CRT for the two LXR isoforms were correlated with ABCA1 induction or lipogenic activation of SRE depending on cellular contexts (i.e, astrocytoma or hepatocarcinoma cells). While these correlations are interesting, there is some leg room to improve the quantitative presentation of these correlations. Finally, the CRT signatures were correlated with the structural stabilization of the LXR: coregulator complexes. In aggregate, this study curated a set of LXR ligands with disparate agonism signatures that may guide the design of future nonlipogenic LXR agonists with potential therapeutic applications for cardiovascular disease, Alzheimer's and type 2 diabetes, without inducing mechanisms that promote fat/lipid production.

      Strengths:

      This study has many strengths, from curating an excellent LXR compound set, to the thoughtful design of the CRT and cellular assays. The design of a multiplexed precision CRT (pCRT) assay that detects corepressor displacement as a function of ligand-induced coactivator recruitment is quite impressive as it allows measurement of ligand potencies to displace corepressors in the presence of coactivators, which cannot be achieved in a regular CRT assay that looks at coactivator recruitment and corepressor dissociation in separate experiments.

      Comments on revisions:

      These weaknesses have been satisfactorily addressed by the authors in the revised preprint.

    1. Reviewer #1 (Public review):

      Summary:

      This preprint from Shaowei Zhao and colleagues presents results that suggest tumorous germline stem cells (GSCs) in the Drosophila ovary mimic the ovarian stem cell niche and inhibit the differentiation of neighboring non-mutant GSC-like cells. The authors use FRT-mediated clonal analysis driven by a germline-specific gene (nos-Gal4, UASp-flp) to induce GSC-like cells mutant for bam or bam's co-factor bgcn. Bam-mutant or bgcn-mutant germ cells produce tumors in the stem cell compartment (the germarium) of the ovary (Fig. 1). These tumors contain non-mutant cells - termed SGC for single-germ cells. 75% of SGCs do not exhibit signs of differentiation (as assessed by bamP-GFP) (Fig. 2). The authors demonstrate that block in differentiation in SGC is a result of suppression of bam expression (Fig. 2). They present data suggesting that in 73% of SGCs BMP signaling is low (assessed by dad-lacZ) (Fig. 3) and proliferation is less in SGCs vs GSCs. They present genetic evidence that mutations in BMP pathway receptors and transcription factors suppress some of the non-autonomous effects exhibited by SGCs within bam-mutant tumors (Fig. 4). They show data that bam-mutant cells secrete Dpp, but this data is not compelling (see below) (Fig. 5). They provide genetic data that loss of BMP ligands (dpp and gbb) suppresses the appearance of SGCs in bam-mutant tumors (Fig. 6). Taken together, their data support a model in which bam-mutant GSC-like cells produce BMPs that act on non-mutant cells (i.e., SGCs) to prevent their differentiation, similar to what in seen in the ovarian stem cell niche. This preprint from Shaowei Zhao and colleagues presents results that suggest tumorous germline stem cells (GSCs) in the Drosophila ovary mimic the ovarian stem cell niche and inhibit the differentiation of neighboring non-mutant GSC-like cells. The authors use FRT-mediated clonal analysis driven by a germline-specific gene (nos-Gal4, UASp-flp) to induce GSC-like cells mutant for bam or bam's co-factor bgcn. Bam-mutant or bgcn-mutant germ cells produce tumors in the stem cell compartment (the germarium) of the ovary (Fig. 1). These tumors contain non-mutant cells - termed SGC for single-germ cells. 75% of SGCs do not exhibit signs of differentiation (as assessed by bamP-GFP) (Fig. 2). The authors demonstrate that block in differentiation in SGC is a result of suppression of bam expression (Fig. 2). They present data suggesting that in 73% of SGCs BMP signaling is low (assessed by dad-lacZ) (Fig. 3) and proliferation is less in SGCs vs GSCs. They present genetic evidence that mutations in BMP pathway receptors and transcription factors suppress some of the non-autonomous effects exhibited by SGCs within bam-mutant tumors (Fig. 4). They show data that bam-mutant cells secrete Dpp, but this data is not compelling (see below) (Fig. 5). They provide genetic data that loss of BMP ligands (dpp and gbb) suppresses the appearance of SGCs in bam-mutant tumors (Fig. 6). Taken together, their data support a model in which bam-mutant GSC-like cells produce BMPs that act on non-mutant cells (i.e., SGCs) to prevent their differentiation, similar to what in seen in the ovarian stem cell niche.

      Strengths:

      (1) Use of an excellent and established model for tumorous cells in a stem cell microenvironment

      (2) Powerful genetics allow them to test various factors in the tumorous vs non-tumorous cells

      (3) Appropriate use of quantification and statistics

      Weaknesses:

      (1) What is the frequency of SGCs in nos>flp; bam-mutant tumors? For example, are they seen in every germarium, or in some germaria, etc or in a few germaria.

      This concern was addressed in the rebuttal. The line number is 106, not line 103.

      (2) Does the breakdown in clonality vary when they induce hs-flp clones in adults as opposed to in larvae/pupae?

      This concern was addressed in the rebuttal. However, these statements are no on lines 331-335 but instead starting on line 339. Please be accurate about the line numbers cited in the rebuttal. They need to match the line numbers in the revised manuscript.

      (3) Approximately 20-25% of SGCs are bam+, dad-LacZ+. Firstly, how do the authors explain this? Secondly, of the 70-75% of SGCs that have no/low BMP signaling, the authors should perform additional characterization using markers that are expressed in GSCs (i.e., Sex lethal and nanos).

      The authors did not perform additional staining for GSC-enriched protein like Sex lethal and nanos.

      (4) All experiments except Fig. 1I (where a single germarium with no quantification) were performed with nos-Gal4, UASp-flp. Have the authors performed any of the phenotypic characterizations (i.e., figures other than figure 1) with hs-flp?

      In the rebuttal, the authors stated that they used nos>flp for all figures except for Fig. 1I. It would be more convincing for them to prove in Fig. 1 than there is not phenoytpic difference between the two methods and then switch to the nos>FLP method for the rest of the paper.

      (5) Does the number of SGCs change with the age of the female? The experiments were all performed in 14-day old adult females. What happens when they look at young female (like 2-day old). I assume that the nos>flp is working in larval and pupal stages and so the phenotype should be present in young females. Why did the authors choose this later age? For example, is the phenotype more robust in older females? or do you see more SGCs at later time points?

      The authors did not supply any data to prove that the clones were larger in 14-day-old flies than in younger flies. Additionally, the age of "younger" flies was not specified. Therefore, the authors did not satisfactorily answer my concern.

      (6) Can the authors distinguish one copy of GFP versus 2 copies of GFP in germ cells of the ovary? This is not possible in the Drosophila testis. I ask because this could impact on the clonal analyses diagrammed in Fig. 4A and 4G and in 6A and B. Additionally, in most of the figures, the GFP is saturated so it is not possible to discern one vs two copies of GFP.

      In the rebuttal, the authors stated that they cannot differential one vs two copies of GFP. They used other clone labeling methods in Fig. 4 and 6. I think that the authors should make a statement in the manuscript that they cannot distinguish one vs two copies of GFP for the record.

      (7) More evidence is needed to support the claim of elevated Dpp levels in bam or bgcn mutant tumors. The current results with dpp-lacZ enhancer trap in Fig 5A,B are not convincing. First, why is the dpp-lacZ so much brighter in the mosaic analysis (A) than in the no-clone analysis (B); it is expected that the level of dpp-lacZ in cap cells should be invariant between ovaries and yet LacZ is very faint in Fig. 5B. I think that if the settings in A matched those in B, the apparent expression of dpp-lacZ in the tumor would be much lower and likely not statistically significantly. Second, they should use RNA in situ hybridization with a sensitive technique like hybridization chain reactions (HCR) - an approach that has worked well in numerous Drosophila tissues including the ovary.

      The HCR FISH in Fig.5 of the revised manuscript needs an explanation for how the mRNA puncta were quantified. Currently, there is no information in the methods. What is meant but relative dpp levels. I think that the authors should report in and unbiased manner "number" of dpp or gbb puncta in TFs. For the germaria, I think that they should report the number of puncta of dpp or gbb divide by the total area in square pixels counted. Additionally, the background fluorescence is noticeably much higher in bamBG/delta86 germaria, which would (falsely) increase the relative intensity of dpp and gbb in bam mutants. Although, I commend the authors for performing HCR FISH, these data are still not convincing to me.

      (8) In Fig 6, the authors report results obtained with the bamBG allele. Do they obtain similar data with another bam allele (i.e., bamdelta86)?

      The authors did not try any experiments with the bamdelta86 allele, despite this allele being molecularly defined, where the bamBG allele is not defined.

    2. Reviewer #2 (Public review):

      In the current version, Zhang et al. have made substantial improvements to the manuscript. It is now easier to read, and the data are more solid compared with the previous version, supporting their conclusion that tumor GSCs secrete stemness factors (BMPs and Dpp) to suppress the differentiation of neighboring wild-type GSCs. This study should benefit a broad readership across developmental biology, germ cell biology, stem cell biology, and cancer biology.

      However, the following suggestions may further improve the clarity and rigor of the research content:

      (1) Clarification of sample size (n).<br /> Each germarium can contain highly variable numbers of SGCs, sometimes reaching 50-100. When reporting "n" values, the authors are encouraged to also indicate the number of germaria analyzed. For example, in lines 126-128:<br /> "Notably, 74% of SGCs (n = 132) were GFP-negative, while the remaining 26% were GFP-positive (Figure 2B, C). This suggests that SGCs can be categorized into two distinct groups: those resembling GSCs (GSC-like) and those resembling cystoblasts (cystoblast-like)."<br /> Please clarify how many germaria were examined to obtain n = 132. In addition, it is unclear whether the authors intend to suggest that the GFP-negative SGCs are GSC-like or cystoblast-like; this point should be clarified.

      (2) Improvement of Fig. 6 in situ hybridization images.<br /> The in situ hybridization images in Fig. 6 are not fully convincing. The control images, in particular, would benefit from higher resolution and enlarged views of the germarium region. In panel C, abundant signals are also present outside the germarium, which may complicate interpretation and should be clarified or controlled for.

      Alternatively, the authors could strengthen the in situ analysis by using bam mutants or bam dpp / bam gbb double mutants as controls to better define signal specificity.

    3. Reviewer #3 (Public review):

      Zhang et al. investigated how germline tumors influence the development of neighboring wild-type (WT) germline stem cells (GSC) in the Drosophila ovary. They report that germline tumors generated by differentiation-arrested mutations (bam and bgcn) inhibit the differentiation of neighboring WT GSCs by arresting them in an undifferentiated state, resulting from reduced expression of the differentiation-promoting factor Bam. They find that these tumor cells produce low levels of the niche-associated signaling molecules Dpp and Gbb, which suppress bam expression and consequently inhibit the differentiation of neighboring WT GSCs non-cell-autonomously. Based on these findings, the authors propose that germline tumors mimic the niche to suppress the differentiation of the neighboring wild-type germline stem cells.

      Strengths:

      The study uses a well-established in vivo model to address an important biological question concerning the interaction between germline tumor cells and wild-type (WT) germline stem cells in the Drosophila ovary. If the findings are substantiated, this study could provide valuable insights that are applicable to other stem cell systems.

      Weaknesses:

      The authors have addressed some of my concerns in the revised submission. However, the data presented do not allow the authors to distinguish whether the failed differentiation of WT stem cells/germline cells results from "arrested differentiation due to the loss of the differentiation niche" or from "direct inhibition by tumor-derived expression of niche-associated molecules Dpp and Gbb". The critical supporting data, HCR in situ results, are not sufficiently convincing.

    1. Reviewer #2 (Public review):

      Summary:

      This paper is an exciting follow-up to two recent publications in eLife: one from the same lab, reporting that slender forms can successfully infect tsetse flies (Schuster, S et al., 2021), and another independent study claiming the opposite (Ngoune, TMJ et al., 2025). Here, the authors address four criticisms raised against their original work: the influence of N-acetyl-glucosamine (NAG), the use of teneral and male flies, and whether slender forms bypass the stumpy stage before becoming procyclic forms.

      Strengths:

      We applaud the authors' efforts in undertaking these experiments and contributing to a better understanding of the T. brucei life cycle. The paper is well-written and the figures are clear.

      Comments on revisions:

      We thank the authors for the revised manuscript and for considering our comments.

      We outline below the 3 points that, in our opinion, remain to be clarified.

      (1) Effect of NAG on slender-form infections in tsetse flies<br /> The conclusion that "NAG has a negligible effect on slender infections in tsetse flies" based on Figure 1, cannot be fully supported in the absence of a positive control. A relevant positive control is well established in the literature, namely that NAG promotes Tsetse infection by stumpy forms. Without such a control, it is not possible to exclude technical issues (for example, an ineffective NAG treatment), which would yield results similar to those presented in Figure 1.

      (2) Infection of non-teneral flies<br /> Because the experiments shown in Figure 1 (teneral flies) and Figure 2 (non-teneral flies) were not conducted in parallel or under identical conditions, it is important that the figure legends clearly state the parasite numbers used in each case. Specifically, infections of teneral flies were performed with 200 parasites/mL (approximately 4 parasites per bloodmeal), whereas non-teneral infections used 1 × 10⁶ parasites/mL (approximately 20,000 parasites per bloodmeal?). At present, this information is scattered across the Methods and Supplementary Tables 1 and 2, making it difficult for readers to immediately appreciate that the parasite load differs by roughly 5,000-fold between these conditions.

      As previously shown by the authors (Schuster et al., 2021) and in the Rotureau laboratory (Tsagmo Ngoune et al.), and as generally expected, the initial parasite dose strongly influences infection outcomes in teneral flies. In this context, it would be informative to know whether the authors have attempted infections of non-teneral flies using lower parasite numbers (noting that Tsagmo Ngoune et al. used a maximum of 10,000 parasites) and what the infection rate was.<br /> Relatedly, the statement in line 370 appears to be an overgeneralization, as fly age was not directly tested under matched experimental conditions:

      Line 370 - "Here, we unambiguously show that, in the absence of immunosuppressive treatment, slender forms can establish infections in tsetse flies, irrespective of the fly's age or sex."

      (3) Transcriptomic analysis<br /> Supplementary Figure 8 lacks statistical analysis, which limits its interpretability. Two types of comparisons would be particularly helpful:<br /> (i) a comparison of PAD1/2 expression levels between slender and stumpy forms at 0 h; and<br /> (ii) for each gene, a comparison of the overall change in expression (from 0 to 72 h) between infections initiated with slender versus stumpy forms.<br /> In addition, the figure legend should clarify what "expression levels" refer to. TPM? Normalized counts?

      Finally, for the benefit of the field, eLife could encourage publishing a collaborative study in which the Engstler and Rotureau laboratories exchange parasite lines and culture protocols (including media with and without methylcellulose) and perform tsetse fly infections in parallel in their respective laboratories. Such an approach could help resolve the remaining discrepancies and provide a valuable reference for the community.

    1. Reviewer #1 (Public review):

      Summary:

      This study investigates the roles of the two tumor necrosis factor genes (tnfa and tnfb) in zebrafish during inflammatory responses. TNF is a central regulator of inflammation across vertebrates; however, while mammalian TNF signaling is well characterized, the functional divergence of duplicated TNF genes in teleosts remains less well understood. In this work, the authors generate novel zebrafish fluorescent reporter lines for tnfb and use them to perform comparative analyses of the spatial and temporal expression patterns of tnfa and tnfb during inflammation. They report that these paralogous genes are produced by distinct immune cell populations and exhibit different induction kinetics during inflammatory processes. Based on these observations, the authors propose that tnfa and tnfb may fulfill non-redundant roles in the zebrafish immune response.

      Strengths:

      The study addresses an important gap in understanding the functional divergence of TNF paralogs in teleosts. Given that gene duplication events are common in fish genomes, clarifying how duplicated cytokines partition their functions is valuable for both evolutionary immunology and zebrafish model research. The work makes effective use of the zebrafish model, which is particularly well suited for in vivo imaging of dynamic immune cell behaviors during inflammation. A key strength of the study is the integration of analyses of cell-type specificity, transcriptional regulation, and temporal expression dynamics. In particular, the live imaging experiments are compelling and provide clear visual evidence that tnfa and tnfb differ in both cellular sources and expression kinetics, which strengthens the claim that these paralogs may have diverged in their regulation and potentially their function. By distinguishing these aspects of the two cytokines, the study provides useful conceptual and methodological guidance for future investigations of inflammatory signaling in zebrafish.

      Weaknesses:

      (1) While the manuscript convincingly documents distinct expression patterns, the functional consequences of these differences remain unexplored. The conclusions regarding non-redundant roles would benefit from functional perturbation experiments. Relatedly, the authors propose that tnfa and tnfb may play different immunological roles, but the mechanistic basis underlying these differences is not addressed. For example, do the two cytokines engage different receptors or signaling pathways? Do they trigger distinct downstream transcriptional programs?

      (2) Some imaging-based observations appear largely qualitative. Additional quantitative analyses, such as statistical comparisons of expression levels across time points or cell populations, would strengthen the robustness of the conclusions. For instance, in Figure 4, the expression levels of tnfa and tnfb reporter transgenes in immune cells should be quantitatively compared between control and amputated conditions.

      (3) It would also be important to clarify whether the distinct maturation kinetics of the fluorescent reporters were taken into account when interpreting expression timing. Since GFP typically matures more rapidly than mCherry in vivo, the authors should comment on whether this difference could influence the apparent expression kinetics of tnfa versus tnfb.

    2. Reviewer #2 (Public review):

      Summary:

      In this manuscript, van Dijk et al analyse the expression of the largely ignored paralogue of TNF in zebrafish, tnfb. They generate reporter transgenic lines and show that the reporter expression is consistent with endogenous mRNA expression in zebrafish larvae. Unlike its better-known paralogue tnfa, tnfb is constitutively expressed in mantle cells of neuromasts, and in a few leukocytes. It is also inducible in macrophages and some neutrophils upon wounding or detection of microbes, with faster kinetics than tnfa or il1b.

      Strengths:

      Generation and convincing validation of new transgenic reporter lines for tnfb with either green or red fluorescent proteins. Superb imaging and careful analysis of these lines crossed to complementary reporter transgenics, backed with in situ hybridization and qRT-PCR analysis of FACS-sorted cells. Excellent methods section.

      Weaknesses:

      Lack of functional analysis; these lines are a potentially valuable tool, but so far provide no clue regarding the role of tnfb. Is it a pro-inflammatory cytokine acting in synergy with tnfa, or is it an antagonist? What are its receptor(s)? What signalling pathways and downstream genes does it induce? Addressing at least some of these questions should greatly increase the impact of the paper.

    1. Reviewer #1 (Public review):

      Summary:

      It has long been known that Drosophila embryonic ventral nerve cord neuroblasts incorporate both spatial and temporal transcription factor expression to generate 30 distinct neuroblasts and lineages per hemisegment. This manuscript aims to elucidate the mechanism by which this integration of spatial and temporal transcription factors occurs through "direct regulation" or "epigenetic regulation". Direct regulation is defined as both spatial and temporal factors binding to open chromatin and working together to dictate specific lineages. Epigenetic regulation is defined as a spatial factor priming the chromatin in a neuroblast-specific manner to allow for the integration of temporal factors to generate specific lineages. The authors conclude that there is a two-step model in which a spatial transcription factor code "primes" the chromatin in terms of accessibility and then recruits temporal factors to ensure lineage-specific enhancer activation.

      Strengths:

      The authors tested two models, "direct regulation" vs "epigenetic regulation" in a well-defined pool of neural stem cells during normal development.

      Weaknesses:

      The data in this study cannot clearly substantiate these two models.

      Overall, there are a number of issues that are inconsistent and not supportive of the model proposed in this manuscript. Firstly, there is no evidence of pioneer factor activity in any of the NB lineages described - i.e., any changes in chromatin accessibility being shown over time. The authors must show chromatin conformation changes during the window of spatial transcription factor expression in order to convince the readers of this phenomenon. Secondly, the phenotypic data do not align with the sequencing data - the story would be more cohesive if the sequencing data and phenotypic data were in the same NB subtypes. On one hand, we are shown that Gsb misexpression induces loss of chromatin accessibility in NB 7-4, however in the widespread loss model, we are not shown a phenotype in these NB7-4 - which suggest that the chromatin accessibility at these sites (sites that have already been distinguished as SoIs for that NB subtype) does not play an important role in distinguishing NB 7-4 identity. However, the authors report loss of NB3-5 identity but have no evidence as to how the chromatin has changed (or if it has at all) in that subtype, leaving the readers to wonder how the loss of identity occurred.

    2. Reviewer #2 (Public review):

      Summary:

      This article by Bhattacharya et al. investigates how neural stem cells (NSCs, NBs) in Drosophila integrate spatial and temporal cues to activate neuron-specific terminal selector (TS) genes. Prior to this work, it was understood that NSCs utilize spatial transcription factors (STFs) and temporal transcription factors (TTFs) to determine lineage identity and birth order, but the mechanisms of integration were not fully elucidated. The authors employed chromatin profiling techniques to analyze the binding of STFs and TTFs in two specific neuroblast lineages, NB5-6 and NB7-4. They found that Gsb (an STF) binds both accessible and less-accessible chromatin in NB5-6, while En (another STF) binds only to pre-accessible chromatin in NB7-4. The findings support an "STF code" where the combination of pioneer and non-pioneer spatial factors, along with temporal factors, triggers neuroblast-specific enhancer activation and determines lineage identity.

      Strengths:

      The experiments are well-executed, the interpretations are generally sound, and the figures are clear and elegant. However, some conclusions are drawn too broadly without essential functional data. Therefore, additional work is needed to more effectively convey the central message.

      Weaknesses:

      (1) Integration of TaDa and functional data on Gsb for the STF model

      The authors demonstrate that TaDa profiling maps Gsb binding across the genome and identifies candidate chromatin-priming sites in NB5-6. Gsb LOF/GOF experiments reveal effects on NB identity. Combining TaDa data with LOF and GOF analyses indicates that Gsb influences NB5-6 specification by binding to both open and relatively closed chromatin, helping maintain NB5-6 identity while limiting NB3-5 fate.

      However, the study does not establish a direct link between specific LOF/GOF phenotypes and particular genomic targets. For instance, analyzing Gsb occupancy at lineage-specific identity factors or terminal selector genes (such as Lbe, Ap, or Eya for NB5-6; and Ems, etc., for NB3-5) in wild-type and manipulated conditions (Gsb misexpression) would directly connect chromatin binding to the regulation of fate determinants. These investigations would strengthen the mechanistic connection between the correlative TaDa profiles and the observed identity changes, supporting the idea that Gsb functions as a context-dependent chromatin-priming factor within the STF code, rather than as a generic transcription factor.

      (2) Gsb misexpression reveals bidirectional chromatin remodelling

      Experiments with ectopic Gsb expression demonstrate bidirectional chromatin remodeling in NB7-4, showing decreases in accessibility at some binding sites and increases at others. While the authors show that Gsb can disrupt chromatin upon misexpression, interpreting its "pioneer-like" or chromatin-priming activity is complex due to several factors: the misexpression occurs in a non-native lineage, the direct versus indirect effects rely on whole-embryo Dam-Gsb peaks instead of NB7-4-specific binding, and heat-shock-induced chromatin changes are not fully accounted for. These issues make it challenging to definitively determine Gsb's role in chromatin priming.

      A complementary approach would be to perform Gsb knockdown/loss-of-function in its native NB5-6 lineage and profile chromatin accessibility (TaDa or CATaDa). This would allow a cleaner, more physiologically relevant assessment of Gsb's contribution to priming, SoI establishment, and Hb recruitment. Such an experiment would strengthen the causal link between Gsb occupancy and chromatin state and clarify whether Gsb truly acts as a context-dependent pioneer in vivo, rather than producing indirect effects due to ectopic misexpression.

      (3) En is not a pioneer factor

      The authors conclude that Engrailed (En) is not a pioneer factor, based on the observation that En binding correlates with accessible chromatin and that En is not enriched at NB5-6-specific SOIs. However, this conclusion is not sufficiently supported by the functional data.

      First, the absence of En binding at NB5-6-specific SOIs does not necessarily indicate an inability to engage closed chromatin. These regions were not selected for the presence of En consensus motifs, so their lack of occupancy may simply reflect the absence of En binding motifs rather than a lack of pioneering capacity. A systematic motif analysis at NB5-6-specific SOIs is needed to determine whether En binding sites are present but unoccupied.

      Second, the claim that En lacks pioneer activity relies solely on a single steady-state TaDa/DamID occupancy assay at one developmental stage. Because pioneer factor interactions can be transient, low-affinity, and stage-specific, such binding may not be detected by TaDa, which also depends on local GATC density and methylation kinetics and may yield false negatives. Given these technical limitations, the absence of En binding at less accessible regions does not definitively rule out a priming role.

      In the absence of direct functional assays (En LOF/GOF), the authors should explicitly acknowledge these technical and conceptual limitations and tone down the claim that "En lacks pioneer activity".

      (4) Clarity of STF-code Model and Central Message

      The manuscript begins by presenting two models, direct and epigenetic, but the central takeaway of the paper is not clear. Specifically, the nuanced roles of the spatial factors Gsb and En as chromatin-priming versus stabilizing/effector factors within an STF code, and the resulting division of labor, are not clearly illustrated. The distinction between Gsb as a chromatin-priming factor and En as a cofactor-dependent activator/stabilizer should be explicitly presented in a stepwise model for better clarity. The authors could strengthen this by providing a schematic with two sequential stages illustrating how neuroblast identity factors (STF code) change chromatin states to drive lineage-specific enhancer activation. The schematic can be shown from the neuroectoderm to individual NB lineages to make it more panoramic.

      (5) Identification of Priming Factors in NB7-4

      While the authors suggest that an unknown priming factor might be responsible for establishing sites of integration in NB7-4, they do not identify or explore potential candidates for this role. Further investigation into what factors might be involved in chromatin priming in NB7-4 could provide a more complete understanding of the mechanisms at play.

      (6) Functional Validation of STF Code Components

      The study proposes an STF code for each neuroblast lineage, but the specific components of these codes, beyond Gsb and En, are not fully explored. Identifying and validating additional factors that contribute to the STF code in each lineage could strengthen the conclusions.

    1. Reviewer #1 (Public review):

      Summary:

      This study presents an interesting approach for finding electrophysiological models that match experimental patch-clamp data. The authors develop a new method for deriving optimized current clamp protocols by training a neural network on synthetic data. This optimized current clamp is then used on both computational training data and on experimental data to predict current gating and conductance parameters that correctly reconstruct the electrical phenotype.

      Strengths:

      (1) The fitting of gating variables through an optimized patch clamp protocol is interesting.

      (2) The inclusion of experimental data is important, and the approach is shown to be effective in fitting them.

      Weaknesses:

      (1) Some clarity is necessary on the generation and selection of variable IPSC models. With such a large variation in so many parameters, I would expect some resulting parameters to generate non-realistic phenotypes, quiescent cells, etc. Are all 200,000 or 1,100,000 generated cells viable? Or are they selected somehow for realistic cell properties?

      (2) The error shown in Figure 4 between different population sizes is not completely explained in the text - there seems to be a minimal difference between a population of 1,000 and 10,000, followed by a very good fit at 200,000. Is there a particular threshold that needs to be crossed where the error drops off? Related, how was the 200,000 number chosen?

      (3) Related to the point above, the 1,100,000 population for fitting experimental data also needs a more complete explanation: how was this number chosen, and how does the error compare with the other population sizes shown in Figure 4?

      (4) Why are the optimized current clamp protocols different between panels A and B in Figure 5? Are they somehow informed by experimental data?

      (5) Figure 6D: Is the EAD risk in panel D specific to cell 1, 2, or the pooled variants of both?

      (6) How sensitive is the fitting to minor parameter variation? Further, if one were to pick, let's say, the next-best fitting value, would that fall close to the best one? Is the solution found unique, or are there multiple sets with good fits?

    2. Reviewer #2 (Public review):

      Summary:

      The authors present a computational framework for generating "cell-specific" digital twins of human iPSC-CMs from a single optimized voltage clamp recording. Using deep learning trained on > 1 million artificial cells, the authors demonstrate that the model can infer 52 biophysical parameters governing 6 major ionic currents, and the resulting digital twins can reproduce experimentally recorded action potentials.

      Strengths:

      The framework has clear potential for understanding cellular heterogeneity in iPSC-CMs, predicting individual drug responses, and reducing the experimental burden of multiple patch clamp protocols.

      Weaknesses:

      There are several concerns about the validation of the model and its clarity. First, the biological variability being modeled in this manuscript is not defined well. It is unclear whether the framework addresses cell-to-cell differences within a single differentiation batch, variability across iPSC lines, or donor-to-donor differences. This ambiguity makes it difficult to interpret what the "digital twin populations" actually represent biologically. Second, the main claim, "the digital twins enable drug testing and arrhythmia prediction that would be impractical experimentally", is not experimentally validated. For example, the E-4031 simulations predict EAD rates, but no direct experimental head-to-head comparison is provided to confirm that these predictions are accurate. Third, technical reproducibility and biological representativeness are not assessed. Single voltage clamp recordings are inherently noisy. Without knowing how much variability comes from the recording process (technical variation) vs true biological differences, it is difficult to judge whether observed "cell-specific" parameter differences are meaningful. In addition, the optimized protocol is claimed to be superior to conventional approaches, but again, no experimental comparison is shown.

      The authors should address these concerns, with particular emphasis on clarifying the biological context and providing direct experimental validation. Below are detailed specific points:

      (1) Ambiguous definition of iPSC-CM heterogeneity.

      The authors model "typical iPSC-CM heterogeneity" by varying 52 parameters +/- 40% around a baseline model (Figure 1), generating > 1 million synthetic cells. However, the manuscript does not clearly state what biological variability this model is intended to capture. Is this modeling within-line, cell-to-cell variability (e.g., cells from the same dish or differentiation batch that differ due to stochastic gene expression or maturation state)? Or is this modeling between-line or between-donor variability (e.g., genetic background differences, reprogramming efficiency)? This distinction is critical for interpretation. If the goal is to understand why different cells in the same dish behave differently, then training data should reflect that. If the goal is to compare patient lines or disease models, the framework needs validation across multiple donors or lines.

      For example, the experimental validation in Figure 5 uses a single iPSC line (iPS-6-9-9T.B), but how many differentiation batches or dishes were tested, or whether cells came from the same preparation are unclear. Another example is that the wide AP diversity in the training population (Figure 1A) is impressive, but there is no demonstration that real experimental cells actually fall within this assumption range of +/- 40%.

      From a biological perspective, iPSC-CMs are known to be highly heterogeneous within lines (maturation state, metabolic differences, epigenetic variation, spatial differences within the same dish, etc) and between lines (different donor/genetic background). Thus, please explicitly state whether the +/- 40% variation is intended to model within-line or between-line heterogeneity, and justify this choice with wet experiment data (or reference to experimental literature on iPSC-CM variability). Please clarify how many dishes, differentiation batches, and time points post-differentiation were used for experimental recordings (Figures 5-6). If the framework is intended to generalize across lines from different donors, please test the model on multiple independent iPSC lines (from different donors).

      (2) Biological representativeness of single-cell measurements.

      The framework generates digital twins from single voltage clamp recordings. The patch clamp recordings in iPSC-CMs are subject to substantial technical variability. The manuscript does not address a fundamental question: "How representative are the measurements from a single cell on the dish (or line)?" In other words, if I measure one cell from a dish of a million cells, does that cell's digital twin tell me something about the dish as a whole, or just about that one cell? The manuscript presents Cell 1 and Cell 2 (Figures 5-6) as distinct individuals, but it's unclear whether these differences reflect true biological heterogeneity or simply sampling variability. I think the authors should perform replicate recordings on multiple cells (e.g., > 10 cells) from the same dish (same differentiation batch) and quantify how much the inferred parameters vary, and then compare between lines.

      (3) No experimental validation of the main claim that in silico populations can replace wet experiments.

      The most exciting claim in the manuscript is that digital twins enable drug testing and arrhythmia prediction "at scale" without requiring hundreds of patch clamp experiments. Specifically, the authors show that in silico populations derived from two experimental cells (Figure 6C) predict dose-dependent EAD incidence for the IKr blocker E-4031 (Figure 6D), with ~3% of cells showing EADs at 50 nM.

      However, this prediction is not validated experimentally. If I actually patch 20-30 real iPSC-CMs and apply 50 nM E-4031, will ~3% of them show EADs, as the model predicts? Without this validation, I think the drug testing framework is purely hypothetical. The model may be internally consistent (e.g., Cell 1's twin behaves differently from Cell 2's twin), but there is no evidence that these in silico populations reflect real biological variability in drug response. Please provide experimental validation that justifies the prediction by digital twins.

      (4) Experimental validation and head-to-head comparison of optimized protocol.

      The authors claim that their deep learning-optimized voltage clamp protocol (Figure 3, Figure 4A) is superior to conventional approaches, but they have not validated this experimentally by doing a head-to-head comparison. The manuscript does not compare the optimized protocol to any published voltage clamp designs. If the optimized protocol is genuinely easier to implement and more informative than existing approaches, this would be a major practical advance. But without side-by-side comparison, it is impossible to judge whether the optimization made a real difference.

    3. Reviewer #3 (Public review):

      Summary:

      This work uses a convolutional neural network to optimize a voltage clamp protocol to identify features and parameters from human pluripotent stem cell-derived cardiomyocytes.

      Yang et al. introduce an innovative experimental framework that integrates computational modeling and deep learning to generate a digital twin of human pluripotent stem cell-derived cardiomyocytes (hPSC-CMs).

      Strengths:

      The major strength is the methodology used to bridge in silico prediction of cell behavior and mechanistic insights from the experimental dataset.

      The approach used in this study represents a significant step toward precision medicine by enabling in silico prediction of cellular behavior and mechanistic insight from experimental datasets. The study addresses an important and timely challenge in stem cell-based and personalized medicine, and the authors compellingly leverage state-of-the-art methods alongside strong expertise in computational modeling and cardiac electrophysiology

      Weaknesses:

      While the overall approach is highly compelling and the potential impact is substantial, there are two areas where clarification and refinement, particularly in the phrasing and framing used throughout the manuscript, would further strengthen the work.

      (1) While the overall goal of the study is compelling, the manuscript would benefit from clearer articulation of how the proposed framework is intended to be used in practice. In particular, it is not entirely clear whether the authors envision this approach as:

      a) a method to extract population-level trends that, when paired with biological data, enhance statistical power and interpretability, or

      b) a strategy capable of constructing a population-based model from limited single-cell recordings. If the latter is intended, additional guidance on the number of action potentials required per cell and the assumptions underlying this extrapolation would greatly clarify the scope and applicability of the method.

      (2) The manuscript would also benefit from a clearer explanation of how electrophysiological heterogeneity observed in hPSC-CMs is linked to inter-patient variability. Although the authors state that this framework can be generalized to compare patient-specific hiPSC-CM lines, it remains unclear how this generalization is achieved, given the substantial sources of variability intrinsic to hiPSC-CMs (e.g., batch effects, reprogramming strategy, differentiation protocol, and maturation state). As acknowledged by the authors, addressing this level of variability likely requires large datasets; further clarification of how the proposed approach mitigates or accommodates these challenges would strengthen the translational claims.

      Below are my suggestions that could help strengthen the claims in the manuscript:

      (1) Adding a dedicated section describing the electrophysiological phenotype of the hPSC-CMs used in this study would help justify the choice of the underlying ionic model and the selection of the six ion currents analyzed. These currents are not only developmentally regulated but may also vary substantially across different hPSC-CM lines, which has implications for generalizability.

      (2) If feasible, inclusion of patch-clamp data from an additional hPSC-CM line would significantly strengthen the claim that this framework can harmonize and generalize across datasets and cell sources.

      (3) The authors note that the experimental cells exhibited high variability in action potential morphology. This is an important observation that directly supports the motivation for the study and should be explicitly presented, even if only in the supplementary materials.

      (4) In the hERG-blocker experiments, further clarification is needed regarding the biological relevance of the reported 3% incidence of early afterdepolarizations (EADs). Additionally, an interrupted sentence in this section makes it unclear whether the goal is to demonstrate that the digital twin can capture rare arrhythmic risk events or whether the digital twin is necessary to determine whether this level of risk is clinically meaningful.

      (5) The manuscript states that some action potentials were excluded from the experimental dataset. A brief explanation of the exclusion criteria, along with guidance on how to distinguish high-quality from low-quality recordings, would improve transparency and reproducibility.

    1. Reviewer #1 (Public review):

      Summary:

      Here, Mattenburger et al use structural biology, biochemistry, and genetics to analyze the membrane-attacking end (spike/spike tip) of the contractile injection systems of two DNA phages (P2 and T4). Understanding how a phage tail mediates host recognition and injects DNA into the host is an important question. This manuscript is divided into two stories. First is a biochemical fractionation showing that the fused spike-spike tip protein of P2 (GpV) is translocated into the host periplasm. Second is a somewhat separate story about the spike tip protein of T4 (gp5.4), which is structurally characterized and shown to aid in infection of E. coli with truncated lipopolysaccharides (LPS). I find the suggestion that gp5.4 aids in penetration of the bacterial envelope the most compelling portion of the manuscript, but I find this conclusion to be insufficiently supported, and the presentation could be described as awkward. Further, while the experiments are generally elegant, I believe additional experiments and a discussion to fully connect the two stories of the manuscript would increase impact.

      Strengths:

      The manuscript is methodologically careful and adds nuance to our understanding of P2 and T4 spike function. The T4 gp5.4 structure is extensively characterized, with crystallography and cryo-EM support. Many experiments are elegant and clever, specifically the P2 periplasmic fractionation and the ex vivo gp5.4 phage reconstitution. If completely supported and explained, the finding that gp5.4 aids in penetration of the bacterial envelope rather than adsorption is compelling.

      Weaknesses:

      The novelty of the work is somewhat incremental, as phage injection is known to occur into the periplasm and gp5.4 is known to be part of the spike tip (Taylor et al, 2016). The finding that gp5.4 promotes penetration and DNA delivery in strains with truncated LPS is incompletely supported. The gp5.4am phage plaquing data are incompletely explained, and may generate a more modest effect for gp5.4 than is claimed. The P2 results, although well-performed, do not directly support the T4 experiments given the evolutionary divergence between these two phages. Lastly, the overall organization of the manuscript and writing is lacking as (1) the P2 results are presented within the T4 data, (2) many figures are presented out of order, and (3) there is no discussion to contextualize the results for the reader.