10,000 Matching Annotations
  1. Jan 2026
    1. Note de synthèse : La Prosocialité Humaine et les Mécanismes de Coopération

      Cette note de synthèse explore les thèmes principaux et les idées clés issues des extraits de la conférence "L'expérience sociale la plus intéressante de ces dernières décennies".

      Elle se concentre sur la nature de la coopération humaine, ses déclencheurs, ses freins et les mécanismes sociaux développés pour la maintenir.

      1. La Nature Intrinsèque de la Prosocialité Humaine

      Le discours débute par la description du "jeu du bien public", une expérience courante en économie expérimentale qui révèle des insights fondamentaux sur le comportement humain.

      Dans ce jeu, les participants reçoivent une somme d'argent (par exemple, 20 €) et peuvent miser une partie de cette somme dans un pot commun qui sera ensuite doublé et redistribué équitablement.

      • Coopération spontanée et initiale : Contrairement à l'hypothèse de l'Homo Economicus purement égoïste et rationnel, les humains, même entre inconnus et sous anonymat, tendent à coopérer spontanément au premier tour.

      "En général la moitié des gens participent enfin les gens participent spontanément même entre inconnus même quand il y a des avec de l'anonymat même s'ils sont entre personnes qui qui n'ont jamais vu au premier tour ils vont quand même participer à hauteur de la moitié de de ce qu'ils ont". Cette tendance est observée "partout dans le monde".

      • L'Homo Economicus comme modèle de laboratoire :

      Le modèle de l'humain rationnel et égoïste est qualifié d'"animal de laboratoire", un "modèle théorique qui aurait dû juste rester au laboratoire". L'être humain est "beaucoup plus prosocial que ce que dit le modèle".

      • L'intuition au service de la coopération : Une expérience de Harvard montre que lorsque les participants sont contraints de répondre rapidement et intuitivement ("dépêchez-vous de répondre réfléchissez pas vous avez 2 secondes pour répondre et pour miser ou pas"), ils misent davantage dans le pot commun.

      À l'inverse, lorsque le mode "rationnel" est activé ("prenez le temps réfléchissez répondez pas trop vite"), la participation diminue.

      "Plus on réfléchit plus on est dans le mental plus on se méfie moins on participe".

      • Stress et prosocialité : Le stress peut également augmenter la coopération.

      Les participants à qui on annonçait une prise de parole en public stressante par la suite "ont plus misé dans le mot peau commun que quand que si on que à ceux qu'on avait dit qu'ils allaient pas parler en public".

      • L'empathie comme fondement : Cette prosocialité est "très ancré en nous" et découle de notre capacité à l'empathie.

      L'existence de "neurones de miroir" permet de "vivre ce que l'autre sent", et cette capacité n'est pas limitée aux humains, s'étendant à d'autres espèces et même à des "bouts de bois" ou de simples symboles visuels.

      • Altruisme précoce chez les bébés : Des études sur les enfants et les bébés montrent que "les capacités prosociales d'empathie et d'altruisme se retrouvent chez les bébés jusqu'à 6 mois même 5 mois".

      Avant même le langage et le raisonnement, les bébés peuvent distinguer les coopérateurs des non-coopérateurs et chercher à aider, et même une récompense peut "démotiver à aider", soulignant une nature intrinsèquement altruiste.

      2. L'Érosion de la Coopération et les Mécanismes de Stabilisation

      Malgré cette tendance initiale à la coopération, le jeu du bien public montre que "au fil des bah des tours (...) l'entraide s'effrite et puis la la confiance s'effrite et puis finalement on se retire bien commun".

      C'est le défi : "comment on fait pour ne pas que ça s'effrite avec le temps".

      Les cultures ont développé des "systèmes des mécanismes sociaux pour stabiliser l'entraide et pour stimuler l'entraide".

      • La réciprocité renforcée : C'est le mécanisme le plus courant et le plus efficace.

      • Récompenser les altruistes : Encourager et reconnaître ceux qui contribuent positivement.

      • Punir les tricheurs et les égoïstes : L'introduction de cette règle dans l'expérience du bien public a eu des "effets miraculeux", faisant exploser et stabiliser les niveaux de prosocialité.

      Les humains sont prêts à dépenser de l'argent ("punition altruiste") pour punir les non-coopérateurs, "ça va même jusqu'à une une grande proportion du salaire mensuel c'est une passion".

      • Plaisir neuronal associé : Coopérer, voir autrui coopérer, ou même anticiper un acte d'altruisme, active le circuit de la récompense dans le cerveau, procurant un "vrai plaisir", même chez les enfants.

      Inversement, le "circuit de dégoût" est activé par la punition d'un altruiste ou la récompense d'un égoïste/tricheur (ex: "quand au hasard quelqu'un du gouvernement est mise en examen pour corruption et est relâché").

      Cela montre l'importance de la justice perçue pour la coopération (ex: plaisir à payer des impôts si l'argent est bien dépensé).

      La réciprocité indirecte et la réputation :

      • Ce mécanisme implique que l'aide donnée à une personne peut inciter une tierce personne à aider le donneur initial, ou qu'un acte altruiste est observé par des témoins, ce qui étend l'entraide dans le groupe.

      • Les "ragots et les Cancans" comme moteur : Ces interactions sociales informelles sont cruciales car elles "créent la réputation". Avoir une "bonne réputation" est un "capital social" précieux qui renforce la coopération.

      • L'expérience de la réputation : Une expérience a montré que lorsque le jeu du bien public est alterné avec un jeu de réputation, les niveaux de coopération restent élevés.

      Cependant, si les participants apprennent que la fin du jeu est proche et que la réputation n'aura plus d'importance, la coopération s'effondre ("ils se sont mis à en profiter à mort ils en avaient plus rien à foutre la réputation n'était pas en plus en jeu").

      • Le sentiment d'être observé : Se sentir observé ("Big Brother", les religions avec un Dieu omniscient) augmente significativement la coopération. Même de simples points évoquant un visage sur un mur peuvent avoir cet effet inconscient.

      3. Les Fondements Profonds de la Relation Humaine et l'Élargissement du Cercle d'Empathie

      La distinction entre interagir avec un humain ou un ordinateur est fondamentale : la coopération avec un ordinateur n'active pas le circuit de la récompense, indiquant que "c'est quelque chose de profondément humain".

      • La relation "Je et Tu" : Le philosophe Martin Buber est cité avec son concept de "Je et Tu" par opposition à "Je et ça".

      La relation "Je et Tu" implique une reconnaissance mutuelle de l'autre comme sujet doté d'empathie, créant une "relation de miroir" infinie.

      • La déshumanisation : L'horreur survient lorsque l'on "sort quelqu'un de notre champ d'empathie", transformant une relation "Je et Tu" en "Je et ça", et déshumanisant l'autre.

      "C'est ce qui s'est passé pour les juifs pendant la guerre au Rwanda avec les les utou et les tutti et probablement en Ukraine dans toutes les guerres on on peut arriver basculer dans l'horreur lorsqu'on sort les humains de notre champ d'empathie ça peut arriver très vite".

      • Élargir le cercle d'empathie : Le défi contemporain est d'élargir ce cercle d'empathie au-delà des seuls humains (souvent limité aux animaux domestiques), pour inclure les animaux et les plantes.

      Considérer le monde non pas comme "entouré d'objets mais entouré de sujets" permettrait de "retrouver des relations de réciprocité et donc de prosocialité et donc tous les circuits vont s'enclencher et ça va faire un un monde totalement différent".

      En conclusion, la prosocialité est une caractéristique fondamentale et spontanée de l'être humain, ancrée dans l'empathie et activée par l'intuition.

      Bien qu'elle puisse s'effriter avec le temps, des mécanismes sociaux tels que la récompense des altruistes, la punition des tricheurs et l'importance de la réputation sont essentiels pour stabiliser et renforcer la coopération.

      Le maintien et l'élargissement de notre "cercle d'empathie" sont cruciaux pour prévenir la déshumanisation et construire un monde plus coopératif et juste.

    1. Author response:

      The following is the authors’ response to the original reviews.

      Public Reviews:

      Reviewer #1 (Public review):

      Weaknesses:

      The technical approach is strong and the conceptual framing is compelling, but several aspects of the evidence remain incomplete. In particular, it is unclear whether the reported changes in connectivity truly capture causal influences, as the rank metrics remain correlational and show discrepancies with the manipulation results.

      We agree that our functional connectivity ranking analyses cannot establish causal influences. As discussed in the manuscript, besides learning-related activity changes, the functional connectivity may also be influenced by neuromodulatory systems and internal state fluctuations. In addition, the spatial scope of our recordings is still limited compared to the full network implicated in visual discrimination learning, which may bias the ranking estimates. In future, we aim to achieve broader region coverage and integrate multiple complementary analyses to address the causal contribution of each region.

      The absolute response onset latencies also appear slow for sensory-guided behavior in mice, and it is not clear whether this reflects the method used to define onset timing or factors such as task structure or internal state.

      We believe this may be primarily due to our conservative definition of onset timing. Specifically, we required the firing rate to exceed baseline (t-test, p < 0.05) for at least 3 consecutive 25-ms time windows. This might lead to later estimates than other studies, such as using the latency to the first spike after visual stimulus onset (Siegle et al., 2021) or the time to half-max response (Goldbach, Akitake, Leedy, & Histed, 2021).

      The estimation of response onset latency in our study may also be affected by potential internal state fluctuations of the mice. We used the time before visual stimulus onset as baseline firing, since firing rates in this period could be affected by trial history, we acknowledge this may increase the variability of the baseline, thus increase the difficulty to statistically detect the onset of response.

      Still, we believe these concerns do not affect the observation of the formation of compressed activity sequence in CR trials during learning.

      Furthermore, the small number of animals, combined with extensive repeated measures, raises questions about statistical independence and how multiple comparisons were controlled.

      We agree that a larger sample size would strengthen the robustness of the findings. However, as noted above, the current dataset has inherent limitations in both the number of recorded regions and the behavioral paradigm. Given the considerable effort required to achieve sufficient unit yields across all targeted regions, we wish to adjust the set of recorded regions, improve behavioral task design, and implement better analyses in future studies. This will allow us to both increase the number of animals and extract more precise insights into mesoscale dynamics during learning.

      The optogenetic experiments, while intended to test the functional relevance of rank increasing regions, leave it unclear how effectively the targeted circuits were silenced. Without direct evidence of reliable local inhibition, the behavioral effects or lack thereof are difficult to interpret.

      We appreciate this important point. Due to the design of the flexible electrodes and the implantation procedure, bilateral co-implantation of both electrodes and optical fibers was challenging, which prevented us from directly validating the inhibition effect in the same animals used for behavior. In hindsight, we could have conducted parallel validations using conventional electrodes, and we will incorporate such controls in future work to provide direct evidence of manipulation efficacy.

      Details on spike sorting are limited.

      We have provided more details on spike sorting in method section, including the exact parameters used in the automated sorting algorithm and the subsequent manual curation criteria.

      Reviewer #2 (Public review):

      Weaknesses:

      I had several major concerns:

      (1) The number of mice was small for the ephys recordings. Although the authors start with 7 mice in Figure 1, they then reduce to 5 in panel F. And in their main analysis, they minimize their analysis to 6/7 sessions from 3 mice only. I couldn't find a rationale for this reduction, but in the methods they do mention that 2 mice were used for fruitless training, which I found no mention in the results. Moreover, in the early case, all of the analysis is from 118 CR trials taken from 3 mice. In general, this is a rather low number of mice and trial numbers. I think it is quite essential to add more mice.

      We apologize for the confusion. As described in the Methods section, 7 mice (Figure 1B) were used for behavioral training without electrode array or optical fiber implants to establish learning curves, and an additional 5 mice underwent electrophysiological recordings (3 for visual-based decision-making learning and 2 for fruitless learning).

      As we noted in our response to Reviewer #1, the current dataset has inherent limitations in both the number of recorded regions and the behavioral paradigm. Given the considerable effort required to achieve high-quality unit yields across all targeted regions, we wish to adjust the set of recorded regions, improve behavioral task design, and implement better analyses in future studies. These improvements will enable us to collect data from a larger sample size and extract more precise insights into mesoscale dynamics during learning.

      (2) Movement analysis was not sufficient. Mice learning a go/no-go task establish a movement strategy that is developed throughout learning and is also biased towards Hit trials. There is an analysis of movement in Figure S4, but this is rather superficial. I was not even sure that the 3 mice in Figure S4 are the same 3 mice in the main figure. There should be also an analysis of movement as a function of time to see differences. Also for Hits and FAs. I give some more details below. In general, most of the results can be explained by the fact that as mice gain expertise, they move more (also in CR during specific times) which leads to more activation in frontal cortex and more coordination with visual areas. More needs to be done in terms of analysis, or at least a mention of this in the text.

      Due to the limitation in the experimental design and implementation, movement tracking was not performed during the electrophysiological recordings, and the 3 mice shown in Figure S4 (now S5) were from a separate group. We have carefully examined the temporal profiles of mouse movements and found it did not fully match the rank dynamics for all regions, and we have added these results and related discussion in the revised manuscript. However, we acknowledge the observed motion energy pattern could explain some of the functional connection dynamics, such as the decrease in face and pupil motion energy could explain the reduction in ranks for striatum.

      Without synchronized movement recordings in the main dataset, we cannot fully disentangle movement-related neural activity from task-related signals. We have made this limitation explicit in the revised manuscript and discuss it as a potential confound, along with possible approaches to address it in future work.

      (3) Most of the figures are over-detailed, and it is hard to understand the take-home message. Although the text is written succinctly and rather short, the figures are mostly overwhelming, especially Figures 4-7. For example, Figure 4 presents 24 brain plots! For rank input and output rank during early and late stim and response periods, for early and expert and their difference. All in the same colormap. No significance shown at all. The Δrank maps for all cases look essentially identical across conditions. The division into early and late time periods is not properly justified. But the main take home message is positive Δrank in OFC, V2M, V1 and negative Δrank in ThalMD and Str. In my opinion, one trio map is enough, and the rest could be bumped to the Supplementary section, if at all. In general, the figure in several cases do not convey the main take home messages. See more details below.

      We thank the reviewer for this valuable critique. The statistical significance corresponding to the brain plots (Figure 4 and Figure 5) was presented in Figure S3 and S5 (now Figure S5 and S7 in the revised manuscript), but we agree that the figure can be simplified to focus on the key results.

      In the revised manuscript, we have condensed these figures to focus on the most important comparisons to make the visual presentation more concise and the take-home message clearer.

      (4) The analysis is sometimes not intuitive enough. For example, the rank analysis of input and output rank seemed a bit over complex. Figure 3 was hard to follow (although a lot of effort was made by the authors to make it clearer). Was there any difference between the output and input analysis? Also, the time period seems redundant sometimes. Also, there are other network analysis that can be done which are a bit more intuitive. The use of rank within the 10 areas was not the most intuitive. Even a dimensionality reduction along with clustering can be used as an alternative. In my opinion, I don't think the authors should completely redo their analysis, but maybe mention the fact that other analyses exist

      We appreciate the reviewer’s comment. In brief, the input- and output-rank analyses yielded largely similar patterns across regions in CR trials, although some differences were observed in certain areas (e.g., striatum) in Hit trials, where the magnitude of rank change was not identical between input and output measures. We have condensed the figures to only show averaged rank results, and the colormap was updated to better covey the message.

      We did explore dimensionality reduction applied to the ranking data. However, the results were not intuitive as well and required additional interpretation, which did not bring more insights. Still, we acknowledge that other analysis approaches might provide complementary insights.

      Reviewer #3 (Public review):

      Weaknesses:

      The weakness is also related to the strength provided by the method. It is demonstrated in the original method that this approach in principle can track individual units for four months (Luan et al, 2017). The authors have not showed chronically tracked neurons across learning. Without demonstrating that and taking advantage of analyzing chronically tracked neurons, this approach is not different from acute recording across multiple days during learning. Many studies have achieved acute recording across learning using similar tasks. These studies have recorded units from a few brain areas or even across brain-wide areas.

      We appreciate the reviewer’s important point. We did attempt to track the same neurons across learning in this project. However, due to the limited number of electrodes implanted in each brain region, the number of chronically tracked neurons in each region was insufficient to support statistically robust analyses. Concentrating probes in fewer regions would allow us to obtain enough units tracked across learning in future studies to fully exploit the advantages of this method.

      Another weakness is that major results are based on analyses of functional connectivity that is calculated using the cross-correlation score of spiking activity (TSPE algorithm). Functional connection strengthen across areas is then ranked 1-10 based on relative strength. Without ground truth data, it is hard to judge the underlying caveats. I'd strongly advise the authors to use complementary methods to verify the functional connectivity and to evaluate the mesoscale change in subnetworks. Perhaps the authors can use one key information of anatomy, i.e. the cortex projects to the striatum, while the striatum does not directly affect other brain structures recorded in this manuscript

      We agree that the functional connectivity measured in this study relies on statistical correlations rather than direct anatomical connections. We plan to test the functional connection data with shorter cross-correlation delay criteria to see whether the results are consistent with anatomical connections and whether the original findings still hold.

      Recommendations for the authors:

      Reviewer #1 (Recommendations for the authors):

      (1) The small number of mice, each contributing many sessions, complicates the  interpretation of the data. It is unclear how statistical analyses accounted for the small  sample size, repeated measures, and non-independence across sessions, or whether  multiple comparisons were adequately controlled.

      We realized the limitation from the small number of animal subjects, yet the difficulty to achieve sufficient unit yields across all regions in the same animal restricted our sample size. Though we agree that a larger sample size would strengthen the robustness of the findings, however, as noted below the current dataset has inherent limitations in both the scope of recorded regions and the behavioral paradigm.

      Given the considerable effort required to achieve sufficient unit yields across all targeted regions, we wish to adjust the set of recorded regions, improve behavioral task design, and implement better analyses in future studies. This will allow us to both increase the number of animals and extract more precise insights into mesoscale dynamics during learning.

      (2) The ranking approach, although intuitive for visualizing relative changes in  connectivity, is fundamentally descriptive and does not reflect the magnitude or  reliability of the connections. Converting raw measures into ordinal ranks may obscure  meaningful differences in strength and can inflate apparent effects when the underlying  signal is weak.

      We agree with this important point. As stated in the manuscript, our motivation in taking the ranking approach was that the differences in firing rates might bias cross-correlation between spike trains, making raw accounts of significant neuron pairs difficult to compare across conditions, but we acknowledge the ranking measures might obscure meaningful differences or inflate weak effects in the data.

      We added the limitations of ranking approach in the discussion section and emphasized the necessity in future studies for better analysis approaches that could provide more accurate assessment of functional connection dynamics without bias from firing rates.

      (3) The absolute response onset latencies also appear quite slow for sensory-guided  behavior in mice, and it remains unclear whether this reflects the method used to  determine onset timing or factors such as task design, sensorimotor demands, or  internal state. The approach for estimating onset latency by comparing firing rates in  short windows to baseline using a t-test raises concerns about robustness, as it may  be sensitive to trial-to-trial variability and yield spurious detections.

      We agree this may be primarily due to our conservative definition of onset timing. Specifically, we required the firing rate to exceed baseline (t-test, p < 0.05) for at least 3 consecutive 25-ms time windows. This might lead to later estimates than other studies, such as using the latency to the first spike after visual stimulus onset (Siegle et al., 2021) or the time to half-max response (Goldbach, Akitake, Leedy, & Histed, 2021).

      The estimation of response onset latency in our study may also be affected by potential internal state fluctuations of the mice. We used the time before visual stimulus onset as baseline firing, since firing rates in this period could be affected by trial history, we acknowledge this may increase the variability of the baseline, thus increase the difficulty to statistically detect the onset of response.

      Still, we believe these concerns do not affect the observation of the formation of compressed activity sequence in CR trials during learning.

      (4) Details on spike sorting are very limited. For example, defining single units only by  an interspike interval threshold above one millisecond may not sufficiently rule out  contamination or overlapping clusters. How exactly were neurons tracked across days  (Figure 7B)?

      We have added more details on spike sorting, including the processing steps and important parameters used in the automated sorting algorithm. Only the clusters well isolated in feature space were accepted in manual curation.

      We attempted to track the same neurons across learning in this project. However, due to the limited number of electrodes implanted in each brain region, the number of chronically tracked neurons in each region was insufficient to support statistically robust analyses.

      This is now stated more clearly in the discussion section.

      (5) The optogenetic experiments, while designed to test the functional relevance of  rank-increasing regions, also raise questions. The physiological impact of the inhibition  is not characterized, making it unclear how effectively the targeted circuits were  actually silenced. Without clearer evidence that the manipulations reliably altered local  activity, the interpretation of the observed or absent behavioral effects remains  uncertain.

      We appreciate this important point. Due to the design of the flexible electrodes and the implantation procedure, bilateral co-implantation of both electrodes and optical fibers was challenging, which prevented us from directly validating the inhibition effect in the same animals used for behavior. In hindsight, we could have conducted parallel validations using conventional electrodes, and we will incorporate such controls in future work to provide direct evidence of manipulation efficacy. 

      (6) The task itself is relatively simple, and the anatomical coverage does not include  midbrain or cerebellar regions, limiting how broadly the findings can be generalized to more flexible or ethologically relevant forms of decision-making.

      We appreciate this advice and have expanded the existing discussion to more explicitly state that the relatively simple task design and anatomical coverage might limit the generalizability of our findings.

      (7) The abstract would benefit from more consistent use of tense, as the current mix of  past and present can make the main findings harder to follow. In addition, terms like  "mesoscale network," "subnetwork," and "functional motif" are used interchangeably in  places; adopting clearer, consistent terminology would improve readability.

      We have changed several verbs in abstract to past form, and we now adopted a more consistent terminology by substituting “functional motif” as “subnetwork”. We still feel the use of

      “mesoscale network” and “subnetwork” could emphasize different aspects of the results according to the context, so these words are kept the same.

      (8) The discussion could better acknowledge that the observed network changes may  not reflect task-specific learning alone but could also arise from broader shifts in  arousal, attention, or motivation over repeated sessions.

      We have expanded the existing discussion to better acknowledge the possible effects from broader shifts in arousal, attention, or motivation over repeated sessions.

      (9) The figures would also benefit from clearer presentation, as several are dense and  not straightforward to interpret. For example, Figure S8 could be organized more  clearly to highlight the key comparisons and main message

      We have simplified the over-detailed brain plots in Figure 4-5, and the plots in Figure 6 and S8 (now S10 in the revised manuscript).

      (10) Finally, while the manuscript notes that data and code are available upon request,  it would strengthen the study's transparency and reproducibility to provide open access  through a public repository, in line with best practices in the field.

      The spiking data, behavior data and codes for the core analyses in the manuscript are now shared in pubic repository (Dryad). And we have changed the description in the Data Availability secition accordingly.

      Reviewer #2 (Recommendations for the authors):

      (A) Introduction:

      (1) "Previous studies have implicated multiple cortical and subcortical regions in visual  task learning and decision-making". No references here, and also in the next sentence.

      The references were in the following introduction and we have added those references here as well.

      We also added one review on cortical-subcortical neural correlates in goal-directed behavior (Cruz et al., 2023).

      (2) Intro: In general, the citation of previous literature is rather minimal, too minimal.  There is a lot of studies using large scale recordings during learning, not necessarily  visual tasks. An example for brain-wide learning study in subcortical areas is Sych et  al. 2022 (cell reports). And for wide-field imaging there are several papers from the  Helmchen lab and Komiyama labs, also for multi-area cortical imaging.

      We appreciate this advice. We included mainly visual task learning literature to keep a more focused scope around the regions and task we actually explored in this study. We fear if we expand the intro to include all the large-scale imaging/recording studies in learning field, the background part might become too broad.

      We have included (Sych, Fomins, Novelli, & Helmchen, 2022) for its relevance and importance in the field.

      (3) In the intro, there is only a mention of a recording of 10 brain regions, with no  mention of which areas, along with their relevance to learning. This is mentioned in the  results, but it will be good in the intro.

      The area names are now added in intro.

      (B) Results:

      (1) Were you able to track the same neurons across the learning profile? This is not  stated clearly.

      We did attempt to track the same neurons across learning in this project. However, due to the limited number of electrodes implanted in each brain region, the number of chronically tracked neurons in each region was insufficient to support statistically robust analyses.

      We now stated this more clearly in the discussion section.

      (2) Figure 1 starts with 7 mice, but only 5 mice are in the last panel. Later it goes down  to 3 mice. This should be explained in the results and justified.

      We apologize for the confusion. As described in the Methods section, 7 mice (Figure 1B) were used for behavioral training without electrode array or optical fiber implants to establish learning curves, and an additional 5 mice underwent electrophysiological recordings (3 for visual-based decision-making learning and 2 for fruitless learning).

      (3) I can't see the electrode tracks in Figure 1d. If they are flexible, how can you make  sure they did not bend during insertion? I couldn't find a description of this in the  methods also.

      The electrode shanks were ultra-thin (1-1.5 µm) and it was usually difficult to recover observable tracks or electrodes in section.

      The ultra-flexible probes could not penetrate brain on their own (since they are flexible), and had to be shuttled to position by tungsten wires through holes designed at the tip of array shanks. The tungsten wires were assembled to the electrode array before implantation; this was described in the section of electrode array fabrication and assembly. We also included the description about the retraction of the guiding tungsten wires in the surgery section to avoid confusion.

      As an further attempt to verify the accuracy of implantation depth, we also measured the repeatability of implantation in a group of mice and found a tendency for the arrays to end in slightly deeper location in cortex (142.1 ± 55.2 μm, n = 7 shanks), and slightly shallower location in subcortical structure (-122.6 ± 71.7 μm, n = 7 shanks). We added these results as new Figure S1 to accompany Figure 1.

      (4) In the spike rater in 1E, there seems to be ~20 cells in V2L, for example, but in 1F,  the number of neurons doesn't go below 40. What is the difference here? 

      We checked Figure 1F, the plotted dots do go below 40 to ~20. Perhaps the file that reviewer received wasn’t showing correctly?

      (5) The authors focus mainly on CR, but during learning, the number of CR trials is  rather low (because they are not experts). This can also be seen in the noisier traces  in Figure 2a. Do the authors account for that (for example by taking equal trials from  each group)? 

      We accounted this by reconstructing bootstrap-resampled datasets with only 5 trials for each session in both the early stage and the expert stage. The mean trace of the 500 datasets again showed overall decrease in CR trial firing rate during task learning, with highly similar temporal dynamics to the original data.

      The figure is now added to supplementary materials (as Figure S3 in the revised manuscript).

      (6) From Figure 2a, it is evident that Hit trials increase response when mice become  experts in all brain areas. The authors have decided to focus on the response onset  differences in CRs, but the Hit responses display a strong difference between naïve  and expert cases.

      Judged from the learning curve in this task the mice learned to inhibit its licking action when the No-Go stimuli appeared, which is the main reason we focused on these types of trials.

      The movement effects and potential licking artefacts in Hit trials also restricted our interpretation of these trials.

      (7) Figure 3 is still a bit cumbersome. I wasn't 100% convinced of why there is a need  to rank the connection matrix. I mean when you convert to rank, essentially there could  be a meaningful general reduction in correlation, for example during licking, and this  will be invisible in the ranking system. Maybe show in the supp non-ranked data, or  clarify this somehow

      We agree with this important point. As stated in the manuscript and response to Reviewer #1, our motivation in taking the ranking approach was that the differences in firing rates could bias cross-correlation between spike trains, making raw accounts of significant neuron pairs difficult to compare across conditions, but we acknowledge the ranking measures might obscure meaningful differences or inflate weak effects in the data.

      We added the limitations of ranking approach in the discussion section and emphasized the necessity in future studies for better analysis approaches that could provide more accurate assessment of functional connection dynamics without bias from firing rates.

      (8) Figure 4a x label is in manuscript, which is different than previous time labels,  which were seconds.

      We now changed all time labels from Figure 2 to milliseconds.

      (9) Figure 4 input and output rank look essentially the same.

      We have compressed the brain plots in Figures 4-5 to better convey the take-home message.

      (10) Also, what is the late and early stim period? Can you mark each period in panel A? Early stim period is confusing with early CR period. Same for early respons and late response.

      The definition of time periods was in figure legends. We now mark each period out to avoid confusion.

      (11) Looking at panel B, I don't see any differences between delta-rank in early stim,  late stim, early response, and late response. Same for panel c and output plots.

      The rankings were indeed relatively stable across time periods. The plots are now compressed and showed a mean rank value.

      (12) Panels B and C are just overwhelming and hard to grasp. Colors are similar both  to regular rank values and delta-rank. I don't see any differences between all  conditions (in general). In the text, the authors report only M2 to have an increase in  rank during the response period. Late or early response? The figure does not go well  with the text. Consider minimizing this plot and moving stuff to supplementary.

      The colormap are now changed to avoid confusion, and brain plots are now compressed.

      (13) In terms of a statistical test for Figure 4, a two-way ANOVA was done, but over  what? What are the statistics and p-values for the test? Is there a main effect of time  also? Is their a significant interaction? Was this done on all mice together? How many  mice? If I understand correctly, the post-hoc statistics are presented in the  supplementary, but from the main figure, you cannot know what is significant and what  is not.

      For these figures we were mainly concerned with the post-hoc statistics which described the changes in the rankings of each region across learning.

      We have changed the description to “t-test with Sidak correction” to avoid the confusion.

      (14) In the legend of Figure 4, it is reported that 610 expert CR trials from 6 sessions,  instead of 7 sessions. Why was that? Also, like the previous point, why only 3 mice?

      Behavior data of all the sessions used were shown in Figure S1. There were only 3 mice used for the learning group, the difficulty to achieve sufficient unit yields across all regions in the same animal restricted our sample size

      (15) Body movement analysis: was this done in a different cohort of mice? Only now  do I understand why there was a division into early and late stim periods. In supp 4,  there should be a trace of each body part in CR expert versus naïve. This should also  be done for Hit trials as a sanity check. I am not sure that the brightness difference  between consecutive frames is the best measure. Rather try to calculate frame-to frame correlation. In general, body movement analysis is super important and should  be carefully analyzed.

      Due to the limitation in the experimental design and implementation, movement tracking was not performed during the electrophysiological recordings, and the 3 mice shown in Figure S4 (now S5) were from a separate group. We have carefully examined the temporal profiles of mouse movements and found it did not fully match the rank dynamics for all regions, and we have added these results and related discussion in the revised manuscript. However, we acknowledge the observed motion energy pattern could explain some of the functional connection dynamics, such as the decrease in face and pupil motion energy could explain the reduction in ranks for striatum.

      Without synchronized movement recordings in the main dataset, we cannot fully disentangle movement-related neural activity from task-related signals. We have made this limitation explicit in the revised manuscript and discuss it as a potential confound, along with possible approaches to address it in future work.

      (16) For Hit trials, in the striatum, there is an increase in input rank around the  response period, and from Figure S6 it is clear that this is lick-related. Other than that,  the authors report other significant changes across learning and point out to Figure 5b,c. I couldn't see which areas and when it occurred.

      We did naturally expect the activity in striatum to be strongly related to movement.

      With Figure S6 (now S7) we wished to show that the observed rank increase for striatum could not simply be attributed to changes in time of lick initiation.

      As some readers may argue that during learning the mice might have learned to only intensely lick after response signal onset, causing the observed rise of input rank after response signal, we realigned the spikes in each trial to the time of the first lick, and a strong difference could still be observed between early training stage and expert training stage.

      We still cannot fully rule out the effects from more subtle movement changes, as the face motion energy did increase in early response period. This result and related discussion has been added to the results section of revised manuscript.

      (17) Figure 6, again, is rather hard to grasp. There are 16 panels, spread over 4 areas,  input and output, stim and response. What is the take home message of all this?  Visually, it's hard to differentiate between each panel. For me, it seems like all the  panels indicate that for all 4 areas, both in output and input, frontal areas increase in  rank. This take-home message can be visually conveyed in much less tedious ways.  This simpler approach is actually conveyed better in the text than in the figures  themselves. Also, the whole explanation on how this analysis was done, was not clear  from the text. If I understand it, you just divided and ranked the general input (or  output) into individual connections? If so, then this should be better explained.

      We appreciate this advice and we have compressed the figures to better convey the main message.The rankings for Figure 6 and Figure S8 (now Figure S9) was explained in the left panel of Figure 3C. Each non-zero element in the connection matrix was ranked to value from 1-10, with a value of 10 represented the 10% strongest non-zero elements in the matrix.

      We have updated the figure legends of Figure 3, and we have also updated the description in methods (Connection rank analyses) to give a clearer description of how the analyses were applied in subsequent figures.

      (18) Figure 7: Here, the authors perform a ROC analysis between go and no-go  stimuli. They balance between choice, but there is still an essential difference between  a hit and a FA in terms of movement and licks. That is maybe why there is a big  difference in selective units during the response period. For example, during a Hit trial  the mouse licks and gets a reward, resulting in more licking and excitement. In FAs,the mouse licks, but gets punished, which causes a reduction in additional licking and  movements. This could be a simple explanation why the ROC was good in the late  response period. Body movement analysis of Hit and FA should be done as in Figure  S4.

      We appreciate this insightful advice.

      Though we balanced the numbers of basic trial types, we couldn’t rule out the difference in the intrinsic movement amount difference in FA trials and Hit trials, which is likely the reason of large proportion of encoding neurons in response period.

      We have added this discussion both in result section and discussion section along with the necessity of more carefully designed behavior paradigm to disentangle task information.

      (19) The authors also find selective neurons before stimulus onset, and refer to trial  history effects. This can be directly checked, that is if neurons decode trial history.

      We attempted encoding analyses on trial history, but regrettably for our dataset we could not find enough trials to construct a dataset with fully balanced trial history, visual stimulus and behavior choice.

      (20) Figure 7e. What is the interpretation for these results? That areas which peaked  earlier had more input and output with other areas? So, these areas are initiating  hubs? Would be nice to see ACC vs Str traces from B superimposed on each other.  Having said this, the Str is the only area to show significant differences in the early  stim period. But is also has the latest peak time. This is a bit of a discrepancy.

      We appreciate this important point.

      The limitation in the anatomical coverage of brain regions restricted our interpretation about these findings. They could be initiating hubs or earlier receiver of the true initiating hubs that were not monitored in our study.

      The Str trace was in fact above the ACC trace, especially in the response period. This could be explained by the above advice 18: since we couldn’t rule out the difference in the intrinsic movement amount difference in FA trials and Hit trials, and considering striatum activity is strongly related to movement, the Str trace may reflect more in the motion related spike count difference between FA trials and Hit trials, instead of visual stimulus related difference.

      This further shows the necessity of more carefully designed behavior paradigm to disentangle task information.

      The striatum trace also in fact didn’t show a true double peak form as traces in other regions, it ramped up in the stimulus region and only peaked in response period. This description is now added to the results section.

      In the early stim period, the Striatum did show significant differences in average percent of encoding neurons, as the encoding neurons were stably high in expert stage. The striatum activity is more directly affected Still the percentage of neurons only reached peak in late stimulus period.

      (21) For the optogenetic silencing experiments, how many mice were trained for each  group? This is not mentioned in the results section but only in the legend of Figure 8. This part is rather convincing in terms of the necessity for OFC and V2M

      We have included the mice numbers in results section as well.

      (C) Discussion

      (1) There are several studies linking sensory areas to frontal networks that should be  mentioned, for example, Esmaeili et a,l 2022, Matteucci et al., 2022, Guo et a,l 2014,Gallero Salas et al, 2021, Jerry Chen et al, 2015. Sonja Hofer papers, maybe. Probably more.

      We appreciate this advice. We have now included one of the mentioned papers (Esmaeili et al., 2022) in the results section and discussion section for its direct characterization of the enhanced coupling between somatosensory region and frontal (motor) region during sensory learning.The other studies mentioned here seem to focus more on the differences in encoding properties between regions along specific cortical pathways, rather than functional connection or interregional activity correlation, and we feel they are not directly related to the observations discussed.

      (2) The reposted reorganization of brain-wide networks with shifts in time is best  described also in Sych et al. 2021.

      We regret we didn’t include this important research and we have now cited this in discussion section.

      (3) Regarding the discussion about more widespread stimulus encoding after learning,  the results indicate that the striatum emerges first in decoding abilities (Figure 7c left  panel), but this is not discussed at all.

      We briefly discussed this in the result section. We tend to attribute this to trial history signal in striatum, but since the structure of our data could not support a direct encoding analysis on trial history, we felt it might be inappropriate to over-interpret the results.

      (4) An important issue which is not discussed is the contribution of movement which  was shown to have a strong effect on brain-wide dynamics (Steinmetz et al 2019;  Musall et al 2019; Stringer et al 2019; Gilad et al 2018) The authors do have some movement analysis, but this is not enough. At least a discussion of the possible effects of movement on learning-related dynamics should be added.

      We have included these studies in discussion section accordingly. Since the movement analyses were done in a separate cohort of mice, we have made our limitation explicit in the revised manuscript and discuss it as a potential confound, along with possible approaches to address it in future work.

      (D) Methods

      (1) How was the light delivery of the optogenetic experiments done? Via fiber  implantation in the OFC? And for V2M? If the red laser was on the skull, how did it get  to the OFC?

      The fibers were placed on cortex surface for V2M group, and were implanted above OFC for OFC manipulation group. These were described in the viral injection part of the methods section.

      (2) No data given on how electrode tracking was done post hoc

      As noted in our response to the advice 3 in results section, the electrode shanks were ultra-thin (1-1.5 µm) and it was usually difficult to recover observable tracks or electrodes in section.

      As an attempt to verify the accuracy of implantation depth, we measured the repeatability of implantation in a group of mice and found a tendency for the arrays to end in slightly deeper location in cortex (142.1 ± 55.2 μm, n = 7 shanks), and slightly shallower location in subcortical structure (-122.6 ± 71.7 μm, n = 7 shanks). We added these results as new Figure S1 to accompany Figure 1.

      Reviewer #3 (Recommendations for the authors):

      (1) The manuscript uses decision-making in the title, abstract and introduction.  However, nothing is related to decision learning in the results section. Mice simply  learned to suppress licking in no-go trials. This type of task is typically used to study behavioral inhibition. And consistent with this, the authors mainly identified changes  related to network on no-go trials. I really think the title and main message is  misleading. It is better to rephrase it as visual discrimination learning. In the  introduction, the authors also reviewed multiple related studies that are based on  learning of visual discrimination tasks.

      We do view the Go/No-Go task as a specific genre of decision-making task, as there were literature that discussed this task as decision-making task under the framework of signal detection theory or updating of item values (Carandini & Churchland, 2013; Veling, Becker, Liu, Quandt, & Holland, 2022).

      We do acknowledge the essential differences between the Go/No-Go task and the tasks that require the animal to choose between alternatives, and since we have now realized some readers may not accept this task as a decision task, we have changed the title to visual discrimination task as advised.

      (2) Learning induced a faster onset on CR trials. As the no-go stimulus was not  presented to mice during early stages of training, this change might reflect the  perceptual learning of relevant visual stimulus after repeated presentation. This further  confirms my speculation, and the decision-making used in the title is misleading. 

      We have changed the title to visual discrimination task accordingly.

      (3) Figure 1E, show one hit trial. If the second 'no-go stimulus' is correct, that trial  might be a false alarm trial as mice licked briefly. I'd like to see whether continuous  licking can cause motion artifacts in recording. 

      We appreciate this important point. There were indeed licking artifacts with continuous licking in Hit trials, which was part of the reason we focused our analyses on CR trials. Opto-based lick detectors may help to reduce the artefacts in future studies.

      (4) What is the rationale for using a threshold of d' < 2 as the early-stage data and d'>3  as expert stage data?

      The thresholds were chosen as a result from trade-off based on practical needs to gather enough CR trials in early training stage, while maintaining a relatively low performance.

      Assume the mice showed lick response in 95% of Go stimulus trials, then d' < 2 corresponded to the performance level at which the mouse correctly rejected less than 63.9% of No-Go stimulus trials, and d' > 3 corresponded to the performance level at which the mouse correctly rejected more than 91.2% of No-Go stimulus trials.

      (5) Figure 2A, there is a change in baseline firing rates in V2M, MDTh, and Str. There  is no discussion. But what can cause this change? Recording instability, problem in  spiking sorting, or learning?

      It’s highly possible that the firing rates before visual stimulus onset is affected by previous reward history and task engagement states of the mice. Notably, though recorded simultaneously in same sessions, the changes in CR trials baseline firing rates in the V2M region were not observed in Hit trials.

      Thus, though we cannot completely rule out the possibility in recording instability, we see this as evidence of the effects on firing rates from changes in trial history or task engagement during learning.

      References:

      Carandini, M., & Churchland, A. K. (2013). Probing perceptual decisions in rodents. Nat Neurosci, 16(7), 824-831. doi:10.1038/nn.3410.

      Cruz, K. G., Leow, Y. N., Le, N. M., Adam, E., Huda, R., & Sur, M. (2023).Cortical-subcortical interactions in goal-directed behavior. Physiol Rev, 103(1), 347-389. doi:10.1152/physrev.00048.2021

      Esmaeili, V., Oryshchuk, A., Asri, R., Tamura, K., Foustoukos, G., Liu, Y., Guiet, R., Crochet, S., & Petersen, C. C. H. (2022). Learning-related congruent and incongruent changes of excitation and inhibition in distinct cortical areas. PLOS Biology, 20(5), e3001667. doi:10.1371/journal.pbio.3001667

      Goldbach, H. C., Akitake, B., Leedy, C. E., & Histed, M. H. (2021). Performance in even a simple perceptual task depends on mouse secondary visual areas. Elife, 10, e62156. doi:10.7554/eLife.62156.

      Siegle, J. H., Jia, X., Durand, S., Gale, S., Bennett, C., Graddis, N., Heller, G.,Ramirez, T. K., Choi, H., Luviano, J. A., Groblewski, P. A., Ahmed, R., Arkhipov, A., Bernard, A., Billeh, Y. N., Brown, D., Buice, M. A., Cain, N.,Caldejon, S., Casal, L., Cho, A., Chvilicek, M., Cox, T. C., Dai, K., Denman, D.J., de Vries, S. E. J., Dietzman, R., Esposito, L., Farrell, C., Feng, D., Galbraith, J., Garrett, M., Gelfand, E. C., Hancock, N., Harris, J. A., Howard, R., Hu, B.,Hytnen, R., Iyer, R., Jessett, E., Johnson, K., Kato, I., Kiggins, J., Lambert, S., Lecoq, J., Ledochowitsch, P., Lee, J. H., Leon, A., Li, Y., Liang, E., Long, F., Mace, K., Melchior, J., Millman, D., Mollenkopf, T., Nayan, C., Ng, L., Ngo, K., Nguyen, T., Nicovich, P. R., North, K., Ocker, G. K., Ollerenshaw, D., Oliver, M., Pachitariu, M., Perkins, J., Reding, M., Reid, D., Robertson, M., Ronellenfitch, K., Seid, S., Slaughterbeck, C., Stoecklin, M., Sullivan, D., Sutton, B., Swapp, J., Thompson, C., Turner, K., Wakeman, W., Whitesell, J. D., Williams, D., Williford, A., Young, R., Zeng, H., Naylor, S., Phillips, J. W., Reid, R. C., Mihalas, S., Olsen, S. R., & Koch, C. (2021). Survey of spiking in the mouse visual system reveals functional hierarchy. Nature, 592(7852), 86-92. doi:10.1038/s41586-020-03171-x

      Sych, Y., Fomins, A., Novelli, L., & Helmchen, F. (2022). Dynamic reorganization of the cortico-basal ganglia-thalamo-cortical network during task learning. Cell Rep, 40(12), 111394. doi:10.1016/j.celrep.2022.111394

      Veling, H., Becker, D., Liu, H., Quandt, J., & Holland, R. W. (2022). How go/no-go training changes behavior: A value-based decision-making perspective. Current Opinion in Behavioral Sciences, 47,101206.

      doi:https://doi.org/10.1016/j.cobeha.2022.101206.

    1. Author response:

      The following is the authors’ response to the original reviews.

      Public Reviews:

      Reviewer #1 (Public review):

      The authors' goal was to arrest PsV capsids on the extracellular matrix using cytochalasin D. The cohort was then released, and interaction with the cell surface, specifically with CD151, was assessed.

      The model that fragmented HS associated with released virions mediates the dominant mechanism of infectious entry has only been suggested by research from a single laboratory and has not been verified in the 10+ years since publication. The authors are basing this study on the assumption that this model is correct, and these data are referred to repeatedly as the accepted model despite much evidence to the contrary.

      We stated in the introduction on line 65/66 ´Two release mechanisms are discussed, that mutually are not exclusive´. This implies that we do not consider the shedding model as ‘the accepted model’. Furthermore, we do not state in the discussion neither that the shedding model is the preferred one. However, we referred to the shedding model in the discussion, because we find HS associated with transferred PsVs, which is in line with this model.

      The discussion in lines 65-71 concerning virion and HSPG affinity changes is greatly simplified. The structural changes in the capsid induced by HS interaction and the role of this priming for KLK8 and furin cleavage have been well researched. Multiple laboratories have independently documented this. If this study aims to verify the shedding model, additional data need to be provided.

      Our findings are compatible with both models, and we do not aim to verify the shedding model neither want to disprove the priming model. However, as we understand, the referee wishes more visibility of the priming model. Therefore, using inhibitors previously used in the field, we tested whether inhibition of KLK8 or furin reduces PsV translocation to the cell body (after CytD wash off). Leupeptin blocks transport, while Furin inhibitor I still allows some initial translocation. We incorporated this new data as Figure 2 (line 265): “…we would expect that inhibition of L1 processing during the CytD incubation prevents the recovery of PsV translocation from the ECM to the cell body (Figure 2A and D). To test for this possibility, as employed in earlier studies, the protease inhibitor leupeptin was used to inhibit proteases including KLK8 which is required for L1 cleavage (Cerqueira et al. 2015). Employing this inhibitor, the PCC between PsV-L1 and F-actin staining remains negative after CytD removal, showing that for translocation indeed the action of proteases is required (Figure 2B and D). In contrast, inhibition of L2 cleavage by a furin specific inhibitor has no effect on the PCC (Figure 2C and D). However, it should be noted that we occasionally observe PsVs not completely translocating but accumulating at the border of the F-actin stained area (for example see Figure 2C (60 min)). This results in an increase of the PCC almost equal to complete translocation, explaining why the PCC remains unaffected despite a furin inhibitory effect. Hence, furin inhibition may have some effect on translocation that, however, is undetected in this type of analysis.’

      Moreover, we have added a paragraph discussing how our data integrates into the established model of the HPV infection cascade (line 604): ‘HPV infection is the result of several steps, starting with the initial binding of virions via electrostatic and polar interactions (Dasgupta et al. 2011) to the primary attachment site HS (Richards et al. 2013), which induces capsid modification (Feng et al. 2024; Cerqueira et al. 2015) and HS cleavage (Surviladze et al. 2015), enabling the virion to be released from the ECM or the glycocalyx. Next, virions bind to the cell surface to a secondary receptor complex that forms over time, and become internalized via endocytosis, before they are trafficked to the nucleus (Ozbun and Campos 2021; Mikuličić et al. 2021). Regarding the transition from the primary attachment site to cell surface binding, as already outlined in the introduction, two models are discussed. In one model, proteases cleave the capsid proteins. After priming, the capsids are structurally modified and the virion can dissociate from its HS attachment site. It has been suggested that capsid priming is mediated by KLK8 (Cerqueira et al. 2015) and furin (Richards et al. 2006). In our system, KLK8 inhibition blocks PsV transport, while furin inhibition has some effect that, however, cannot be detected in this analysis (Figure 2) suggesting furin engagement at later steps in the infection cascade. This is in line with earlier in vitro studies on the role of cell surface furin (Surviladze et al. 2015; Day et al. 2008; Day and Schiller 2009). In any case, our results align with both models of ECM detachment: one involving HS cleavage (HS co-transfer) and another involving capsid modification (by e.g., KLK8).’

      The model should be fitted into established entry events,…

      Please see our reply above.

      or at minimum, these conflicting data, a subset of which is noted below, need to be acknowledged.

      (1) The Sapp lab (Richards et al., 2013) found that HSPG-mediated conformational changes in L1 and L2 allowed the release of the virus from primary binding and allowing secondary receptor engagements in the absence of HS shedding.

      (2) Becker et al. found that furin-precleaved capsids could infect cells independently of HSPG interaction, but this infection was still inhibited with cytochalasin D.

      (3) Other work from the Schelhaas lab showed that cytochalasin D inhibition of infection resulted in the accumulation of capsids in deep invaginations from the cell surface, not on the ECM

      (4) Selinka et al., 2007, showed that preventing HSPG-induced conformational changes in the capsid surface resulted in noninfectious uptake that was not prevented with cytochalasin D.

      (5) The well-described capsid processing events by KLK8 and furin need to be mechanistically linked to the proposed model. Does inhibition of either of these cleavages prevent engagement with CD151?

      The authors need to consider an explanation for these discrepancies.

      We do not see any discrepancies; our observations are compatible with aspects of both the shedding and the priming model. That PsVs carry HS-cleavage products doesn´t imply that HS cleavage is sufficient or required for infection, or that the priming model would be wrong. We do not view our data as being in conflict with the priming model. Most of the above-mentioned papers are now cited.

      Altogether, we acknowledge that the study gains importance by directly testing the priming model within our experimental system. We are thankful for the above comments and addressed this issue.

      Other issues:

      (1) Line 110-111. The statement about PsVs in the ECM being too far away from the cell surface to make physical contact with the cell surface entry receptors is confusing. ECM binding has not been shown to be an obligatory step for in vitro infection.

      Not obligatory, but strongly supportive (Bienkowska-Haba et al., Plos Path., 2018; Surviladze et al., J. Gen. Viro., 2015). As recently published by the Sapp lab (Bienkowska-Haba et al., Plos Path., 2018), ´Direct binding of HPV16 to primary keratinocytes yields very inefficient infection rates for unknown reasons.´ Moreover, the paper shows that HaCaT cell ECM binding of PsVs increases the infection of NHEK by 10-fold and of HFK by almost 50-fold.

      This idea is referred to again on lines 158-159 and 199. The claim (line 158) that PsV does not interact with the cell within an hour needs to be demonstrated experimentally and seems at odds with multiple laboratories' data. PsV has been shown to directly interact with HSPG on the cell surface in addition to the ECM. Why are these PsVs not detected?

      The reviewing editor speculated that HaCaT cells may be a model system in which the in vivo relevant binding to the ECM can be better studied as in non-polarized cell types. This is because binding to the ECM cannot be bypassed by direct cell surface binding. The observation that only few PsVs bind to the basal cell membrane indeed suggests restricted diffusional access of PsVs to binding receptors of the basal membrane. The reviewing editor asked for an experiment showing that more PsVs bind after cell detachment. We performed this experiment and indeed find more PsVs binding to the cell surface of detached cells. This point is very important for the understanding of the study and now we mention it in several sections of the manuscript, as outlined in the following.

      Line 125: ‘Many PsVs that bind to the ECM may locate distal from the cell surface and are thus unable to establish direct contact with entry receptors. However, they are capable of migrating by an actindependent transport along cell protrusions towards the cell body (Smith et al. 2008; Schelhaas et al. 2008). We aimed for blocking this transport in HaCaT cells, a cell line that is widely used as a cell culture model for HPV infection. HaCaT cells closely resemble primary keratinocytes in key aspects: they are not virally transformed and produce large amounts of ECM that facilitates infection (Bienkowska-Haba et al. 2018; Gilson et al. 2020). In addition, HaCaT cells exhibit cellular polarity that enforces binding of virus particles to the ECM, as the virions cannot bind to receptors/entry components, such as CD151, Itgα6 and HSPGs that co-distribute on the basolateral membrane of polarized keratinocytes (Sterk et al. 2000; Cowin et al. 2006; Mertens et al. 1996), making them inaccessible by diffusion.’

      Line 205: ‘During the CytD incubation, PsVs bind to HSPGs of the basolateral membrane for 5 h. Still, in the cell body area hardly any PsVs are present (0.14 PsV/µm<sup>2</sup>, Supplementary Figure 1B). In the control, the PsV density is several-fold larger (Supplementary Figure 1B). This is expected, as the PsVs bind to the ECM and translocate to the cell body. We wondered whether there are more binding sites at the basal membrane that remain inaccessible to PsVs by diffusion because of the insufficient space between glass-coverslip and basolateral membrane. For clarification, we incubated EDTA detached HaCaT cells in suspension with PsVs for 1 h at 4 °C, followed by re-attachment for 1 h. Under these conditions, we find a PsV density 12.4-fold larger than after 5 h of CytD incubation of adhered cells (Supplementary Figure 1B and D). However, it should be noted that these values cannot be directly compared. Aside from the different treatments, another difference lies in the size of the basal membrane, as re-attachment of cells is not complete after only 1 h (compare size of adhered membranes in Supplementary Figure 1A and C). Therefore, the imaged membranes are likely strongly ruffled, which results in the underestimation of the size of the adhered membrane. As a result, we overestimate the PsVs per µm<sup>2</sup> (please note that we cannot re-attach cells for longer times as we would then lose PsVs due to endocytosis). On the other hand, we would underestimate the PsV density at the basal membrane if after re-attachment we image in part also some apical membrane. In any case, the experiment suggests that PsVs bind more efficiently if membrane surface receptors are accessible by diffusion. This is in support of the above notion that the basal membrane may provide more entry receptors than one would expect from the low density of PsVs bound after 5 h CytD (Supplementary Figure 1B). This suggests that under our assay conditions, PsVs cannot easily bypass the translocation from the ECM to the cell body by diffusing directly to the basal membrane. Hence, the large majority of PsVs that enter the cell were previously bound to the ECM. Therefore, HaCaT cells serve as an ideal model for studying the transfer of ECM bound HPV particles to the cell surface, which is similar to in vivo infection of basal keratinocytes after binding to the basement membrane (Day and Schelhaas 2014; Kines et al. 2009; Schiller et al. 2010; Bienkowska-Haba et al. 2018).’

      Line 529: ‘Filopodia usage not only facilitates infection but also increases the likelihood of virions to reach their target cells during wound healing, namely the filopodia-rich basal dividing cells. In fact, several types of viruses exploit filopodia during virus entry (Chang et al. 2016), hinting at the possibility that for HPV and other types of viruses actin-driven virion transport may play a more important role than it is currently assumed. If this is the case, sub-confluent HaCaT cells, or even better single HaCaT cells, would be an ideal model system for the study of these very early infection steps that involve ECM attachment and subsequent filopodia-dependent transport. As shown in Supplementary Figure 1, HaCaT cells have many binding sites for the HPV16 PsVs. However, as they are polarized and the binding receptors are only at the basal membrane, they remain relatively inaccessible by diffusion. Therefore, the ECM binding that is also observed in vivo (Day and Schelhaas 2014) and subsequent transport via filopodia are used upon infection of HaCaT cells that locate at the periphery of cell patches. Here, PsVs bind to the ECM which strongly enhances infection of primary keratinocytes (Bienkowska-Haba et al. 2018). In contrast, HPV can readily bind to HSPGs on the cell surface of nonpolarized cells, and by this bypasses ECM mediated virus priming and the filopodia dependency. We propose that HaCaT cells are a valuable system for studying the very early events in HPV infection that allows for dissecting capsid interaction with ECM resident priming factors and cell surface receptors.’

      Finally, please note that in the previous version of the manuscript, we did not question that in many cellular systems PsVs interact with heparan sulfate proteoglycans (HSPGs) present on the cell surface, or both on the cell surface and the ECM. We stated on line 59 ´While in cell culture virions bind to HS of the cell surface and the ECM, it has been suggested that in vivo they bind predominantly to HS of the extracellular basement membrane (Day and Schelhaas, 2014; Kines et al., 2009; Schiller et al., 2010).´

      We hope that after adding the above explanations and the experiment requested by the reviewing editor it is now clear why only few PsVs bind directly (not via the ECM) to the cell surface. We appreciate the reviewer’s and the reviewing editor’s input that has significantly improved the manuscript.

      (2) The experiments shown in Figure 5 need to be better controlled. Why is there no HS staining of the cell surface at the early timepoints? This antibody has been shown to recognize N-sulfated glucosamine residues on HS and, therefore, detects HSPG on the ECM and cell surface.

      There is staining. However, as the staining at the periphery is stronger and images are shown at the same settings of brightness and contrast, the impression is given that the cell surface is not stained. We have added more images showing HS cell surface staining.

      (i) Supplementary Figure 4C shows an enlarged view of the CytD/0 min cell shown in Figure 6A. In the area stained by Itgα6, that marks the cell body, HS staining is present, although less abundant in comparison to the ECM.

      (ii) In Figure 8, CytD/30 min, a cell is shown with abundant HS in the cell body region (compare cyan and green LUT).

      (iii) In newly added Figure 3A, lower panel, another cell with HS in the cell body region is shown.

      Please note that the staining is highly variable. We indicate this by stating on Line 373: ‘The pattern of the HS staining (cyan LUT) and the overlap of HS with PsVs and Itgα6 are highly variable (Figure 6A).’

      Therefore, the conclusion that this confirms HS coating of PsV during release from the ECM (line 430431) is unfounded. How do the authors distinguish between "HS-coated virions" and HSPG-associated virions?

      The transient increase in the PCC at CytD/30 min can be interpreted as PsV/HS co-transport or as direct binding of PsVs to cell surface HSPGs. However, two arguments support co-transport.

      First, we find that CytD/PsVs increases the HS intensity (see newly added Figure 3, confirming old Figure 5 that is now Figure 6). We state on line 290 ‘… that without actin-dependent PsV translocation HS cleavage products are retained in the ECM, consistent with the hypothesis that cleaved HS remains associated with PsVs (Ozbun and Campos 2021).

      Second, the distance between HS and Itgα6 (the cell body marker) decreases over time after CytD removal, which suggests movement of HS to the cell body (Supplementary Figure 8D). We state on line 422: ‘The movement of HS towards the cell body after removal of CytD, which indirectly demonstrates that PsVs are coated with HS, is suggested by a shortening of the HS-Itgα6 distance over time (Supplementary Figure 8D).’

      It is difficult to comprehend how the addition of 50 vge/cell of PsV could cause such a global change in HS levels.

      Some areas are covered with confluent cells, to which hardly any PsVs are bound, because accessing their basolateral membrane is nearly impossible, and PsVs do not bind to the exposed apical membrane as well. We assume this is a major difference to cultures of unpolarized cells, where PsVs should distribute more or less equally over cells. This means that in our experiments the vge/cell is not a suitable parameter for relating the magnitude of an effect to a defined number of PsVs. In the ECM, the PsV density is very high, enabling one cell to collect, in theory, several hundred PsVs, much more than expected from the 50 vge/cell.

      We state on line 135: ‘Frequently, we observe patches of confluent cells which are common to HaCaT cells. Cells at the center of these patches are dismissed during imaging, because there are no anterogradely migrating PsVs at these cells. A second reason for our dismissal of these cells is that hardly any PsVs are bound to them, possibly because their basal membranes are inaccessible by diffusion. Instead, we focus on isolated HaCaT cells or cells at the periphery of cell patches. In these cells, we find more PsVs per cell than one would expect from the employed 50 viral genome equivalents (vge) per cell, indicating that PsVs are unequally distributed between the cells.’

      The claim that the HS levels are decreased in the non-cytochalasin-treated cells due to PsV-induced shedding needs to be demonstrated.

      We did not claim that PsVs induce shedding, we rather believe they retain shedded HS. Without PsVs, the shedded HS is washed off from the ECM. We have reproduced the observation made in old Figure 5 (now Figure 6) in the newly added Figure 3 that also shows that PsVs alone have no effect on the HS intensity, only when present together with CytD. We state on line 277: ‘As outlined above, during the 5 h incubation with CytD, proteases in the ECM are expected to cleave HS chains. These cleavage products should be able to diffuse out of the ECM, unless they remain associated with nontranslocating PsVs. In the control, PsV associated HS cleavage products would leave the ECM through PsV translocation…. Using an antibody that reacts with an epitope in native heparan sulfate chains, only after CytD and if PsVs are present, the level of HS staining is significantly increased (Figure 3B). As shown in Figure 3A, stronger HS staining at PsVs (open arrows) and as well in PsV free areas (closed arrows) was observed… Collectively, our findings indicate that without actin-dependent PsV translocation HS cleavage products are retained in the ECM, consistent with the hypothesis that cleaved HS remains associated with PsVs (Ozbun and Campos 2021).’

      If HS is actually shed, staining of the cell periphery could increase with the antibody 3G10, which detects the HS neoepitope created following heparinase cleavage.

      We have tested the antibody by which we obtain only a very weak staining (Supplementary Figure 2), not allowing to differentiate between an increase in the cell periphery and the cell body area. We still include the experiment as it suggests that CytD has no effect on HS processing. We state on line 286: ‘As additional control and shown in Supplementary Figure 2, we use an antibody that reacts with a HS neo-epitope generated by heparitinase-treated heparan sulfate chains (Yokoyama et al. 1999; for details see methods). This neo-epitope staining is independent of the presence of CytD and the incubation time, suggesting that CytD does not directly affect HS processing.’

      Reviewer #2 (Public review):

      Summary:

      Massenberg and colleagues aimed to understand how Human papillomavirus particles that bind to the extracellular matrix (ECM) transfer to the cell body for later uptake, entry, and infection. The binding to ECM is key for getting close to the virus's host cell (basal keratinocytes) after a wounding scenario for later infection in a mouse vaginal challenge model, indicating that this is an important question in the field.

      Strengths:

      The authors take on a conceptually interesting and potentially very important question to understand how initial infection occurs in vivo. The authors confirm previous work that actin-based processes contribute to virus transport to the cell body. The superresolution microscopy methods and data collection are state-of-the art and provide an interesting new way of analysing the interaction with host cell proteins on the cell surface in certain infection scenarios. The proposed hypothesis is interesting and, if substantiated, could significantly advance the field.

      Weaknesses:

      As a study design, the authors use infection of HaCaT keratinocytes, and follow virus localisation with and without inhibition of actin polymerisation by cytochalasin D (cytoD) to analyse transfer of virions from the ECM to the cell by filopodial structures using important cellular proteins for cell entry as markers.

      First, the data is mostly descriptive besides the use of cytoD, and does not test the main claim of their model, in which virions that are still bound to heparan sulfate proteoglycans are transferred by binding to tetraspanins along filopodia to the cell body.

      The study identifies a rapid translocation step from the ECM to CD151 assemblies. We have no data that demonstrates a physical interaction between PsVs and CD151. In the model figure, we draw CD151 as part of the secondary receptor complex. We are sorry for having raised the impression that PsVs would bind directly to CD151 and have modified the model Figure accordingly. In the new model figure (Figure 9), the first contact established is to a CD151 free receptor.

      Second, using cytoD is a rather broad treatment that not only affects actin retrograde flow, but also virus endocytosis and further vesicular transport in cells, including exocytosis. Inhibition of myosin II, e.g., by blebbistatin, would have been a better choice as it, for instance, does not interfere with endocytosis of the virus.

      As we focus on early events, we are not concerned about CytD blocking as well late steps in the infection cascade, like endocytosis. However, we agree that a comparison between CytD and blebbistatin would be very interesting. We added Figure 8, showing that blebbistatin only partially stops migration.

      Line 429: ‘Actin retrograde transport, which underlies the here observed virion transport, is the integrative result of three components (Smith et al. 2008; Schelhaas et al. 2008)…. As CytD broadly interferes with F-actin dependent processes, we investigated the effects upon inhibition of only one of the three components, namely the myosin II mediated retrograde movement towards the cell body. Instead of CytD, we employed in the 5 h preincubation the myosin II inhibitor blebbistatin. For the control (0 min), we show in Figure 8A one example of a cell with comparatively many PsVs at the periphery (as mentioned above, the PsV pattern is highly variable) to better illustrate the difference to the PsV pattern occasionally seen with blebbistatin. After blebbistatin treatment (0 min), PsVs are still distal to the cell body but less dispersed than after CytD treatment, seemingly as if translocation started but stopped in the midst of the pathway (Figure 8A, blebbistatin). The PCC between PsVs and HS, like after CytD (Figure 6C), is elevated after blebbistatin, albeit the effect is not significant (Figure 8C). The cell body PCC, is not at 30 min (CytD) but already at 0 min elevated (compare Figure 6D to Figure 8D), which can be explained by partial translocation. This is further supported by the fact that only 8% of PsVs are closely associated with HS (Figure 8E; blebbistatin, 0 min) compared to 15% after CytD treatment (Figure 6E; 0 min). Furthermore, after 0 min PsV incubation with blebbistatin we observe no effect on the HS intensity (compare Figure 8B to Figure 3B and Figure 6B). Hence, in contrast to CytD, blebbistatin does not trap the PsVs in the ECM where they associate with HS, but ongoing actin polymerization pushes actin filaments along with PsVs towards the cell body.’

      Third, the authors aim to study transfer from ECM to the cell body and the effects thereof. However, there are substantial, if not the majority of, viruses that bind to the cell body compared to ECM-bound viruses in close vicinity to the cells.

      Please see our detailed reply to referee #1 that has raised the same issue. In brief, we agree that in multiple cell culture systems viruses bind preferentially to the cell surface directly. However, in HaCaT cells, the majority of PsVs does not bind directly to the basal membrane but gets there after initial binding to the ECM. Thus, we believe our system appropriately models the physiologically relevant scenario of ECM-to-cell transfer, as also speculated by the reviewing editor that has suggested an experiment showing that more PsVs bind to detached cells (please see above).

      This is in part obscured by the small subcellular regions of interest that are imaged by STED microscopy, or by the use of plasma membrane sheets. As a consequence, the obtained data from time point experiments is skewed, and remains for the most part unconvincing due to the fact that the origin of virions in time and space cannot be taken into account. This is particularly important when interpreting association with HS, the tetraspanin CD151, and integral alpha 6, as the low degree of association could originate from cell-bound and ECM-transferred virions alike.

      As already stated above, we observe massive binding of PsVs to the ECM, in contrast to very few PsVs that diffuse beneath the basolateral membrane of the polarized HaCaT cells and do bind directly to the cell surface. In other cellular systems, cells may hardly secrete ECM, are not polarized, and therefore virions can easily bypass ECM binding. Therefore, it is reasonable to assume that in HaCaT cells the large majority of PsVs found on the cell body originates from the ECM.

      Fourth, the use of fixed images in a time course series also does not allow for understanding the issue of a potential contribution of cell membrane retraction upon cytoD treatment due to destabilisation of cortical actin. Or, of cell spreading upon cytoD washout.

      The newly added blebbistatin experiment suggests that the initial translocation is exclusively dependent on retrograde actin flow. However, we agree that we are not able to unravel more details regarding the different possible contributions to the movement. Importantly, the lack of PCC increase after CytD/leupeptin removal (Figure 2D) suggest there is not much cell spreading into the area of accumulated PsVs. Please see our more detailed reply to the same issue raised by the same referee in the recommendations for the authors.

      The microscopic analysis uses an extension of a plasma membrane stain as a marker for ECM-bound virions, which may introduce a bias and skew the analysis.

      The dye TMA-DPH stains exclusively cellular membranes and not the ECM. The stain is actually used to delineate the cell body from the ECM area (please see Figure 1).

      Fifth, while the use of randomisation during image analysis is highly recommended to establish significance (flipping), it should be done using only ROIs that have a similar density of objects for which correlations are being established.

      We agree that the way of how randomization is done is very important. Regarding the association of PsVs with CD151 and HS, we corrected for random background association, which is now explained in more detail in in the Figure legend of Supplementary Figure 7: “On flipped images, we often find values more than half of the values of the original images, demonstrating that many PsVs have a distance ≤ 80 nm to CD151 merely by chance (background association)… (C) Each time point in (A) and (B) obtained from flipped images is the average of three biological replicates. We use these altogether 24 data points, plotting the fraction of closely associated PsVs against the CD151 maxima density. The fraction increases with the maxima density, as the chance of random association increases with the maxima density. The fitted linear regression line describes the dependence of the background association from the maxima density. As a result, the background association (y) can be calculated for any maxima density (x) in original images with the equation y = 2.04x. Please note that the CytD/0 min may be overcorrected as we subtract background association with reference to the CD151 maxima density of the entire ROI (for an example ROI see Supplementary Figure 6A), although the local maxima density at distal PsVs is lower. On the other hand, PsVs at the cell border may have a larger local CD151 maxima density and consequently are undercorrected.’

      For instance, if one flips an image with half of the image showing the cell body, and half of the image ECM, it is clear that association with cell membrane structures will only be significant in the original.

      We are aware of this problem. For instance, it would produce ‘artificially’ low PCCs after flipping images of PsV/HS stainings (please see negative PCC value after flipping in Supplementary Figure 8). In this case, we do not use as argument that in flipped images the PCC is lower. Instead, we would argue that over time the PCC changes in the original images. We still provide the PCC values of flipped images, as additional information, showing that in most cases we obtain after flipping a PCC of zero, as expected

      Hence, we fully agree that careful controls in image analysis is required, and used the above-described method for the correction of background association when the fraction of closely associated PsVs is analyzed. We do not use a lower PCC value in flipped images as argument if not appropriate.

      I am rather convinced that using randomisation only on the plasma membrane ROIs will not establish any clear significance of the correlating signals.

      Figure 6D and 8D show the PCC specifically of the cell body (only of plasma membrane ROIs). In flipped images (not shown in the previous version for clarity), we obtain significantly lower PCCs (Supplementary Figure 8F/G and Supplementary Figure 10C/D. We propose that in this case it would be appropriate to use a lower PCC of flipped images as argument for specific association. Still, also in this experiment we argue with a change in the PCC over time, and not with a PCC of zero after flipping. As above, we still provide the PCC values of flipped images as additional information.

      Also, there should be a higher n for the measurements.

      One replicate is based on the average of 14-15 cells for each condition (more for figure 4). Hence, in a typical experiment (Control and CytD with 4 time points) about 120 cells are analyzed, which is a broad basis for the averages of one replicate.

      We realize that with three biological replicates we find significant effects only if we have strong effects or moderate effects with very low variance.

      Recommendations for the authors:

      Reviewing Editor:

      The focus on the events of HPV infection between ECM binding and keratinocyte-specific receptor binding is unique and interesting. However, I agree with the reviewers that some of the conclusions could use more experimental support, as detailed in their comments. The failure to detect direct binding of the PsV to HSPGs on the cell surface in in vitro assays contradicts much of the published literature. For example, others have found that HPV capsids bind cultured cell lines in suspension, i.e, in the absence of ECM. Do EDTA-suspended HaCaT cells bind PsV? Is the binding HSPG dependent? If the authors think that failure to detect direct cell binding of HaCaTs is an unusual feature of these cell lines or culture condition,s then it would be helpful to provide an explanation. However, it is worth noting that an in vitro system where the cells do not directly bind capsids through HSPG interactions would be a much better model for studying the stages of HPV infection that are the focus of this study, since there is no direct binding of keratinoctyes in vivo.

      We are thankful for this comment that had a strong influence on the revision. The suggested experiment has been incorporated as new Supplementary Figure 1. It shows that many more PsVs bind to the cell surface of cells in suspension than to adhered cells. As suggested by the reviewing editor, we explain now that HaCaT cells are a suitable model system for studying the in vivo transport from the ECM to the cell body that in these cells, due to their polarization, cannot be bypassed (for more details please see our replies above addressing these issues).

      Because conclusions drawn regarding HS interactions are largely based on experiments using a single HS mAb, it is important that the specificity of this mAb is described in more detail, either based on the literature or further experimentation.

      We provide now detailed information about the HS antibodies used in the study. We state on line 282 ‘Using an antibody that reacts with an epitope in native heparan sulfate chains…’ and on line 286 ‘we use an antibody that reacts with a HS neo-epitope generated by heparitinase-treated heparan sulfate chains…’ and in the methods section ‘For Heparan sulfate (HS) a mouse IgM monoclonal antibody (1:200) (amsbio, cat# 370255-S) was used that reacts with an epitope in native heparan sulfate chains and not with hyaluronate, chondroitin or DNA, and poorly with heparin (mAb 10E4 (David et al., 1992)). For HS neo-epitope (Yokoyama et al., 1999) detection, a mouse monoclonal antibody (1:200) (amsbio, cat#370260-S) was used that reacts only with heparitinase-treated heparan sulfate chains, proteoglycans, or tissue sections, and not with heparinase treated HSPGs. The antibody recognizes desaturated uronic acid residues (mAb 3G10 (David et al., 1992)).’

      Reviewer #1 (Recommendations for the authors):

      (1) The phrase "tight association" or similar is repeatedly used and is not acceptable for microscopic studies; use "close association", which has no affinity connotations.

      Has been changed as suggested by the referee.

      (2) Why are lysine-coated coverslips used for microscopy? HaCaT cells adhere tightly to untreated glass, and this coating could affect the distribution of ECM and extracellular PsV.

      We believe a tight association of the basal cell membrane to its substrate, as in vivo, where the basal membrane is tightly adhered to other cells, is important in these experiments. In weakly adherent cells more PsVs may bind to the cell surface, bypassing the transport step. Hence, although HaCaT cells may not require the coat and would be able to adhere to glass, the association may not be tight enough to mimic in vivo conditions.

      (3) What is the reason to use detection of the pseudogenome for some of the experiments instead of L1 detection throughout? The process of EdU detection is sufficiently denaturing to affect some protein epitopes. The introduction of this potential artifact doesn't seem warranted for capsid detection experiments.

      The L1 and the Itgα6 antibody are from the same species, wherefore we have used in Figures 4 and 6 click-labeling of the reporter plasmid. We do not disagree with the notion of the referee, that EdU detection may denature the epitope of some proteins. For instance, we have observed a different staining pattern for CD151; for Itgα6 and HS we saw no obvious difference in the staining patterns. In double staining experiments using L1 antibody and click-labeling, both staining patterns overlapped very well, indicating that click-labeling is suitable to visualize PsVs.

      (4) What concentration of TMA-DPH was used?

      TMA-DPH is a poorly water-soluble dye that becomes strongly fluorescent upon insertion into a membrane. Because of its poor water solubility, a precise concentration cannot be given. We added 50 µl of a saturated TMA-DPH solution in PBS to 1 ml of PBS in the imaging chamber. We state this now in the methods section.

      (5) Line 419: This statement is misleading. Although PsV interaction with HSPG on the ECM is crucial for infectious transfer to cells, the majority of the PsV binding on the ECM has been attributed to interaction with laminin 332. Treatment of PsV with heparin causes sequestration to the ECM.

      We are sorry for the confusion and have removed the misleading statement.

      (6) Some reference choices are poor:

      Line 54: Ozbun and Campos, this is not the correct reference

      In the review we cited, in the introduction it is stated that PsVs establish infection via a break in the epithelial barrier? However, we have replaced this reference by a review that focuses more on epithelial wounding: ‘Ozbun, Michelle A. (2019): Extracellular events impacting human papillomavirus infections: Epithelial wounding to cell signaling involved in virus entry. In Papillomavirus research (Amsterdam, Netherlands) 7, pp. 188–192. DOI: 10.1016/j.pvr.2019.04.009.’

      Line 2012: Doorbar et al., this is not the correct reference.

      Thank you for pointing this out (..we assume the referee refers to line 104 and not line 2012). We have noticed this error during revision. As it is difficult to get a specialized review on this topic, we now cite Ozbun and Campus, 2021 that states PsVs are ‘structurally and immunologically indistinguishable from lesion- and tissue-derived HPVs.’

      Minor issues:

      (1) It is difficult to appreciate the ECM and cell surface binding pattern from the provided images, which do not even contain an entire cell. We need to see a few representative field views with the ECM delineated with laminin 332 staining, as HS antibodies stain both the ECM and cell surface.

      We now provide overview images in Supplementary Figure 4. The only experiment requiring a clear delineation between ECM and cell surface is the experiment of Figure 4. Here, we do not use the HS as a reference staining because it stains both the ECM and the cell surface.

      (2) For Figure 1E, the cells were only infected for 24 hours. The half-time for infectious internalization of HaCaT cells was shown to be 8 hours for cell-associated PsV and closer to 20 hours for PsV that was associated with the ECM prior to cell association (Becker et al., 2018). Why was such a short infection time chosen?

      During assay establishment it has been observed that after 24 h the luciferase activity is optimal.

      (3) Figure 5, the staining of uninfected cells +/- cyto treatment needs to be included.

      Now visible in new Figure 3.

      I am confused by lines 54-57. It seems as if the authors are claiming that HSPGs are not present on the ECM. This sentence, as written, is misleading.

      We agree, and state now on line 58 ‘Here, virions bind to the linear polysaccharide heparan sulfate (HS) that is present in the extracellular matrix (ECM) but as well on the plasma membrane surface. HS is attached to proteins forming so called heparan sulfate proteoglycans (HSPGs).’

      Reviewer #2 (Recommendations for the authors):

      There are further issues that are not pertaining to the study design that I find important.

      (1) It remains speculative whether the virions that are transferred from the ECM are actually structurally modified.

      The newly added Figure 2, showing that leupeptin blocks infection in our assay, suggests that virions indeed are primed.

      (2) The origin of HS correlated with virions on the cell body after transfer is also not clear: does the virus associate with cell surface HS, or does it bring HS from the ECM? Simply staining HS against Nsulfated moieties does not allow such conclusions.

      This issue has been already raised in the public review to which we replied above. In brief, we agree that the transient increase of the PCC between PsVs and HS in the cell body region can be also explained by PsVs coming from the ECM without HS and binding to cell surface HS, or from PsVs binding directly (not via the ECM) to cell surface HSPGs. However, there are two more arguments indicating that PsVs are coated with HS. Please see our detailed reply above.

      (3) Figure 1: There are few, if any, filopodia in untreated cells. It would be good to quantify their abundance to substantiate that resting HaCat cells are indeed a good model for filopodial transport bs. membrane retraction / spreading. In HaCat ECM, the virus also binds to laminin-332 for a good part. Would this not also confound the analysis?

      At first glance, the number of filopodia appears to be too low to account for such an efficient transport. However, please note that the formation of filopodia is very dynamic, and that they can form and disappear within minutes (see below). We also often observe many PsVs aligned at one filopodium. Moreover, not every cell periphery exhibits large accumulations of PsVs. Therefore, we believe it is in principle possible that filopodia are largely responsible for the transport. We cannot exclude that we overestimate the transport rate due to partial cell spreading after CytD removal, which, however, we consider as rather unlikely as in Figure 2 we observe no increase in the PCC when leupeptin was present during the CytD incubation. Under these conditions, PsVs do not translocate but cells could spread, and this would increase he PCC between PsVs and F-actin if cells would spread into the area of accumulated PsVs.

      We now state on line 304: ‘This suggests that the half-time of PsV translocation from the periphery to the cell body is about 15 min. In fact, the half-time maybe longer, as we cannot exclude that cell spreading after CytD removal contributes to less PsVs measured in the cell periphery.’ and on line 477 ‘As mentioned above, the half-time could be longer if cell spreading is in part responsible for the translocation of PsVs onto the cell body. However, we assume that this is rather unlikely, as cell spreading would increase the PCC between PsVs and F-actin under a condition where filopodia mediated transport is blocked but not cell spreading, which is not the case (Figure 2B and D, CytD/leupeptin).’

      (4) Figure 2: This would benefit from live cell analysis. There are considerable amounts of virions on the cell body, which partially contradicts statements from Figure 1.

      Does the referee refer to the images shown in Figure 4 (old Figure 2)? Please note that at CytD/0 min there are hardly any PsVs in the cell body region, the fluorescence (magenta LUT) is autofluorescence (this is explained in the results section). Only at later time points PsVs are in the cell body region.

      The fast transfer to the cell body after cyto D washout is based on the assumption that filopodia formation and transport along them (and not membrane extension) occur quickly. Is this reasonable?

      We are no experts on filopodia, but one finds references suggesting that they grow at rates of several µm per minutes and have lifetimes between a few seconds and several minutes. Hence, within the 15 min we determine for the transport, cells may need a few minutes to recover from CytD, a few minutes to form filopodia that reach out into the ECM, and a few minutes for the transport itself. However, we agree that we cannot exclude membrane extension contributing to our observed transport, although we consider this as rather unlikely (see above).

      (5) Figure 3: The rationale of claiming the existence of 'endocytic structures' needs to be better explained and quantified in the according supplementary figure.

      We now state in the legend ‘We propose that the agglomerated CD151 maxima close to PsVs feature the characteristics of endocytic structures, as CD151 has been shown to co-internalize with PsVs (Scheffer et al. 2013), and as these structures invaginate into the cell, like PsV filled tubular organelles previously described by electron microscopy (Schelhaas et al. 2012).’ For a proper quantification of these highly variable structures a much larger sample would be required.

      The formation of virus-filled tubules upon cytoD treatment has been previously reported. Are these viruses that come from the cell body or from the ECM?

      With the new data and explanations that have been added to the manuscript, it should be clear that it is reasonable to assume that they come largely from the ECM.

      (6) Figure 4: How are the subcellular ROIs chosen? Is there not a bias by not studying a full cell?

      We now explain better how we chose cells for analysis. We state on line 138 ‘Instead, we focus on isolated HaCaT cells or cells at the periphery of cell patches. In these cells, we find more PsVs per cell than one would expect from the employed 50 viral genome equivalents (vge) per cell, as PsVs are unequally distributed between the cells. Moreover, these PsVs usually are not homogenously distributed around the cell but concentrate at one region. We investigate the translocation of PsVs from these regions, defining ROIs for analysis that cover PsVs at the periphery and the cell body (see Supplementary Figures 6A and 8A).’

      (7) Figure 5/6: The data needs a better analysis on correlation by using randomisation as explained above.

      Please see our reply to the same point of the public review raised by the same referee.

      (8) Figure 7: This model involves CD151 being a mediator in transfer, but this has not been functionally shown. There are HaCaT CD151 KO cells available (from the Sonnenberg lab), it would be good to use those to test the model and whether transfer indeed involves CD151.

      As already stated above, we are sorry for having raised the impression that PsVs bind directly to CD151. The model Figure has been modified. Please see our reply above.

      (9) The manuscript would benefit from a number of experiments addressing the most crucial issues:

      (a) As mentioned before, the use of blebbistatin, which blocks myosin II function and arrests actin retrograde flow within seconds of addition, would be a good inhibitor to control for transfer in at least some of the most crucial experiments.

      In Figure 8 we have tested blebbistatin. Please see our reply above.

      (b) Live cell analysis would allow for monitoring of whether membrane retraction upon cytoD treatment would have to be taken into account for the analysis of the data. The same is true for the cytoD washouts, upon which most cells exhibit pronounced membrane spreading. The latter is important to support filopodial transport rather than membrane ruffling and spreading, leading to the clearance of extracellular virions from the ECM.

      We agree that this would be desirable. As replied above, we now discuss the issue of possible membrane spreading and reason why we consider it as rather unlikely.

      (c) To rid oneself of the issue of plasma membrane-bound virions as a confounding factor, one could use cells treated by sodium chlorate, which leads to undersulfation of HS on the cell surface, and seed them onto ECM with functional HSPGs. This would then indeed establish that the HS and virus are transferred together.

      We agree that this would be a smart experiment. As the main focus of our study is not clarifying whether PsVs are coated with HS or not, we gave other experiments priority.

      (10) The manuscript is, while carefully and thoughtfully worded on the issue of microscopy analysis, for a good part, extrapolating too strongly from the authors' data and unsubstantiated assumptions to conclude on their model. It would be good if the authors would support their claims with previous or their own experimental work. Just two examples of several: the assumption that cell-bound virions are negligible should be substantiated, as the literature would indicate otherwise.

      We determined the PsV density in adhered, CytD treated cells, and find around 0.14 per µm<sup>2</sup> (Supplementary figure 1B), which is 4 to 5-fold less when compared to the PsV density quantified in an area covering the cell body and the periphery (Figure 1B, see line 174 for PsVs/µm<sup>2</sup> values). Quantifying the PsV density only in the periphery would yield a severalfold larger difference. However, due to the limited resolution of the microscope we would strongly underestimate the PsV density in the accumulations. We prefer not to discuss this in detail, as exact numbers are difficult to obtain.

      Line 129: Cyto D should not inhibit the enzymes modifying HS or proteins (including virions). This is true, but cytoD may limit their secretion and abundance.

      We show in Figure 3 that CytD does not reduce HS staining (e.g., by limiting HS secretion, as suggested by the referee), suggesting that it rather does not limit secretion.

      We thank the referee´s and the reviewing editor for their helpful comments!

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      __Reviewer #1 __

      *This study "Interpreting the Effects of DNA Polymerase Variants at the Structural Level" comprises an in-depth analysis of protein sequence variants in two DNA polymerase enzymes with particular emphasis on deducing the mechanistic impact in the context of cancer. The authors identify numerous variants for prioritisation in further studies, and showcase the effectiveness of integrating various data sources for inferring the mechanistic impact of variants. *

      *All the comments below are minor, I think the manuscript is exceptionally well written. *

      *> The main body of the manuscript has almost as much emphasis on usage of the MAVISp tool as analysis of the polymerase variants. I don't think this is an issue, as an illustrated example of proper usage is very handy. I do, however, think that the title and abstract should better reflect this emphasis. E.g. "Interpreting the Effects of DNA Polymerase Variants at the Structural Level with MAVISp". This would make the paper more discoverable to people interested in learning about the tool. *

      We have changed the manuscript title according to the reviewer’s suggestions, and the current title is “Interpreting the Effects of DNA Polymerase Variants at the Structural Level using MAVISp and molecular dynamics simulations.”

      • *

      *> Figure 1. I don't believe there is much value in showing the intersection between the datasets (especially since the in-silico saturation dataset intersects perfectly with all the others). As an alternative, I suggest a flow-chart or similar visual overview of the analysis pipeline. *

      • *

      We moved the former Figure 1 to SI. We decided to keep it at least in SI because it provides guidance on the number of variants relative to the total reported across the different disease-related datasets annotated with the MAVISp toolkit. On the other hand, the suggestion of a visual scheme for the pipeline followed in the analyses is a great idea. We have thus added Figure 1, which illustrates the pipeline workflows for analysis of known pathogenic variants and for discovery of VUS and other unknown variants, as suggested by the reviewer.

      *> Please note in the MAVISp dot-plot figure legends that the second key refers to the colour of the X-axis labels rather than the dots *

      We have revised the code that produces the dotplot so the second key is placed closer to the x-axis and clearer to read.

      Missing figure reference (Figure XXX) at the bottom of page 16

      We apologize for this mistake. Figures, contents, and the order have changed significantly to address all reviewers’ comments; this statement is no longer included. Also, we have carefully proofread the final version of the manuscript before resubmitting it.


      __Reviewer #2 __

      • *

      This manuscript reports a comprehensive study of POLE and POLD1 annotated clinical variants using a recently developed framework, MAVISp, that leverages scores and classifications from evolutionary-based variant effect predictors. The resource can be useful for the community. However, I have a number of major concerns regarding the methodology, the presentation of the results.

      *** On the choice of tools in MAVISp and interpretation of their outputs *

      - Based on the ProteinGym benchmark: https://proteingym.org/benchmarks*, GEMME outperforms EVE for predicting the pathogenicity of ClinVar mutations, with an AUC of 0.919 for GEMME compared to 0.914 for EVE. Thus, it is not clear for me why the authors chose to put more emphasis on EVE for predicting mutation pathogenicity. It seems that GEMME can better predict this property, without any adaptation or training on clinical labels. *

      • *

      We appreciate this comment, but we should not exclude EVE entirely from our data collection or from VEP coverage under MAVISp, based on a difference in AUC of 0.005. It was not our intention to place more emphasis on EVE predictions, and we have revised it accordingly. We would like to clarify the workflow we use for applications of the MAVISp framework in “discovery mode,” i.e., for variants not reported as pathogenic in ClinVar. This relies on AlphaMissense to prioritize the pathogenic variants and then retain further only the ones that also have an impact according to DeMaSk, which provides further indication for loss/gain-of-fitness. DeMaSk nicely fits the MAVISp framework, as it was trained on data from experimental deep mutational scans, which we generally import in the EXPERIMENTAL_DATA module. We have revised the text to make this clearer. GEMME and EVE (or REVEL) can be used for complementary analysis in the discovery workflow. Other users of MAVISp data might want to combine them with a different design, and they have access to all the original scores in the MAVISp database CSV file and the code for downstream analysis to do so. The choice for our MAVISp discovery workflow is mainly dictated by the fact that we have noticed we do not always have full coverage of all variants in many protein instances for EVE, GEMME, and REVEL. In particular, since the reviewer highlights GEMME over EVE, GEMME is currently unavailable for a few cases in the MAVISp database. This is because we need to rely on an external web server to collect the data, which slows down data collection on our end.

      Additionally, we have encountered instances where GEMME was unable to provide an output for inclusion in the MAVISp entries. When we designed the workflow for variant characterization in focused studies, we also made practical considerations. We are also exploring the possibility of using pre-calculated GEMME scores from

      https://datadryad.org/dataset/doi:10.5061/dryad.vdncjsz1s, but we encountered some challenges at the moment that deserve further investigations and considerations. For example, MAVISp annotations rely on the canonical isoform as reported in Uniprot, which can lead to mismatches with the GeMME pre-computed scores. So far, we have identified a couple of entries whose canonical isoforms no longer match the one in the pre-computed GEMME score dataset. Another limitation is the absence of the original MSA files in the dataset, which we would need for a more in-depth comparison with the ones we used for our calculations. We are facing some challenges in reproducing the MSA output from MMseq2-based ColabFold protocol in this context that need to be solved first. Overall, the dataset shows potential for integration into MAVISp, but we need to define the inclusion criteria and compare it with the existing results in more detail.

      Additionally, since the principle behind MAVISp is to provide a framework rooted in protein structure, AlphaMissense was the most reasonable choice for us as the primary indicator among the VEPs for our discovery workflow, and it has performed reasonably well in this case study and others.

      Of course, our discovery design is one of the many applications and designs that could be envisioned using the data provided and collected by MAVISp. We also include all raw scores in the database's final CSV files, allowing other end users to decide how to use them in their own computational design. The design choice we made for the discovery phase of focused studies, using MAVISp to identify variants of interest for further studies, has been applied in other publications (see https://elelab.gitbook.io/mavisp/overview/publications-that-used-mavisp-data) in some cases together with experiments. It is also a fair choice for the application, as the ultimate goal is to provide a catalog of variants for further studies that may have a potentially damaging impact, along with a corresponding structural mechanism.

      We have now revised the results section text where Table 1 is cited to clarify this. We also revised the terminology because we are using the VEPs' capability to predict damaging variants, rather than the pathogenic variants themselves. Experiments on disease models should validate our predictions before concluding whether a variant is pathogenic in a disease context, and we want to avoid misunderstandings among readers regarding our stance on this matter.

      - Which of the predictors, among AM, EVE, GEMME, and DeMaSK, provide a classification of variants and which ones provide continuous scores? This should be clarified in the text. If some predictors do not output a classification, then evaluating their performance on a classification task is unfair. The MAVISp framework sets thresholds on the predicted scores to perform the classification and it is unclear from reading the manuscript whether these thresholds are optimal nor whether using universal cutoff values is pertinent. For instance, for GEMME, a recent study shows that fitting a Gaussian mixture to the predicted score distribution yields higher accuracy than setting a universal threshold (https://doi.org/10.1101/2025.02.09.637326*). Along this line, for predictors that do not provide a classification, I am not convinced of the benefit for the users of having access to only binary labels, instead of the continuous scores. The users currently do not have any idea of whether each variant is borderline (close to theshold) or confident (far from threshold). *

      We agree with the reviewer, and this is due to us not being sufficiently clear in the manuscript. We have now revised the first part of the results to clarify this and to explain how we use the MAVISp data for application to focused studies, where the goal is to identify the most interesting variants that are potentially damaging and have a linked structural mechanism. Of course, there are other applications for leveraging the data in the database. We do offer scores to variants instead of just classification labels in the MAVISp csv file. They can be accessed, together with the full dataset, through the MAVISp website and reused for any applications.

      Additionally, we used the scores in the revised manuscript for the VUS variant ranking (Figure 5), applying a strategy recently designed as an addition to the downstream analysis tool kit of MAVISp (​​https://github.com/ELELAB/MAVISp_downstream_analysis), thereby allowing the scores themselves to be taken into account. Also, in the final part of the manuscript, the VEP scores have been used to introduce the ACMG-like classification of the variants in response to reviewer 3 (Figure 9 and Tables S3-S4). We absolutely agree that it is informative to keep the continuous scores, and we have never overlooked this aspect. However, we also need a strategy with a simpler classification to highlight the most interesting variants among thousands or more to start an exploration. This is why we included the support with dotplots and lolliplots, for example. Our purpose here is to identify, among many cases, those with a potentially damaging signature (and thus we need a binary classification for simplicity). Next, we evaluate whether this signature entails a fitness effect (with DeMaSk), and finally, retain only the cases we can identify with a structural mechanism to study further.

      The thresholds we set as the default for data analysis of dotplots in GEMME and DeMaSk are discussed in __Supplementary Text S3 __of the original MAVISp article. In brief, we carried out an ROC analysis against the scores for known pathogenic and benign variants in ClinVar with review status higher than 2. For applicative purposes, one could design other strategies to analyze the MAVISp data too; it is not limited to the workflow we decided to set as the primary one for our focused studies, as already mentioned above.

      We have now also included classification based on the GMM model applied to GEMME scores for POLE and POLD1, so it can be evaluated against other designs for our protein of interest (see Table 1 in the revised version). The method section has been revised to include this part, and the ProteoCast pre-print is cited as a reference. We have not yet officially included this classification in the MAVISp database because we must first follow internal protocols to meet the inclusion criteria for new methods or analyses. We will do so by performing a similar comparison on the entire MAVISp dataset and focusing on high-quality variants, as ClinVar annotations, as we did to set the current thresholds for GEMME in Supplementary Table S3 of the original MAVISp article. We need to allocate time and resources to this pilot, which is scheduled for Q1 2026.

      ** On the presentation and impact of the results

      • While reading the manuscript, it is difficult to grasp the main messages. The text contains abundant discussion about the potential caveats of the framework, the care that should be taken in interpreting the results, and the dependency on the clinical context. Although these aspects are certainly important, this extensive discussion (spread throughout the manuscript) obscures the results. Moreover, the way variants are catalogued throughout the text makes it difficult to grasp key highlights. The reader is left unsure about whether the framework can actually help the clinical practitioners.

      We have revised the text to make it easier to read, including additional MD simulations of three variants of interest and more downstream analyses to clarify the mechanisms of action. We also added a recap of the most interesting variants and their associated mechanisms, along with the ranking of the variants using the different features available in the MAVISp csv file for the VUS. We hope that this makes it more accessible and valuable. In the original publication, Table 2 aimed to provide a summary of the interesting variants, and we have revised it now in light of the ranking results and the additional analyses that allow us to clarify the mechanisms of action further. We have also introduced__ Figure 9 and Tables S3 and S4__, which present data on ACMG-like classification for VUS that can fall into the likely pathogenic or benign categories.

      • In many cases, the authors state that experimental validation is required to validate the results. Could they be more explicit on the experimental design and the expected outcome?

      We have added a section on the point above at pages 21 and 30, where, alongside the summary of mechanisms per variant, we propose the experimental readouts to use based on known MAVE assays or assays that could be designed.

      • AlphaMissense seems to tend to over-predict pathogenicity. Could the authors comment on that?

      We are unsure whether this comment relates to our specific case or to a general feature of AlphaMissense.

      In the latest iteration of our small benchmarking dataset for POLE and POLD1 (as shown in the paper), we achieve a sensitivity of 1 and a balanced specificity of 0.96 for AlphaMissense, which suggests that AlphaMissense does not over-predict pathogenicity very significantly in these proteins, predicting true negatives (i.e., non-pathogenic) mutations quite accurately. As performance was sufficient in our case, we deemed recalibrating the classification threshold for AlphaMissense unnecessary.

      We are aware that this is not necessarily the case for every gene, e.g., it has been shown that AlphaMissense shows lower specificity in some cases (see e.g. 10.3389/fgene.2024.1487608, 10.1038/s41375-023-02116-3). This is also why we found it essential to evaluate its performance with its recommended classification on a gene-specific basis, as done here. In the future, we will keep a critical eye on our predictors to understand whether they are suitable for the specific case of study, or whether they require threshold recalibration or the use of a different predictor.

      ** On specific variants

      • The mention of H1066R, H1068, and D1068Y is very confusing. There seems to be a confusion between residue numbers and amino acid types.

      We have revised the text for typos and errors. This part of the text changed, so these specific variants are no longer mentioned.

      • A major limitation of the 3D modeling is this impossibility to include Zn2+ coordination by cysteine residues. This limitation holds for both POLE and POLD1. Could the authors comment on the implication of this limitation for interpreting the mechanistic impact of variants. In particular, there are several variants reported in the study that consist in gain of cysteines. The authors discuss the potential impact of some of these mutations on the structural stability but not that on Zn coordination or the formation of disulphide bridges.

      This is a great suggestion. We had, for a long time, a plan in the pipeline to include a module to tackle changes in cysteines. We have now used this occasion to include a new module that allows identifying mutations: 1) that are likely to disrupt native disulphide bridges and annotate them as damaging or 2) potential de novo formation of disulphide bridges upon a mutation of a residue to a cysteine, also annotated as damaging with respect to the original functionality. We also included a step that evaluates if the protein target is eligible for the analysis based on the cellular localization, since in specific compartments the redox condition (such as the nucleus) would not favour disulfide bridges. The module has been added to MAVISp, and we are collecting data with the module for the existing entries in the database to be able to release them at one of the following updates. More details are on the website in the Documentation section (https://services.healthtech.dtu.dk/services/MAVISp-1.0/). We could not apply the module to POLE and POLD1 since they are nuclear proteins, and it would not be meaningful to look into this structural aspect either in connection with loss of native cysteines or de novo disulfide bridge formation upon mutations that change a wild-type residue to a cysteine.

      We would like to clarify that the structures we use, as it is a focused study rather than high-throughput data collection for the first inclusion in the MAVISp database, have been modelled with zinc at the correct position. It is just the first layer of high-throughput collection with MAVISp, which uses models without cofactors unless the biocurator attempts to model them or we move to collect further data for research studies (as done here). Prompted by this confusion, we have now added a field to the metadata of a MAVISp entry indicating the cofactor state. Nevertheless, the RaSP stability prediction does not account for the cofactor's presence, even when it is bound in the model. This is discussed in the Method Section. We thus did not further analyze the variants in sites directly coordinating the metal groups due to these limitations.

      • MAVISp does not identify any mechanistic effect for a substantial portion of variants labelled as pathogenic. Could the authors comment on this point?

      We are not sure how to interpret this question. It can be read two ways. Either the reviewer is asking about the known pathogenic ClinVar variants without mechanistic indicators, or more generally, the ones that we label “pathogenic” in discovery (we actually refer to more usually damaging in the dotplots), and for which we cannot associate a mechanism.

      Overall, as a general consideration, it would be challenging to envision a mechanism for each variant predicted to be functionally damaging. For example, in the case of POLE and POLD1, we still lack models of complexes that did not meet the quality-control and inclusion criteria for the binding-free-energy scheme used by the LOCAL INTERACTION module. Also, when it comes to effects on catalysis or to analyzing effects in more detail at the cofactor sites, we could miss effects that would require QM/MM calculations. Other points we have not yet covered include cases related to changes in protein abundance due to degron exposure for degradation, which is one of the mechanistic indicators we are currently developing. Moreover, we used only unbiased molecular simulations of the free protein, and we would need future studies with enhanced sampling approaches and longer timescales to better address conformational changes and changes in the population of different protein conformational states induced by the mutation (including DNA). This can be handled formally by the MAVISp framework using metadynamics approaches, but it would be outside the scope of this work and is a direction for future studies on a subset of variants to investigate in even greater detail.

      Furthermore, modifications related to PTM differ from phosphorylations. Anyway, our scope is to use the platform to provide structure-based characterization of either known pathogenic variants or potentially damaging ones predicted by VEPs, and focus on more detailed analyses of those. As we develop MAVISp further and design new modules, we will also be able to tackle other mechanistic aspects. This discussion, however, is more relevant to the MAVISp method paper itself.

      Moreover, none of the variants discussed are associated with allosteric effect. Is this expected?

      .

      In general, allosteric mutations are rare. Nevertheless, in these case studies, the size of the proteins under investigation also poses some challenges for the underlying coarse-grain model used in the simple mode to generate the allosteric signalling map, as we have found it performs best on protein structures below 1000 residues

      __Reviewer #3 (Evidence, reproducibility and clarity (Required)): __

      The manuscript utilized the MAVISp framework to characterize 64,429 missense variants (43,415 in POLE and 21,014 in POLD1) through computational saturation mutagenesis. The authors integrate protein stability predictions with pathogenicity predictors to provide mechanistic insights into DNA polymerase variants relevant to cancer predisposition and immunotherapy response. There are discussions of known PPAP-associated variants and somatic cancer mutations in the context of known data and some proposed variants of interest (which are not validated).

      Major comments:

      I was unaware of the MAVISp framework. It concerns me that alebit this paper has a lot of technical details about the framework, its not the paper about the framework. I did look into the paper https://www.biorxiv.org/content/10.1101/2022.10.22.513328v5 which keeps benign updated (version five now) for three years, but I do not see a peer reviewed version. It would be unfair of me to peer review the underlying framework of the work but together with the previous comments, I am a bit concerned.

      We have intentionally left the MAVISp resource paper as a living pre-print until we have sufficient data in the database that could be useful to the rest of the community. We have been actively revising the manuscript, thanks to comments from users in previous versions, to ensure it provides a solid resource. We had attempted approximately one and a half years ago a submission to a high-impact journal and even addressed the reviewers’ comments there. Still, we did not receive feedback for a long time, and ultimately, we were not sent to the reviewers again despite more than six months of work on our side. After that, we realized that we would benefit from collecting a larger dataset, and we invested time and effort in that and submitted again for revision, this time through Review Commons in the Summer of 2025. Anyway, the paper has been peer-reviewed by three reviewers through Review Commons. We submitted the revised version and response to reviewers, and it is now under revision with Protein Science. The reviewers’ comments and our responses can be found in the “Latested Referred Preprints” on the Review Commons website with the date of 17th of October 2025.

      We would also like to clarify another point on this. In our experience, it is common practice to keep sofware on BioRxiv even for a long and to bring it to a more complete form in parallel with the community already applying it. This allows feedback from peers in a broad manner. We had similar experiences with MoonlightR, where the first publications with applications within the TCGA-PanCancer papers came before the publication of the tool itself, and the same has been for any of our main workflows, such as MutateX or RosettaDDGPrediction, which are widely used by the community. Finally, it can be considered that the MAVISp framework has already been used in different published peer-review studies (since 2023), attesting to its integrity and potential. Here, the reviewer can read more about the studies that used MAVISp data or modules: https://elelab.gitbook.io/mavisp/overview/publications-that-used-mavisp-data

      For example, the authors are using AlphaFold models to predict DDG values. Delgado et al. (2025, Bioinformatics) explicitly tested FoldX on such models and concluded that "AlphaFold2 models are not suitable for point mutation ΔΔG estimation" after observing a correlation of 0.06 between experimental and calculated values. AlphaFold's own documentation states it "has not been validated for predicting the effect of mutations". Pak et al. (2023, PLOS ONE) showed correlation between AlphaFold confidence metrics and experimental ΔΔG of -0.17. Needless to say that these concerns seriously undermine the validity of a major part of the study.

      We appreciate the reviewer’s comments and would like to clarify a point regarding the MAVISp STABILITY module, which we believe may have been misunderstood. Based on the studies cited by the reviewer, which critique the use of AF-generated mutant structures for assessing stability effects, we understand that this assumption may have led to the concern.

      The STABILITY module utilises three in silico tools (FoldX, Rosetta, and RaSP) to assess changes in protein stability resulting from missense mutations. Importantly, the input to these assessments consists of AF models of the WT protein structures, not of AF-generated mutant structures. The mutants are generated using the FoldX and Rosetta protocols, along with estimates of the changes in free energy. For further details and clarification, we kindly refer the reviewer to the MAVISp original publication.

      Also, one should consider the goal of our use of free energy calculations: not to identify the exact ΔΔG values, but to correlate with data from in vitro or biophysical experiments, such as those from cellular experiments like MAVE. We, other researchers, have shown that we have a good agreement in the MAVISp paper (case study on PTEN as an example in the original MAVISp publication and https://pmc.ncbi.nlm.nih.gov/articles/PMC5980760/ https://pubmed.ncbi.nlm.nih.gov/28422960/,10.7554/eLife.49138). Also, we had, before even designing the STABILITY module for MAVISp, verified that we can use WT structures from AlphaFold (upon proper trimming and quality control with Prockech) instead of experimental structure without compromising accuracy in the publications of the two main protocols of the STABILITY module (MutateX and RosettaDDGPrediction and a case study on p53, https://doi.org/10.1093/bib/bbac074,https://doi.org/10.1002/pro.4527). In the focused studies, we also carefully consider whether the prediction is at a site with a low pLDDT score or surrounded by other sites with a low pLDDT score before reaching any conclusions. The pLDDT score is reported in the MAVISp csv file exactly to be used for flagging variants or looking closer at them, as we discuss in this study (see, for example, Figure 2). Additionally, it should be noted that we employ a consensus approach across the two classes of methods in MAVISp to account for their limitations arising from their empirical energy function or backbone stiffness. Furthermore, in the focused studies, we also collected molecular dynamics simulations for the ensemble mode and reassessed the stability on different conformations from the trajectory to compensate for the issues with backbone stiffness of FoldX, RaSP, and Rosetta ΔΔG protocols.

      I have to add that this is also true for the technical choices: Several integrated predictors (DeMaSk, GEMME) are outperformed by newer methods according to benchmarking studies (https://www.embopress.org/doi/full/10.15252/msb.202211474). AlphaMissense, while state-of-the-art, shows substantial overcalling of pathogenic variants. could ensemble meta-predictors (REVEL, BayesDel) improve accuracy?

      The MAVISP framework includes REVEL as one of the VEPs available for data analysis. In this way, we were representing one of the ensemble meta-predictors. This is explained in the MAVISp original paper. We were not aware of BayesDel, which we will consider for one of the next pilot projects to assess new tools for the framework (see more details below on how we generally proceed). Currently, we cannot use REVEL for all variants because we do not necessarily have genomic coordinates for them. We retrieve genomic-level variants corresponding to our protein variants from mutation databases, where available (e.g., ClinVar, COSMIC, or CbioPortal). However, as we strive to cover every possible mutation, several of the variants in MAVISp are not in the database, which means we do not have the corresponding genomic variation for those, limiting our ability to annotate them with VEPs. In the future (see GitHub issue https://github.com/ELELAB/cancermuts/issues/235), we will revise the code to identify the genomic variants that could give rise to each protein mutation of interest, thereby increasing the coverage of VEP annotations.

      We can see from the work cited by the reviewer that ESM-1v, EVE, and DeepSequence are among the top performers, whereas reviewer 2 cited another work in which GEMME outperforms EVE. We have been covering all of them, except ESM-1v, in our framework. We are planning to evaluate for inclusion in MAVISP some of the new top-performing predictors, including ESM-1v, in Q2 2026 (according to the protocol described later in this answer), which is why it is not available yet.

      In our discovery protocol (i.e., when we work on VUS or variants not classified in ClinVar), we generally use AlphaMissense as the first indicator of potentially damaging variants. EVE, REVEL, or GEMME could be used in the case that AlphaMissense data are missing or as a second layer of evidence in the case we want, for example, to select a smaller pool of variants for experimental validation in a protein target with too many uncharacterized variants and too many that pass the evaluation with our discovery workflow. Finally, we rely on DeMaSk, as it also provides information on possible loss- or gain-of-fitness signatures to further filter the variant of interest for the search of mechanistic indicators. Since the MAVISp framework is modular, other users may want to use the data differently and design a different workflow. They have access to them (scores and classifications) through the web portal. The fact that we combine AlphaMissense with DeMaSk could yield final results after further variant filtering and mitigate the issue that AlphaMissense risks over-predicting pathogenicity.

      In general, we work to keep MAVISp up-to-date, and we have developed a protocol for the inclusion of new methodologies in the available module before generating and releasing data with new tools in the database. In particular, we perform comparative studies using data already available in the database to evaluate the performance of new approaches against that of the tools already included. Depending on the module, we use different golden standards that we are also curating in parallel, and it would make sense to apply for that specific module. For example, if the question is to evaluate VEP, we would compare it against ClinVar known variants with good review status. If the VEP performs better than the currently included ones, we can include it as an additional source of annotations and evaluate whether we could change the protocol for the discovery/characterization of variants. We operate similarly for the structural modules. For example, for stability, we are importing experimental data from MAVE assays on protein abundance and use them as a golden standard where we evaluate new approaches against the current FoldX and Rosetta-based consensus for changes in folding free energies. Instead, If we find evidence that suggests switching to a new method or integrating it would be beneficial, we will do so as a result of these investigations. An example of our working mode for evaluating tools for inclusion in the framework is illustrated by how we handled the comparison between RaSP and Rosetta in the MAVISp original article (Supplementary file S2) before officially switching to RaSP for high-throughput data collection. We still maintain Rosetta, especially in focused studies, to validate further variants classified as uncertain.

      *Further, I found the web site of the framework, where I looked for the data on these models, rather user unfriendly. Selecting POLD1, POLD2, or POLE tells me I am viewing entries A2ML1, ABCB11, ABCB6 respectively, when I search for POL and then click: these are the first three entries of the table, bot the what I click on. displaying the whole table and clicking on POLD1, gets me to POLD1. However, when I selected "Damaging mutations on structure" I get "Could not fetch protein structure model from the AlphaFold Protein Structure Database". Many other features are not working (Safari or Chrome, in a Mac). That is a concern for the usability of the dataset. *

      • *

      We have been able to reproduce the bugs identified by the reviewer and have fixed them. The second was connected to recent updates on the AlphaFold Protein Structure Database. We are not really sure how to work and act on the “other features that are not working” due to lack of specificity in this comment. Still, we have worked to make the website more robust: the coauthors of this work and other colleagues in the MAVISp team have extensively tested it across different proteins and with various browsers and operating systems, and we have fixed all identified issues. We also have a GitHub repository where users can open issues to share problems they have been experiencing with the website, which we will fix as promptly as we can (https://www.github.com/ELELAB/MAVISp), as we do for any of the tools we develop and maintain. If the reviewer were to come across other specific problems with the website, we recommend to (anonymously) open issues on the MAVISp repository so that they can be described more in detail and dealt with appropriately.

      This comment seems more related to the MAVISP paper itself than to the POLE and POLD1 entries. We have been doing several revisions to the web app to improve it over time. We are also afraid that the reviewer consulted it during one of these changes, and we hope it will be better now. For POLE and POLD1, the CSV files were, in any case, also available through the MAVISp website itself (https://services.healthtech.dtu.dk/services/MAVISp-1.0/), as well as in the OSF repository connected to this paper (https://osf.io/z8x4j/overview), in case the reader needed to consult them or as a reference for the analyses reported in this paper.

      Albeit this is a thorough analysis with the existing tools, and the authors make some sparse attempts to put the mutants classification in context with examples, the work stays descriptive for know effects in literature, or point out that e.g. "further functional and in vitro assays are required". The examples are not presented in a systematic way, or in an appealing manner. Thus, what this manuscript adds to the web site is unclear. It is a description of content, which could be at least more appealing if examples woudl be more clearly outlined in a conceptual framework, and illustrated more consistently. For exmaple I read in the middle of mage 16 "One such example is the F931S (p.Phe931Ser) variant (Figure 5A)" and then I see "F931 forms contacts with D626, a critical residue for the coordination of Mg2+ which is essential for the correct orientation of the incoming nucleotide (Figure XXX)". Figure 5B is not XXX as this has just many mutations labeled. These issues are very discouraging. I woudl recommend to put much more effort in examples, put them in clearer paragraphs, and decribe results rather than the methodology. Doing both in an intemigled way, clearly does not work for me.

      We have revised the storyline to make it more straightforward for the reader, focusing on the essential messages and avoiding excessive description in the results section, instead conveying the key points directly. We also included new simulation data on three variants and downstream analyses of other variants. We revised the section to focus less on methodologies and more on the actual biological results. We have also added a ranking approach for the VUS and an ACMG-like classification to facilitate the identification of the most important results.

      Additionally, we included a summary Table (Table 2) and Figure 9 that present the main findings on the VUS, and we discussed in the text the possible associated experimental validation.

      We also do not fully understand the reviewer’s comment “the work stays descriptive for know effects in literature”. We agree that we should make a better effort to write the results in a logical and easy-to-follow manner, without risking the reader getting lost in too many details, and with more dedicated subsections. However, the paper does not describe just known effects in the literature. We had, in the previous version, a section aimed at identifying mechanistic indicators for ClinVar-reported variants that are also (in some cases) functionally characterized. This is true, but it is the very first part of the results, and it is still adding structure-based knowledge to these variants. After this, we also reported predicted results with mechanisms for VUS and variants in other databases. We took the opportunity in this revised version to elaborate more on the results of the variants reported in COSMIC and cBioPortal.

      We are afraid that we also do not fully understand the reviewer's comment on the fact that “Thus, what this manuscript adds to the website is unclear.” We have generated POLE and POLD1 data with the MAVISp toolkit in both ensemble and simple mode, and the whole pool of local interactions with other proteins and DNA, specifically for this publication. It should be acknowledged that we have generated new data in ensemble mode, which relies on all-atom microsecond molecular dynamics simulations, and additional modules for the simple mode, including calculations with the flexddg protocol of Rosetta, which is also computationally demanding, to provide a comprehensive overview of the effects of variants in POLE and POLD1. The two proteins were available in the database only in simple mode with the basic default modules, and the remaining data were collected during this research article. This can also be inferred by the references in the csv file of the ensemble mode, which refer only to the DOI of the pre-print of this article. This entails a substantial effort in computing and analysis. The website is the repository for data that researchers collect using the MAVISp protocols or modules; in our opinion, it cannot replace a research project. We designed the database to store the data generated by the framework for others to consult and use for various purposes (e.g., biological studies, preparing datasets for benchmarking approaches against existing ones, or using features for machine learning applications). The entry point in the database is the simple mode, along with some compulsory modules (VEPs, STABILITY, PTM, EFOLDMINE, SASA). After this initial entry point, a biocurator or a team of researchers can decide to expand data coverage by moving into the other modules. Still, at some point, one would need to design focused studies to have a comprehensive overview of the effects on specific targets, as we did here, or, for example, in the publication https://doi.org/10.1016/j.bbadis.2024.167260.

      Furthermore, there are analyses here, especially in the simulations, that are not directly available from consulting the database; in these cases, one needs to use other resources beyond MAVISp to investigate further the mechanisms underlying the predicted mechanistic indicators. We also included simulations of mutant variants to validate the hypothesis further. And another example is the analysis of the effects on the splicing site that is not covered by a structure-based framework, such as MAVISp, but is still an essential aspect in the analysis of the variants' effects.

      Will the community find this analysis useful?

      The analysis provided here will be helpful, especially for researchers interested in experimental studies of these enzymes, because they have throughout the study an extensive portfolio of structural data to consult, including a ranked list of variants by class of effect. We originally started designing MAVISp because we realized it was needed by our experimental collaborators, both in cellular biology and in more clinical research, whenever they needed to predict or simulate variants, and we expanded the concept into a robust, versatile framework for broader use. Especially for those genes where extensive MAVE data are not available (as in this case), having a set of variants to test experimentally is crucial support, as it provides the potential mechanism behind the predicted damaging variant.

      How many ClinVar VUS could be reclassified using MAVISp data under current ACMG/AMP guidelines?

      • *

      The ACMG/AMP variant classification guidelines, to the best of our knowledge, include computational evidence (PP3/BP4) and well-established functional studies (PS3/BS3). Because MAVISp provides multi-level mechanistic predictions derived from structural modelling, these data formally fall within the PP3/BP4 computational category. They cannot be used to reclassify ClinVar VUS independently under ACMG/AMP rules. This is not really the goal of our framework, which is to provide a structure-based framework for investigating potentially damaging variants predicted by VEPs. However, the suggestion of the reviewer is something we wanted to explore too in general with MAVISp data, and we failed because of a lack of time. We checked the requirements for PP3, BP4, and PM1 and developed a classifier for VUS reported in ClinVar, using MAVISp features in accordance with the ACMG/AMP guidelines. Using ClinVar pathogenic and benign variants with at least a review status of 1 for calibration, we obtained thresholds for all MAVISp-supported VEPs (REVEL, AlphaMissense, EVE, GEMME, and DeMaSk). These thresholds were then applied to all ClinVar VUS to determine PP3 (pathogenic-supporting) and BP4 (benign-supporting) evidence. In parallel, we constructed a PM1-like mechanistic evidence category that integrates MAVISp structural stability, protein–protein interactions, DNA interactions, long-range allosteric paths, functional sites, and PTM-mediated regulatory effects. Variants classified as damaging in MAVISp according to such criteria were assigned PM1-like support. These evidence tags provide mechanistic insight to support VUS classification for polymerase proofreading genes. The workflow and complete annotated VUS table are now included in the revised manuscript and in the OSF repository. Although these findings cannot formally reclassify variants under ACMG/AMP criteria, they provide prioritization for PS3/BS3 experimental validation and highlight variants that are likely to be reclassified once supporting functional evidence becomes available.

      How do MAVISp predictions meet calibrated thresholds, as in https://genomemedicine.biomedcentral.com/articles/10.1186/s13073-023-01234-y* for the exonuclease domain of POLE and POLD1? *

      • *

      Mur et al. (Genome Medicine 2023) restricted their ACMG/AMP recommendations to the exonuclease domain (ED) because (i) nearly all known pathogenic germline variants in POLE/POLD1 cluster within the ED, (ii) the ED has a well-characterised structure–function architecture, and (iii) sufficient pathogenic and benign variants exist only within the ED to support empirical calibration. To mirror this approach, we performed the calibration workflow exclusively on ED variants (POLE residues 268–471; POLD1 residues 304–533). For these ED-restricted variants, we recalibrated all MAVISp-derived computational predictors (REVEL, AlphaMissense, EVE, GEMME, DeMaSk) using ClinVar P/LP and B/LB variants. We applied the resulting POLE/POLD1-specific thresholds to all ClinVar VUS within the ED. We also applied our PM1-like structural/functional evidence exclusively to ED variants. The results of this ED-specific analysis are now reported in the revised manuscript (Figure 9 Supplementary Tables S3 and S4), as also explained in the response to the previous question. This ensures that MAVISp predictions are applied in a manner that is consistent with the principles of Mur et al. and ACMG/AMP variant interpretation.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      The manuscript used the MAVISp framework to characterize 64,429 missense variants (43,415 in POLE, 21,014 in POLD1) through computational saturation mutagenesis. The authors integrate protein stability predictions with pathogenicity predictors to provide mechanistic insights into DNA polymerase variants relevant to cancer predisposition and immunotherapy response. There are discussions of known PPAP-associated variants and somatic cancer mutations in the context of known data and some proposed variants of interest (which are not validated).

      Major comments:

      I was unaware of the MAVISp framework. It concerns me that alebit this paper has a lot of technical details about the framework, its not the paper about the framework. I did look into the paper https://www.biorxiv.org/content/10.1101/2022.10.22.513328v5 which keeps benign updated (version five now) for three years, but I do not see a peer reviewed version. It would be unfair of me to peer review the underlying framework of the work but together with the previous comments, I am a bit concerned. For example, the authors are using AlphaFold models to predict DDG values. Delgado et al. (2025, Bioinformatics) explicitly tested FoldX on such models and concluded that "AlphaFold2 models are not suitable for point mutation ΔΔG estimation" afte observing a correlation of 0.06 between experimental and calculated values. AlphaFold's own documentation states it "has not been validated for predicting the effect of mutations". Pak et al. (2023, PLOS ONE) showed correlation between AlphaFold confidence metrics and experimental ΔΔG of -0.17. Needless to say that these concerns seriously undermine the validity of a major part of the study. I have to add tha this is also true for toher technical choices: Several integrated predictors (DeMaSk, GEMME) are outperformed by newer methods according to benchmarking studies (https://www.embopress.org/doi/full/10.15252/msb.202211474). AlphaMissense, while state-of-the-art, shows substantial overcalling of pathogenic variants. could ensemble meta-predictors (REVEL, BayesDel) improve accuracy?

      Further, I found the web site of the framework, where I looked for the data on these models, rather user unfriendly. Selecting POLD1, POLD2, or POLE tells me I am viewing entries A2ML1, ABCB11, ABCB6 respectively, when I search for POL and then click: these are the first three entries of the table, bot the what I click on. displaying the whole table and clicking on POLD1, gets me to POLD1. However, when I selected "Damaging mutations on structure" I get "Could not fetch protein structure model from the AlphaFold Protein Structure Database". Many other features are not working (Safari or Chrome, in a Mac). That is a concern for the usability of the dataset.

      Albeit this is a thorough analysis with the existing tools, and the authors make some sparse attempts to put the mutants classification in context with examples, the work stays descriptive for know effects in literature, or point out that e.g. "further functional and in vitro assays are required". The examples are not presented in a systematic way, or in an appealing manner. Thus, what this manuscript adds to the web site is unclear. It is a description of content, which could be at least more appealing if examples woudl be more clearly outlined in a conceptual framework, and illustrated more consistently. For exmaple I read in the middle of mage 16 "One such example is the F931S (p.Phe931Ser) variant (Figure 5A)" and then I see "F931 forms contacts with D626, a critical residue for the coordination of Mg2+ which is essential for the correct orientation of the incoming nucleotide (Figure XXX)". Figure 5B is not XXX as this has just many mutations labeled. These issues are very discouraging. I woudl recommend to put much more effort in examples, put them in clearer paragraphs, and decribe results rather than the methodology. Doing both in an intemigled way, clearly does not work for me.

      Will the community find this analysis useful? How many ClinVar VUS could be reclassified using MAVISp data under current ACMG/AMP guidelines? How do MAVISp predictions meet calibrated thresholds as in https://genomemedicine.biomedcentral.com/articles/10.1186/s13073-023-01234-y for the exonuclease domain of POLE and POLD1? Such questions might undermien teh appear of the work and coudl been looked into.

      Referee cross-commenting

      I agree with all the comments raised by reviewer 2; she/he elaborates more on some issues I brought up too briefly (e.g. the choice of GEMME) while other issues that I made more comments about are also mentioned. I only want to note that the statement "A major limitation of the 3D modeling is this impossibility to include Zn2+ coordination by cysteine residues" is not accurate, as there are many 3D structure prediction tools and modeling tools that are capable og handling zinc ions coordinated by cysteines.

      While I respect that Referee 1 is clearly more positive and less concerned by methodological issues, I note that while I agree that "The authors identify numerous variants for prioritisation in further studies" (albeit in a sparse and not well organised manner in my view), I am not convinced by the present manuscript that "the effectiveness of integrating various data sources for inferring the mechanistic impact of variants" is really shown: there are hypotheses generated, but none are tested, so the effectiveness of the approach remains to be proven in my view.

      I still view this as a thorough study and a very brave attempt to be integrative and inclusive, but several methodological limitations and lack of concrete novel insight, seriously dampen my enthusiasm.

      Significance

      Strengths:

      A very comprehensive analysis of POLE and POLD1 missense variants (64,429 total), approximately 600-fold more coverage than the ~100 experimentally characterized variants in the PolED database. The multi-layered MAVISp approach provides mechanistic interpretability beyond simple pathogenic/benign classifications, potentially valuable for understanding variant effects on stability, DNA binding, protein interactions, and allosteric communication. The clinical context is highly relevant given POLE/POLD1 roles in disease.

      Limitations:

      The methodological concerns were outlined above. No solid new insight examples in a validated manner. Examples of how the datasets can be really used are not well-organised as they appear in the context of the approach in perplexed manner.

      Advance:

      The advance is primarily technical and database-driven rather than conceptually novel. Scale, Multi-dimensional assessment, Mechanistic insight and consideration of Clinical framework integration is a clear advance.

      Audience:

      The audience is the POLDPOLE experts; I however doubt if clinical scientists will find the paper useful, especially in the context of the absence of a dedicated resource and the fact that the entried in the MAVISp web-toold are not easily and intuitively accessible and clinical requirements(eg Integration with ACMG/AMP classification frameworks) are not clearly met.

      Reviewer expertise: I am a structural biologist with experience in structure analysis of experimental and predicted models, but no specific expertise or interest in polymerases.

    1. How to form an Internet Resiliency Club: Collect a group of internet-y people within ~10 km of each other Decide how to communicate normally (Signal, Matrix, email, etc.) Buy everyone LoRa (Long Range) radios and a powerbank with trickle charge Install Meshtastic on the LoRa radios Choose a LoRa channel to communicate on Organize meetups, send messages over Meshtastic, have fun If you work for a internet infrastructure company, you can suggest giving interested employees a LoRa radio, a mobile phone powerbank, and maybe even a small solar panel for their personal use (perhaps as part of an annual gift or bonus).

      this is half a plan. The LoRA stuff is to be able to keep comms within the group up if they fail. And the group is then expected to bootstrap general connectivity. The rest of the page only deals with LoRa and choosing devices. No word on the actual work of achieving internet resilience, other than 'collect a group of internet-y people'.

    1. Author response:

      eLife Assessment

      This study provides valuable mechanistic insight into the mutually exclusive distributions of the histone variant H2A.Z and DNA methylation by testing two hypotheses: (i) that DNA methylation destabilizes H2A.Z nucleosomes, thereby preventing H2A.Z retention, and (ii) that DNA methylation suppresses H2A.Z deposition by ATP-dependent chromatin remodeling complexes. Through a series of well-designed and carefully executed experiments, findings are presented in support of both hypotheses. However, the evidence in support of either hypothesis is incomplete, so that the proposed mechanisms underlying the enrichment of H2A.Z on unmethylated DNA remain somewhat speculative.

      We would like to thank the editor and reviewers for their critical assessments of our manuscript. While we do acknowledge the limitations of our work, we believe that our results provide important mechanistic insights into the long-standing question of how H2A.Z is preferentially enriched in hypomethylated genomic DNA regions. First, our structural and biochemical data suggest that DNA methylation increases the openness and physical accessibility of H2A.Z, albeit the effect is relatively subtle and is sequence-dependent. Second, using Xenopus egg extracts and synthetic DNA templates, we provide the first clear and direct evidence that DNA methylation-sensitive H2A.Z deposition is due to the H2A.Z chaperone SRCAP-C, corroborated by our discovery that SRCAP-C binding to DNA is suppressed by DNA methylation. Although the molecular details by which DNA methylation inhibits binding of SRCAP-C is an important area of future study, in our current manuscript, we do provide evidence that directly links the presence of SRCAP-C to the establishment of the DNA methylation/H2A.Z antagonism in a physiological system. Thanks to criticisms by the reviewers, we realized that we did not clearly state in our Abstract that the impact of DNA methylation on intrinsic H2A.Z nucleosome stability is relatively subtle, although we did explain these observations and limitations in the main text. In our revised manuscript, we are willing to edit the text to better clarify the criticisms raised by the reviewers.

      Public Reviews:

      Reviewer #1 (Public review):

      Summary:

      The authors considered the mechanism underlying previous observations that H2A.Z is preferentially excluded from methylated DNA regions. They considered two non-mutually exclusive mechanisms. First, they tested the hypothesis that nucleosomes containing both methylated DNA and H2A.Z might be intrinsically unstable due to their structural features. Second, they explored the possibility that DNA methylation might impede SRCAP-C from efficiently depositing H2A.Z onto these DNA methylated regions.

      Their structural analyses revealed subtle differences between H2A.Z-containing nucleosomes assembled on methylated versus unmethylated DNA. To test the second hypothesis, the authors allowed H2A.Z assembly on sperm chromatin in Xenopus egg extracts and mapped both H2A.Z localization and DNA methylation in this transcriptionally inactive system. They compared these data with corresponding maps from a transcriptionally active Xenopus fibroblast cell line. This comparison confirmed the preferential deposition or enrichment of H2A.Z on unmethylated DNA regions, an effect that was much more pronounced in the fibroblast genome than in sperm chromatin. Furthermore, nucleosome assembly on methylated versus unmethylated DNA, along with SRCAP-C depletion from Xenopus egg extracts, provided a means to test whether SRCAP-C contributes to the preferential loading of H2A.Z onto unmethylated DNA.

      Strengths:

      The strength and originality of this work lie in its focused attempt to dissect the unexplained observation that H2A.Z is excluded from methylated genomic regions.

      Weaknesses:

      The study has two weaknesses. First, although the authors identify specific structural effects of DNA methylation on H2A.Z-containing nucleosomes, they do not provide evidence demonstrating that these structural differences lead to altered histone dynamics or nucleosome instability. Second, building on the elegant work of Berta and colleagues (cited in the manuscript), the authors implicate SRCAP-C in the selective deposition of H2A.Z at unmethylated regions. Yet the role of SRCAP-C appears only partial, and the study does not address how the structural or molecular consequences of DNA methylation prevent efficient H2A.Z deposition. Finally, additional plausible mechanisms beyond the two scenarios the authors considered are not investigated or discussed in the manuscript.

      Although we acknowledge the limitations of our study and are willing to expand our discussion to more thoroughly discuss these points, we believe our manuscript provides several important mechanistic insights which this reviewer may not have fully appreciated.

      Our first conclusion that H2A.Z nucleosomes on methylated DNA are more open and accessible compared to their unmethylated counterparts is supported by both our cryo-EM study and the restriction enzyme accessibility assay. Although the physical effect of DNA methylation is relatively subtle and is likely sequence dependent, as we clearly noted within the manuscript, the difference does exist and is valuable information for the chromatin field at large to consider.

      The second major conclusion of our manuscript is that SRCAP-C exhibits preferential binding to unmethylated DNA over methylated DNA, and that SRCAP-C represents the major mechanism that can explain the biased deposition of H2A.Z to unmethylated DNA in Xenopus egg extracts. Furthermore, our experiments using Xenopus egg extract clearly demonstrated that H2A.Z is deposited by both DNA-methylation sensitive and insensitive mechanisms. Depletion of SRCAP-C almost completely eliminated the levels of DNA-methylation-sensitive H2A.Z deposition and reduced the total level of H2A.Z on chromatin to less than half of that seen in non-depleted extract. This result demonstrated that DNA methylation-sensitive H2A.Z loading is primarily regulated by SRCAP-C, at least in our experimental context where transcription, replication, and other epigenetic modifications are not involved. It is likely that additional mechanisms do further contribute, implicated by our sequencing experiments, particularly at regions with active transcription, and we have noted these possibilities and the rationale for their existence in the Discussion.

      Our study also suggests that a SRCAP-independent, DNA methylation-insensitive mechanism of H2A.Z loading exists, which we suspect to be mediated by Tip60-C. In line with this possibility, our data suggest that Tip60-C binds DNA in a DNA methylation-insensitive manner in Xenopus egg extract. Since antibodies to deplete Tip60-C from Xenopus egg extract are currently unavailable, we were unable to directly test that hypothesis and decided not to include Tip60-C into our final model as we lacked experimental evidence for its role. However, whether or not Tip60-C is the complex responsible for the DNA methylation-insensitive pathway does not influence our final conclusion that SRCAP-C plays a major role in DNA methylation-sensitive H2A.Z loading. We are planning to edit our manuscript to more comprehensively discuss these points.

      Please note that while Berta et al reported that DNA methylation increases at H2A.Z loci in tumors defective in SRCAP-C, they selected those regions based off where H2A.Z is typically enriched within normal tissues (Berta et al., 2021). They did not show data indicating whether H2A.Z is still retained specifically at those analyzed loci upon mutation of SRCAP-C subunits. Thus, although we greatly admire their work and are pleased that many of our findings align with theirs, their paper did not directly address whether SRCAP-C itself differentiates between DNA methylation status nor the impact that has on H2A.Z and DNA methylation colocalization. In contrast, our Xenopus egg extract system, where de novo methylation is undetectable (Nishiyama et al., 2013; Wassing et al., 2024) offers a unique opportunity to examine the direct impact of DNA methylation on H2A.Z deposition using controlled synthetic DNA substrates. Corroborated with our demonstration that DNA binding of SRCAP-C is suppressed by DNA methylation, we believe that our manuscript provides a specific mechanism that can explain the preferential deposition of H2A.Z at hypomethylated genomic regions.

      Reviewer #2 (Public review):

      This manuscript aims to elucidate the mechanistic basis for the long-standing observation that DNA methylation and the histone variant H2A.Z occupy mutually exclusive genomic regions. The authors test two hypotheses: (i) that DNA methylation intrinsically destabilizes H2A.Z nucleosomes, thereby preventing H2A.Z retention, and (ii) that DNA methylation suppresses H2A.Z deposition by ATP-dependent chromatin-remodelling complexes. However, neither hypothesis is rigorously addressed. There are experimental caveats, issues with data interpretation, and conclusions that are not supported by the data. Substantial revision and additional experiments, including controls, would be required before mechanistic conclusions can be drawn. Major concerns are as follows:

      We appreciate the critical assessment of our manuscript by this reviewer. Although we acknowledge the limitations of our study and will revise the manuscript to better describe them, we would like to respectfully argue against the statement that our "conclusions […] are not supported by the data".

      (1) The cryo-EM structure of methylated H2A.Z nucleosomes is insufficiently resolved to address the central mechanistic question: where the methylated CpGs are located relative to DNA-histone contact points and how these modifications influence H2A.Z nucleosome structure. The structure provides no mechanistic insights into methylation-induced destabilization.

      The fact that the DNA resolution in the methylated structure was not high enough to resolve the positions of methylated CpGs despite a high overall resolution of 2.78 Å implies that 1) the Sat2R-P DNA was not as stably registered as the 601L sequence, requiring us to create two alternative Sat2R-P atomic models to account for the variable positioning in our samples, and 2) that the presence of DNA methylation increases that positional variability. We understand that one may prefer to see highly resolved density around each methylation mark, but we do believe that our inability to accomplish that is actually a feature rather than a weakness and has important biological implications. The decrease in local DNA resolution on the methylated Sat2R-P structure compared to its unmethylated counterpart is meaningful and suggests to us that DNA methylation weakens overall DNA wrapping and positioning on the nucleosome, supported by the increased flexibility seen at the linker DNA ends as well as an increase in the population of highly shifted nucleosomes amongst the methylated particles. Additionally, one major view in the DNA methylation/nucleosome stability field is that the presence of DNA methylation can make DNA stiffer and harder to bend, causing opening and destabilization of nucleosomes (Ngo et al., 2016). The increased opening of linker DNA ends and accessibility of methylated H2A.Z nucleosomes in our hands also aligns with such an idea, again suggesting decreased histone-DNA contact stability on methylated DNA substrates. We plan to revise the writing in our manuscript to better reflect these ideas.

      The experimental system also lacks physiological relevance. The template DNA sequence is artificial, despite the existence of well-characterised native genomic sequences for which DNA methylation is known to inhibit H2A.Z incorporation. Alternatively, there are a number of studies examining the effect of DNA methylation on nucleosome structure, stability, DNA unwrapping, and positioning. Choosing one of these DNA sequences would have at least allowed a direct comparison with a canonical nucleosome. Indeed, a major omission is the absence of a cryo-EM structure of a canonical nucleosome assembled on the same DNA template - this is essential to assess whether the observed effects are H2A.Z-specific.

      The reviewer raises a fair question about whether canonical H2A would experience the same DNA methylation-dependent structural effects. We had considered solving the H2A structures, however, ultimately decided against it for a few reasons. First, there already exists crystal structures of canonical H2A nucleosomes using a DNA sequence highly similar to our Sat2R-P with and without the presence of DNA methylation (PDB: 5CPI and 5CPJ). The authors of this study did not see any physical differences present in their structures (Osakabe et al., 2015). Additionally, we had included canonical H2A conditions within our restriction enzyme accessibility assay and did not see a significant impact of DNA methylation on those samples (Fig 3). Because of the previous report and our own negative data, we expected that only limited additional insights would be obtained from the canonical H2A structures and decided not to pursue that analysis.

      One of the primary reasons we chose the Sat2R-P sequence was, as noted above, that there already was a published study examining how DNA methylation affects nucleosome structure using a variant of this sequence which we could compare to our results, as the reviewer has suggested. We did have to modify the sequence, namely by making it palindromic, in order to increase the final achievable resolution. We viewed the Sat2R-P sequence as an attractive candidate because it is physiologically relevant; the initial sequence was taken directly from human satellite II. Several modifications were made for technical reasons, including making the sequence palindromic as described above and also ensuring that each CpG is recognizable by a methylation-sensitive restriction enzyme so that we could be certain about the degree of methylation on our substrates. These practical concerns outweighed the necessity of maintaining a strict physiological sequence to us. However, we still believe the final Sat2R-P more closely mimics physiological sequences than Widom 601. Additionally, human satellite II is a highly abundant sequence in the human genome that is known to undergo large methylation changes on the onset of many disorders, like cancer, as well as during aging. Thus, there are interesting biological questions surrounding how the methylation state of this particular sequence affects chromatin structure. Furthermore, it has been reported that satellite II is devoid of H2A.Z (Capurso et al., 2012). Beyond those reasons, the satellite II sequence is generally interesting to our lab because we have been studying genes involved in ICF syndrome, where hypomethylation of satellite II sequences forms one of the hallmarks of this disorder (Funabiki et al., 2023; Jenness et al., 2018; Wassing et al., 2024). We understand that sequence context plays a large role in nucleosome wrapping and stability. This is why we strived to test multiple sequences in each of our assays. We do agree that it would be interesting to use DNA sequences where H2A.Z binding has already been described to be affected in a DNA methylation-dependent manner, forming an exciting future study to pursue.

      Furthermore, the DNA template is methylated at numerous random CpG sites. The authors' argument that only the global methylation level is relevant is inconsistent with the literature, which clearly demonstrates that methylation effects on canonical nucleosomes are position-dependent. Not all CpG sites contribute equally to nucleosome stability or unwrapping, and this critical factor is not considered.

      We did not argue that only the global methylation level is relevant. We also would appreciate it if the reviewer could provide specific references that "clearly demonstrates that methylation effects on canonical nucleosomes are position-dependent". We are aware of a series of studies conducted by Chongli Yuan's group, including one testing the effect of placing methylated CpGs at different positions along the Widom 601 sequence. In that study (Jimenez-Useche et al., 2013), they did find that positioning of mCpGs has differential impacts on the salt resistance of the nucleosomes, with 5 tandem mCpG copies at the dyad causing the most dramatic nucleosome opening whereas having mCpGs only at the DNA major grooves, but not elsewhere, increased nucleosome stability. However, they did also find that methylation of the original Widom 601 sequence also caused destabilization, albeit to a lesser degree, and another study by the same group (Jimenez-Useche et al., 2014) also found that CpG methylation decreased nucleosome-forming ability for all tested variants of the Widom 601 sequence, regardless of CpG density or positioning.

      Other studies monitored how distribution of methylated CpGs correlates with nucleosome positioning (Collings et al., 2013; Davey et al., 1997; Davey et al., 2004). However, these studies assessed the sequence-dependent effects specifically on nucleosome assembly during in vitro salt dialysis, which is a different physical process than the one our manuscript focuses on, especially when considering the fact that H2A.Z is deposited onto preassembled H2A-nucleosome. Our cryo-EM analysis examines the structural changes induced by DNA methylation on already formed nucleosomes rather than the process of formation. Thus, probing accessibility changes using a restriction enzyme was the more appropriate biochemical assay to verify our structures.

      We do very much agree that DNA context can influence nucleosome stability under different conditions. A study of molecular dynamics simulations concluded that the "combination of overall DNA geometrical and shape properties upon methylation" makes nucleosomes resistant to unwrapping (Li et al., 2022), while another modeling study suggests that DNA methylation impacts nucleosome stability in a manner dependent on DNA sequence, where "[s]trong binding is weakened and weak binding is strengthened" (Minary and Levitt, 2014). While G/C-dinucleotides are preferentially placed at major groove-inward positions in the nucleosomes in vivo (Chodavarapu et al., 2010; Segal et al., 2006) and G/C-rich segments are excluded from major groove-outward positions in Widom 601-like nucleosomes (Chua et al., 2012), methylated CpG dinucleotides are preferably, if not exclusively, located at major groove-outward positions in vivo. Mechanisms behind this biased mCpG positioning on the nucleosome remain speculative, likely caused by a combination of multiple factors, but the fact that we did not observe clear structural impacts using the Widom 601L sequence, where mCpGs are located at the major groove-outward and -inward positions ((Chua et al., 2012) and our structure), deserves a space for discussion. On the other hand, positioning of mCpG on satellite II-derived sequences that we used in this study was based on a physiological sequence, and thus it may not be appropriate to say that those CpGs are placed at multiple "random" positions. Although we decided not to discuss the position of 5mC on our Sat2R nucleosome structure due to ambiguous base assignments, neither of our two atomic models is consistent with an idea that DNA methylation repositions the CpG to the outward major grooves. As the potential contribution of how DNA methylation affects the nucleosome structure via modulating DNA stiffness has been extensively studied (Choy et al., 2010; Li et al., 2022; Ngo et al., 2016; Perez et al., 2012), we believe that it is appropriate to consider overall DNA properties along the whole DNA sequence, though we are willing to discuss potential positional effects in the revised manuscript.

      Perhaps one of the most important points that we did not emphasize enough in our original manuscript was that in contrast to the subtle intrinsic effect of DNA methylation that was DNA sequence dependent, we observed SRCAP-dependent preferential H2A.Z deposition to unmethylated DNA over methylated DNA in both 601 and satellite II DNAs. In the revised manuscript, we will make the value of comparative studies on 601 and satellite II in two distinct mechanisms.

      Finally, and most importantly, the reported increase in accessibility of the methylated H2A.Z nucleosome is negligible compared with the much larger intrinsic DNA accessibility of the unmethylated H2A.Z nucleosome. These data do not support the authors' hypothesis and contradict the manuscript's conclusions. Claims that methylated H2A.Z nucleosomes are "more open and accessible" must therefore be removed, and the title is misleading, given that no meaningful impact of DNA methylation on H2A.Z nucleosome stability is demonstrated.

      We respectfully disagree with this reviewer's criticism. We investigated the potential impact of DNA methylation on nucleosome stability to the best of our abilities through complementary assays and reported our observations. The effect of DNA methylation is smaller than the difference between H2A.Z and H2A, but we were able to see an effect. It is also not uncommon for small differences to have functional impacts in biological systems. We agree that further testing is required to determine whether this subtle effect is functionally important, and it remains the subject of future research due to the many technical challenges associated with addressing said question. We would like to note that 18 years have passed since Daniel Zilberman first reported the antagonistic relationship between H2AZ and DNA methylation (Zilberman et al., 2008) but very few studies have since directly tested specific mechanistic hypotheses. We believe that our study lays the groundwork for exciting future investigation that better elucidates the pathways that contribute to this antagonism and will have meaningful impacts on the field in general. However, thanks to the reviewer's criticism, we realized that we did not clearly state in the Abstract the relatively subtle effect of DNA methylation on the intrinsic H2A.Z nucleosome stability. Therefore, we will accordingly revise the Abstract to make this point clearer.

      (2) The cryo-EM structures of methylated and unmethylated 601L H2A.Z nucleosomes show no detectable differences. As presented, this negative result adds little value. If anything, it reinforces the point that the positional context of CpG methylation is critical, which the manuscript does not consider.

      We believe the inclusion and factual reporting of negative data is important for the scientific community as one of the major issues currently in biology research is biased omission of negative data. We considered eLife as a venue to publish this work for this reason. We understand that the reviewer believes our 601L structures may detract from the overall message of our manuscript. We believe this data rather emphasizes the importance of DNA sequence context, something that the reviewer also rightfully notes. It is standard practice in the nucleosome field to use the Widom 601 sequence, along with its variants. Our experience has shown that use of an artificially strong positioning sequence may mask weaker physical effects that could play a physiological role. Thus, we were careful to validate all further assays with multiple DNA sequences and believed it important to report these sequence-dependent effects on nucleosome structure.

      (3) Very little H3 signal coincides with H2A.Z at TSSs in sperm pronuclei, yet this is neither explained nor discussed (Supplementary Figure 10D). The authors need to clarify this.

      Our H3 signal, which represents the global nucleosome population, is more broadly distributed across the genome than H2A.Z, which is known to localize at specific genomic sites. Since both histone types were sequenced to similar read depths, H3 peaks are generally shallower than H2A.Z and peak heights cannot be directly compared (i.e. they should be represented in separate appropriate data ranges).

      (4) In my view, the most conceptually important finding is that H2A.Z-associated reads in sperm pronuclei show ~43% CpG methylation. This directly contradicts the model of strict mutual exclusivity and suggests that the antagonism is context-dependent. Similarly, the finding that the depletion of SRCAP reduces H2A.Z deposition only on unmethylated templates is also very intriguing. Collectively, these result warrants further investigation (see below).

      (5) Given that H2A.Z is located at diverse genomic elements (e.g., enhancers, repressed gene bodies, promoters), the manuscript requires a more rigorous genomic annotation comparing H2A.Z occupancy in sperm pronuclei versus XTC-2 cells. The authors should stratify H2A.Z-DNA methylation relationships across promoters, 5′UTRs, exons, gene bodies, enhancers, etc., as described in Supplementary Figure 10A.

      (below is response to (4) and (5) together)

      We agree that the substantial presence of co-localized H2A.Z and DNA methylation specifically in the sperm pronuclei samples and the changes in pattern between nuclear types are highly interesting and require further investigation. However, we faced technical challenges in our sequencing experiments that made us refrain from conducting a more detailed analysis for fear of over-interpreting potential artifacts. These challenges mainly stemmed from the difficulties in collecting enough material from Xenopus egg extracts and Tn5’s innate bias towards accessible regions of the genome. Because of this, open regions of the genome tend to be overrepresented in our data (as noted in our Discussion), making it challenging to rigorously compare methylation profiles and H2A.Z/H3 associated genomic elements.

      While the degree of separation seems to be dependent on nuclei type, we still believe the antagonism exists in both the sperm pronuclei and XTC-2 samples when comparing H2A.Z methylation profiles to the corresponding H3 condition. Our study also demonstrates that H2A.Z is preferentially deposited to hypomethylated DNA in a manner dependent of SRCAP-C (the loss of SRCAP only reduces H2A.Z on unmethylated substrates) but an additional methylation-insensitive H2A.Z deposition mechanism also exists. We realized that this interesting point was not clearly highlighted in Abstract, so we will revise it accordingly.

      (6) Although H2A.Z accumulates less efficiently on exogenous methylated substrates in egg extract, substantial deposition still occurs (~50%). This observation directly challenges the strong antagonistic model described in the manuscript, yet the authors do not acknowledge or discuss it. Moreover, differences between unmethylated and methylated 601 DNA raise further questions about the biological relevance of the cryo-EM 601 structures.

      As depicted in Figure 6 and described in the Discussion, we clearly indicated that both methylation-sensitive and methylation-insensitive pathways exist to deposit H2A.Z within the genome. We also directly stated in our Discussion that a substantial proportion of H2A.Z colocalizes with DNA methylation both in our study as well as in previous reports, which is of major interest for future study. Additionally, we further discussed how the absence of transcription in Xenopus eggs is a likely reason for the more limited effect of DNA methylation restricting H2A.Z deposition in our egg extract system.

      As noted in our response to (2), the lack of a clear impact on our 601L structures implies that this is due to the extraordinarily strong artificial nucleosome positioning capacity of the 601 sequence and its variants. Since 601 is heavily used in chromatin biology, including within DNA methylation research, such negative data are still useful to include and publish.

      (7) The SRCAP depletion is insufficiently validated i.e., the antibody-mediated depletion of SRCAP lacks quantitative verification. A minimum of three biological replicates with quantification is required to substantiate the claims.

      We are willing to address this concern. However, please note that our data showed that methylation-dependent H2A.Z deposition is almost completely erased upon SRCAP depletion, indicating functionally effective depletion. The specificity of the custom antibody against Xenopus SRCAP was verified by mass spectrometry. Additionally, we have obtained the same effect using another commercially available SRCAP antibody, though we did not include this preliminary result in our original manuscript. Due to its relatively low abundance and high molecular weight, SRCAP western blot signals are weak, making it challenging to quantify the degree of depletion. We also believe that the value of quantification in this context, with the points noted above, is rather limited. In the past, our lab has published papers on depleting the H3T3 kinase Haspin from Xenopus egg extracts (Ghenoiu et al., 2013; Kelly et al., 2010) but were never able to detect Haspin via western blot. This protein was only detected by mass spectrometry specifically on nucleosome array beads with H3K9me3 (Jenness et al., 2018). However, depletion of Haspin was readily monitored by erasure of H3T3ph, the enzymatic product of Haspin. In these experiments, it was impossible, and not critical, to quantitatively monitor the depletion of Haspin protein in order to investigate its molecular functions. Similarly, in this current study, the important fact is that depletion of SRCAP suppressed methylation-sensitive H2A.Z deposition and quantifying the degree of SRCAP depletion would not have a major impact on this conclusion.

      (8) It appears that the role of p400-Tip60 has been completely overlooked. This complex is the second major H2A.Z deposition complex. Because p400 exhibits DNA methylation-insensitive binding (Supplementary Figure 14), it may account for the deposition of H2A.Z onto methylated DNA. This possibility is highly significant and must be addressed by repeating the key experiments in Figure 5 following p400-Tip60 depletion.

      We are aware that the Tip60 complex is a very likely candidate for mediating DNA methylation-insensitive H2A.Z deposition, which is why we tested whether DNA binding of p400 is methylation sensitive. Therefore, the reviewer's statement that we "completely overlooked" Tip60-C’s role does not fairly report on our efforts. We wished to test the potential contribution of Tip60-C, but, unfortunately, the antibodies we currently have available to us were not successful in depleting the complex from egg extract. Since we had no direct experimental evidence indicating the role Tip60-C plays, we decided to take a conservative approach to our model and leave the methylation-insensitive pathway as mediated by something still unidentified. While further investigating Tip60-C’s contribution to this pathway is of definite value, we do not believe that it impacts our major conclusion that SRCAP-C is the main mediator responsible for H2A.Z deposition on unmethylated DNA and thus remains a subject for future study.

      (9) The manuscript repeatedly states that H2A.Z nucleosomes are intrinsically unstable; however, this is an oversimplification. Although some DNA unwrapping is observed, multiple studies show that H3/H4 tetramer-H2A.Z/H2B interactions are more stable (important recent studies include the following: DOI: 10.1038/s41594-021-00589-3; 10.1038/s41467-021-22688-x; and reviewed in 10.1038/s41576-024-00759-1).

      We understand that the H2A.Z stability field is highly controversial. We have introduced the many conflicting reports that have been published in the field but can further expand on the controversies if desired. We also understand that the term “nucleosome stability” is broad and encompasses many physical aspects. As noted in a prior response, we will better specify our use of the term within the manuscript. In our assays, we are most focused on the DNA wrapping stability of the nucleosome and have consistently seen in our hands that H2A.Z nucleosomes are much more open and accessible compared to canonical H2A on satellite II-derived sequences, regardless of methylation status. However, we do understand that many groups have observed the opposite findings while others have obtained results similar to us. We reported on our findings of the general H2A.Z stability with the hopes to help clarify some of the field’s controversies.

      In summary, the current manuscript does not present a convincing mechanistic explanation for the antagonism between DNA methylation and H2A.Z. The observation that H2A.Z can substantially coexist with DNA methylation in sperm pronuclei, perhaps, should be the conceptual focus.

      We appreciate this reviewer’s advice. However, please note that the first author who led this project has already successfully defended their PhD thesis primarily based on this project, making it impractical and unrealistic to completely change the focus of this manuscript to include an entirely new avenue of research. We believe that our data provide important insights into the mechanisms by which H2A.Z is excluded from methylated DNA, particularly via the DNA methylation-sensitive binding of SRCAP-C, which has never been described before. We agree that many questions are still left unanswered, including the exact molecular mechanism behind how DNA methylation prevents SRCAP-C binding. We have preliminary data that suggest none of the known DNA-binding modules of SRCAP-C, including ZNHIT1, by themselves can explain this sensitivity. This implies that domain dissection in the context of the holo-SRCAP complex is required to fully address this question. We believe this represents a very exciting future avenue of study; however, it does not negate our finding that SRCAP-C itself is important for maintaining the DNA methylation/H2A.Z antagonism. Therefore, we respectfully disagree with this reviewer's summary statement, which misleadingly undermines the impact of our work.

      Reviewer #3 (Public review):

      Summary:

      Histone variant H2A.Z is evolutionarily conserved among various species. The selective incorporation and removal of histone variants on the genome play crucial roles in regulating nuclear events, including transcription. Shih et al. aimed to address antagonistic mechanisms between histone variant H2A.Z deposition and DNA methylation. To this end, the authors reconstituted H2A.Z nucleosomes in vitro using methylated or unmethylated human satellite II DNA sequence and examined how DNA methylation affects H2A.Z nucleosome structure and dynamics. The cryo-EM analysis revealed that DNA methylation induces a more open conformation in H2A.Z nucleosomes. Consistent with this, their biochemical assays showed that DNA methylation subtly increases restriction enzyme accessibility in H2A.Z nucleosomes compared with canonical H2A nucleosomes. The authors identified genome-wide profiles of H2A.Z and DNA methylation using genomic assays and found their unique distribution between Xenopus sperm pronuclei and fibroblast cells. Using Xenopus egg extract systems, the authors showed SRCAP complex, the chromatin remodelers for H2A.Z deposition, preferentially deposit H2A.Z on unmethylated DNA.

      Strengths:

      The study is solid, and most conclusions are well-supported. The experiments are rigorously performed, and interpretations are clear. The study presents a high-resolution cryo-EM structure of human H2A.Z nucleosome with methylated DNA. The discovery that the SRCAP complex senses DNA methylation is novel and provides important mechanistic insight into the antagonism between H2A.Z and DNA methylation.

      We are grateful that this reviewer recognizes the importance of our study.

      Weaknesses:

      The study is already strong, and most conclusions are well supported. However, it can be further strengthened in several ways.

      (1) It is difficult to interpret how DNA methylation alters the orientation of the H4 tail and leads to the additional density on the acidic patch. The data do not convincingly support whether DNA methylation enhances interactions with H2A.Z mono-nucleosomes, nor whether this effect is specific to methylated H2A.Z nucleosomes.

      The altered H4 tail orientation and extra density seen on the acidic patch were incidental findings that we thought could be interesting for the field to be aware of but decided not to follow up on as there were other structural differences that were more directly related to our central question. We do believe that the above two differences are linked to each other because we used a highly purified and homogenous sample for cryo-EM analysis and the H4 tail/acidic patch interaction is a well characterized contact that mediates inter-nucleosome interactions. Additionally, other groups have reported that the presence of DNA methylation causes condensation of both chromatin and bare DNA (cited within our manuscript), though the mechanics behind this phenomenon remain to be elucidated. We believed that our structure data may also align with those findings. However, the reviewer is fair in pointing out that we do not provide further experimental evidence in verifying the existence of these increased interactions. We can revise our writing to clarify that these points are currently hypotheses rather than validated results.

      (2) It remains unclear whether DNA methylation alters global H2A.Z nucleosome stability or primarily affects local DNA end flexibility. Moreover, while the authors showed locus-specific accessibility by HinfI digestion, an unbiased assay such as MNase digestion would strengthen the conclusions.

      We would like to thank the reviewer for bringing up these issues. Although our current data cannot explicitly clarify these possibilities, we favor an idea that DNA methylation specifically alters histone to DNA contacts and that this effect is felt globally across the entire nucleosome rather than only at specific locations. The intrinsic flexibility of linker DNA ends means that that region tends to exhibit the greatest differences under different physical influences, hence the focus on characterizing that area; flexibility of a thread on a spool is most pronounced at the ends. However, we also found that the DNA backbone of H2A.Z on methylated DNA had a lower local resolution compared to its unmethylated counterpart, despite that structure having a higher global resolution, which suggested to us that DNA positioning along the nucleosome is overall weaker under the presence of DNA methylation. This is corroborated by the increased population of open/shifted structures in our classification analysis. The reviewer raises a fair point about the use of a specific restriction enzyme versus MNase. We agree that our accessibility assay is highly influenced by the position of the restriction site and have previously seen that moving the cut site too close to the linker DNA end will abolish any DNA methylation-dependent differences. We did initially attempt an MNase digestion-based assay, but the data were not as reproducible as with the use of a specific restriction enzyme. We do not know the reason behind this irreproducibility though we believe that the processivity of MNase could make it difficult to capture subtle effects like those induced by DNA methylation on already highly accessible H2A.Z nucleosomes. Overall, while we believe that DNA methylation does exert a physical effect, its subtlety may explain the many contradictory studies present within the DNA methylation and nucleosome stability field.

      References

      Berta, D.G., H. Kuisma, N. Valimaki, M. Raisanen, M. Jantti, A. Pasanen, A. Karhu, J. Kaukomaa, A. Taira, T. Cajuso, S. Nieminen, R.M. Penttinen, S. Ahonen, R. Lehtonen, M. Mehine, P. Vahteristo, J. Jalkanen, B. Sahu, J. Ravantti, N. Makinen, K. Rajamaki, K. Palin, J. Taipale, O. Heikinheimo, R. Butzow, E. Kaasinen, and L.A. Aaltonen. 2021. Deficient H2A.Z deposition is associated with genesis of uterine leiomyoma. Nature. 596:398–403.

      Capurso, D., H. Xiong, and M.R. Segal. 2012. A histone arginine methylation localizes to nucleosomes in satellite II and III DNA sequences in the human genome. BMC Genomics. 13:630.

      Chodavarapu, R.K., S. Feng, Y.V. Bernatavichute, P.Y. Chen, H. Stroud, Y. Yu, J.A. Hetzel, F. Kuo, J. Kim, S.J. Cokus, D. Casero, M. Bernal, P. Huijser, A.T. Clark, U. Kramer, S.S. Merchant, X. Zhang, S.E. Jacobsen, and M. Pellegrini. 2010. Relationship between nucleosome positioning and DNA methylation. Nature. 466:388–392.

      Choy, J.S., S. Wei, J.Y. Lee, S. Tan, S. Chu, and T.H. Lee. 2010. DNA methylation increases nucleosome compaction and rigidity. J Am Chem Soc. 132:1782–1783.

      Chua, E.Y., D. Vasudevan, G.E. Davey, B. Wu, and C.A. Davey. 2012. The mechanics behind DNA sequence-dependent properties of the nucleosome. Nucleic Acids Res. 40:6338–6352.

      Collings, C.K., P.J. Waddell, and J.N. Anderson. 2013. Effects of DNA methylation on nucleosome stability. Nucleic Acids Res. 41:2918–2931.

      Davey, C., S. Pennings, and J. Allan. 1997. CpG methylation remodels chromatin structure in vitro. J Mol Biol. 267:276–288.

      Davey, C.S., S. Pennings, C. Reilly, R.R. Meehan, and J. Allan. 2004. A determining influence for CpG dinucleotides on nucleosome positioning in vitro. Nucleic Acids Res. 32:4322–4331.

      Funabiki, H., I.E. Wassing, Q. Jia, J.D. Luo, and T. Carroll. 2023. Coevolution of the CDCA7-HELLS ICF-related nucleosome remodeling complex and DNA methyltransferases. Elife. 12.

      Ghenoiu, C., M.S. Wheelock, and H. Funabiki. 2013. Autoinhibition and polo-dependent multisite phosphorylation restrict activity of the histone h3 kinase haspin to mitosis. Mol Cell. 52:734–745.

      Jenness, C., S. Giunta, M.M. Muller, H. Kimura, T.W. Muir, and H. Funabiki. 2018. HELLS and CDCA7 comprise a bipartite nucleosome remodeling complex defective in ICF syndrome. Proc Natl Acad Sci U S A. 115:E876–E885.

      Jimenez-Useche, I., J. Ke, Y. Tian, D. Shim, S.C. Howell, X. Qiu, and C. Yuan. 2013. DNA methylation regulated nucleosome dynamics. Sci Rep. 3:2121.

      Jimenez-Useche, I., D. Shim, J. Yu, and C. Yuan. 2014. Unmethylated and methylated CpG dinucleotides distinctively regulate the physical properties of DNA. Biopolymers. 101:517–524.

      Kelly, A.E., C. Ghenoiu, J.Z. Xue, C. Zierhut, H. Kimura, and H. Funabiki. 2010. Survivin reads phosphorylated histone H3 threonine 3 to activate the mitotic kinase Aurora B. Science. 330:235–239.

      Li, S., Y. Peng, D. Landsman, and A.R. Panchenko. 2022. DNA methylation cues in nucleosome geometry, stability and unwrapping. Nucleic Acids Res. 50:1864–1874.

      Minary, P., and M. Levitt. 2014. Training-free atomistic prediction of nucleosome occupancy. Proc Natl Acad Sci U S A. 111:6293–6298.

      Ngo, T.T., J. Yoo, Q. Dai, Q. Zhang, C. He, A. Aksimentiev, and T. Ha. 2016. Effects of cytosine modifications on DNA flexibility and nucleosome mechanical stability. Nat Commun. 7:10813.

      Nishiyama, A., L. Yamaguchi, J. Sharif, Y. Johmura, T. Kawamura, K. Nakanishi, S. Shimamura, K. Arita, T. Kodama, F. Ishikawa, H. Koseki, and M. Nakanishi. 2013. Uhrf1-dependent H3K23 ubiquitylation couples maintenance DNA methylation and replication. Nature. 502:249–253.

      Osakabe, A., F. Adachi, Y. Arimura, K. Maehara, Y. Ohkawa, and H. Kurumizaka. 2015. Influence of DNA methylation on positioning and DNA flexibility of nucleosomes with pericentric satellite DNA. Open Biol. 5.

      Perez, A., C.L. Castellazzi, F. Battistini, K. Collinet, O. Flores, O. Deniz, M.L. Ruiz, D. Torrents, R. Eritja, M. Soler-Lopez, and M. Orozco. 2012. Impact of methylation on the physical properties of DNA. Biophys J. 102:2140–2148.

      Segal, E., Y. Fondufe-Mittendorf, L. Chen, A. Thastrom, Y. Field, I.K. Moore, J.P. Wang, and J. Widom. 2006. A genomic code for nucleosome positioning. Nature. 442:772–778.

      Wassing, I.E., A. Nishiyama, R. Shikimachi, Q. Jia, A. Kikuchi, M. Hiruta, K. Sugimura, X. Hong, Y. Chiba, J. Peng, C. Jenness, M. Nakanishi, L. Zhao, K. Arita, and H. Funabiki. 2024. CDCA7 is an evolutionarily conserved hemimethylated DNA sensor in eukaryotes. Sci Adv. 10:eadp5753.

      Zilberman, D., D. Coleman-Derr, T. Ballinger, and S. Henikoff. 2008. Histone H2A.Z and DNA methylation are mutually antagonistic chromatin marks. Nature. 456:125–129.

    1. Author response:

      The following is the authors’ response to the original reviews.

      Public Reviews:

      Reviewer #1 (Public review):

      The Reviewer structured their review such that their first two recommendations specifically concerned the two major weaknesses they viewed in the initial submission. For clarity and concision, we have copied their recommendations to be placed immediately following their corresponding points on weaknesses.

      Strengths:

      Studying prediction error from the lens of network connectivity provides new insights into predictive coding frameworks. The combination of various independent datasets to tackle the question adds strength, including two well-powered fMRI task datasets, resting-state fMRI interpreted in relation to behavioral measures, as well as EEG-fMRI.

      Weaknesses:

      Major:

      (R1.1) Lack of multiple comparisons correction for edge-wise contrast:

      The analysis of connectivity differences across three levels of prediction error was conducted separately for approximately 22,000 edges (derived from 210 regions), yet no correction for multiple comparisons appears to have been applied. Then, modularity was applied to the top 5% of these edges. I do not believe that this approach is viable without correction. It does not help that a completely separate approach using SVMs was FDR-corrected for 210 regions.

      [Later recommendation] Regarding the first major point: To address the issue of multiple comparisons in the edge-wise connectivity analysis, I recommend using the Network-Based Statistic (NBS; Zalesky et al., 2010). NBS is well-suited for identifying clusters (analogous to modules) of edges that show statistically significant differences across the three prediction error levels, while appropriately correcting for multiple comparisons.

      Thank you for bringing this up. We acknowledge that our modularity analysis does not evaluate statistical significance. Originally, the modularity analysis was meant to provide a connectome-wide summary of the connectivity effects, whereas the classification-based analysis was meant to address the need for statistical significance testing. However, as the reviewer points out, it would be better if significance were tested in a manner more analogous to the reported modules. As they suggest, we updated the Supplemental Materials (SM) to include the results of Network-Based Statistic analysis (SM p. 1-2):

      “(2.1) Network-Based Statistic

      Here, we evaluate whether PE significantly impacts connectivity at the network level using the Network-Based Statistic (NBS) approach.[1] NBS relied on the same regression data generated for the main-text analysis, whereby a regression is performed examining the effect of PE (Low = –1, Medium = 0, High = +1) on connectivity for each edge. This was done across the connectome, and for each edge, a z-score was computed. For NBS, we thresholded edges to |Z| > 3.0, which yielded one large network cluster, shown in Figure S3. The size of the cluster – i.e., number of edges – was significant (p < .05) per a permutation-test using 1,000 random shuffles of the condition data for each participant, as is standard.[1] These results demonstrate that the networklevel effects of PE on connectivity are significant. The main-text modularity analysis converts this large cluster into four modules, which are more interpretable and open the door to further analyses”.

      We updated the Results to mention these findings before describing the modularity analysis (p. 8-9):

      “After demonstrating that PE significantly influences brain-wide connectivity using Network-Based Statistic analysis (Supplemental Materials 2.1), we conducted a modularity analysis to study how specific groups of edges are all sensitive to high/low-PE information.”

      (R1.2) Lack of spatial information in EEG:

      The EEG data were not source-localized, and no connectivity analysis was performed. Instead, power fluctuations were averaged across a predefined set of electrodes based on a single prior study (reference 27), as well as across a broader set of electrodes. While the study correlates these EEG power fluctuations with fMRI network connectivity over time, such temporal correlations do not establish that the EEG oscillations originate from the corresponding network regions. For instance, the observed fronto-central theta power increases could plausibly originate from the dorsal anterior cingulate cortex (dACC), as consistently reported in the literature, rather than from a distributed network. The spatially agnostic nature of the EEG-fMRI correlation approach used here does not support interpretations tied to specific dorsal-ventral or anterior-posterior networks. Nonetheless, such interpretations are made throughout the manuscript, which overextends the conclusions that can be drawn from the data.

      [Later recommendation] Regarding the second major point: I suggest either adopting a source-localized EEG approach to assess electrophysiological connectivity or revising all related sections to avoid implying spatial specificity or direct correspondence with fMRI-derived networks. The current approach, which relies on electrode-level power fluctuations, does not support claims about the spatial origin of EEG signals or their alignment with specific connectivity networks.

      We thank the reviewer for this important point, which allows us to clarify the specific and distinct contributions of each imaging modality in our study. Our primary goal for Study 3 was to leverage the high temporal resolution of EEG to identify the characteristic frequency at which the fMRI-defined global connectivity states fluctuate. The study was not designed to infer the spatial origin of these EEG signals, a task for which fMRI is better suited and which we addressed in Studies 1 and 2.

      As the reviewer points out, fronto-central theta is generally associated with the dACC. We agree with this point entirely. We suspect that there is some process linking dACC activation to the identified network fluctuations – some type of relationship that does not manifest in our dynamic functional connectivity analyses – although this is only a hypothesis and one that is beyond the present scope.

      We updated the Discussion to mention these points and acknowledge the ambiguity regarding the correlation between network fluctuation amplitude (fMRI) and Delta/Theta power (EEG) (p. 24):

      “We specifically interpret the fMRI-EEG correlation as reflecting fluctuation speed because we correlated EEG oscillatory power with the fluctuation amplitude computed from fMRI data. Simply correlating EEG power with the average connectivity or the signed difference between posterior-anterior and ventral-dorsal connectivity yields null results (Supplemental Materials 6), suggesting that this is a very particular association, and viewing it as capturing fluctuation amplitude provides a parsimonious explanation. Yet, this correlation may be interpreted in other ways. For example, resting-state Theta is also a signature of drowsiness,[2] which may correlate with PE processing, but perhaps should be understood as some other mechanism. Additionally, Theta is widely seen as a sign of dorsal anterior cingulate cortex activity,3 and it is unclear how to reconcile this with our claims about network fluctuations. Nonetheless, as we show with simulations (Supplemental Materials 5), a correlation between slow fMRI network fluctuations and fast EEG Delta/Theta oscillations is also consistent with a common global neural process oscillating rapidly and eliciting both measures.”

      Regarding source-localization, several papers have described known limitations of this strategy for drawing precise anatomical inferences,[4–6] and this seems unnecessary given that our fMRI analyses already provide more robust anatomical precision. We intentionally used EEG in our study for what it measures most robustly: millisecond-level temporal dynamics.

      (R1.2a)Examples of problematic language include:

      Line 134: "detection of network oscillations at fast speeds" - the current EEG approach does not measure networks.

      This is an important issue. We acknowledge that our EEG approach does not directly measure fMRI-defined networks. Our claim is inferential, designed to estimate the temporal dynamics of the large-scale fMRI patterns we identified. The correlation between our fMRI-derived fluctuation amplitude (|PA – VD|) and 3-6 Hz EEG power provides suggestive evidence that the transitions between these network states occur at this frequency, rather than being a direct measurement of network oscillations.

      To support the validity of this inference, we performed two key analyses (now in Supplemental Materials). First, a simulation study provides a proof-of-concept, confirming our method can recover the frequency of a fast underlying oscillator from slow fMRI and fast EEG data. Second, a specificity analysis shows the EEG correlation is unique to our measure of fluctuation amplitude and not to simpler measures like overall connectivity strength. These analyses demonstrate that our interpretation is more plausible than alternative explanations.

      Overall, we have revised the manuscript to be more conservative in the language employed, such as presenting alternative explanations to the interpretations put forth based on correlative/observational evidence (e.g., our modifications above described in our response to comment R1.2). In addition, we have made changes throughout the report to state the issues related to reverse inference more explicitly and to better communicate that the evidence is suggestive – please see our numerous changes described in our response to comment R3.1. For the statement that the reviewer specifically mentioned here, we revised it to be more cautious (p. 7):

      “Although such speed outpaces the temporal resolution of fMRI, correlating fluctuations in dynamic connectivity measured from fMRI data with EEG oscillations can provide an estimate of the fluctuations’ speed. This interpretation of a correlation again runs up against issues related to reverse inference but would nonetheless serve as initial suggestive evidence that spontaneous transitions between network states occur rapidly.”

      (R1.2b) Line 148: "whether fluctuations between high- and low-PE networks occur sufficiently fast" - this implies spatial localization to networks that is not supported by the EEG analysis.

      Building on our changes described in our immediately prior response, we adjusted our text here to say our analyses searched for evidence consistent with the idea that the network fluctuations occur quickly rather than searching for decisive evidence favoring this idea (p. 7-8):

      “Finally, we examined rs-fMRI-EEG data to assess whether we find parallels consistent with the high/low-PE network fluctuations occurring at fast timescales suitable for the type of cognitive operations typically targeted by PE theories.”

      (R1.2c) Line 480: "how underlying neural oscillators can produce BOLD and EEG measurements" - no evidence is provided that the same neural sources underlie both modalities.

      As described above, these claims are based on the simulation study demonstrating that this is a possibility, and we have revised the manuscript overall to be clearer that this is our interpretation while providing alternative explanations.

      Reviewer #2 (Public review):

      Strengths:

      Clearly, a lot of work and data went into this paper, including 2 task-based fMRI experiments and the resting state data for the same participants, as well as a third EEG-fMRI dataset. Overall, well written with a couple of exceptions on clarity, as per below, and the methodology appears overall sound, with a couple of exceptions listed below that require further justification. It does a good job of acknowledging its own weakness.

      Weaknesses:

      (R2.1) The paper does a good job of acknowledging its greatest weakness, the fact that it relies heavily on reverse inference, but cannot quite resolve it. As the authors put it, "finding the same networks during a prediction error task and during rest does not mean that the networks' engagement during rest reflects prediction error processing". Again, the authors acknowledge the speculative nature of their claims in the discussion, but given that this is the key claim and essence of the paper, it is hard to see how the evidence is compelling to support that claim.

      We thank the reviewer for this comment. We agree that reverse inference is a fundamental challenge and that our central claim requires a particularly high bar of evidence. While no single analysis resolves this issue, our goal was to build a cumulative case that is compelling by converging on the same conclusion from multiple, independent lines of evidence.

      For our investigation, we initially established a task-general signature of prediction error (PE). By showing the same neural pattern represents PE in different contexts, we constrain the reverse inference, making it less likely that our findings are a task-specific artifact and more likely that they reflect the core, underlying process of PE. Building on this, our most compelling evidence comes from linking task and rest at the individual level. We didn't just find the same general network at rest; we showed that an individual’s unique anatomical pattern of PE-related connectivity during the task specifically predicts their own brain's fluctuation patterns at rest. This highly specific, person-by-person correspondence provides a direct bridge between an individual's task-evoked PE processing and their intrinsic, resting-state dynamics. Furthermore, these resting-state fluctuations correlate specifically with the 3-6 Hz theta rhythm—a well-established neural marker for PE.

      While reverse inference remains a fundamental limitation for many studies on resting-state cognition, the aspects mentioned above, we believe, provide suggestive evidence, favoring our PE interpretation. Nonetheless, we have made changes throughout the manuscript to be more conservative in the language we use to describe our results, to make it clear what claims are based on correlative/observational evidence, and to put forth alternative explanations for the identified effects. Please find our numerous changes detailed in our response to comment R3.1.

      (R2.2) Given how uncontrolled cognition is during "resting-state" experiments, the parallel made with prediction errors elicited during a task designed for that effect is a little difficult to make. How often are people really surprised when their brains are "at rest", likely replaying a previously experienced event or planning future actions under their control? It seems to be more likely a very low prediction error scenario, if at all surprising.

      We (and some others) take a broad interpretation of PE and believe it is often more intuitive to think about PE minimization in terms of uncertainty rather than “surprise”; the word “surprise” usually implies a sudden emotive reaction from the violation of expectations, which is not useful here.

      When planning future actions, each step of the plan is spurred by the uncertainty of what is the appropriate action given the scenario set up by prior steps. Each planned step erases some of that uncertainty. For example, you may be mentally simulating a conversation, what you will say, and what another person will say. Each step of this creates uncertainty of “what is the appropriate response?” Each reasoning step addresses contingencies. While planning, you may also uncover more obvious forms of uncertainty, sparking memory retrieval to finish it. A resting-state participant may think to cook a frozen pizza when they arrive home, but be uncertain about whether they have any frozen pizzas left, prompting episodic memory retrieval to address this uncertainty. We argue that every planning step or memory retrieval can be productively understood as being sparked by uncertainty/surprise (PE), and the subsequent cognitive response minimizes this uncertainty.

      We updated the Introduction to include a paragraph near the start providing this explanation (p. 3-4):

      “PE minimization may broadly coordinate brain functions of all sorts, including abstract cognitive functions. This includes the types of cognitive processes at play even in the absence of stimuli (e.g., while daydreaming). While it may seem counterintuitive to associate this type of cognition with PE – a concept often tied to external surprises – it has been proposed that the brain's internal generative model is continuously active.[12–14] Spontaneous thought, such as planning a future event or replaying a memory, is not a passive, low-PE process. Rather, it can be seen as a dynamic cycle of generating and resolving internal uncertainty. While daydreaming, you may be reminded of a past conversation, where you wish you had said something different. This situation contains uncertainty about what would have been the best thing to say. Wondering about what you wish you said can be viewed as resolving this uncertainty, in principle, forming a plan if the same situation ever arises again in the future. Each iteration of the simulated conversation repeatedly sparks and then resolves this type of uncertainty.”

      (R2.3)The quantitative comparison between networks under task and rest was done on a small subset of the ROIs rather than on the full network - why? Noting how small the correlation between task and rest is (r=0.021) and that's only for part of the networks, the evidence is a little tenuous. Running the analysis for the full networks could strengthen the argument.

      We thank the reviewer for this opportunity to clarify our method. A single correlation between the full, aggregated networks would be conceptually misaligned with what we aimed to assess. To test for a personspecific anatomical correspondence, it is necessary to examine the link between task and rest at a granular level. We therefore asked whether the specific parts of an individual's network most responsive to PE during the task are the same parts that show the strongest fluctuations at rest. Our analysis, performed iteratively across all 3,432 possible ROI subsets, was designed specifically to answer this question, which would be obscured by an aggregated network measure.

      We appreciate the reviewer's concern about the modest effect size (r = .021). However, this must be contextualized, as the short task scan has very low reliability (.08), which imposes a severe statistical ceiling on any possible task-rest correlation. Finding a highly significant effect (p < .001) in the face of such noisy data, therefore, provides robust evidence for a genuine task-rest correspondence.

      We updated the Discussion to discuss this point (p. 22-23):

      “A key finding supporting our interpretation is the significant link between individual differences in task-evoked PE responses and resting-state fluctuations. One might initially view the effect size of this correspondence (r = .021) as modest. However, this interpretation must be contextualized by the considerable measurement noise inherent in short task-fMRI scans; the split-half reliability of the task contrast was only .08. This low reliability imposes a severe statistical ceiling on any possible task-rest correlation. Therefore, detecting a highly significant (p < .001) relationship despite this constraint provides robust evidence for a genuine link. Furthermore, our analytical approach, which iteratively examined thousands of ROI subsets rather than one aggregated network, was intentionally granular. The goal was not simply to correlate two global measures, but to test for a personspecific anatomical correspondence – that is, whether the specific parts of an individual's network most sensitive to PE during the task are the same parts that fluctuate most strongly at rest. An aggregate analysis would obscure this critical spatial specificity. Taken together, this granular analysis provides compelling evidence for an anatomically consistent fingerprint of PE processing that bridges task-evoked activity and spontaneous restingstate dynamics, strengthening our central claim.”

      (R2.4) Looking at the results in Figure 2C, the four-quadrant description of the networks labelled for low and high PE appears a little simplistic. The authors state that this four-quadrant description omits some ROIs as motivated by prior knowledge. This would benefit from a more comprehensive justification.Which ROIs are excluded, and what is the evidence for exclusion?

      Our four-quadrant model is a principled simplification designed to distill the dominant, large-scale connectivity patterns from the complex modularity results. This approach focuses on coherent, well-documented anatomical streams while setting aside a few anatomically distant and disjoint ROIs that were less central to the main modules. This heuristic additionally unlocks more robust and novel analyses.

      The two low-PE posterior-anterior (PA) pathways are grounded in canonical processing streams. (i) The OCATL connection mirrors the ventral visual stream (the “what” pathway), which is fundamental for object recognition and is upregulated during the smooth processing of expected stimuli. (ii) The IPL-LPFC connection represents a core axis of the dorsal attention stream and the Fronto-Parietal Control Network (FPCN), reflecting the maintenance of top-down cognitive control when information is predictable; the IPL-LPFC module excludes ROIs in the middle temporal gyrus, which are often associated with the FPCN but are not covered here.

      In contrast, the two high-PE ventral-dorsal (VD) pathways reflect processes for resolving surprise and conflict. (i) The OC-IPL connection is a classic signature of attentional reorienting, where unexpected sensory input (high PE) triggers a necessary shift in attention; the OC-IPL module excludes some ROIs that are anterior to the occipital lobe and enter the fusiform gyrus and inferior temporal lobe. (ii) The ATL-LPFC connection aligns with mechanisms for semantic re-evaluation, engaging prefrontal control regions to update a mental model in the face of incongruent information.

      Beyond its functional/anatomical grounding, this simplification provides powerful methodological and statistical advantages. It establishes a symmetrical framework that makes our dynamic connectivity analyses tractable, such as our “cube” analysis of state transitions, which required overlapping modules. Critically, this model also offers a statistical safeguard. By ensuring each quadrant contributes to both low- and high-PE connectivity patterns, we eliminate confounds like region-specific signal variance or global connectivity. This design choice isolates the phenomenon to the pattern of connectivity itself (posterior-anterior vs. ventral-dorsal), making our interpretation more robust.

      We updated the end of the Study 1A results (p. 10-11):

      “Some ROIs appear in Figure 2C but are excluded from the four targeted quadrants (Figures 2C & 2D) – e.g., posterior inferior temporal lobe and fusiform ROIs are excluded from the OC-IPL module, and middle temporal gyrus ROIs are excluded from the IPL-LPFC modules. These exclusions, in favor of a four-quadrant interpretation, are motivated by existing knowledge of prominent structural pathways among these quadrants. This interpretation is also supported by classifier-based analyses showing connectivity within each quadrant is significantly influenced by PE (Supplemental Materials 2.2), along with analyses of single-region activity showing that these areas also respond to PE independently (Supplemental Materials 3). Hence, we proceeded with further analyses of these quadrants’ connections, which summarize PE’s global brain effects.

      “This four-quadrant setup also imparts analytical benefits. First, this simplified structure may better generalize across PE tasks, and Study 1B would aim to replicate these results with a different design. Second, the four quadrants mean that each ROI contributes to both the posterior-anterior and ventral-dorsal modules, which would benefit later analyses and rules out confounds such as PE eliciting increased/decreased connectivity between an ROI and the rest of the brain. An additional, less key benefit is that this setup allows more easily evaluating whether the same phenomena arise using a different atlas (Supplemental Materials Y).”

      (R2.5) The EEG-fMRI analysis claiming 3-6Hz fluctuations for PE is hard to reconcile with the fact that fMRI captures activity that is a lot slower, while some PEs are as fast as 150 ms. The discussion acknowledges this but doesn't seem to resolve it - would benefit from a more comprehensive argument.

      We thank the reviewer for raising this important point, which allows us to clarify the logic of our multimodal analysis. Our analysis does not claim that the fMRI BOLD signal itself oscillates at 3-6 Hz. Instead, it is based on the principle that the intensity of a fast neural process can be reflected in the magnitude of the slow BOLD response. It’s akin to using a long-exposure photograph to capture a fast-moving object; while the individual movements are blurred, the intensity of the blur in the photo serves as a proxy for the intensity of the underlying motion. In our case, the magnitude of the fMRI network difference (|PA – VD|) acts as the "blur," reflecting the intensity of the rapid fluctuations between states within that time window.

      Following this logic, we correlated this slow-moving fMRI metric with the power of the fast EEG rhythms, which reflects their amplitude. To bridge the different timescales, we averaged the EEG power over each fMRI time window and convolved it with the standard hemodynamic response function (HRF) – a crucial step to align the timing of the neural and metabolic signals. The resulting significant correlation specifically in the 3-6 Hz band demonstrates that when this rhythm is stronger, the fMRI data shows a greater divergence between network states. This allows us to infer the characteristic frequency of the underlying neural fluctuations without directly measuring them at that speed with fMRI, thus reconciling the two timescales.

      Reviewer #3 (Public review):

      Bogdan et al. present an intriguing and timely investigation into the intrinsic dynamics of prediction error (PE)-related brain states. The manuscript is grounded in an intuitive and compelling theoretical idea: that the brain alternates between high and low PE states even at rest, potentially reflecting an intrinsic drive toward predictive minimization. The authors employ a creative analytic framework combining different prediction tasks and imaging modalities. They shared open code, which will be valuable for future work.

      (R3.1) Consistency in Theoretical Framing

      The title, abstract, and introduction suggest inconsistent theoretical goals of the study.

      The title suggests that the goal is to test whether there are intrinsic fluctuations in high and low PE states at rest. The abstract and introduction suggest that the goal is to test whether the brain intrinsically minimizes PE and whether this minimization recruits global brain networks. My comments here are that a) these are fundamentally different claims, and b) both are challenging to falsify. For one, task-like recurrence of PE states during resting might reflect the wiring and geometry of the functional organization of the brain emerging from neurobiological constraints or developmental processes (e.g., experience), but showing that mirroring exists because of the need to minimize PE requires establishing a robust relationship with behavior or showing a causal effect (e.g., that interrupting intrinsic PE state fluctuations affects prediction).

      The global PE hypothesis-"PE minimization is a principle that broadly coordinates brain functions of all sorts, including abstract cognitive functions"-is more suitable for discussion rather than the main claim in the abstract, introduction, and all throughout the paper.

      Given the above, I recommend that the authors clarify and align their core theoretical goals across the title, abstract, introduction, and results. If the focus is on identifying fluctuations that resemble taskdefined PE states at rest, the language should reflect that more narrowly, and save broader claims about global PE minimization for the discussion. This hypothesis also needs to be contextualized within prior work. I'd like to see if there is similar evidence in the literature using animal models.

      Thank you for bringing up this issue. We have made changes throughout the paper to address these points. First, we have omitted reference to a “global PE hypothesis” from the Abstract and Introduction, in favor of structuring the Introduction in terms of a falsifiable question (p. 4):

      “We pursued this goal using three studies (Figure 1) that collectively targeted a specific question: Do the taskdefined connectivity signatures of high vs. low PE also recur during rest, and if so, how does the brain transition between exhibiting high/low signatures?”

      We made changes later in the Introduction to clarify that the investigation is based on correlative evidence and requires interpretations that may be debated (p. 5-7):

      “Although this does not entirely address the reverse inference dilemma and can only produce correlative evidence, the present research nonetheless investigates these widely speculated upon PE ideas more directly than any prior work.

      Although such speed outpaces the temporal resolution of fMRI, correlating fluctuations in dynamic connectivity measured from fMRI data with EEG oscillations can provide an estimate of the fluctuations’ speed. This interpretation of a correlation again runs up against issues related to reverse inference but would nonetheless serve as initial suggestive evidence that spontaneous transitions between network states occur rapidly.

      Second, we examined the recruitment of these networks during rs-fMRI, and although the problems related to reverse inference are impossible to overcome fully, we engage with this issue by linking rs-fMRI data directly to task-fMRI data of the same participants, which can provide suggestive evidence that the same neural mechanisms are at play in both.”

      We made changes throughout the Results now better describing the results as consistent with a hypothesis rather than demonstrating it (p. 12-19):

      “In other words, we essentially asked whether resting-state participants are sometimes in low PE states and sometimes in high PE states, which would be consistent with spontaneous PE processing in the absence of stimuli.

      These emerging states overlap strikingly with the previous task effects of PE, suggesting that rs-fMRI scans exhibit fluctuations that resemble the signatures of low- and high-PE states. 

      To be clear, this does not entirely dissuade concerns about reverse inference, which would require a type of causal manipulation that is difficult (if not impossible) to perform in a resting state scan. Nonetheless, these results provide further evidence consistent with our interpretation that the resting brain spontaneously fluctuates between high/low PE network states.

      These patterns are most consistent with a characteristic timescale near 3–6 Hz for the amplitude of the putative high/low-PE fluctuations. This is notably consistent with established links between PE and Delta/Theta and is further consistent with an interpretation in which these fluctuations relate to PE-related processing during rest.”

      We have also made targeted edits to the Discussion to present the findings in a more cautious way, more clearly state what is our interpretation, and provide alternative explanations (p. 19-26):

      “The present research conducted task-fMRI, rs-fMRI, and rs-fMRI-EEG studies to clarify whether PE elicits global connectivity effects and whether the signatures of PE processing arise spontaneously during rest. This investigation carries implications for how PE minimization may characterize abstract task-general cognitive processes. […] Although there are different ways to interpret this correlation, it is consistent with high/low PE states generally fluctuating at 3-6 Hz during rest. Below, we discuss these three studies’ findings.

      Our rs-fMRI investigation examined whether resting dynamics resemble the task-defined connectivity signatures of high vs. low PE, independent of the type of stimulus encountered. The resting-state analyses indeed found that, even at rest, participants’ brains fluctuated between strong ventral-dorsal connectivity and strong posterior-anterior connectivity, consistent with shifts between states of high and low PE. This conclusion is based on correlative/observational evidence and so may be controversial as it relies on reverse inference.

      These patterns resemble global connectivity signatures seen in resting-state participants, and correlations between fMRI and EEG data yield associations, consistent with participants fluctuating between high-PE (ventral-dorsal) and low-PE (posterior-anterior) states at 3-6 Hz. Although definitively testing these ideas is challenging, given that rs-fMRI is defined by the absence of any causal manipulations, our results provide evidence consistent with PE minimization playing a role beyond stimulus process.”

      (R3.2) Interpretation of PE-Related Fluctuations at Rest and Its Functional Relevance. It would strengthen the paper to clarify what is meant by "intrinsic" state fluctuations. Intrinsic might mean taskindependent, trait-like, or spontaneously generated. Which do the authors mean here? Is the key prediction that these fluctuations will persist in the absence of a prediction task?

      Of the three terms the reviewer mentioned, “spontaneous” and “task-independent” are the most accurate descriptors. We conceptualize these fluctuations as a continuous background process that persists across all facets of cognition, without requiring a task explicitly designed to elicit prediction error – although we, along with other predictive coding papers, would argue that all cognitive tasks are fundamentally rooted in PE mechanisms and thus anything can be seen as a “prediction task” (see our response to comment R2.2 for our changes to the Introduction that provide more intuition for this point). The proposed interactions can be seen as analogous to cortico-basal-thalamic loops, which are engaged across a vast and diverse array of cognitive processes.

      The prior submission only used the word “intrinsic” in the title. We have since revised it to “spontaneous,” which is more specific than “intrinsic,” and we believe clearer for a title than “task-independent” (p. 1): “Spontaneous fluctuations in global connectivity reflect transitions between states of high and low prediction error”

      We have also made tweaks throughout the manuscript to now use “spontaneously” throughout (it now appears 8 times in the paper).

      Regardless of the intrinsic argument, I find it challenging to interpret the results as evidence of PE fluctuations at rest. What the authors show directly is that the degree to which a subset of regions within a PE network discriminates high vs. low PE during task correlates with the magnitude of separation between high and low PE states during rest. While this is an interesting relationship, it does not establish that the resting-state brain spontaneously alternates between high and low PE states, nor that it does so in a functionally meaningful way that is related to behavior. How can we rule out brain dynamics of other processes, such as arousal, that also rise and fall with PE? I understand the authors' intention to address the reverse inference concern by testing whether "a participant's unique connectivity response to PE in the reward-processing task should match their specific patterns of resting-state fluctuation". However, I'm not fully convinced that this analysis establishes the functional role of the identified modules to PE because of the following:

      Theoretically, relating the activities of the identified modules directly to behavior would demonstrate a stronger functional role.

      (R3.2a) Across participants: Do individuals who exhibit stronger or more distinct PE-related fluctuations at rest also perform better on tasks that require prediction or inference? This could be assessed using the HCP prediction task, though if individual variability is limited (e.g., due to ceiling effects), I would suggest exploring a dataset with a prediction task that has greater behavioral variance.

      This is a good idea, but unfortunately difficult to test with our present data. The HCP gambling task used in our study was not designed to measure individual differences in prediction or inference and likely suffers from ceiling effects. Because the task outcomes are predetermined and not linked to participants' choices, there is very little meaningful behavioral variance in performance to correlate with our resting-state fluctuation measure.

      While we agree that exploring a different dataset with a more suitable task would be ideal, given the scope of the existing manuscript, this seems like it would be too much. Although these results would be informative, they would ultimately still not be a panacea for the reverse inference issues.

      Or even more broadly, does this variability in resting state PE state fluctuations predict general cognitive abilities like WM and attention (which the HCP dataset also provides)? I appreciate the inclusion of the win-loss control, and I can see the intention to address specificity. This would test whether PE state fluctuations reflect something about general cognition, but also above and beyond these attentional or WM processes that we know are fluctuating.

      This is a helpful suggestion, motivating new analyses: We measured the degree of resting-state fluctuation amplitude across participants and correlated it with the different individual differences measures provided with the HCP data (e.g., measures of WM performance). We computed each participant’s fluctuation amplitude measure as the average absolute difference between posterior-anterior and ventral-dorsal connectivity; this is the average of the TR-by-TR fMRI amplitude measure from Study 3. We correlated this individual difference score with all of the ~200 individual difference measures provided with the HCP dataset (e.g., measures of intelligence or personality). We measured the Spearman correlation between mean fluctuation amplitude with each of those ~200 measures, while correcting for multiple hypotheses using the False Discovery Rate approach.[18]

      We found a robust negative association with age, where older participants tend to display weaker fluctuations (r = -.16, p < .001). We additionally find a positive association with the age-adjusted score on the picture sequence task (r = .12, p<sub>corrected</sub> = .03) and a negative association with performance in the card sort task (r = -.12, p<sub>corrected</sub> = 046). It is unclear how to interpret these associations, without being speculative, given that fluctuation amplitude shows one positive association with performance and one negative association, albeit across entirely different tasks.  We have added these correlation results as Supplemental Materials 8 (SM p. 11):

      “(8) Behavioral differences related to fluctuation amplitude 

      To investigate whether individual differences in the magnitude of resting-state PE-state fluctuations predict general cognitive abilities, we correlated our resting-state fluctuation measure with the cognitive and demographic variables provided in the HCP dataset.

      (8.1) Methods

      For each of the 1,000 participants, we calculated a single fluctuation amplitude score. This score was defined as the average absolute difference between the time-varying posterior-anterior (PA) and ventral-dorsal (VD) connectivity during the resting-state fMRI scan (the average of the TR-by-TR measure used for Study 3). We then computed the Spearman correlation between this score and each of the approximately 200 individual difference measures provided in the HCP dataset. We corrected for multiple comparisons using the False Discovery Rate (FDR) approach.

      (8.2) Results

      The correlations revealed a robust negative association between fluctuation amplitude and age, indicating that older participants tended to display weaker fluctuations (r = -.16, p<sub>corrected</sub> < .001). After correction, two significant correlations with cognitive performance emerged: (i) a positive association with the age-adjusted score on the Picture Sequence Memory Test (r = .12, p<sub>corrected</sub> = .03), (ii) a negative association with performance on the Card Sort Task (r = -.12, p<sub>corrected</sub> = .046). As greater fluctuation amplitude is linked to better performance on one task but worse performance on another, it is unclear how to interpret these findings.”

      We updated the main text Methods to direct readers to this content (p. 39-40):

      “(4.4.3) Links between network fluctuations and behavior

      We considered whether the extent of PE-related network expression states during resting-state is behaviorally relevant. We specifically investigated whether individual differences in the overall magnitude of resting-state fluctuations could predict individual difference measures, provided with the HCP dataset. This yielded a significant association with age, whereby older participants tended to display weaker fluctuations. However, associations with cognitive measures were limited. A full description of these analyses is provided in Supplemental Materials 8.”

      (R3.2b) Within participants: Do momentary increases in PE-network expression during tasks relate to better or faster prediction? In other words, is there evidence that stronger expression of PE-related states is associated with better behavioral outcomes?

      This is a good question that probes the direct behavioral relevance of these network states on a trial-by-trial basis. We agree with the reviewer's intuition; in principle, one would expect a stronger expression of the low-PE network state on trials where a participant correctly and quickly gives a high likelihood rating to a predictable stimulus.

      Following this suggestion, we performed a new analysis in Study 1A to test this. We found that while network expression was indeed linked to participants’ likelihood ratings: higher likelihood ratings correspond to stronger posterior-anterior connectivity, whereas lower ratings correspond to stronger ventral-dorsal connectivity (Connectivity-Direction × likelihood, β [standardized] = .28, p = .02). Yet, this is not a strong test of the reviewer’s hypothesis, and different exploratory analyses of response time yield null results (p > .05). We suspect that this is due to the effect being too subtle, so we have insufficient statistical power. A comparable analysis was not feasible for Study 1B, as its design does not provide an analogous behavioral measure of trialby-trial prediction success.

      (R3.3) A priori Hypothesis for EEG Frequency Analysis.

      It's unclear how to interpret the finding that fMRI fluctuations in the defined modules correlate with frontal Delta/Theta power, specifically in the 3-6 Hz range. However, in the EEG literature, this frequency band is most commonly associated with low arousal, drowsiness, and mind wandering in resting, awake adults, not uniquely with prediction error processing. An a priori hypothesis is lacking here: what specific frequency band would we expect to track spontaneous PE signals at rest, and why? Without this, it is difficult to separate a PE-based interpretation from more general arousal or vigilance fluctuations.

      This point gets to the heart of the challenge with reverse inference in resting-state fMRI. We agree that an interpretation based on general arousal or drowsiness is a potential alternative that must be considered. However, what makes a simple arousal interpretation challenging is the highly specific nature of our fMRI-EEG association. As shown in our confirmatory analyses (Supplemental Materials 6), the correlation with 3-6 Hz power was found exclusively with the absolute difference between our two PE-related network states (|PA – VD|)—a measure of fluctuation amplitude. We found no significant relationship with the signed difference (a bias toward one state) or the sum (the overall level of connectivity). This specificity presents a puzzle for a simple drowsiness account; it seems less plausible that drowsiness would manifest specifically as the intensity of fluctuation between two complex cognitive networks, rather than as a more straightforward change in overall connectivity. While we cannot definitively rule out contributions from arousal, the specificity of our finding provides stronger evidence for a structured cognitive process, like PE, than for a general, undifferentiated state. 

      We updated the Discussion to make the argument above and also to remind readers that alternative explanations, such as ones based on drowsiness, are possible (p. 24):

      “We specifically interpret the fMRI-EEG correlation as reflecting fluctuation speed because we correlated EEG oscillatory power with the fluctuation amplitude computed from fMRI data. Simply correlating EEG power with the average connectivity or the signed difference between posterior-anterior and ventral-dorsal connectivity yields null results (Supplemental Materials 6), suggesting that this is a very particular association, and viewing it as capturing fluctuation amplitude provides a parsimonious explanation. Yet, this correlation may be interpreted in other ways. For example, resting-state Theta is also a signature of drowsiness,[2] which may correlate with PE processing, but perhaps should be understood as some other mechanism.”

      (R3.4) Significance Assessment

      The significance of the correlation above and all other correlation analyses should be assessed through a permutation test rather than a single parametric t-test against zero. There are a few reasons: a) EEG and fMRI time series are autocorrelated, violating the independence assumption of parametric tests;

      Standard t-tests can underestimate the true null distribution's variance, because EEG-fMRI correlations often involve shared slow drifts or noise sources, which can yield spurious correlations and inflating false positives unless tested against an appropriate null.

      Building a null distribution that preserves the slow drifts, for example, would help us understand how likely it is for the two time series to be correlated when the slow drifts are still present, and how much better the current correlation is, compared to this more conservative null. You can perform this by phase randomizing one of the two time courses N times (e.g., N=1000), which maintains the autocorrelation structure while breaking any true co-occurrence in patterns between the two time series, and compute a non-parametric p-value. I suggest using this approach in all correlation analyses between two time series.

      This is an important statistical point to clarify, and the suggested analysis is valuable. The reviewer is correct that the raw fMRI and EEG time series are autocorrelated. However, because our statistical approach is a twolevel analysis, we reasoned that non-independence at the correlation-level would not invalidate the higher-level t-test. The t-test’s assumption of independence applies to the individual participants' coefficients, which are independent across participants. Thus, we believe that our initial approach is broadly appropriate, and its simplicity allows it to be easily communicated.

      Nonetheless, the permutation-testing procedure that the Reviewer describes seems like an important analysis to test, given that permutation-testing is the gold standard for evaluating statistical significance, and it could guarantee that our above logic is correct. We thus computed the analysis as the reviewer described. For each participant, we phase-randomized the fMRI fluctuation amplitude time series. Specifically, we randomized the Fourier phases of the |PA–VD| series (within run), while retaining the original amplitude spectrum; inverse transforms yielded real surrogates with the same power spectrum. This was done for each participant once per permutation. Each participant’s phase-randomized data was submitted to the analysis of each oscillatory power band as originally, generating one mean correlation for each band. This was done 1,000 times.

      Across the five bands, we find that the grand mean correlation is near zero (M<sub>r</sub> = .0006) and the 97.5<sup>th</sup> percentile critical value of the null distribution is r = ~.025; this 97.5<sup>th</sup> percentile corresponds to the upper end of a 95% confidence interval for a band’s correlation; the threshold minimally differs across bands (.024 < rs < .026). Our original correlation coefficients for Delta (M<sub>r</sub> = .042) and Theta (M<sub>r</sub> = .041), which our conclusions focused on, remained significant (p ≤ .002); we can perform family-wise error-rate correction by taking the highest correlation across any band for a given permutation, and the Delta and Theta effects remain significant (p<sub>FWE</sub>corrected ≤ .003); previously Reviewer comment R1.4c requested that we employ family-wise error correction.

      These correlations were previously reported in Table 1, and we updated the caption to note what effects remain significant when evaluated using permutation-testing and with family-wise error correction (p. 19):

      “The effects for Delta, Theta, Beta, and Gamma remain significant if significance testing is instead performed using permutation-testing and with family-wise error rate correction (p<sub>corrected</sub> < .05).”

      We updated the Methods to describe the permutation-testing analysis (p. 43):

      “To confirm the significance of our fMRI-EEG correlations with a non-parametric approach, we performed a group-level permutation-test. For each of 1,000 permutations, we phase-randomized the fMRI fluctuation amplitude time series. Specifically, we randomized the Fourier phases of the |PA–VD| series (within run), while retaining the original amplitude spectrum; inverse transforms yielded real surrogates with the same power spectrum. This procedure breaks the true temporal relationship between the fMRI and EEG data while preserving its structure. We then re-computed the mean Spearman correlation for each frequency band using this phase-randomized data. We evaluated significance using a family-wise error correction approach that accounts for us analyzing five oscillatory power bands. We thus create a null distribution composed of the maximum correlation value observed across all frequency bands from each permutation. Our observed correlations were then tested for significance against this distribution of maximums.”

      (R3.5) Analysis choices

      If I'm understanding correctly, the algorithm used to identify modules does so by assigning nodes to communities, but it does not itself restrict what edges can be formed from these modules. This makes me wonder whether the decision to focus only on connections between adjacent modules, rather than considering the full connectivity, was an analytic choice by the authors. If so, could you clarify the rationale? In particular, what justifies assuming that the gradient of PE states should be captured by edges formed only between nearby modules (as shown in Figure 2E and Figure 4), rather than by the full connectivity matrix? If this restriction is instead a by-product of the algorithm, please explain why this outcome is appropriate for detecting a global signature of PE states in both task and rest.

      We discuss this matter in our response to comment R2.(4).

      When assessing the correspondence across task-fMRI and rs-fMRI in section 2.2.2, why was the pattern during task calculated from selecting a pair of bilateral ROIs (resulting in a group of eight ROIs), and the resting state pattern calculated from posterior-anterior/ventral-dorsal fluctuation modules? Doesn't it make more sense to align the two measures? For example, calculating task effects on these same modules during task and rest?

      We thank the reviewer for this question, as it highlights a point in our methods that we could have explained more clearly. The reviewer is correct that the two measures must be aligned, and we can confirm that they were indeed perfectly matched.

      For the analysis in Section 2.2.2, both the task and resting-state measures were calculated on the exact same anatomical substrate for each comparison. The analysis iteratively selected a symmetrical subset of eight ROIs from our larger four quadrants. For each of these 3,432 iterations, we computed the task-fMRI PE effect (the Connectivity Direction × PE interaction) and the resting-state fluctuation amplitude (E[|PA – VD|]) using the identical set of eight ROIs. The goal of this analysis was precisely to test if the fine-grained anatomical pattern of these effects correlated within an individual across the task and rest states. We will revise the text in Section 2.2.2 to make this direct alignment of the two measures more explicit.

      Recommendations for authors:

      Reviewer #1 (Recommendations for authors):

      (R1.3) Several prior studies have described co-activation or connectivity "templates" that spontaneously alternate during rest and task states, and are linked to behavioral variability. While they are interpreted differently in terms of cognitive function (e.g., in terms of sustained attention: Monica Rosenberg; alertness: Catie Chang), the relationship between these previously reported templates and those identified in the current study warrants discussion. Are the current templates spatially compatible with prior findings while offering new functional interpretations beyond those already proposed in the literature? Or do they represent spatially novel patterns?

      Thank you for this suggestion. Broadly, we do not mean to propose spatially novel patterns but rather focus on how these are repurposed for PE processing. In the Discussion, we link our identified connectivity states to established networks (e.g., the FPCN). We updated this paragraph to mention that these patterns are largely not spatially novel (p. 20):

      “The connectivity patterns put forth are, for the most part, not spatially novel and instead overlap heavily with prior functional and anatomical findings.”

      Regarding the specific networks covered in the prior work by Rosenberg and Chang that the reviewer seems to be referring to, [7,8] this research has emphasized networks anchored heavily in sensorimotor, subcortical– cerebellar, and medial frontal circuits, and so mostly do not overlap with the connectivity effects we put forth.

      (R1.4) Additional points:

      (R1.4a) I do not think that the logic for taking the absolute difference of fMRI connectivity is convincing. What happens if the sign of the difference is maintained ?

      Thank you for pointing out this area that requires clarification. Our analysis targets the amplitude of the fluctuation between brain states, not the direction. We define high fluctuation amplitude as moments when the brain is strongly in either the PA state (PA > VD) or the VD state (VD > PA). The absolute difference |PA – VD| correctly quantifies this intensity, whereas a signed difference would conflate these two distinct high-amplitude moments. Our simulation study (Supplemental Materials, Section 5) provides the theoretical validation for this logic, showing how this absolute difference measure in slow fMRI data can track the amplitude of a fast underlying neural oscillator.

      When the analysis is tested in terms of the signed difference, as suggested by the Reviewer, the association between the fMRI data and EEG power is insignificant for each power band (ps<sub>uncorrected</sub> ≥ .47). We updated Supplemental Materials 6 to include these results. Previously, this section included the fluctuation amplitude (fMRI) × EEG power results while controlling for: (i) the signed difference between posterior-anterior and ventral-dorsal connectivity, (ii) the sum of posterior-anterior and ventral-dorsal connectivity, and (iii) the absolute value of the sum of posterior-anterior and ventral-dorsal connectivity. For completeness, we also now report the correlation between each EEG power band and each of those other three measures (SM, p. 9)

      “We additionally tested the relationship between each of those three measures and the five EEG oscillation bands. Across the 15 tests, there were no associations (ps<sub>uncorrected</sub>  ≥ .04); one uncorrected p-value was at p = .044, although this was expected given that there were 15 tests. Thus, the association between EEG oscillations and the fMRI measure is specific to the absolute difference (i.e., amplitude) measure.”

      (R1.4b) Reasoning of focus on frontal and theta band is weak, and described as "typical" (line 359) based on a single study.

      Sorry about this. There is a rich literature on the link between frontal theta and prediction error,[3,9–11] and we updated the Introduction to include more references to this work (p. 18): “The analysis was first done using power averaged across frontal electrodes, as these are the typical focus of PE research on oscillations.[3,9–11]”

      We have also updated the Methods to cite more studies that motivate our electrode choice (p. 41): “The analyses first targeted five midline frontal electrodes (F3, F1, Fz, F2, F4; BioSemi64 layout), given that this frontal row is typically the focus of executive-function PE research on oscillations.[9–11]”

      (R1.4c) No correction appears to have been applied for the association between EEG power and fMRI connectivity. Given that 100 frequency bins were collapsed into 5 canonical bands, a correction for 5 comparisons seems appropriate. Notably, the strongest effects in the delta and theta bands (particularly at fronto-central electrodes) may still survive correction, but this should be explicitly tested and reported.

      Thanks for this suggestion. We updated the Table 1 caption to mention what results survive family-wise error rate correction – as the reviewer suggests, the Delta/Theta effects would survive Bonferroni correction for five tests, although per a later comment suggesting that we evaluate statistical significance with a permutationtesting approach (comment R3.4), we instead report family-wise error correction based on that. The revised caption is as follows (p. 19):

      “The effects for Delta, Theta, Beta, and Gamma remain significant if significance testing is instead performed using permutation-testing and with family-wise error rate correction (p<sub>corrected</sub> < .05).”

      (R1.4d) Line 135. Not sure I understand what you mean by "moods". What is the overall point here?

      The overall argument is that the fluctuations occur rapidly rather than slowly. By slow “moods” we refer to how a participant could enter a high anxiety state of >10 seconds, linked to high PE fluctuations, and then shift into a low anxiety state, linked to low PE fluctuations. We argue that this is not occurring. Regardless, we recognize that referring to lengths of time as short as 10 seconds or so is not a typical use of the word “mood” and is potentially ambiguous, so we have omitted this statement, which was originally on page 6: “Identifying subsecond fluctuations would broaden the relevance of the present results, as they rule out that the PE states derive from various moods.”

      (R1.4e) Line 100. "Few prior PE studies have targeted PE, contrasting the hundreds that have targeted BOLD". I don't understand this sentence. It's presumably about connectivity vs activity?

      Yes, sorry about this typo. The reviewer is correct, and that sentence was meant to mention connectivity. We corrected (p. 5): “Few prior PE studies have targeted connectivity, contrasting the hundreds that have targeted BOLD.”

      (R1.4f) Line 373: "0-0.5Hz" in the caption is probably "0-50Hz".

      Yes, this was another typo, thank you. We have corrected it (p. 19): “… every 0.5 Hz interval from 0-50 Hz.”

      Reviewer #2 (Recommendations for authors):

      (R2.6) (Page 3) When referring to the "limited" hypothesis of local PE, please clarify in what sense is it limited. That statement is unclear.

      Thank you for pointing out this text, which we now see is ambiguous. We originally use "limited" to refer to the hypothesis's constrained scope – namely, that PE is relevant to various low-level operations (e.g., sensory processing or rewards) but the minimization of PE does not guide more abstract cognitive processes. We edited this part of the Introduction to be clearer (p. 3)

      “It is generally agreed that the brain uses PE mechanisms at neuronal or regional levels,[15,16] and this idea has been useful in various low-level functional domains, including early vision [15] and dopaminergic reward processing.[17] Some theorists have further argued that PE propagates through perceptual pathways and can elicit downstream cognitive processes to minimize PE.”

      (R2.7) (Page 5) "Few prior PE have targeted PE"... this statement appears contradictory. Please clarify.

      Sorry about this typo, which we have corrected (p. 5):

      “Few prior PE studies have targeted connectivity, contrasting the hundreds that have targeted BOLD.”

      (R2.8) What happened to the data of the medium PE condition in Study 1A?

      The medium PE condition data were not excluded. We modeled the effect of prediction error on connectivity using a linear regression across the three conditions, coding them as a continuous variable (Low = -1, Medium = 0, High = +1). This approach allowed us to identify brain connections that showed a linear increase or decrease in strength as a function of increasing PE. This linear contrast is a more specific and powerful way to isolate PErelated effects than a High vs. Low contrast. We updated the Results slightly to make this clearer (p. 8-9):

      “In the fMRI data, we compared the three PE conditions’ beta-series functional connectivity, aiming to identify network-level signatures of PE processing, from low to high. […] For the modularity analysis, we first defined a connectome matrix of beta values, wherein each edge’s value was the slope of a regression predicting that edge’s strength from PE (coded as Low = -1, Medium = 0, High = +1; Figure 2A).”

      (R2.9) (Page 15) The point about how the dots in 6H follow those in 6J better than those in 6I is a little subjective - can the authors provide an objective measure?

      Thank you for pointing out this issue. The visual comparison using Figure 6 was not meant as a formal analysis but rather to provide intuition. However, as the reviewer describes, this is difficult to convey. Our formal analysis is provided in Supplemental Materials 5, where we report correlation coefficients between a very large number of simulated fMRI data points and EEG data points corresponding to different frequencies. We updated this part of the Results to convey this (p. 16-17):

      “Notice how the dots in Figure 6H follow the dots in Figure 6J (3 Hz) better than the dots in Figure 6I (0.5 Hz) or Figure 6K (10 Hz); this visual comparison is intended for illustrative purposes only, and quantitative analyses are provided in Supplemental Materials 5.”

      References

      (1) Zalesky, A., Fornito, A. & Bullmore, E. T. Network-based statistic: identifying differences in brain networks. Neuroimage 53, 1197–1207 (2010)

      (2) Strijkstra, A. M., Beersma, D. G., Drayer, B., Halbesma, N. & Daan, S. Subjective sleepiness correlates negatively with global alpha (8–12 Hz) and positively with central frontal theta (4–8 Hz) frequencies in the human resting awake electroencephalogram. Neuroscience letters 340, 17–20 (2003).

      (3) Cavanagh, J. F. & Frank, M. J. Frontal theta as a mechanism for cognitive control. Trends in cognitive sciences 18, 414–421 (2014).

      (4) Grech, R. et al. Review on solving the inverse problem in EEG source analysis. Journal of neuroengineering and rehabilitation 5, 25 (2008)

      (5) Palva, J. M. et al. Ghost interactions in MEG/EEG source space: A note of caution on inter-areal coupling measures. Neuroimage 173, 632–643 (2018).

      (6) Koles, Z. J. Trends in EEG source localization. Electroencephalography and clinical Neurophysiology 106, 127–137 (1998).

      (7) Rosenberg, M. D. et al. A neuromarker of sustained attention from whole-brain functional connectivity. Nature neuroscience 19, 165–171 (2016).

      (8) Goodale, S. E. et al. fMRI-based detection of alertness predicts behavioral response variability. elife 10, e62376 (2021).

      (9) Cavanagh, J. F. Cortical delta activity reflects reward prediction error and related behavioral adjustments, but at different times. NeuroImage 110, 205–216 (2015)

      (10) Hoy, C. W., Steiner, S. C. & Knight, R. T. Single-trial modeling separates multiple overlapping prediction errors during reward processing in human EEG. Communications Biology 4, 910 (2021).

      (11) Neo, P. S.-H., Shadli, S. M., McNaughton, N. & Sellbom, M. Midfrontal theta reactivity to conflict and error are linked to externalizing and internalizing respectively. Personality neuroscience 7, e8 (2024).

      (12) Friston, K. J. The free-energy principle: a unified brain theory? Nature reviews neuroscience 11, 127–138 (2010)

      (13) Feldman, H. & Friston, K. J. Attention, uncertainty, and free-energy. Frontiers in human neuroscience 4, 215 (2010).

      (14) Friston, K. J. et al. Active inference and epistemic value. Cognitive neuroscience 6, 187–214 (2015).

      (15) Rao, R. P. & Ballard, D. H. Predictive coding in the visual cortex: a functional interpretation of some extraclassical receptive-field effects. Nature neuroscience 2, 79–87 (1999)

      (16) Walsh, K. S., McGovern, D. P., Clark, A. & O’Connell, R. G. Evaluating the neurophysiological evidence for predictive processing as a model of perception. Annals of the new York Academy of Sciences 1464, 242– 268 (2020)

      (17) Niv, Y. & Schoenbaum, G. Dialogues on prediction errors. Trends in cognitive sciences 12, 265–272 (2008).

      (18) Benjamini, Y. & Hochberg, Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. Journal of the Royal statistical society: series B (Methodological) 57, 289–300 (1995).

    1. Author response:

      The following is the authors’ response to the previous reviews

      eLife Assessment

      This valuable study combines a computational language model, i.e., HM-LSTM, and temporal response function (TRF) modeling to quantify the neural encoding of hierarchical linguistic information in speech, and addresses how hearing impairment affects neural encoding of speech. The analysis has been significantly improved during the revision but remain somewhat incomplete - The TRF analysis should be more clearly described and controlled. The study is of potential interest to audiologists and researchers who are interested in the neural encoding of speech.

      We thank the editors for the updated assessment. In the revised manuscript, we have added a more detailed description of the TRF analysis on p. of the revised manuscript. We have also updated Figure 1 to better visualize the analyses pipeline. Additionally, we have included a supplementary video to illustrate the architecture of the HM-LSTM model, the ridge regression methods using the model-derived features, and mTRF analysis using the acoustic envelop and the binary rate models.

      Public Reviews:

      Reviewer #1 (Public review):

      About R squared in the plots:

      The authors have used a z-scored R squared in the main ridge regression plots. While this may be interpretable, it seems non-standard and overly complicated. The authors could use a simple Pearson r to be most direct and informative (and in line with similar work, including Goldstein et al. 2022 which they mentioned). This way the sign of the relationships is preserved.

      We did not use Pearson’s r as in Goldstein et al. (2022) because our analysis did not involve a train-test split, which was a key aspect of their approach. Specifically, Goldstein et al. (2022) divided their data into training and testing sets, trained a ridge regression model on the training set, and then used the trained model to predict neural responses on the test set. They calculated Pearson’s r to assess the correlation between the predicted and observed neural responses, making the correlation coefficient (r) their primary measure of model performance. In contrast, our analysis focused on computing the model fitting performance (R²) of the ridge regression model for each sensor and time point for each subject. At the group level, we conducted one-sample t-tests with spatiotemporal cluster-based correction on the R² values to identify sensors and time windows where R² values were significantly greater than baseline. We established the baseline by normalizing the R² values using Fisher z-transformation across sensors within each subject. We have added this explanation on p.13 of the revised manuscript.

      About the new TRF analysis:

      The new TRF analysis is a necessary addition and much appreciated. However, it is missing the results for the acoustic regressors, which should be there analogous to the HM-LSTM ridge analysis. The authors should also specify which software they have utilized to conduct the new TRF analysis. It also seems that the linguistic predictors/regressors have been newly constructed in a way more consistent with previous literature (instead of using the HM-LSTM features); these specifics should also be included in the manuscript (did it come from Montreal Forced Aligner, etc.?). Now that the original HM-LSTM can be compared to a more standard TRF analysis, it is apparent that the results are similar.

      We used the Python package Eelbrain (https://eelbrain.readthedocs.io/en/r0.39/auto_examples/temporal-response-functions/trf_intro.html) to conduct the multivariate temporal response function (mTRF) analyses. As we previously explained in our response to R3, we did not apply mTRF to the acoustic features due to the high dimensionality of the input. Specifically, our acoustic representation consists of a 130-dimensional vector sampled every 10 ms throughout the speech stimuli (comprising a 129-dimensional spectrogram and a 1dimensional amplitude envelope). This led to interpreting the 130-dimensional TRF estimation difficult to interpret. A similar constraint applied to the hidden-layer activations from our HMLSTM model for the five linguistic features. After dimensionality reduction via PCA, each still resulted in 150-dimensional vectors. To address this, we instead used binary predictors marking the offset of each linguistic unit (phoneme, syllable, word, phrase, sentence). Since our speech stimuli were computer-synthesized, the phoneme and syllable boundaries were automatically generated. The word boundaries were manually annotated by a native Mandarin as in Li et al. (2022). The phrase boundaries were automatically annotated by the Stanford parser and manually checked by a native Mandarin speaker. These rate models are represented as five distinct binary time series, each aligned with the timing of the corresponding linguistic unit, making them well-suited for mTRF analysis. Although the TRF results from the 1-dimensional rate predictors and the ridge regression results from the high-dimensional HM-LSTM-derived features are similar, they encode different things: The rate regressors only encode the timing of linguistic unit boundaries, while the model-derived features encode the representational content of the linguistic input. Therefore, we do not consider the mTRF analyses to be analogous to the ridge regression analyses. Rather, these results complement each other and both provide informative results into the neural tracking of linguistic structures at different levels for the attended and unattended speech.

      Since the TRF result for the continuous acoustic features also concerns R2, we have added an mTRF analysis where we fitted the one-dimensional speech envelope to the EEG. We extracted the envelope at 10 ms intervals for both attended and unattended speech and computed mTRFs independently for each subject and sensor using a basis of 50 ms Hamming windows spanning –100 ms to 300 ms relative to envelope onset. The results showed that in hearing-impaired participants, attended speech elicited a significant cluster in the bilateral temporal regions from 270 to 300 ms post-onset (t = 2.40, p = 0.01, Cohen’s d = 0.63). Unattended speech elicited an early cluster in right temporal and occipital regions from –100 ms to –80 ms (t = 3.07, p = 0.001, d = 0.83). Normal-hearing participants showed significant envelope tracking in the left temporal region at 280–300 ms after envelope onset (t = 2.37, p = 0.037, d = 0.48), with no significant cluster for unattended speech. These results further suggest that hearing-impaired listeners may have difficulty suppressing unattended streams. We have added the new TRF results for envelope to Figure S3 and the “mTRF results for attended and unattended speech” on p.7 and the “mTRF analysis” in Material and Methods of the revised manuscript.

      The authors' wording about this suggests that these new regressors have a nonzero sample at each linguistic event's offset, not onset. This should also be clarified. As the authors know, the onset would be more standard, and using the offset has implications for understanding the timing of the TRFs, as a phoneme has a different duration than a word, which has a different duration from a sentence, etc.

      In our rate‐model mTRF analyses, we initially labelled linguistic boundaries as “offsets” because our ridge‐regression with HM-LSTM features was aligned to sentence offsets rather than onsets. However, since each offset coincides with the next unit’s onset—and our regressors simply mark these transition points as 1—the “offset” and “onset” models yield identical mTRFs. To avoid confusion, we have relabeled “offset” as “boundary” in Figure S2.

      As discussed in our prior responses, this design was based on the structure of our input to the HM-LSTM model, where each input consists of a pair of sentences encoded in phonemes, such as “t a_1 n əŋ_2 f ei_1 <sep> zh ə_4 sh iii_4 f ei_1 j ii_1” (“It can fly <sep> This is an airplane”). The two sentences are separated by a special <sep> token, and the model’s objective is to determine whether the second sentence follows the first, similar to a next-sentence prediction task. Since the model processes both sentences in full before making a prediction, the neural activations of interest should correspond to the point at which the entire sentence has been processed by humans. To enable a fair comparison between the model’s internal representations and brain responses, we aligned our neural analyses with the sentence offsets, capturing the time window after the sentence has been fully perceived by the participant. Thus, we extracted epochs from -100 to +300 ms relative to each sentence offset, consistent with our model-informed design.

      We understand that phonemes, syllables, words, phrases, and sentences differ in their durations. However, the five hidden activity vectors extracted from the model are designed to capture the representations of these five linguistic levels across the entire sentence. Specifically, for a sentence pair such as “It can fly <sep> This is an airplane,” the first 2048-dimensional vector represents all the phonemes in the two sentences (“t a_1 n əŋ_2 f ei_1 <sep> zh ə_4 sh iii_4 f ei_1 j ii_1”), the second vector captures all the syllables (“ta_1 nəŋ_2 fei_1 <sep> zhə_4 shiii_4 fei_1jii_1”), the third vector represents all the words, the fourth vector captures the phrases, and the fifth vector represents the sentence-level meaning. In our dataset, input pairs consist of adjacent sentences from the stimuli (e.g., Sentence 1 and Sentence 2, Sentence 2 and Sentence 3, and so on), and for each pair, the model generates five 2048-dimensional vectors, each corresponding to a specific linguistic level. To identify the neural correlates of these model-derived features—each intended to represent the full linguistic level across a complete sentence—we focused on the EEG signal surrounding the completion of the second sentence rather than on incremental processing. Accordingly, we extracted epochs from -100 ms to +300 ms relative to the offset of the second sentence and performed ridge regression analyses using the five model features (reduced to 150 dimensions via PCA) at every 50 ms across the epoch. We have added this clarification on p.12 of the revised manuscript.

      About offsets:

      TRFs can still be interpretable using the offset timings though; however, the main original analysis seems to be utilizing the offset times in a different, more confusing way. The authors still seem to be saying that only the peri-offset time of the EEG was analyzed at all, meaning the vast majority of the EEG trial durations do not factor into the main HM-LSTM response results whatsoever. The way the authors describe this does not seem to be present in any other literature, including the papers that they cite. Therefore, much more clarification on this issue is needed. If the authors mean that the regressors are simply time-locked to the EEG by aligning their offsets (rather than their onsets, because they have varying onsets or some such experimental design complexity), then this would be fine. But it does not seem to be what the authors want to say. This may be a miscommunication about the methods, or the authors may have actually only analyzed a small portion of the data. Either way, this should be clarified to be able to be interpretable.

      We hope that our response in RE4, along with the supplementary video, has helped clarify this issue. We acknowledge that prior studies have not used EEG data surrounding sentence offsets to examine neural responses at the phoneme or syllable levels. However, this is largely due to a lack of model that represent all linguistic levels across an entire sentence. There is abundant work comparing model predictors with neural data time-locked to offsets because they mark the point at which participants has already processed the relevant information (Brennan, 2016; Brennan et al., 2016; Gwilliams et al., 2024, 2025). Similarly, in our model– brain alignment study, our goal is to identify neural correlates for each model-derived feature. If we correlate model activity with EEG data aligned to sentence onsets, we would be examining linguistic representations at all levels (from phoneme to sentence) of the whole sentence at the time when participants have not heard the sentence yet. Although this limits our analysis to a subset of the data (143 sentences × 400 ms windows × 4 conditions), it targets the exact moment when full-sentence representations emerge against background speech, allowing us to examine each model-derived feature onto its neural signature. We have added this clarification on p.12 of the revised manuscript.

      Reviewer #2 (Public review):

      This study presents a valuable finding on the neural encoding of speech in listeners with normal hearing and hearing impairment, uncovering marked differences in how attention to different levels of speech information is allocated, especially when having to selectively attend to one speaker while ignoring an irrelevant speaker. The results overall support the claims of the authors, although a more explicit behavioural task to demonstrate successful attention allocation would have strengthened the study. Importantly, the use of more "temporally continuous" analysis frameworks could have provided a better methodology to assess the entire time course of neural activity during speech listening. Despite these limitations, this interesting work will be useful to the hearing impairment and speech processing research community. The study compares speech-in-quiet vs. multi-talker scenarios, allowing to assess within-participant the impact that the addition of a competing talker has on the neural tracking of speech. Moreover, the inclusion of a population with hearing loss is useful to disentangle the effects of attention orienting and hearing ability. The diagnosis of high-frequency hearing loss was done as part of the experimental procedure by professional audiologists, leading to a high control of the main contrast of interest for the experiment. Sample size was big, allowing to draw meaningful comparisons between the two populations.

      We thank you very much for your appreciation of our research and we have now added a more description of the mTRF analyses on p.13-14 of the revised manuscript.

      An HM-LSTM model was employed to jointly extract speech features spanning from the stimulus acoustics to word-level and phrase-level information, represented by embeddings extracted at successive layers of the model. The model was specifically expanded to include lower level acoustic and phonetic information, reaching a good representation of all intermediate levels of speech. Despite conveniently extracting all features jointly, the HMLSTM model processes linguistic input sentence-by-sentence, and therefore only allows to assess the corresponding EEG data at sentence offset. If I understood correctly, while the sentence information extracted with the HM-LSTM reflects the entire sentence - in terms of its acoustic, phonetic and more abstract linguistic features - it only gives a condensed final representation of the sentence. As such, feature extraction with the HM-LSTM is not compatible with a continuous temporal mapping on the EEG signal, and this is the main reason behind the authors' decision to fit a regression at nine separate time points surrounding sentence offsets.

      Yes, you are correct. As explained in RE4, the model generates five hidden-layer activity vectors, each intended to represent all the phonemes, syllables, words, phrases within the entire sentence (“a condensed final representation”). This is the primary reason we extract EEG data surrounding the sentence offsets—this time point reflects when the full sentence has been processed by the human brain. We assume that even at this stage, residual neural responses corresponding to each linguistic level are still present and can be meaningfully analyzed.

      While valid and previously used in the literature, this methodology, in the particular context of this experiment, might be obscuring important attentional effects impacted by hearing-loss. By fitting a regression only around sentence-final speech representations, the method might be overlooking the more "online" speech processing dynamics, and only assessing the permanence of information at different speech levels at sentence offset. In other words, the acoustic attentional bias between Attended and Unattended speech might exist even in hearing-impaired participants but, due to a lower encoding or permanence of acoustic information in this population, it might only emerge when using methodologies with a higher temporal resolution, such as Temporal Response Functions (TRFs). If a univariate TRF fit simply on the continuous speech envelope did not show any attentional bias (different trial lengths should not be a problem for fitting TRFs), I would be entirely convinced of the result. For now, I am unsure on how to interpret this finding.

      We agree and we have added the mTRF results using the rate models for the 5 linguistic levels in the prior revision. The rate model aligns with the boundaries of each linguistic unit at each level. As explained in RE3, the rate regressors encode the timing of linguistic unit boundaries, while the model-derived features encode the representational content of the linguistic input. The mTRF results showed similar patterns to those observed using features from our HM-LSTM model with ridge regression (see Figure S2). These results complement each other and both provide informative results into the neural tracking of linguistic structures at different levels for the attended and unattended speech.

      We have also added TRF results fitting the envelope of attended and unattended speech at every 10 ms to the whole 10-minute EEG data at every 10 ms. Our results showed that in hearing-impaired participants, attended speech elicited a significant cluster in the bilateral temporal regions from 270 to 300 ms post-onset (t = 2.40, p = 0.01, Cohen’s d = 0.63). Unattended speech elicited an early cluster in right temporal and occipital regions from –100 ms to –80 ms (t = 3.07, p = 0.001, d = 0.83). Normal-hearing participants showed significant envelope tracking in the left temporal region at 280–300 ms after envelope onset (t = 2.37, p = 0.037, d = 0.48), with no significant cluster for unattended speech. These results further suggest that hearing-impaired listeners may have difficulty suppressing unattended streams. We have added the new TRF results for envelope to Figure S3 and the “mTRF results for attended and unattended speech” on p.7 and the “mTRF analysis” in Material and Methods of the revised manuscript.

      Despite my doubts on the appropriateness of condensed speech representations and singlepoint regression for acoustic features in particular, the current methodology allows the authors to explore their research questions, and the results support their conclusions. This work presents an interesting finding on the limits of attentional bias in a cocktail-party scenario, suggesting that fundamentally different neural attentional filters are employed by listeners with highfrequency hearing loss, even in terms of the tracking of speech acoustics. Moreover, the rich dataset collected by the authors is a great contribution to open science and will offer opportunities for re-analysis.

      We sincerely thank you again for your encouraging comments regarding the impact of our study.

      Reviewer #3 (Public review):

      Summary:

      The authors aimed to investigate how the brain processes different linguistic units (from phonemes to sentences) in challenging listening conditions, such as multi-talker environments, and how this processing differs between individuals with normal hearing and those with hearing impairments. Using a hierarchical language model and EEG data, they sought to understand the neural underpinnings of speech comprehension at various temporal scales and identify specific challenges that hearing-impaired listeners face in noisy settings.

      Strengths:

      Overall, the combination of computational modeling, detailed EEG analysis, and comprehensive experimental design thoroughly investigates the neural mechanisms underlying speech comprehension in complex auditory environments. The use of a hierarchical language model (HM-LSTM) offers a data-driven approach to dissect and analyze linguistic information at multiple temporal scales (phoneme, syllable, word, phrase, and sentence). This model allows for a comprehensive neural encoding examination of how different levels of linguistic processing are represented in the brain. The study includes both single-talker and multi-talker conditions, as well as participants with normal hearing and those with hearing impairments. This design provides a robust framework for comparing neural processing across different listening scenarios and groups.

      Weaknesses:

      The analyses heavily rely on one specific computational model, which limits the robustness of the findings. The use of a single DNN-based hierarchical model to represent linguistic information, while innovative, may not capture the full range of neural coding present in different populations. A low-accuracy regression model-fit does not necessarily indicate the absence of neural coding for a specific type of information. The DNN model represents information in a manner constrained by its architecture and training objectives, which might fit one population better than another without proving the non-existence of such information in the other group. It is also not entirely clear if the DNN model used in this study effectively serves the authors' goal of capturing different linguistic information at various layers. More quantitative metrics on acoustic/linguistic-related downstream tasks, such as speaker identification and phoneme/syllable/word recognition based on these intermediate layers, can better characterize the capacity of the DNN model.

      We agree that, before aligning model representations with neural data, it is essential to confirm that the model encodes linguistic information at multiple hierarchical levels. This is the purpose of our validation analysis: We evaluated the model’s representations across five layers using a test set of 20 four-syllable sentences in which every syllable shares the same vowel—e.g., “mā ma mà mǎ” (mother scolds horse), “shū shu shǔ shù” (uncle counts numbers; see Table S1). We hypothesized that the activity in the phoneme and syllable layer would be more similar than other layers for same-vowel sentences. The results confirmed our hypothesis: Hidden-layer activity for same-vowel sentences exhibited much more similar distributions at the phoneme and syllable levels compared to those at the word, phrase and sentence levels Figure 3C displays the scatter plot of the model activity at the five linguistic levels for each of the 20 4-syllable sentences, post dimension reduction using multidimensional scaling (MDS). We used color-coding to represent the activity of five hidden layers after dimensionality reduction. Each dot on the plot corresponds to one test sentence. Only phonemes are labeled because each syllable in our test sentences contains the same vowels (see Table S1).The plot reveals that model representations at the phoneme and syllable levels are more dispersed for each sentence, while representations at the higher linguistic levels—word, phrase, and sentence—are more centralized. Additionally, similar phonemes tend to cluster together across the phoneme and syllable layers, indicating that the model captures a greater amount of information at these levels when the phonemes within the sentences are similar.

      Apart from the DNN model, we also included the rate models which simply mark 1 at each unit boundaries across the 5 levels. We performed mTRF analyses with these rate models and found similar patterns to our ridge‐regression results with the DNN: (see Figure S2). This provides further evidence that the model reliably captures information across all five hierarchical levels.

      Since EEG measures underlying neural activity in near real-time, it is expected that lower-level acoustic information, which is relatively transient, such as phonemes and syllables, would be distributed throughout the time course of the entire sentence. It is not evident if this limited time window effectively captures the neural responses to the entire sentence, especially for lower-level linguistic features. A more comprehensive analysis covering the entire time course of the sentence, or at least a longer temporal window, would provide a clearer understanding of how different linguistic units are processed over time.

      We agree that lower-level linguistic features may be distributed throughout the whole sentence, however, using the entire sentence duration was not feasible, as the sentences in the stimuli vary in length, making statistical analysis challenging. Additionally, since the stimuli consist of continuous speech, extending the time window would risk including linguistic units from subsequent sentences. This would introduce ambiguity as to whether the EEG responses correspond to the current or the following sentence. Additionally, our model activity represents a “condensed final representation” at the five linguistic levels for the whole sentence, rather than incrementally during the sentence. We think the -100 to 300 ms time window relative to each sentence offset targets the exact moment when full-sentence representations are comprehended and a “condensed final representation” for the whole sentence across five linguistic level have been formed in the brain. We have added this clarification on p.13 of the revised manuscript.

      Recommendations for the authors:

      Reviewer #1 (Recommendations for the authors):

      Here are some specifics and clarifications of my public review:

      Initially I was interpreting the R squared as a continuous measure of predicted EEG relative to actual EEG, based on an encoding model, but this does not appear to be correct. Thank you for pointing out that the y axis is z-scored R squared in your main ridge regression plots. However, I am not sure why/how you chose to represent this that way. It seems to me that a simple Pearson r would be most informative here (and in line with similar work, including Goldstein et al. 2022 that you mentioned). That way you preserve the sign of the relationships between the regressors and the EEG. With R squared, we have a different interpretation, which is maybe also ok, but I also don't see the point of z-scoring R squared. Another possibility is that when you say "z-transformed" you are referring to the Fisher transformation; is that the case? In the plots you say "normalized", so that sounds like a z-score, but this needs to be clarified; as I say, a simple Pearson r would probably be best.

      We did not use Pearson’s r, as in Goldstein et al. (2022), because our analysis did not involve a train-test split, which was central to their approach. In their study, the data were divided into training and testing sets, and a ridge regression model was trained on the training set. They then used the trained model to predict neural responses on the held-out test set, and calculated Pearson’s r to assess the correlation between the predicted and observed neural responses. As a result, their final metric of model performance was the correlation coefficient (r). In contrast, our analysis is more aligned with standard temporal response function (TRF) approaches. We did not perform a train-test split; instead, we computed the model fitting performance (R²) of the ridge regression model at each sensor and time point for each subject. At the group level, we conducted one-sample t-tests with spatiotemporal cluster-based correction on the R² values to determine which sensors and time windows showed significantly greater R² values than baseline. To establish a baseline, we z-scored the R² values across sensors and time points, effectively centering the distribution around zero. This normalization allowed us to interpret deviations from the mean R² as meaningful increases in model performance and provided a suitable baseline for the statistical tests. We have added this clarification on p.13 of the revised manuscript.

      Thank you for doing the TRF analysis, but where are the acoustic TRFs, analogous to the acoustic results for your HM-LSTM ridge analyses? And what tools did you use to do the TRF analysis? If it is something like the mTRF MATLAB toolbox, then it is also using ridge regression, as you have already done in your original analysis, correct? If so, then it is pretty much the same as your original analysis, just with more dense timepoints, correct? This is what I meant by referring to TRFs originally, because what you have basically done originally was to make a 9-point TRF (and then the plots and analyses are contrasts of pairs of those), with lags between -100 and 300 ms relative to the temporal alignment between the regressors and the EEG, I think (more on this below).

      Also with the new TRF analysis, you say that the regressors/predictors had "a value of 1 at each unit boundary offset". So this means you re-made these predictors to be discrete as I and reviewer 3 were mentioning before (rather than using the HM-LSTM model layer(s)), and also, that you put each phoneme/word/etc. marker at its offset, rather than its onset? I'm also confused as to why you would do this rather than the onset, but I suppose it doesn't change the interpretation very much, just that the TRFs are slid over by a small amount.

      We used the Python package Eelbrain (https://eelbrain.readthedocs.io/en/r0.39/auto_examples/temporal-response-functions/trf_intro.html) to conduct the multivariate temporal response function (mTRF) analyses. As we previously explained in our response to Reviewer 3, we did not apply mTRF to the acoustic features due to the high dimensionality of the input. Specifically, our acoustic representation consists of a 130-dimensional vector sampled every 10 ms throughout the speech stimuli (comprising a 129-dimensional spectrogram and a 1-dimensional amplitude envelope). This renders the 130 TRF weights to the acoustic features uninterpretable. However, we have now added TRF results from the 1- dimension envelope to the attended and unattended speech at every 10 ms.

      A similar constraint applied to the hidden-layer activations from our HM-LSTM model for the five linguistic features. After dimensionality reduction via PCA, each still resulted in 150-dimensional vectors, further preventing their use in mTRF analyses. To address this, we instead used binary predictors marking the offset of each linguistic unit (phoneme, syllable, word, phrase, sentence). These rate models are represented as five distinct binary time series, each aligned with the timing of the corresponding linguistic unit, making them well-suited for mTRF analysis. It is important to note that these rate predictors differ from the HM-LSTMderived features: They encode only the timing of linguistic unit boundaries, not the content or representational structure of the linguistic input. Therefore, we do not consider the mTRF analyses to be equivalent to the ridge regression analyses based on HM-LSTM features

      For onset vs. offset, as explained RE4, we labelled them “offsets” because our ridge‐regression with HM-LSTM features was aligned to sentence offsets rather than onsets (see RE4 and RE15 below for the rationale of using sentence offset). However, since each unit offset coincides with the next unit’s onset—and the rate model simply mark these transition points as 1—the “offset” and “onset” models yield identical mTRFs. To avoid confusion, we have relabeled “offset” as “boundary” in Figure S2.

      I'm still confused about offsets generally. Does this maybe mean that the EEG, and each predictor, are all aligned by aligning their endpoints, which are usually/always the ends of sentences? So e.g. all the phoneme activity in the phoneme regressor actually corresponds to those phonemes of the stimuli in the EEG time, but those regressors and EEG do not have a common starting time (one trial to the next maybe?), so they have to be aligned with their ends instead?

      We chose to use sentence offsets rather than onsets based on the structure of our input to the HM-LSTM model, where each input consists of a pair of sentences encoded in phonemes, such as “t a_1 n əŋ_2 f ei_1 <sep> zh ə_4 sh iii_4 f ei_1 j ii_1” (“It can fly <sep> This is an airplane”). The two sentences are separated by a special <sep> token, and the model’s objective is to determine whether the second sentence follows the first, similar to a next-sentence prediction task. Since the model processes both sentences in full before making a prediction, the neural activations of interest should correspond to the point at which the entire sentence has been processed. To enable a fair comparison between the model’s internal representations and brain responses, we aligned our neural analyses with the sentence offsets, capturing the time window after the sentence has been fully perceived by the participant. Thus, we extracted epochs from -100 to +300 ms relative to each sentence offset, consistent with our modelinformed design. If we align model activity with EEG data aligned to sentence onsets, we would be examining linguistic representations at all levels (from phoneme to sentence) of the whole sentence at the time when participants have not heard the sentence yet. By contrast, aligning to sentence offsets ensures that participants have constructed a full-sentence representation.

      We understand that it is a bit confusing why the regressor of each level is not aligned to their own offsets in the data. The hidden-layer activations of the HM-LSTM model corresponding to the five linguistic levels (phoneme, syllable, word, phrase, sentence) are consistently 150-dimensional vectors after PCA reduction. As a result, for each input sentence pair, the model produces five distinct hidden-layer activations, each capturing the representational content associated with one linguistic level for the whole sentence. We believe our -100 to 300 ms time window relative to sentence offset reflects a meaningful period during which the brain integrates and comprehends information across multiple linguistic levels.

      Being "time-locked to the offset of each sentence at nine latencies" is not something I can really find in any of the references that you mentioned, regarding the offset aspect of this method. Can you point me more specifically to what you are trying to reference with that, or further explain? You said that "predicting EEG signals around the offset of each sentence" is "a method commonly employed in the literature", but the example you gave of Goldstein 2022 is using onsets of words, which is indeed much more in line with what I would expect (not offsets of sentences).

      You are correct that Goldstein (2022) aligned model predictions to onsets rather than offsets; however, many studies in the literature also align model predictions with unit offsets. typically because they mark the point at which participants has already processed the relevant information (Brennan, 2016; Brennan et al., 2016; Gwilliams et al., 2024, 2025). Similarly, in our study, we aim to identify neural correlates for each model-derived feature. If we correlate model activity with EEG data aligned to sentence onsets, we would be examining linguistic representations at all levels (from phoneme to sentence) of the whole sentence at the time when participants have not heard the sentence yet. By contrast, aligning to sentence offsets ensures that participants have constructed a full-sentence representation. Although this limits our analysis to a subset of the data (143 sentences × 400 ms windows × 4 conditions), it targets the exact moment when full-sentence representations emerge against background speech, allowing us to examine each model-derived feature onto its neural signature. We have added this clarification on p.12 of the revised manuscript.

      This new sentence does not make sense to me: "The regressors are aligned to sentence offsets because all our regressors are taken from the hidden layer of our HM-LSTM model, which generates vector representations corresponding to the five linguistic levels of the entire sentence".

      Thank you for the suggestion. We hope our responses in RE4, 15 and 16, along with our supplementary video have now clarified the issue. We have deleted the sentence and provided a more detailed explanation on p.12 of the revised manuscript: The regressors are aligned to sentence offsets because our goal is to identify neural correlates for each model-derived feature of a whole sentence. If we align model activity with EEG data time-locked to sentence onsets, we would be finding neural responses to linguistic levels (from phoneme to sentence) of the whole sentence at the time when participants have not processed the sentence yet. By contrast, aligning to sentence offsets ensures that participants have constructed a full-sentence representation. Although this limits our analysis to a subset of the data (143 sentences × 2 sections × 400 ms windows), it targets the exact moment when full-sentence representations emerge against background speech, allowing us to examine each model-derived feature onto its neural signature. We understand that phonemes, syllables, words, phrases, and sentences differ in their durations. However, the five hidden activity vectors extracted from the model are designed to capture the representations of these five linguistic levels across the entire sentence Specifically, for a sentence pair such as “It can fly <sep> This is an airplane,” the first 2048dimensional vector represents all the phonemes in the two sentences (“t a_1 n əŋ_2 f ei_1 <sep> zh ə_4 sh iii_4 f ei_1 j ii_1”), the second vector captures all the syllables (“ta_1 nəŋ_2 fei_1 <sep> zhə_4 shiii_4 fei_1jii_1”), the third vector represents all the words, the fourth vector captures the phrases, and the fifth vector represents the sentence-level meaning. In our dataset, input pairs consist of adjacent sentences from the stimuli (e.g., Sentence 1 and Sentence 2, Sentence 2 and Sentence 3, and so on), and for each pair, the model generates five 2048dimensional vectors, each corresponding to a specific linguistic level. To identify the neural correlates of these model-derived features—each intended to represent the full linguistic level across a complete sentence—we focused on the EEG signal surrounding the completion of the second sentence rather than on incremental processing. Accordingly, we extracted epochs from -100 ms to +300 ms relative to the offset of the second sentence and performed ridge regression analyses using the five model features (reduced to 150 dimensions via PCA) at every 50 ms across the epoch.

      More on the issue of sentence offsets: In response to reviewer 3's question about -100 - 300 ms around sentence offset, you said "Using the entire sentence duration was not feasible, as the sentences in the stimuli vary in length, making statistical analysis challenging. Additionally, since the stimuli consist of continuous speech, extending the time window would risk including linguistic units from subsequent sentence." This does not make sense to me, so can you elaborate? It sounds like you are actually saying that you only analyzed 400 ms of each trial, but that cannot be what you mean.

      Yes, we analyzed only the 400 ms window surrounding each sentence offset. Although this represents just a subset of our data (143 sentences × 400 ms × 4 conditions), it precisely captures when full-sentence representations emerge against background speech. Because our model produces a single, condensed representation for each linguistic level over the entire sentence—rather than incrementally—we think it is more appropriate to align to the period surrounding sentence offsets. Additionally, extending the window (e.g. to 2 seconds) would risk overlapping adjacent sentences, since sentence lengths vary. Our focus is on the exact period when integrated, level-specific information for each sentence has formed in the brain, and our results already demonstrate different response patterns to different linguistic levels for the two listener groups within this interval. We have added this clarification on p.13 of the revised manuscript.

      In your mTRF analysis, you are now saying that the discrete predictors have "a value of 1" at each of the "boundary offsets", and those TRFs look very similar to your original plots. It sounds to me like you should not be referring to time zero in your original ridge analysis as "sentence offset". If what you mean is that sentence offset time is merely how you aligned the regressors and EEG in time, then your time zero still has a standard, typical TRF interpretation. It is just the point in time, or lag, at which the regressor(s) and EEG are aligned. So activity before zero is "predictive" and activity after zero is "reactive", to think of it crudely. So also in the text, when you say things like "50-150 ms after the sentence offsets", I think this is not really what you mean. I think you are referring to the lags of 50 - 150 ms, relative to the alignment of the regressor and the EEG.

      Thank you very much for the explanation. We agree that, in our ridge‐regression time course, pre zero lags index “predictive” processing and post-zero lags index “reactive” processing. Unlike TRF analysis, we applied ridge regression to our high-dimensional model features at nine discrete lags around the sentence offset. At each lag, we tested whether the regression score exceeded a baseline defined as the mean regression score across all lags. For example, finding a significantly higher regression score between 50 and 150 ms suggests that our regressor reliably predicted EEG activity in that time window. So here time zero refers to the precise moment of the sentence offset—not the the alignment of the regressor and the EEG.

      I look forward to discussing how much of my interpretation here makes sense or doesn't, both with the authors and reviewers.

      Thank you very much for these very constructive feedback and we hope that we have addressed all your questions.

    1. mutualidad

      Me gusta un montón la noción de mutualidad. Expresa mejor ese "ser con otros" que la palabra colaboración donde está intrínseca la cuestión de la tarea y el fin.

    2. para un internet que nos ayude a inventar modos de encontrarnos entre los diferentes,

      Esta parte, junto con la descripción del compost como lugar de encuentro multiespecie, me interpela. Creo que los jardines nos sirven para conectar desde otras formas de relacionamiento, desde los intereses y las ideas de la/s otra/s persona/s. Este diálogo desde lo divergente permite escapar a la automatización de la conversación, de las frases hechas y la fórmula de construcción de la IA.

    3. nuevas formas de computar el mundo.

      Destaco solamente esto para señalar las posibilidades de uno de esos juegos de palabras que nos gustan tanto, y pensar los jardines digitales como un proceso que va desde la computadora a la compostadora. Entendiendo que la computación es algo más estrictamente cuantitativo y la compostadora trataría de aportar matices cualitativos al ecosistema digital

    4. El compost, con lo que tiene de cuidado de la tierra, es lo que nos enseña a hacer alquimia de lo digital, de su binarismo ontológico, y hacer emerger, a partir de ahí, nuevas formas de computar el mundo.

      El cuidado de lo digital desde el descuido, lo que nace fuera de la observación meditada y funcionalista.

    5. –pero bueno ese es, por ahora, otro tema–

      Será bonito ver cuándo se escriba el post sobre la importancia del compost en Asturias y esa mención sea un enlace a esa nueva entrada. Ganas de leerlo.

    1. Quedé y olvidéme,el rostro recliné sobre el Amado;cesó todo, y dejéme,dejando mi cuidadoentre las azucenas olvidado.

      Que poderoso. Que por un momento las normas de la vida no existen solo este amor.

    2. secreta escala

      No entiendo perfectamente qué está pasando, pero se ve como hay alguien que se está colando para encontrar Sor Juana. ¿A mejor un amor secreto? Está hablando de amor y a mejor tenía de ser secreto por ser religiosa.

    3. oscura

      Hipótesis: las cosas de la noche están fuera del normal. Me recuerdo a unas de las obras de Lorca en que los pensamientos y deseos están iluminados en la noche. Noche oscura del alma se ve como un poema que abre los ojos de los problemas de día con las acciones de la noche.

    4. prisión o quizás unpoco después de salir

      No quiero ser muy rígido, pero digo que hay una gran diferencia en lo que está escrito durante su tiempo en prisión y después porque la emoción está más cruda en el momento. Sin embargo, con todo de esta época, información va lento y casi nunca van a tener los datos específicos. Con esto, es interesante ver si realimente hay una diferencia en su literatura durante y después de prisión.

    1. Author response:

      The following is the authors’ response to the original reviews

      eLife Assessment

      This is a valuable polymer model that provides insight into the origin of macromolecular mixed and demixed states within transcription clusters. The well-performed and clearly presented simulations will be of interest to those studying gene expression in the context of chromatin. While the study is generally solid, it could benefit from a more direct comparison with existing experimental data sets as well as further discussion of the limits of the underlying model assumptions.

      We thank the editors for their overall positive assessment. In response to the Referees’ comments, we have addressed all technical points, including a more detailed explanation of the methodology used to extract gene transcription from our simulations and its analogy with real gene transcription. Regarding the potential comparison with experimental data and our mixing–demixing transition, we have added new sections discussing the current state of the art in relevant experiments. We also clarify the present limitations that prevent direct comparisons, which we hope can be overcome with future experiments using the emerging techniques.

      Reviewer #1 (Public Review):

      This manuscript discusses from a theory point of view the mechanisms underlying the formation of specialized or mixed factories. To investigate this, a chromatin polymer model was developed to mimic the chromatin binding-unbinding dynamics of various complexes of transcription factors (TFs).

      The model revealed that both specialized (i.e., demixed) and mixed clusters can emerge spontaneously, with the type of cluster formed primarily determined by cluster size. Non-specific interactions between chromatin and proteins were identified as the main factor promoting mixing, with these interactions becoming increasingly significant as clusters grow larger.

      These findings, observed in both simple polymer models and more realistic representations of human chromosomes, reconcile previously conflicting experimental results. Additionally, the introduction of different types of TFs was shown to strongly influence the emergence of transcriptional networks, offering a framework to study transcriptional changes resulting from gene editing or naturally occurring mutations.

      Overall I think this is an interesting paper discussing a valuable model of how chromosome 3D organisation is linked to transcription. I would only advise the authors to polish and shorten their text to better highlight their key findings and make it more accessible to the reader.

      We thank the Referee for carefully reading our manuscript and recognizing its scientific value. As suggested, we tried to better highlight our key findings and make the text more accessible while addressing also the comments from the other Referees.

      Reviewer #2 (Public Review):

      Summary:

      With this report, I suggest what are in my opinion crucial additions to the otherwise very interesting and credible research manuscript ”Cluster size determines morphology of transcription factories in human cells”.

      Strengths:

      The manuscript in itself is technically sound, the chosen simulation methods are completely appropriate the figures are well-prepared, the text is mostly well-written spare a few typos. The conclusions are valid and would represent a valuable conceptual contribution to the field of clustering, 3D genome organization and gene regulation related to transcription factories, which continues to be an area of most active investigation.

      Weaknesses:

      However, I find that the connection to concrete biological data is weak. This holds especially given that the data that are needed to critically assess the applicability of the derived cross-over with factory size is, in fact, available for analysis, and the suggested experiments in the Discussion section are actually done and their results can be exploited. In my judgement, unless these additional analysis are added to a level that crucial predictions on TF demixing and transcriptional bursting upon TU clustering can be tested, the paper is more fitted for a theoretical biophysics venue than for a biology journal such as eLife.

      We thank the Reviewer for their positive assessment of the soundness of our work and its contribution to the field. We have added a paragraph to the Conclusions highlighting the current state of experimental techniques and outlining near-term experiments that could be extended to test our predictions. We also emphasise that our analysis builds on state-of-the-art polymer models of chromatin and on quantitative experimental datasets, which we used both to build the model construction and to validate its outcomes (gene activity). We hope this strengthened link to experiment will catalyse further studies in the field.

      Major points:

      (1) My first point concerns terminology.The Merriam-Webster dictionary describes morphology as the study of structure and form. In my understanding, none of the analyses carried out in this study actually address the form or spatial structuring of transcription factories. I see no aspects of shape, only size. Unless the authors want to assess actual shapes of clusters, I would recommend to instead talk about only their size/extent. The title is, by the same argument, in my opinion misleading as to the content of this study.

      We agree with the Referee that the title could be misleading. In our study we characterized clusters size, that is a morphological descriptor, and cluster composition that isn’t morphology per se but used in the community in a broader sense. Nevertheless to strength the message we have changed the title in: “Cluster size determines internal structure of transcription factories in human cells”

      (2) Another major conceptual point is the choice of how a single TF:pol particle in the model relates to actual macromolecules that undergo clustering in the cell. What about the fact that even single TF factories still contain numerous canonical transcription factors, many of which are also known to undergo phase separation? Mediator, CDK9, Pol II just to name a few. This alone already represents phase separation under the involvement of different species, which must undergo mixing. This is conceptually blurred with the concept of gene-specific transcription factors that are recruited into clusters/condensates due to sequencespecific or chromatin-epigenetic-specific affinities. Also, the fact that even in a canonical gene with a ”small” transcription factory there are numerous clustering factors takes even the smallest factories into a regime of several tens of clustering macromolecules. It is unclear to me how this reality of clustering and factory formation in the biological cell relates to the cross-over that occurs at approximately n=10 particles in the simulations presented in this paper.

      This is a good point. However in our case we can either look at clustering transcription factors or transcription units. In an experimental situation, transcription units could be “coloured”, or assigned different types, by looking at different cell types, so that they can be classified as housekeeping, or cell-type independent, or cell-type specific. This is similar to how DHS can be clustered. In this way the mixing or demixing state can be identified by looking at the type of transcription unit, removing any ambiguity due to the fact that the same protein may participate in different TF complexes..

      (3) The paper falls critically short in referencing and exploiting for analysis existing literature and published data both on 3D genome organization as well as the process of cluster formation in relation to genomic elements. In terms of relevant literature, most of the relevant body of work from the following areas has not been included:

      (i) mechanisms of how the clustering of Pol II, canonical TFs, and specific TFs is aided by sequence elements and specific chromatin states

      (ii) mechanisms of TF selectivity for specific condensates and target genomic elements

      (iii) most crucially, existing highly relevant datasets that connect 3D multi-point contacts with transcription factor identity and transcriptional activity, which would allow the authors to directly test their hypotheses by analysis of existing data

      Here, especially the data under point (iii) are essential. The SPRITE method (cited but not further exploited by the authors), even in its initial form of publication, would have offered a data set to critically test the mixing vs. demixing hypothesis put forward by the authors. Specifically, the SPRITE method offers ordered data on k-mers of associated genomic elements. These can be mapped against the main TFs that associate with these genomic elements, thereby giving an account of the mixed / demixed state of these k-mer associations. Even a simple analysis sorting these associations by the number of associated genomic elements might reveal a demixing transition with increasing association size k. However, a newer version of the SPRITE method already exists, which combines the k-mer association of genomic elements with the whole transcriptome assessment of RNAs associated with a particular DNA k-mer association. This can even directly test the hypotheses the authors put forward regarding cluster size, transcriptional activation, correlation between different transcription units’ activation etc.

      To continue, the Genome Architecture Mapping (GAM) method from Ana Pombo’s group has also yielded data sets that connect the long-range contacts between gene-regulatory elements to the TF motifs involved in these motifs, and even provides ready-made analyses that assess how mixed or demixed the TF composition at different interaction hubs is. I do not see why this work and data set is not even acknowledged? I also strongly suggest to analyze, or if they are already sufficiently analyzed, discuss these data in the light of 3D interaction hub size (number of interacting elements) and TF motif composition of the involved genomic elements.

      Further, a preprint from the Alistair Boettiger and Kevin Wang labs from May 2024 also provides direct, single-cell imaging data of all super-enhancers, combined with transcription detection, assessing even directly the role of number of super-enhancers in spatial proximity as a determinant of transcriptional state. This data set and findings should be discussed, not in vague terms but in detailed terms of what parts of the authors’ predictions match or do not match these data.

      For these data sets, an analysis in terms of the authors’ key predictions must be carried out (unless the underlying papers already provide such final analysis results). In answering this comment, what matters to me is not that the authors follow my suggestions to the letter. Rather, I would want to see that the wealth of available biological data and knowledge that connects to their predictions is used to their full potential in terms of rejecting, confirming, refining, or putting into real biological context the model predictions made in this study.

      References for point (iii):

      - RNA promotes the formation of spatial compartments in the nucleus https://www.cell.com/cell/fulltext/S0092-8674(21)01230-7?dgcid=raven_jbs_etoc_email

      - Complex multi-enhancer contacts captured by genome architecture mapping https://www.nature.com/articles/nature21411

      - Cell-type specialization is encoded by specific chromatin topologies https://www.nature.com/articles/s41586-021-04081-2

      - Super-enhancer interactomes from single cells link clustering and transcription https://www.biorxiv.org/content/10.1101/2024.05.08.593251v1.full

      For point (i) and point (ii), the authors should go through the relevant literature on Pol II and TF clustering, how this connects to genomic features that support the cluster formation, and also the recent literature on TF specificity. On the last point, TF specificity, especially the groups of Ben Sabari and Mustafa Mirx have presented astonishing results, that seem highly relevant to the Discussion of this manuscript.

      We appreciate the Reviewer’s insightful suggestion that a comparison between our simulation results and experimental data would strengthen the robustness of our model. In response, we have thoroughly revised the literature on multi-way chromatin contacts, with particular attention to SPRITE and GAM techniques. However, we found that the currently available experimental datasets lack sufficient statistical power to provide a definitive test of our simulation predictions, as detailed below.

      As noted by the Reviewer, SPRITE experiments offer valuable information on the composition of highorder chromatin clusters (k-mers) that involve multiple genomic loci. A closer examination of the SPRITE data (e.g., Supplementary Material from Ref. [1]) reveals that the majority of reported statistics correspond to 3-mers (three-way contacts), while data on larger clusters (e.g., 8-mers, 9-mers, or greater) are sparse. This limitation hinders our ability to test the demixing-mixing transition predicted in our simulations, which occurs for cluster sizes exceeding 10.

      Moreover, the composition of the k-mers identified by SPRITE predominantly involves genomic regions encoding functional RNAs—such as ITS1 and ITS2 (involved in rRNA synthesis) and U3 (encoding small nucleolar RNA)—which largely correspond to housekeeping genes. Conversely, there is little to no data available for protein-coding genes. This restricts direct comparison to our simulations, where the demixing-mixing transition depends critically on the interplay between housekeeping and tissue-specific genes.

      Similarly, while GAM experiments are capable of detecting multi-way chromatin contacts, the currently available datasets primarily report three-way interactions [2,3].

      In summary, due to the limited statistical data on higher-order chromatin clusters [4], a quantitative comparison between our simulation results and experimental observations is not currently feasible. Nevertheless, we have now briefly discussed the experimental techniques for detecting multi-way interactions in the revised manuscript to reflect the current state of the field, mentioning most of the references that the Reviewer suggested.

      (4) Another conceptual point that is a critical omission is the clarification that there are, in fact, known large vs. small transcription factories, or transcriptional clusters, which are specific to stem cells and ”stressed cells”. This distinction was initially established by Ibrahim Cisse’s lab (Science 2018) in mouse Embryonic Stem Cells, and also is seen in two other cases in differentiated cells in response to serum stimulus and in early embryonic development:

      - Mediator and RNA polymerase II clusters associate in transcription-dependent condensates https://www.science.org/doi/10.1126/science.aar4199

      - Nuclear actin regulates inducible transcription by enhancing RNA polymerase II clustering https://www.science.org/doi/10.1126/sciadv.aay6515

      - RNA polymerase II clusters form in line with surface condensation on regulatory chromatin https://www.embopress.org/doi/full/10.15252/msb.202110272

      - If ”morphology” should indeed be discussed, the last paper is a good starting point, especially in combination with this additional paper: Chromatin expansion microscopy reveals nanoscale organization of transcription and chromatin https://www.science.org/doi/10.1126/science.ade5308

      We thank the Reviewer for pointing out the discussion about small and large clusters observed in stressed cells. Our study aims to provide a broader mechanistic explanation on the formation of TF mixed and demixed clusters depending on their size. However, to avoid to generate confusion between our terminology and the classification that is already used for transcription factories in stem and stressed cells, we have now added some comments and references in the revised text.

      (5) The statement scripts are available upon request is insufficient by current FAIR standards and seems to be non-compliant with eLife requirements. At a minimum, all, and I mean all, scripts that are needed to produce the simulation outcomes and figures in the paper, must be deposited as a publicly accessible Supplement with the article. Better would be if they would be structured and sufficiently documented and then deposited in external repositories that are appropriate for the sharing of such program code and models.

      We fully agree with the Reviewer. We have now included in the main text a link to an external repository containing all the codes required to reproduce and analyze the simulations.

      Recommendations for the authors:

      Minor and technical points

      (6) Red, green, and yellow (mix of green and red) is a particularly bad choice of color code, seeing that red-green blindness is the most common color blindness. I recommend to change the color code.

      We appreciate the Reviewer’s thoughtful comment regarding color accessibility. We fully agree that red–green combinations can pose challenges for color-blind readers. In our figures, however, we chose the red–green–yellow color scheme deliberately because it provides strong contrast and intuitive representation for different TF/TU types. To ensure accessibility, we optimized brightness and saturation within red-green schemes and we carefully verified that the chosen hues are distinguishable under the most common forms of color vision deficiency, i.e. trichromatic color blindness, using color-blindness simulation tools (e.g., Coblis).

      How is the dispersing effect of transcriptional activation and ongoing transcription accounted for or expected to affect the model outcome? This affects both transcriptional clusters (they tend to disintegrate upon transcriptional activation) as well as the large scale organization, where dispersal by transcription is also known.

      We thank the Reviewer for this very insightful question. The current versions of both our toy model and the more complex HiP-HoP model do not incorporate the effects of RNA Polymerase elongation. Our primary goal was to develop a minimalisitc framework that focuses on investigating TF clusters formation and their composition. Nevertheless, we find that this straightforward approach provides a good agreement between simulations and Hi-C and GRO-seq experiments, lending confidence to the reliability of our results concerning TF cluster composition.

      We fully agree, however, that the effects of transcription elongation are an interesting topic for further exploration. For example, modeling RNA Polymerases as active motors that continually drive the system out of equilibrium could influence the chromatin polymer conformation and the structure of TF clusters. Additionally, investigating how interactions between RNA molecules and nuclear proteins, such as SAF-A, might lead to significant changes in 3D chromatin organization and, consequently, transcription [5], is also in intriguing prospect. Although we do not believe that the main findings of our study, particularly regarding cluster composition and mixed-demixed transition, would be impacted by transcription elongation effects, we recognize the importance of this aspect. As such, we have now included some comments in the Conclusions section of the revised manuscript.

      “and make the reasonable assumption that a TU bead is transcribed if it lies within 2.25 diameters (2.25σ) of a complex of the same colour; then, the transcriptional activity of each TU is given by the fraction of time that the TU and a TF:pol lie close together.” How is that justified? I do not see how this is reasonable or not, if you make that statement you must back it up.

      As pointed out by the Referee, we consider a TU to be active if at least one TF is within a distance 2.25σ from that TU. This threshold is a slightly larger than the TU-TF interaction cutoff distance, r<sub>c</sub> \= 1.8σ between TFs and TUs. The rationale for this choice is to ensure that, in the presence of a TU cluster surrounded by TFs, TUs that are not directly in contact with a TF are still considered active. Nonetheless, we find that using slightly different thresholds, such as 1.8σ or 1.1σ, leads to comparable results, as shown in Fig. S11, demonstrating the robustness of our analysis.

      Clearly, close proximity in 1D genomic space favours formation of similarly-coloured clusters. This is not surprising, it is what you built the model to do. Should not be presented as a new insight, but rather as a check that the model does what is expected.

      We believed that this sentence already conveyed that the formation of single-color clusters driven by 1D genomic proximity is not a surprising outcome. However, we have now slightly rephrased it to better emphasize that this is not a novel insight.

      That said, we would like to highlight that while 1D genomic proximity facilitates the formation of clusters of the same color, the unmixed-to-mixed transition in cluster composition is not easily predictable solely from the TU color pattern. Furthermore, in simulations of real chromosomes, where TU patterns are dictated by epigenetic marks, the complexity of these patterns makes it challenging—if not impossible—to predict cluster composition based solely on the input data of our model.

      “…how closely transcriptional activities of different TUs correlate…” Please briefly state over what variable the correlation is carried out, is it cross correlation of transcription activity time courses over time? Would be nice to state here directly in the main text to make it easier for the reader.

      We have now included a brief description in the revised manuscript explaining how the transcriptional correlations were evaluated and how the correlation matrix was constructed.

      “The second concerns how expression quantitative trait loci (eQTLs) work. Current models see them doing so post-transcriptionally in highly-convoluted ways [11, 55], but we have argued that any TU can act as an eQTL directly at the transcriptional level [11].” This text does not actually explain what eQTLs do. I think it should, in concise words.

      We agree with the Referee’s suggestion. We have revised the sentence accordingly and now provide a clear explanation of eQTLs upon their first mention. The revised paragraph now reads as follows:

      “The second concerns how expression quantitative trait loci (eQTLs)—genomic regions that are statistically associated with variation in gene expression levels—function. While current models often attribute their effects to post-transcriptional regulation through complex mechanisms [6,7], we have previously argued that any transcriptional unit (TU) can act as an eQTL by directly influencing gene expression at the transcriptional level [7]. Here, we observe individual TUs up-regulating or down-regulating the activity of others TUs – hallmark behaviors of eQTLs that can give rise to genetic effects such as “transgressive segregation” [8]. This phenomenon refers to cases in which alleles exhibit significantly higher or lower expression of a target gene, and can be, for instance, caused by the creation of a non-parental allele with a specific combination of QTLs with opposing effects on the target gene.”

      “In the string with 4 mutations, a yellow cluster is never seen; instead, different red clusters appear and disappear (Fig. 2Eii)…” How should it be seen? You mutated away most of the yellow beads. I think the kymograph is more informative about the general model dynamics, not the effects of mutations. Might be more appropriate to place a kymograph in Figure 1.

      We agree with the Referee that the kymograph is the most appropriate graphical representation for capturing the effects of mutations. Panel 2E already refers to the standard case shown in Figure 1. We have now clarified this both in the caption and in the main text. In addition, we have rephrased the sentence—which was indeed misleading—as follows:

      “From the activity profiles in Fig. 2C, we can observe that as the number of mutations increases, the yellow cluster is replaced by a red cluster, with the remaining yellow TUs in the region being expelled (Fig. 2B(ii)). This behavior is reflected in the dynamics, as seen by comparing panels E(i) and E(ii): in the string with four mutations, transcription of the yellow TUs is inhibited in the affected region, while prominent red stripes—corresponding to active, transcribing clusters—emerge (Fig. 2E(ii)).” We hope that the comparison is now immediately clear to the reader.

      “…but this block fragments in the string with 4 mutations…” I don’t know or cannot see what is meant by ”fragmentation” in the correlation matrix.

      With the sentence “this block fragments in the string with 4 mutations” we mean that the majority of the solid red pixels within the black box become light-red or white once the mutations are applied. We have now added a clarification of this point in the revised manuscript.

      “Fig. 3D shows the difference in correlation between the case with reduced yellow TFs and the case displayed in Fig. 1E.” Can you just place two halves of the different matrices to be compared into the same panel? Similar to Fig. S5. Will be much easier to compare.

      We thank the Referee for this suggestion. We tried to implement this modification, and report the modified figure below (Author response image 1). As we can see, in the new figure it is difficult to spot the details we refer to in the main text, therefore we prefer to keep the original version of the figure.

      Author response image 1.

      Heatmap comparing activity correlations of TUs in the random string under normal conditions (top half) and with reduced yellow-TF concentration (bottom half).

      What is the omnigenic model? It is not introduced.

      We thank the Reviewer for highlighting this important point. The omnigenic model, first introduced by Boyle et al in Ref. [6], was proposed to explain how complex traits, including disease risk, are influenced by a vast number of genes. Accordingly to this model, the genetic basis of a trait is not limited to a small set of core genes whose expression is directly related to the trait, but also includes peripheral genes. The latter, although not directly involved in controlling the trait, can influence the expression of core genes through gene regulatory networks, thereby contributing to the overall genetic influence on the trait. We have now added a few lines in the revised manuscript to explain this point.

      “Additionally, blue off-diagonal blocks indicate repeating negative correlations that reflect the period of the 6-pattern.” How does that look in a kymograph? Does this mean the 6 clusters of same color steal the TFs from the other clusters when they form?

      The intuition of the Referee is indeed correct. The finite number of TFs leads to competition among TUs of the same colour, resulting in anticorrelation:when a group of six nearby TUs of a given colour is active, other, more distant TUs of the same colour are not transcribing due to the lack of available TFs. As the Referee suggested,this phenomenon is visible in the kymograph showing TU activity. In Author response image 2, it can be observed that typically there is a single TU cluster for each of the three colours (yellow, green, and red). These clusters can be long-lived (e.g., the yellow cluster at the center of the kymograph) or may destroy during the simulation (e.g., the red cluster at the top of the kymograph, which dissolves at t ∼ 600 × 10<sup>5</sup> τ<sub>B</sub>). In the latter case, TFs of the corresponding colour are released into the system and can bind to a different location, forming a new cluster (as seen with the red cluster forming at the bottom of the kymograph for t > 600 × 10<sup>5</sup> τ<sub>B</sub>). This point is further discussed at the point 2.30 of this Reply where additional graphical material is provided.

      Author response image 2.

      Kymograph showing the TU activity during a typical run in the 6-pattern case. Each row reports the transcriptional state of a TU during one simulation. Black pixels correspond to inactive TUs, red (yellow, green) pixels correspond to active red (yellow, green) TUs.

      “Conversely, negative correlations connect distant TUs, as found in the single-color model…” But at the most distal range, the negative correlation is lost again! Why leave this out? Your correlation curves show the same , equilibration towards no correlation at very long ranges.

      As highlighted in Figure 5Ai, long-range negative correlations (grey segments) predominantly connect distant TUs of the same colour. This is quantified in Figure 5Bi: restricting to same-colour TUs shows that at large genomic separations the correlation is almost entirely negative, with small fluctuations at distances just below 3000 kbp where sampling is sparse; we therefore avoid further interpretation of this regime.

      “These results illustrate how the sequence of TUs on a string can strikingly affect formation of mixed clusters; they also provide an explanation of why activities of human TUs within genomic regions of hundreds of kbp are positively correlated [60].” This is a very nice insight.

      We thank the Reviewer for the very supportive comment.

      “To quantify the extent to which TFs of different colours share clusters, we introduce a demixing coefficient, θ<sub>dem</sub> (defined in Fig. 1).” This is not defined in Fig. 1 or anywhere else here in the main text.

      We thank the Referee for pointing this out. For a given cluster, the demixing coefficient is defined as

      where n is the number of colors, i indexes each color present in the model, and x<sub>i,max</sub> the largest fraction of TFs of the same i-th color in a single TF cluster.

      The demixing coefficient is defined in the Methods section; therefore, we have replaced defined in Fig. 1 with see Methods for definition.

      “Mixing is facilitated by the presence of weakly-binding beads, as replacing them with non-interacting ones increases demixing and reduces long-range negative correlations (Figure S3). Therefore, the sequence of strong and weak binding sites along strings determines the degree of mixing, and the types of small-world network that emerge. If eQTLs also act transcriptionally in the way we suggest [11], we predict that down-regulating eQTLs will lie further away from their targets than up-regulating ones.” Going into these side topics and minke points here is super distracting and waters down the message. Maybe first deal with the main conclusions on mixed vs demixed clusters in dependence on the strong and specific binding site patterns, before dealing with other additional points like the role of weak binding sites.

      Thank you for the suggestion. We now changed the paragraph to highlight the main results. The new paragraph is as follows. “These results on activity correlation and TF cluster composition suggest that, if eQTLs act transcriptionally as expected [7], down-regulating eQTLs are likely to be located further from their target genes than up-regulating ones. In addition, it is important to note that mixing is promoted by the presence of weakly binding beads; replacing these with non-interacting ones leads to increased demixing and a reduction in long-range negative correlations (Figure S3). More generally, our findings indicate that the presence of multiple TF colors offers an effective mechanism to enrich and fine-tune transcriptional regulation.”

      “…provides a powerful pathway to enrich and modulate transcriptional regulation.” Before going into the possible meaning and implications of the results, please discuss the results themselves first.

      See previous point.

      Figure 5B. Does activation typically coincide with spatial compaction of the binding sites into a small space or within the confines of a condensate? My guess would be that colocalization of the other color in a small space is what leads to the mixing effect?

      As the Reviewer correctly noted, the activity of a given TU is indeed influenced by the presence of nearby TUs of the same color, since their proximity facilitates the recruitment of additional TFs and enhances the overall transcriptional activity. In this context, the mixing effect is certainly affected by the 1D arrangement of TUs along the chromatin fiber. As emphasized in the revised manuscript, when domains of same-color TUs are present (as in the 6-pattern string), the degree of demixing is greater compared to the case where TUs of different colors alternate and large domains are absent (as in the 1-pattern string). This difference in the demixing parameter as a function of the 1D TU arrangement is clearly visible in Fig. S2B.

      “…euchromatic regions blue, and heterochromatic ones grey.” Please also explain what these color monomers mean in terms of non specific interactions with the TFs.

      Generally, in our simulation approach we assume euchromatin regions to be more open and accessible to transcription factors, whereas heterochromatin corresponds to more compacted chromatin segments [9]. To reflect this, we introduce weak, non-specific interactions between euchromatin and TFs, while heterochromatin interacts with TFs only thorugh steric effects. To clarify this point, we have now slightly revised the caption of Fig.6.

      “More quantitatively, Spearman’s rank correlation coefficient is 3.66 10<sup>−1</sup>, which compares with 3.24 10<sup>−1</sup> obtained previously using a single-colour model [11].” This comparison does not tell me whether the improvement in model performance justifies an additional model component. There are other, likelihood based approaches to assess whether a model fits better in a relevant extent by adding a free model parameter. Can these be used for a more conclusive comparison? Besides, a correlation of 0.36 does not seem so good?

      We understand the Reviewer’s concern that the observed increase in the activity correlation may not appear to provide strong evidence for the improvement of the newly introduced model. However, within the context of polymer models developed to study realistic gene transcription and chromatin organization, this type of correlation analysis is a widely accepted approach for model validation. Experimental data commonly used for such validation include Hi-C maps, FISH experiments, and GRO-seq data [10,11]. The first two are typically employed to assess how accurately the model reproduces the 3D folding of chromatin; a comparison between experimental and simulated Hi-C maps is provided in the Supplementary Information (Fig. S5), showing a Pearson correlation of 0.7. GRO-seq or RNA-seq data, on the other hand, are used to evaluate the model’s ability to predict gene transcription levels. To date, the highest correlation for transcriptional activity data has been achieved by the HiP-HoP model at a resolution of 1 kbp [10], reporting a Spearman correlation of 0.6. Therefore, the correlation obtained with our 2-color model represents a good level of agreement when compared with the more complex HiP-HoP model. In this context, the observed increase in correlation—from 0.324 to 0.366—can be regarded as a modest yet meaningful improvement.

      “…consequently, use of an additional color provides a statisticallysignificant improvement (p-value < 10<sup>−6</sup>, 2-sided t-test).” I do not follow this argument. Given enough simulation repeats, any improvement, no matter how small, will lead to statistically significant improvements.

      We agree that this sentence could be misleading. We have now rephrased it in a clearer manner specifying that each of the two correlation values is statistically significant alone, while before we were wrongly referring to the significance of the improvement.

      “Additionally, simulated contact maps show a fair agreement with Hi-C data (Figure S5), with a Pearson correlation r ∼ 0.7 (p-value < 10<sup>−6</sup>, 2-sided t-test).” Nice!

      We thank the Reviewer for the positive comment.

      “Because we do not include heterochromatin-binding proteins, we should not however expect a very accurate reproduction of Hi-C maps: we stress that here instead we are interested in active chromatin, transcription and structure only as far as it is linked to transcription.” Then why do you not limit your correlation assessment to only these regions to show that these are very well captured by your model?

      We thank the Reviewer for this insightful comment. Indeed, we could have restricted our investigation to active chromatin regions, as done in our previous works [11,12]. However, our intention in this section of the manuscript was to clarify that the current model is relatively simple and therefore not expected to achieve a very high level of agreement between experimental and simulated Hi-C maps. Another important limitation of the two color model described in the section is the absence of active loop extrusion mediated by SMC proteins, which is known to play a central role in establishing TADs boundaries. Consequently, even if our analysis were limited to active chromatin regions, the agreement with experimental Hi-C maps would still remain lower than that obtained with more comprehensive models, such as HiP-HoP, that we use later in the last section of the paper. We have now added a comment in the revised manuscript explicitly noting the lack of active loop extrusion in our 2-color model.

      “We also measure the average value of the demixing coefficient, θ<sub>dem</sub> (Materials and Methods). If θ<sub>dem</sub> = 1, this means that a cluster contains only TFs of one colour and so is fully demixed; if θ<sub>dem</sub> = 0, the cluster contains a mixture of TFs of all colors in equal number, and so is maximally mixed.” Repetitive.

      We have now rephrased the sentence in a more concise way.

      “…notably, this is similar to the average number of productivelytranscribing pols seen experimentally in a transcription factory [6].” That seems a bit fast and loose. The number of Polymerases can differ depending on state, type of factory, gene etc. and vary between anything from to a few hundreds of Polymerase complexes depending on definition of factory, and what is counted as active. Also, one would think that polymerases only make up a small part of the overall protein pool that constitutes a condensate, so it is unclear whether this is a pertinent estimate.

      Here we refer to the average size of what is normally referred to as a PolII factory, not a generic nuclear condensate. These are the clusters which arise in our simulations. These structures emerge through microphase separation and have been well characterised, for instance see [13] for a recent review. For these structures while there is a distribution the average is well defined and corresponds to a size of about 100 nm, which is very much in line with the size of the clusters we observe, both in terms of 3D diameter and number of participating proteins. Because of the size, the number of active complexes which can contribute cannot be significantly more than ∼ 10. These estimates are, we note, very much in line with super-resolution measurements of SAF-A clusters [14], which are associated with active transcription and hence it is reasonable to assume they colocalise with RNA and polymerase clusters.

      “Conversely, activities of similar TUs lying far from each other on the genetic map are often weakly negatively correlated, as the formation of one cluster sequesters some TFs to reduce the number available to bind elsewhere.” This point is interesting, and I strongly suspect that this indeed happening. But I don’t think it was shown in the analysis of the simulation results in sufficient clarity. We need direct assessment of this sequestration, currently it’s only indirectly inferred.

      Indeed, this is the mechanism underlying the emergence of negative long-range correlations among TU activity values. As the Reviewer correctly pointed out, the competition for a finite number of TFs was only indirectly inferred in the original manuscript. To address this, we have now included a new figure explicitly illustrating this effect. In Fig. S12, we show the kymograph of active TUs (left panel), as in Fig. 2E(i) of the main text, alongside a new kymograph depicting the number of green TFs within a sphere of radius 10σ centered on each green TU (right panel). For simplicity, we focus here only on green TUs and TFs. It can be observed that, during the initial part of the simulation, green TFs are localized near genomic position ∼ 2000(right panel), where green TUs are transcriptionally active (left panel). Toward the end of the simulation, TUs near genomic position ∼ 500 become active, coinciding with the relocation of TFs to this region and the depletion of the previous one.

      In the definition for the demixing coefficient (equation 1), what does the index i stand for?

      Here i is an index denoting each of the colors present in the model. We have now specified the meaning of i after Eq. 1.

      Reviewer 3 (Public Review):

      In this work, the authors present a chromatin polymer model with some specific pattern of transcription units (TUs) and diffusing TFs; they simulate the model and study TFclustering, mixing, gene expression activity, and their correlations. First, the authors designed a toy polymer with colored beads of a random type, placed periodically (every 30 beads, or 90kb). These colored beads are considered a transcription unit (TU). Same-colored TUs attract with each other mediated by similarly colored diffusing beads considered as TFs. This led to clustering (condensation of beads) and correlated (or anti-correlation) ”gene expression” patterns. Beyond the toy model, when authors introduce TUs in a specific pattern, it leads to emergence of specialized and mixed cluster of different TFs. Human chromatin models with realistic distribution of TUs also lead to the mixing of TFs when cluster size is large.

      Strengths.

      This is a valuable polymer model for chromatin with a specific pattern of TUs and diffusing TF-like beads. Simulation of the model tests many interesting ideas. The simulation study is convincing and the results provide solid evidence showing the emergence of mixed and demixed TF clusters within the assumptions of the model.

      Weaknesses.

      Weakness of the work: The model has many assumptions. Some of the assumptions are a bit too simplistic. Concerns about the work are detailed below:

      We thank the Referee for this overall positive evaluation.

      We thank the Referee for this important observation. The way we The authors assume that when the diffusing beads (TFs) are near a TU, the gene expression starts. However, mammalian gene expression requires activation by enhancer-promoter looping and other related events. It is not a simple diffusion-limited event. Since many of the conclusions are derived from expression activity, will the results be affected by the lack of looping details?

      We do not need to assume promoter-enhancer contact, this emerges naturally through the bridging-induced phase separation and indeed is a key strength of our model. Even though looping is not assumed as key to transcriptional initiation, in practice the vast majority of events in which a TF is near a TU are associated with the presence of a cluster where regulatory elements are looped. So transcription in our case is associated with the bridging-induced phase separation, and there is no lack of looping, looping is naturally associated with transcription, and this is an emergent property of the model (not an assumption), which is an important feature of our model. Accordingly, both contact maps and transcriptional activity are well predicted by our model, both in the version described here and in the more sophisticated single-colour HiP-HoP model [10] (an important ingredient of which is the bridging-induced phase separation).

      Authors neglect protein-protein interactions. Without proteinprotein interactions, condensate formation in natural systems is unlikely to happen.

      We thank the Reviewer for pointing out the absence of protein-protein interactions in our simulations. While we acknowledge this limitation, we would like to emphasize that experimental studies have not observed nuclear proteins forming condensates at physiological concentrations in the absence of DNA or chromatin. For example, studies such as Ryu et al. [15] and Shakya et al. [16] show that protein-protein interactions alone are insufficient to drive condensate formation in vivo. Instead, the presence of a substrate, such as DNA or chromatin, is essential to favor and stabilize the formation of protein clusters.

      In our simulations, we propose that protein liquid-liquid phase separation (LLPS) is driven by the presence of both strong and weak attractions between multivalent protein complexes and the chromatin filament. As stated in our manuscript, the mechanism leading to protein cluster formation is the bridging induced attraction. This mechanism involves a positive feedback loop, where protein binding to chromatin induces a local increase in chromatin density, which then attracts more proteins, further promoting cluster formation.

      While we acknowledge that adding protein-protein interactions could be incorporated into our simulations, we believe this would need to be a weak interaction to remain consistent with experimental data. Additionally, incorporating such interactions would not alter the conclusions of our study.

      What is described in this paper is a generic phenomenon; many kinds of multivalent chromatin-binding proteins can form condensates/clusters as described here. For example, if we replace different color TUs with different histone modifications and different TFs with Hp1, PRC1/2, etc, the results would remain the same, wouldn’t they? What is specific about transcription factor or transcription here in this model? What is the logic of considering 3kb chromatin as having a size of 30 nm? See Kadam et al. (Nature Communications 2023). Also, DNA paint experimental measurement of 5kb chromatin is greater than 100 nm (see work by Boettiger et al.).

      We thank the Reviewer for this important observation, which we now address. To begin, we consider the toy model introduced in the first part of the manuscript, where TUs are randomly positioned rather than derived from epigenetic data. As the Reviewer points out, in this simplified context, our results reflect a generic phenomenon: the composition of clusters depends primarily on their size, independent of the specific types of proteins involved. However, the main goal of our work is to gain insights into apparently contradictory experimental findings, which show that some transcription factories consist of a single type of transcription factors, while other contain multiple types. This led us to focus on TF clusters and their role in transcriptional regulation and co-regulation of distant genes. Therefore, in the second part of the manuscript, we use DNase I hypersensitive site (DHS) data to position TUs based on predicted TF binding sites, providing a more biological framework. In both the toy model and the more realistic HiP-HoP model, we observe a size-dependent transition in cluster composition. However, we refrain from generalizing these results to clusters composed of other protein complexes, such as HP1 and PRC, as their binding is governed by distinct epigenetic marks (e.g. H3K927me3 and H3K27me3), which exhibit different genomic distributions compared to DHS marks.

      Finally, the mapping of 3kb to 30nm is an estimate which does not significantly impact our conclusions. The relationship between genomic distance (in kbp) and spatial distance (in nm) is highly dependent on the degree of chromatin compaction, which can vary across cell types and genomic context. As such, providing an exact conversion is challenging [17]. For example, in a previous work based on the HiP-HoP model [12] we compared simulated and experimental FISH measurements and found that 1kbp typically corresponds to 15 − 20nm, implying that 3kbp could span 60nm. Nevertheless, we emphasize that varying this conversion factor does not affect the core results or conclusions of our study. We have now included a clarification in the revised SI to highlight this point.

      Recommendations for the authors:

      Other points.

      Figure 1(D) caption says 2.25σ = 1.6 nanometer. Is this a typo? Sigma is 30nm.

      Yes, it was. As 1σ ∼ 30nm, we have 2.25σ = 2.25 · 30 nm = 67.2 nm ∼ 6.7 × 10<sup>−8</sup>m. We have now corrected the caption.

      Page 6, column 2nd, 3rd para, it is written that θ<sub>dem</sub> (”defined in Fig.1”). There is no θ<sub>dem</sub> defined in Fig.1, is there? I can see it defined in Methods but not in Fig. 1.

      Correct, we replaced (defined in Fig.1) with (see Methods for definition).

      Page 6, column 2, 4th para: what does “correlations overlap and correlations diverge mean”?

      With reference to the plots from Fig. 5B, correlation overlap and diverge simply refers to the fact that same-colour (red curves) and different-colour (blue curves) correlation trends may or may not overlap on each other. We have now clarified this point.

      What is the precise definition of correlation in Fig 5B (Y-axis)?

      In Fig.5B, correlation means Pearson correlation. We have now specified this point in the revised text and in the caption of Fig.5.

      References

      (1) S. A. Quinodoz, J. W. Jachowicz, P. Bhat, N. Ollikainen, A. K. Banerjee, I. N. Goronzy, M. R. Blanco, P. Chovanec, A. Chow, Y. Markaki et al., “Rna promotes the formation of spatial compartments in the nucleus,” Cell, vol. 184, no. 23, pp. 5775–5790, 2021.

      (2) R. A. Beagrie, A. Scialdone, M. Schueler, D. C. Kraemer, M. Chotalia, S. Q. Xie, M. Barbieri, I. de Santiago, L.-M. Lavitas, M. R. Branco et al., “Complex multi-enhancer contacts captured by genome architecture mapping,” Nature, vol. 543, no. 7646, pp. 519–524, 2017.

      (3) R. A. Beagrie, C. J. Thieme, C. Annunziatella, C. Baugher, Y. Zhang, M. Schueler, A. Kukalev, R. Kempfer, A. M. Chiariello, S. Bianco et al., “Multiplex-gam: genome-wide identification of chromatin contacts yields insights overlooked by hi-c,” Nature Methods, vol. 20, no. 7, pp. 1037–1047, 2023.

      (4) L. Liu, B. Zhang, and C. Hyeon, “Extracting multi-way chromatin contacts from hi-c data,” PLOS Computational Biology, vol. 17, no. 12, p. e1009669, 2021.

      (5) R.-S. Nozawa, L. Boteva, D. C. Soares, C. Naughton, A. R. Dun, A. Buckle, B. Ramsahoye, P. C. Bruton, R. S. Saleeb, M. Arnedo et al., “Saf-a regulates interphase chromosome structure through oligomerization with chromatin-associated rnas,” Cell, vol. 169, no. 7, pp. 1214–1227, 2017.

      (6) E. A. Boyle, Y. I. Li, and J. K. Pritchard, “An expanded view of complex traits: from polygenic to omnigenic,” Cell, vol. 169, no. 7, pp. 1177–1186, 2017.

      (7) C. Brackley, N. Gilbert, D. Michieletto, A. Papantonis, M. Pereira, P. Cook, and D. Marenduzzo, “Complex small-world regulatory networks emerge from the 3d organisation of the human genome,” Nat. Commun., vol. 12, no. 1, pp. 1–14, 2021.

      (8) R. B. Brem and L. Kruglyak, “The landscape of genetic complexity across 5,700 gene expression traits in yeast,” Proceedings of the National Academy of Sciences, vol. 102, no. 5, pp. 1572– 1577, 2005.

      (9) M. Chiang, C. A. Brackley, D. Marenduzzo, and N. Gilbert, “Predicting genome organisation and function with mechanistic modelling,” Trends in Genetics, vol. 38, no. 4, pp. 364–378, 2022.

      (10) M. Chiang, C. A. Brackley, C. Naughton, R.-S. Nozawa, C. Battaglia, D. Marenduzzo, and N. Gilbert, “Genome-wide chromosome architecture prediction reveals biophysical principles underlying gene structure,” Cell Genomics, vol. 4, no. 12, 2024.

      (11) A. Buckle, C. A. Brackley, S. Boyle, D. Marenduzzo, and N. Gilbert, “Polymer simulations of heteromorphic chromatin predict the 3d folding of complex genomic loci,” Mol. Cell, vol. 72, no. 4, pp. 786–797, 2018.

      (12) G. Forte, A. Buckle, S. Boyle, D. Marenduzzo, N. Gilbert, and C. A. Brackley, “Transcription modulates chromatin dynamics and locus configuration sampling,” Nature Structural & Molecular Biology, vol. 30, no. 9, pp. 1275–1285, 2023.

      (13) P. R. Cook and D. Marenduzzo, “Transcription-driven genome organization: a model for chromosome structure and the regulation of gene expression tested through simulations,” Nucleic acids research, vol. 46, no. 19, pp. 9895–9906, 2018.

      (14) M. Marenda, D. Michieletto, R. Czapiewski, J. Stocks, S. M. Winterbourne, J. Miles, O. C. Flemming, E. Lazarova, M. Chiang, S. Aitken et al., “Nuclear rna forms an interconnected network of transcription-dependent and tunable microgels,” BioRxiv, pp. 2024–06, 2024.

      (15) J.-K. Ryu, C. Bouchoux, H. W. Liu, E. Kim, M. Minamino, R. de Groot, A. J. Katan, A. Bonato, D. Marenduzzo, D. Michieletto et al., “Bridging-induced phase separation induced by cohesin smc protein complexes,” Science advances, vol. 7, no. 7, p. eabe5905, 2021.

      (16) A. Shakya, S. Park, N. Rana, and J. T. King, “Liquid-liquid phase separation of histone proteins in cells: role in chromatin organization,” Biophysical journal, vol. 118, no. 3, pp. 753–764, 2020.

      (17) A.-M. Florescu, P. Therizols, and A. Rosa, “Large scale chromosome folding is stable against local changes in chromatin structure,” PLoS computational biology, vol. 12, no. 6, p. e1004987, 2016.

    1. Además, se genera una imagen sobrevalorada del desempeño, lo que deteriora la confiabilidad de las notas como un indicador de competencias (Schorr, 2025). Así, obtener buenas calificaciones independientemente de las características del curso desincentiva la asistencia a clases, debido a que se puede considerar una actividad prescindible para aprobar las materias. De esta manera, se genera una sensación de injusticia en las notas entre aquellos estudiantes comprometidos con sus deberes académicos y quienes cumplen con lo justo, pues las calificaciones no distinguen esto. De hecho, como el estándar de calificación es elevado, si un estudiante no alcanza una nota de acuerdo a ese estándar, esto genera estrés en los universitarios, puesto que cualquier nota por debajo de este umbral es considerada como un fracaso (Schorr, 2025).

      Revisaría la redacción de este párrafo. Me parece que se repite muchas veces una o dos ideas y que se podrían presentar de manera más concisa.

    2. llegando al punto de que los universitarios pueden ausentarse a las evaluaciones regulares de los cursos y solamente presentarse al exámen final, o bien que la asistencia obligatoria pierda este carácter.

      En general, creo que hace falta citas en este párrafo.

      Sobre esta frase en particular, ¿hay alguna evaluación desde Pregrado sobre la implementación de las adecuaciones curriculares?

    1. Author response:

      The following is the authors’ response to the original reviews.

      Public Reviews:

      Reviewer #1 (Public review):

      In this manuscript, Aghabi et al. present a comprehensive characterization of ZFT, a metal transporter located at the plasma membrane of the eukaryotic parasite Toxoplasma gondii. The authors provide convincing evidence that ZFT plays a crucial role in parasite fitness, as demonstrated by the generation of a conditional knockdown mutant cell line, which exhibits a marked impact on mitochondrial respiration, a process dependent on several iron-containing proteins. Consistent with previous reports, the authors also show that disruption of mitochondrial metabolism leads to conversion into the persistent bradyzoite stage. The study then employed advanced techniques, such as inductively coupled plasma-mass spectrometry (ICP-MS) and X-ray fluorescence microscopy (XFM), to demonstrate that ZFT depletion results in reduced parasite-associated metals, particularly iron and zinc. Additionally, the authors show that ZFT expression is modulated by the availability of these metals, although defects in the transporter could not be compensated for by exogenous addition of iron or zinc. 

      While the manuscript does not directly investigate the transport function of ZFT through biochemical assays, the authors indirectly support the notion that ZFT can transport zinc by demonstrating its ability to compensate for a lack of zinc transport in a yeast heterologous system. Furthermore, phenotypic analyses suggest defects in iron availability, particularly with regard to Fe-S mitochondrial proteins and mitochondrial function. Overall, the manuscript provides a solid, well-rounded argument for ZFT's role in metal transport, using a combination of complementary approaches. Although direct biochemical evidence for the transporter's substrate specificity and transport activity is lacking, the converging evidence, including changes in metal concentrations upon ZFT depletion, yeast complementation data, and phenotypic changes linked to iron deficiency, presents a convincing case. Some aspects of the results may appear somewhat unbalanced, particularly since iron transport could not be confirmed through heterologous complementation, while zinc-related phenotypes in the parasites have not been thoroughly explored (which is challenging given the limited number of zinc-dependent proteins characterized in Toxoplasma). Nevertheless, given that metal acquisition remains largely uncharacterized in Toxoplasma, this manuscript provides an important first step in identifying a metal transporter in these parasites, and the data presented are generally convincing and insightful. 

      We thank the reviewer for their assessment and would like to highlight that we now add direct biochemical characterisation in the new Figure 8, supporting our hypothesis and confirming iron transport by this protein.

      Reviewer #2 (Public review): 

      Summary: 

      The intracellular pathogen Toxoplasma gondii scavenges metal ions such as iron and zinc to support its replication; however, mechanistic studies of iron and zinc uptake are limited. This study investigates the function of a putative iron and zinc transporter, ZFT. In this paper, the authors provide evidence that ZFT mediates iron and zinc uptake by examining the regulation of ZFT expression by iron and zinc levels, the impact of altered ZFT expression on iron sensitivity, and the effects of ZFT depletion on intracellular iron and zinc levels in the parasite. The effects of ZFT depletion on parasite growth are also investigated, showing the importance of ZFT function for the parasite. 

      Strengths: 

      A key strength of the study is the use of multiple complementary approaches to demonstrate that ZFT is involved in iron and zinc uptake. Additionally, the authors build on their finding that loss of ZFT impairs parasite growth by showing that ZFT depletion induces stage conversion and leads to defects in both the apicoplast and mitochondrion. 

      Weaknesses: 

      (1) Excess zinc was shown not to alter ZFT expression, but a cation chelator (TPEN) did lead to decreased expression. While TPEN is often used to reduce zinc levels, does it have any effect on iron levels? Could the reduction in ZFT after TPEN treatment be due to a reduction in the level of iron or another cation?

      WE thank the reviewers for this comment, we agree that TPEN is a fairly unspecific cation chelator so to determine if its effects are due to removal of zinc or other cations we treated with TPEN and either zinc or iron. Co-incubation of TPEN and zinc prevented ZFT depletion, while TPEN+FAC had no effect compared to TPEN alone (new Figure 6h and i), strongly suggesting the effects on ZFT abundance are linked to zinc and not just iron.  

      (2) ZFT expression was found to be dynamic depending on the size of the vacuole, based on mean fluorescence intensity measurements. Looking at protein levels by Western blot at different times during infection would strengthen this finding. 

      We show here that ZFT expression is highly dynamic, depending both the iron status of the host cell and the number of parasites/vacuole. However, validating this finding by western would be complex due to the highly unsynchronised nature of parasite replication and the large number (5x10<sup>6</sup> - 1x10<sup>7</sup>cells) of parasites required to visualise ZFT. Further, we show that ZFT is apparently internalised prior to degradation. For this reason, we have not attempted to validate this finding by western blotting at this time.

      (3) ZFT localization remained at the parasite periphery under low iron conditions. However, in the images shown in Figure S1c, larger vacuoles (containing 4-8 parasites) are shown for the untreated conditions, and single parasite-containing vacuoles are shown for the low iron condition. As ZFT localization is predominantly at the basal end of the parasite in larger PV and at the parasite periphery for smaller vacuoles, it would be better to compare vacuoles of similar size between the untreated and low-iron conditions.

      The reviewer brings up a good point, the concentration of iron chelator that we used here does not enable parasite replication, making an assessment of changes in localisation challenging. To address this, have new data using a much lower concentration of chelator (20 mM), which is still expected to impact the parasites (Hanna et al, 2025), but allows for replication. In this low iron environment, ZFT localisation remained significantly more peripheral (Fig. S1d,e), supporting our hypothesis that ZFT localisation is iron dependent, independent of vacuolar stage.

      Reviewer #3 (Public review): 

      Summary:

      Aghabi et al set out to characterize a T. gondii transmembrane protein with a ZIP domain, termed ZFT. The authors investigate the consequences of ZFT downregulation and overexpression for parasite fitness. Downregulation of ZFT causes defects in the parasite's endosymbiotic organelles, the apicoplast and the mitochondrion. Specifically, lack of ZFT causes a decrease in mitochondrial respiration, consistent with its role as an iron transporter. This impact on the mitochondria appears to trigger partial differentiation to bradyzoites. The authors furthermore demonstrate that expression of TgZFT can rescue a yeast mutant lacking its zinc transporter and perform an array of direct metal ion measurements, including X-ray fluorescence microscopy and inductively coupled mass spectrometry (ICP-MS). These reveal reduced metal ions in parasites depleted in ZFT. Overall, the data by Aghabi et al. reveal that ZFT is a major metal ion transporter in T. gondii, importing iron and zinc for diverse essential processes. 

      Strengths:

      This study's strength lies in the thorough characterization of the transporter. The authors combine a number of techniques to measure the impact of ZFT depletion, ranging from the direct measurement of metal ions to determining the consequences for the parasite's metabolism (mitochondrial respiration), as well as performing a yeast mutant complementation. This work is very thorough and clearly presented, leaving little doubt about this protein's function. 

      Weaknesses:

      This study offers no major novel insights into the biology of T. gondii. The transporter was already annotated as a zinc transporter (ToxoDB), was deemed essential (PMID: 27594426), and localized to the plasma membrane (PMID: 33053376). This study mostly confirms and validates these previous datasets. The authors identify three other proteins with a ZIT domain. Particularly, the role of TGME49_225530 is intriguing, as it is likely fitness-conferring (score: -2.8, PMID: 27594426) and has no subcellular localization assigned. Characterizing this protein as well, revealing its localization, and identifying if and how these transporters coordinate metal ion transport would have been worthwhile. 

      We agree that the work presented here validates the previous datasets, and if that was all we had done, we agree that the biological insights would be limited. However, we have gone significantly beyond the predictions, demonstrating dynamic localisation changes, iron-mediated regulation, the lack of substrate-based complementation and validating transport activity of both zinc and iron. Although in silico predictions and screens can be informative, it remains important to validate biological functions experimentally. While we agree that characterisation of TGME49_225530 (as well as the other two annotated ZIP proteins) would be interesting, and will certainly form part of our future plans, it is significantly beyond the scope of the presented manuscript.

      Another weakness is the data related to the impact of ZFT downregulation on the apicoplast in Figure 4. The authors show that downregulation of ZFT causes an increase in elongated apicoplasts (Figure 4d). The subsequent panels seem to show that the parasites present a dramatic growth defect at that time point. This growth arrest can directly explain the elongated apicoplast, but does not allow any conclusion about an impact on the organelle. In any case, an assessment of 'delayed death' as presented in Figure 4c seems futile, since the many other processes affected by zinc and iron depletion likely cause a rapid death, masking any potential delayed death.

      To address this point, we agree that given the importance of iron and zinc to the parasite that we cannot differentiate the death of the parasite due to apicoplast defects from death from other causes and we have modified the discussion to reflect this, as below.

      “However, given the delayed phenotype typically seen upon apicoplast disruption, we cannot determine if this is a direct effect of ZFT, or a downstream consequence of metal depletion”

      Recommendations for the authors:

      Reviewer #1 (Recommendations for the authors):

      Specific Comments: 

      (1) The background on the typical sequence features that would identify Toxoplasma ZIP homologues should be expanded and clarified. While these proteins are likely quite divergent and may lack many conserved features, the manuscript currently does not provide enough detail to assess how similar (or different) TgZIPs are from well-characterized family members. Additionally, the justification for focusing on TGGT1_261720 (ZFT) over TGGT1_225530, as stated in the first paragraph of the results section, seems unclear. There is no predictive data supporting a potential plasma membrane localization for TGGT1_225530 (yet this cannot be excluded), and TGGT1_225530 appears to have more canonical metal-binding motifs. I believe that the fact that only TGGT1_261720 is iron-regulated should be sufficient justification for its selection, and this point could be emphasized more clearly. Furthermore, the discussion mentions a leucine residue that may be associated with broad substrate specificity, but this is not addressed in the initial comparative sequence analysis. These residues and the HK motif are not actually addressed in the Gyimesi et al. reference currently mentioned; thus this could be clarified and updated with references (such as PMID: 31914589) that provide more recent insights into key residues involved in metal selectivity in ZIP transporters.

      We thank you for this comment, to address these points:

      We agree that the iron-mediated regulation is sufficient for our focus on ZFT and have clarified the text to reflect this, as described above.

      We have also updated the references as suggested, our apologies for this oversight.

      We have further expanded the discussion, especially with reference to our new results using heterologous expression in oocytes (please see above).

      (2) Figure 1D, Figure 2A, C, H, Figure 3D, Figure 6F, H, corresponding text and paragraph 2 of the Discussion: It seems that most of the "non-specific bands" annotated in Figure 1D, which are lower molecular weight products, are not present in the parental cell line, suggesting they may not be non-specific after all. These bands also vary depending on the cell line (e.g., promoter used, see Figures 2H and 3D) or experimental conditions (e.g., iron excess or depletion). Given the dynamic localization of ZFT during intracellular development, it may be worth exploring whether these lower molecular weight bands represent degraded forms of TgZFT, possibly corresponding to the basally-clustered signal observed by immunofluorescence, with only the full-length protein associating with the plasma membrane. This possibility should be investigated or at least discussed further.

      While the lower bands are not present in the parental, we do see them in other HA-tagged lines, especially when the expression of the tagged protein is low, seen below (Author response image 1). We don’t currently have an explanation for these, but we can confirm that they do not change in abundance in parallel with the full length protein, supporting our hypothesis that these bands are an artefact of the anti-HA antibody in our system. Although ZFT is clearly degraded (e.g. Fig. 1g), we currently do not believe these bands are ZFT c-terminal degradation products.

      Author response image 1.

      Western blot of ZFT-3HA<sub>zft</sub> and another HA-tagged unrelated cytosolic protein, demonstrating that the lower bands are most likely nonspecific.

      (3) It is unfortunate that ZFT could not complement a yeast iron transporter mutant cell line, as this would have provided a strong argument for ZFT's role in iron transport. The manuscript does not provide much detail about the Δfet2/3 yeast mutant line. Fet3 is the ferroxidase subunit, while Ftr1 is the permease subunit of the high-affinity iron transport complex in yeast. Fet2, however, appears to be Saccharomyces cerevisiae's VPS41 homolog. Therefore, is Δfet2/3 the most appropriate mutant to use, or would another mutant line (e.g., ΔFtr1) be a better choice? Additionally, while Figure 7 suggests a decrease in metal uptake upon ZFT depletion, it would be useful to test whether overexpression of ZFT leads to enhanced metal incorporation, perhaps using a FerroOrange assay. 

      We thank the reviewer for their comments, which we have answered below:

      The Δfet2/3 yeast mutant was a typo and has been corrected, or apologies, we did use the  Δfet3/4 mutant line, based on previous successful experiments involving plant metal transporters (e.g  (DiDonato et al., 2004)).

      Unfortunately, we were unable to perform the FerroOrange assay in the overexpression line as this line is endogenously fluorescent in the same channel as FerroOrange.

      However, as detailed above we have now added significant new data, confirming our hypothesis that ZFT is an iron/zinc transporter through heterologous expression in Xenopus oocytes in the new figure 8. This provides direct evidence of transport of iron, and evidence that zinc can inhibit this transport, consistent with our hypothesis.  

      (4) The annotation of the blot in Figure 2H suggests that overexpressed ZFT-TY can only be detected in the absence of heat denaturation. However, this is not addressed in the text. Does heat denaturation also affect the detection of ZFT-3HA or the lower molecular weight products? This should be clarified in the manuscript. 

      Interestingly, ZFT is detectable after boiling at 95° C for 5 minutes when expressed at endogenous (or near endogenous) levels in the ZFT-3HA<sub>sag1</sub> and ZFT-3HA<sub>zft</sub> tagged parasite lines. However, overexpression of ZFT leads to a loss of detection via western blot when boiled, although the protein is detectable without heat denaturation.

      A possible explanation for this is that overexpression of protein may cause ZFT to miss-fold, making the protein more prone to aggregation following boiling, rendering the protein insoluble and unable to enter the gel. Moreover, heat aggregation can sometimes mask the epitope tags on the protein that is required for the antibody to be recognised, possibly explaining by ZFT is undetectable when overexpressed and exposed to boiling conditions, as has previously been observed for other transmembrane proteins (e.g. (Tsuji, 2020)).

      We have clarified this in the results section, although we do not have a full explanation for this, we consider it important to share for others who may be looking at expression of these proteins.

      (5) Figure 3G: It might be helpful to include an uncropped gel profile to allow readers to visualize that the main product does indeed correspond to a potential dimeric form in the native PAGE. 

      This has now been added in Figure S3e, thank you for this suggestion.

      (6) The investigation of the impact of ZFT depletion on the apicoplast could be improved. The authors suggest that ZFT knockdown inhibits apicoplast replication based on a modest increase in elongated organelles, but the term "delayed death" is not appropriate in that case, as it is typically linked to a loss of the organelle. This is not observed here and is also illustrated by the unchanged CPN60 processing profile. So, clearly, there seems to be no strong morphological effect on the apicoplast early on after ZFT depletion. On the other hand, the authors dismiss any impact on TgPDH-E2 lipoylation (which is iron-dependent) based on the fact that the lipoylated form of the protein is still detected by Western blot. However, closer inspection of the blot in Figure 4B suggests that the intensity of the annotated TgPDH-E2 signal is reduced compared to the -ATc condition (although there might be differences in protein loading, as indicated by the control) or even with the mitochondrial 2-oxoglutarate dehydrogenase-E2, whose lipoylation is presumably iron-independent (see PMID: 16778769). This experiment should be repeated, and the results quantified properly in case something was missed, and the duration of depletion conditions perhaps extended further. Of note, it would also be worthwhile to revisit size estimations, as the displayed profiles seem inconsistent with the typical sizes of lipoylated proteins detected with the anti-lipoyl antibody (e.g., ~100 kDa for PDH-E2, ~60 kDa for branched-chain 2-oxo acid dehydrogenase, and ~40 kDa 2-oxoglutarate dehydrogenase).

      We thank the reviewer for this comment. We agree that there is no strong defect on the apicoplast in the first lytic cycle and we have modified the language to remove reference to delayed death, as given the magnitude of changes associated with loss of iron and zinc, we cannot be certain about the role of the apicoplast.

      Based on this suggestion, we have now quantified the levels of lipoylation of PDH-E2, BDCK-E2 and OGDH-E2 and now include this in Figure S4b, c, d. Supporting our other results, we do not see a significant change in PDH-E2 lipolyation upon ZFT knockdown. However, although OGDH-E2 lipoylation is unchanged (Figure S4c) interestingly we do see a significant increase in BDCK-E2 lipoylation (Figure S4d). This process is not expected to be directly iron related, as mitochondrial lipoylation is through scavenging rather than synthesis however, speaks to the larger mitochondrial disruption that we see. We now consider this further in the discussion.

      For the sizes, we thank the reviewer for bringing this up, our apologies this was due to an error in the annotation, and we have now corrected this in the figure.

      (7) In the third paragraph of the discussion, the authors mention the inability to complement ZFT loss by adding exogenous metals. One argument is the potential lack of metal access to the parasitophorous vacuole (PV). Although largely unexplored, this point could be expanded further in the discussion, as the issue of metal transport to the parasite involves not only the parasite plasma membrane but also the PV membrane. Additionally, the authors mention the absence of functional redundancy in transporters, but it would be helpful to discuss potential stage-specific or differential expression of other ZIP candidates. Transcriptomic data available on Toxodb.org could provide useful insights into this, and experimental approaches, such as RT-PCR, could be used to assess the expression of these candidates in the absence of ZFT. 

      On the issue of metals crossing the PV membrane, we agree that while we do not currently know mechanisms of metal transport within the infected host cell, we do have experimental confirmation that the concentration and form of the metals that we are using can impact the parasites. We show that metal treatment inhibits parasites growth (e.g. Figure 3k-n, Figure 6a-d) and we can detect the increased metals through our experiments using FerroOrange and FluroZine (Figure 7a, c). In these experiments, parasites were treated intracellularly and so we can confirm that, regardless of the mechanism, iron and zinc can reach the parasite. While entry of metals across the PV is an intriguing question, it is beyond the scope of the present work which focuses on the role of the selected transporter.

      We agree that a more detailed discussion of the other ZIP transporters is warranted. We have extended this section of the discussion although for now, we cannot determine the role of the other ZIP transporters in Toxoplasma.

      (8) In the discussion, the authors mention that « Inhibition of respiration has previously been linked to bradyzoite conversion ». To strengthen their point, the authors could mention that mitochondrial Fe-S mutants, as well as mutants affecting mitochondrial translation or the mitochondrial electron transport chain, also initiate bradyzoite conversion (PMID: 34793583). This would reinforce the connection between mitochondrial dysfunction and stage conversion. 

      This is an excellent point and we have added this to the discussion as follows:

      “Inhibition of mitochondrial Fe-S biogenesis or mitochondrial respiration have both previously been linked to bradyzoite conversion (Pamukcu et al., 2021; Tomavo and Boothroyd, 1995), however we do not yet know the signalling factors linking iron, zinc or mitochondrial function to bradyzoite differentiation”.

      (9) As a general comment on manuscript formatting, providing page and line numbers would significantly improve the manuscript's readability and allow reviewers to more easily reference specific sections. This would help address the minor issues of typos (e.g., multiple occurrences of "promotor"). I suggest a careful read-through to correct these issues. 

      We thank the reviewer for this comment and in the resubmitted version we have corrected these issues. 

      Reviewer #2 (Recommendations for the authors): 

      (1) In the alignment (Figure 1a), the BPZIP sequence is from which organism (genus, species)? It would be helpful to include this information in the figure legend.

      Apologies for this oversight, this figure and section have been reworked and the species name (Bordetella bronchiseptica) added.

      (2) In reference to Figure 1a, the authors state, "Interestingly, all parasite ZIP-domain proteins examined have a HK motif at the M2 metal binding". I was wondering if by "all" the authors mean Toxoplasma and Plasmodium falciparum (shown in Figure 1a) or did the authors also look at other apicomplexan parasites such as Cryptosporidium or Neospora? Is this a general feature of apicomplexan parasites? 

      We looked at this, and the HK motif in the M2 binding site is conserved in Neospora Cryptosporidium, and even the digenic gregarine Porospora cf. gigantea. However, in the more distantly related Chromera we find a HH motif at the same position. This suggests that the HK motif is present in the Apicomplexa, but not conserved in the free-living Alveolata. Although we cannot speculate on the role of this motif currently, its role in metal import in Apicomplexa does deserve future scrutiny. To reflect this finding we have modified Figure 1a and the text.

      (3) In Figure 1e, to better visualize the ZFT-3HA staining at the basal pole, it would be better to omit the DAPI staining from the merged image. It is difficult to see the ZFT staining in the image of the large vacuole.

      We have removed the DAPI from this image to improve clarity.

      (4) Based on the "delayed-death" phenotype of the apicoplast, it is not surprising that no defects were observed in CPN60 processing or protein lipoylation. Have the authors considered measuring these phenotypes after a further round of growth (as was done for visualizing apicoplast morphology)? 

      We agree that changes in apicoplast function are often only seen in the second round of replication. However, here we wanted to check if ZFT depletion led to immediate changes in function of the organelle, which was not the case. It is highly likely that after the second round, we would see significant defects in the apicoplast function, however given the immediate importance of iron and zinc to many processes within the parasite, we believe that these experiments would be complicated to interpret.

      (5) Depleting ZFT led to a reduction in expression levels for the mitochondrial Fe-S protein SDHB but not for a cytosolic Fe-S protein. Is it expected that less intracellular iron (via depleted ZFT) would differentially affect mitochondrial versus cytosolic Fe-S proteins? 

      Previous studies (e.g., Maclean et al., 2024; Renaud et al., 2025) have shown that upon direct inhibition of the cytosolic Fe-S pathway, ABCE1 is fairly stable and levels can persist for 2-3 days post treatment. However, our recent work has shown that rapid and acute depletion of iron directly (though treatment with a chelator) can lead to ABCE1 levels decreasing within 24h (Hanna et al., 2025). In the case of ZFT knockdown, due to the more gradual reduction in iron levels seen (e.g. Figure 7j) we believe the parasites are prioritising key Fe-S pathways (e.g. essential proteostasis through ABCE1), probably while remodelling metabolism (as seen in our Seahorse assays). However, there are many proteins expected to be directly impacted by iron and zinc restriction that these parasites experience, and different protein classes are expected to behave differently in these conditions.

      Reviewer #3 (Recommendations for the authors): 

      (1) Is the effect on the plaque size between T7S4-ZFT (-aTc) in regular and 'high iron' conditions significant? The authors show convincingly that the plaque size is smaller due to the swapped promoter and the resulting overexpression of ZFT. But is the effect aggravated in high iron? This would be expected if excess iron were the problem.

      The plaque sizes are significantly smaller in the T7S4-ZFT line under high iron compared to the untreated condition, and compared to the parental untreated line. However, if we normalise plaque size to untreated conditions for both lines, there is not a significant change in plaque size in high iron between the parental and T7S4-ZFT. This is possibly due to the concentration of iron used (200 mM), which may not be optimal to see this effect, or the time taken for plaque assays (6-7 days), which may allow the excess iron to be stored by the host cells, changing the effective concentration of parasite exposure.

      (2) I struggle to understand the intracellular growth assay in Figure 5b. Here, T7S4-ZFT parasites show 25 % of vacuoles with more than 8 parasites (labelled 8+). But such large vacuoles are not observed in the parental strain. It appears as if the inducible strain grows faster even though it was earlier shown to have a fitness defect (see Figure 3j). Can you please clarify?

      This is a result of rapid growth of the parental line, some vacuoles in this line lysed and initiated a new round of replication at this time point while we saw no evidence at any timepoint that ZFT-depleted parasites were able to lyse the host cell. However, the initial (24-48h post ATc addition) replication rate of the ZFT KD remains similar to the parental. In this panel, we wanted to emphasize that the major phenotype we see upon ZFT depletion is vacuole disorganisation, which we believe is linked to the start of differentiation into bradyzoites.

      (3) Did the authors perform an IFA in addition to the Western blot to localize the 2nd Ty-tagged ZFT copy? It seems important to validate that the protein correctly localizes to the plasma membrane. 

      We have done so and now include these data in Figure S2b. Overexpression of ZFT-Ty localises to internal structures (probably vesicles) with some signal at the periphery, however, this limited expression at the periphery is sufficient to mediate the phenotypes that we see.

      (4) First sentence of the abstract and introduction: The authors speak of metabolism and cellular respiration as though they are two different processes. Is respiration not part of metabolism? 

      This is an excellent point, we wanted to distinguish mitochondrial respiration  from general cellular metabolism, but this was not clear. We have now changed this in the introduction to the below:

      “Iron, and other transition metals such as zinc, manganese and copper, are essential nutrients for almost all life, playing vital roles in biological processes such as DNA replication, translation, and metabolic processes including mitochondrial respiration (Teh et al., 2024)”

      (5) 2nd paragraph of the introduction: toxoplasmosis is written capitalized but should be lower case.

      This has been corrected.

      (6) Figure 4j legend: change 'shits parasites to a more quiescent stage' to 'shifts parasites'.

      This has been corrected, our apologies.

      (7) Please correct the following sentence: 'These data demonstrate ZFT depletion leads to the expression of the bradyzoite-specific markers BAG1 and DBL.' DBL is not expressed by the parasite. It is a lectin that binds to the sugars in the cyst wall.

      We have now modified this in the text. The sentence now reads: “These data show that ZFT depletion leads to the expression of the bradyzoite marker BAG1 and the production of the cyst wall, as detected by DBL”.

      (8) In the section on yeast complementation with TgZFT, the authors write: 'Based on this success, we also attempted to complement...'. Please consider changing 'Success' to something more neutral.

      We have modified the text to now read: “Based on these results, we also attempted to complement”…

      (9) In the discussion, the authors write: 'We see a delayed phenotype on the apicoplast, suggesting that metal import is also required in this organelle, although no apicoplast metal transporters have yet been identified.' Please consider the study Plasmodium falciparum ZIP1 Is a Zinc-Selective Transporter with Stage-Dependent Targeting to the Apicoplast and Plasma Membrane in Erythrocytic Parasites (PMID: (38163252).

      We thank the reviewer for the note and have modified the text to include this and the reference. Please see below:

      “Iron is known to be required in the apicoplast (Renaud et al., 2022), zinc also may be required, as the fitness-conferring Plasmodium zinc transporter ZIP1 is transiently localised to the apicoplast (Shrivastava et al., 2024), although the functional relevance of this localisation has not yet been established”.

      (10) The authors write: 'Iron is known to be required in the apicoplast (Renaud et al., 2022), although a potential role for zinc in this organelle has not yet been established.' The role for zinc in the apicoplast may not have been shown formally, but surely among its hundreds of proteins, and those involved in replication and transcription, there are some that depend on zinc...?

      Yes, we agree it would make sense, however multiple searches using ToxoDB and the datasets from Chen et al (2025) were unable to find any apicoplast-localised proteins with zinc-binding domains. We cannot exclude that zinc is in the apicoplast, and the results from Plasmodium (Shrivastava et al., 2024) may suggest that is, however currently we do not have any evidence for its role within this organelle.

      References

      DiDonato, R.J., Roberts, L.A., Sanderson, T., Eisley, R.B., Walker, E.L., 2004. Arabidopsis Yellow Stripe-Like2 (YSL2): a metal-regulated gene encoding a plasma membrane transporter of nicotianamine-metal complexes. Plant J 39, 403–414. https://doi.org/10.1111/j.1365-313X.2004.02128.x

      Hanna, J.C., Shikha, S., Sloan, M.A., Harding, C.R., 2025. Global translational and metabolic remodelling during iron deprivation in Toxoplasma gondii. https://doi.org/10.1101/2025.08.11.669662

      Maclean, A.E., Sloan, M.A., Renaud, E.A., Argyle, B.E., Lewis, W.H., Ovciarikova, J., Demolombe, V., Waller, R.F., Besteiro, S., Sheiner, L., 2024. The Toxoplasma gondii mitochondrial transporter ABCB7L is essential for the biogenesis of cytosolic and nuclear iron-sulfur cluster proteins and cytosolic translation. mBio 15, e00872-24. https://doi.org/10.1128/mbio.00872-24

      Pamukcu, S., Cerutti, A., Bordat, Y., Hem, S., Rofidal, V., Besteiro, S., 2021. Differential contribution of two organelles of endosymbiotic origin to iron-sulfur cluster synthesis and overall fitness in Toxoplasma. PLoS Pathog 17, e1010096. https://doi.org/10.1371/journal.ppat.1010096

      Renaud, E.A., Maupin, A.J.M., Berry, L., Bals, J., Bordat, Y., Demolombe, V., Rofidal, V., Vignols, F., Besteiro, S., 2025. The HCF101 protein is an important component of the cytosolic iron–sulfur synthesis pathway in Toxoplasma gondii. PLoS Biol 23, e3003028. https://doi.org/10.1371/journal.pbio.3003028

      Shrivastava, D., Jha, A., Kabrambam, R., Vishwakarma, J., Mitra, K., Ramachandran, R., Habib, S., 2024. Plasmodium falciparum ZIP1 Is a Zinc-Selective Transporter with Stage-Dependent Targeting to the Apicoplast and Plasma Membrane in Erythrocytic Parasites. ACS Infect. Dis. 10, 155–169. https://doi.org/10.1021/acsinfecdis.3c00426

      Teh, M.R., Armitage, A.E., Drakesmith, H., 2024. Why cells need iron: a compendium of iron utilisation. Trends in Endocrinology & Metabolism 35, 1026–1049. https://doi.org/10.1016/j.tem.2024.04.015 Tomavo, S., Boothroyd, J.C., 1995. Interconnection between organellar functions, development and drug resistance in the protozoan parasite, Toxoplasma gondii. International Journal for Parasitology 25, 1293–1299. https://doi.org/10.1016/0020-7519(95)00066-B.

    1. Briefing : Renouer avec l'Autorité à l'École avec Jean-Pierre Bellon

      Source : Extraits de "Instant Canopé : renouer avec l'autorité à l'école avec Jean Pierre Bellon"

      Date : Journée nationale de lutte contre le harcèlement scolaire (veille) 2024

      Intervenants :

      • Sophie Courau : Directrice d'ESF sciences humaines, éditrice des ouvrages de Jean-Pierre Bellon.

      • Jean-Pierre Bellon : Professeur de philosophie, pionnier de la lutte contre le harcèlement scolaire en France, auteur de "Renouer avec l'autorité à l'école".

      • Public : Professionnels de l'éducation (enseignants, chefs d'établissement, CPE, directeurs d'école).

      Synthèse :

      Cette discussion avec Jean-Pierre Bellon met en lumière les liens étroits entre la crise de l'autorité à l'école, le harcèlement scolaire et le chahut en classe.

      Bellon, fort de son expérience d'enseignant et de son travail sur le harcèlement, propose dix mesures concrètes pour restaurer un climat scolaire apaisé, insistant sur la nécessité d'une autorité bienveillante alliant courtoisie et fermeté.

      Il critique le manque de formation des enseignants sur la gestion des incivilités et des classes difficiles, le flou entourant la définition et la hiérarchisation des infractions scolaires, et l'inefficacité perçue des sanctions actuelles, notamment dans le premier degré.

      Des propositions sont faites pour repenser les punitions, les sanctions (notamment l'exclusion temporaire), l'utilisation des téléphones portables, l'architecture des établissements et les relations entre l'école et les familles, en plaidant pour des protocoles clairs, une approche collective et une verticalité institutionnelle renforcée.

      Thèmes principaux et Idées clés :

      Lien entre Manque d'Autorité, Harcèlement et Chahut : Bellon établit un lien direct entre la défaillance de l'autorité et les phénomènes de harcèlement et de chahut.

      Les élèves victimes de harcèlement témoignent que la situation est "le pire" dans les classes des professeurs en difficulté.

      Le chahut, tout comme le harcèlement, est un phénomène de groupe, rendant les sanctions individuelles inefficaces et potentiellement contre-productives en provoquant la coalition du groupe.

      Le manque d'autorité se manifeste par le chahut, dont la réalité dans les classes françaises est confirmée par des enquêtes internationales comme PISA. PISA 2022 révèle qu'un lycéen sur deux considère qu'il y a trop de bruit en classe pour entendre le professeur, une situation qualifiée d'"injustice scolaire gigantesque".

      Citation : "Le lien est direct moi le lien que j'ai vu d'abord entre le le harcèlement et et les classes dites dit difficile il était le suivant tous les élèves victime de brimade tous les élèves victimes de harcèlement faisaient tous le même constat c'est que c'était dans la classe du professeur en difficulté lui-même que la situation était le pire."

      Citation : "le harcèlement comme le Chahu sont deux symptômes d'une défaillance de l'autorité tout de même."

      Citation : "PISA 2022 nous apprend qu'un lycéen sur de considère qu'il y a trop de bruit dans sa classe au point qu'il n'entend pas ce que dit le professeur imaginez l'injustice que cela représente."

      La Crise de l'Autorité à l'École, Symptôme d'une Crise Sociétale :

      Bellon reconnaît que l'école n'échappe pas à une crise de l'autorité plus vaste qui touche la société.

      Cependant, il estime que l'école est un lieu où il est possible "d'essayer de faire quelque chose" pour rétablir l'autorité, en raison du contact quotidien et prolongé (de 8h à 17h) avec les jeunes en formation.

      Propositions pour Rétablir l'Autorité :

      Allier Courtoisie et Fermeté : S'inspirant d'Hannah Arendt, l'autorité se situe entre la force/contrainte et la persuasion/négociation.

      Il s'agit de donner des injonctions claires et fermes, tout en maintenant la courtoisie.

      L'objectif est de ne laisser à l'élève récalcitrant que le choix du refus d'obtempérer, qui peut alors être traité formellement.

      Citation : "cette alliance entre courtoisie et fermeté ne laisse à l'élève contrevenant si jeose dire qu'une seule porte de sortie c'est le refus d'obtempéré."

      Citation : "je pense qu'il convient absolument de rétablir au sein de des établissements scolaires des règles de courtoisie de civilité."

      Nécessité de la Formation des Enseignants : Il existe un "scandale" concernant le manque de formation des enseignants à la gestion des comportements difficiles (élèves arrogants, insultants, menaçants). Contrairement à d'autres métiers en contact avec le public, les enseignants ne disposent pas de protocoles de réaction.

      Citation : "le défaut de formation des enseignants il est quand même criant c'est un purure scandale."

      Citation : "on nous a pas formé à un protocole sur comment je réagis lorsque j'ai face à moi un élève arrogant un élève insultant un élève menaçant et cetera."

      Définir et Hiérarchiser les Infractions Scolaires : Il n'existe pas de liste claire et hiérarchisée des incidents scolaires non délictuels.

      Bellon a tenté de le faire (56 incidents listés) pour permettre aux enseignants de savoir à quoi s'attendre et de faire la distinction entre les incidents mineurs et graves.

      Le manque de hiérarchie conduit à des sanctions disproportionnées (ex: lettre d'excuse pour une insulte grave).

      Citation : "on s'était jamais penché sur qu'est-ce que c'est qu'un incident scolaire j'ai essayé de le faire j'ai essayer rédiger la liste des incidents scolaires c'est-à-dire tout ce qui ne devrait pas se faire se produire dans une classe dans un établissement dans une école et cetera je suis arrivé à 56 inincid."

      Citation : "Avouez quand même que oublier son livre et insulter un professeur c'est pas tout à fait de même nature."

      Tolérance Zéro et Signalement Systématique : Toutes les infractions, même mineures, doivent être systématiquement signalées.

      Cela permettrait d'informer l'opinion, les parents et les élèves, et de s'accorder sur une "échelle" de gravité et de sanctions appropriées ("un barème").

      Citation : "non pas forcément mais elles doivent être systématiquement signalé il faut en laisser passer aucune faut une tolérance zéro à cet égard il faut un signalement systématique de toutes les infractions."

      Réformer le Système de Sanctions (Punitions vs Sanctions) :

      La distinction actuelle entre punitions (données par tout personnel) et sanctions (données par le chef d'établissement) est jugée désordonnée et inefficace.

      Les enseignants sont mis en difficulté en devant gérer seuls les punitions.

      Bellon suggère de supprimer le pouvoir discrétionnaire de punition des enseignants.

      Citation : "ce désordre de la distinction entre les punitions et les sanctions c'est quelque chose qui a fait son temps qui n'a plus de sens."

      Citation : "je suggère qu'on leur retire ce pouvoir discrétionnaire dont ils ne savent pas quoi faire d'ailleurs et qu' est met à risque parce que l'occasion de la sanction en classe c'est un risque authentique pour les professeurs."

      Confier la Proposition de Sanctions à une Commission : Il est proposé de créer une commission (composée de différents professionnels de l'établissement) qui examinerait tous les incidents signalés et ferait une proposition de sanction au chef d'établissement.

      Cela permettrait une plus grande cohérence et déchargerait l'enseignant.

      Adapter la Sanction dans le Primaire : La situation dans le premier degré est décrite comme un "désordre absolu" en matière de sanctions, avec un manque criant de "vie scolaire" et de lieux dédiés pour gérer les élèves perturbateurs. La solitude des professeurs des écoles est soulignée.

      Repenser l'Exclusion Temporaire (Exclusion Internée) : L'exclusion sèche est paradoxale et inefficace, surtout pour les élèves en difficulté.

      L'exclusion devrait être "internée", c'est-à-dire que l'élève reste dans l'établissement mais avec des contraintes (horaires décalés, lieu spécifique, travail différent).

      Il s'agit d'une pratique héritée des Bénédictins (règle de Saint-Benoît). La sanction doit être "frustrante" (enlever quelque chose) et "signifiante" (verbalisée).

      Citation : "renvoyer un garçon ou une fille chez lui ça peut pas marcher ben c'est bien précisément le contraire qu'il faut faire."

      Citation : "l'idée de que l'exclusion soit internée c'est-à-dire que l'élève a l'obligation devenenir en cours mais il pourra peut-être pas faire exactement les mêmes choses."

      Citation : "pour une sanction pour qu'elle soit éducative faut qu'elle soit dite faut qu'elle soit mise en mot faut qu'elle soit verbalisée... il faut qu'aussi la sanction elle soit frustrante."

      Gérer l'Utilisation du Téléphone Portable : Il n'y a pas de "ligne claire" en France sur ce sujet.

      Bellon préconise une règle simple et ferme : téléphone éteint dans le sac au fond de la pièce, comme lors d'un examen.

      Toute utilisation entraîne un signalement et une sanction. Le téléphone en classe représente un risque pour les enseignants.

      L'exemple d'un incident où une enseignante est insultée après avoir demandé à un élève de ranger son téléphone illustre cette difficulté.

      Citation : "sur la question du téléphone portable il faut quand même avoir une ligne claire en France on n'a pas de ligne clair."

      Citation : "franchement les téléphones portables ils n'ont rien à faire à l'école."

      Aménager l'Espace et le Temps Scolaire (Architecture des Établissements) : L'architecture actuelle des établissements peut être propice aux problèmes d'autorité (ex: salle des profs inadaptée, cours de récréation regroupant tous les élèves).

      Bellon suggère de repenser les espaces pour les adapter aux besoins (bureaux pour les enseignants, plusieurs cours de récréation, lieux dédiés pour les sanctions).

      Ce point est considéré comme un projet à plus long terme ("le lycée Hannah Arendt").

      Citation : "tous les architectes scolaires d'ailleurs j'observent que les architectes scolaires c'est pas toujours ils interrogent pas toujours les professeurs pour c'est quand même énorme ça quand on construit une maison en général on s'intéresse à aux habitants là non pas trop."

      Améliorer les Relations École-Familles : Les relations entre l'école et les parents sont souvent tendues, avec des directeurs d'école et enseignants confrontés à des incivilités et des contestations (44% des directeurs d'école insultés selon une étude).

      Bellon recommande l'établissement de protocoles pour accueillir les familles et gérer les situations difficiles.

      Il souligne que les professionnels du primaire sont plus exposés en l'absence de sas d'accueil.

      Citation : "vous pointez les mauvaises relations qu'entretiennent trop souvent les parents avec l'institution scolaire et vous citez notamment une étude de George futinos selon laquelle 44 % des directeurs d'école ont déclaré avoir été insultés par des parents d'élèves."

      Citation : "il faut vraiment là encore avoir un protocole pour réagir face aux incivilités."

      Principes de la Sanction Éducative :

      S'inspirant d'Éric Prerat, Bellon insiste sur trois aspects d'une sanction éducative :

      • Signifiante : Elle doit être dite, verbalisée, prononcée avec gravité et solennité, en distinguant l'infraction de la personne.

      • Frustrante : Elle doit enlever quelque chose à l'élève (un droit, un avantage, la participation à une activité).

      • Réparatrice : Elle doit inclure une dimension de réparation, en lien avec le préjudice causé à la vie collective ou aux personnes (excuses, réparation matérielle).

      Citation : "Eric prerat dit que pour une sanction pour qu'elle soit éducative faut qu'elle soit dite faut qu'elle soit mise en mot faut qu'elle soit verbalisée."

      Citation : "il faut qu'aussi la sanction elle soit frustrante faut qu'on m'enlève quelque chose."

      Citation : "il faut que la sanction elle est une dimension réparatrice."

      Nécessité d'une Approche Collective et Institutionnelle :

      Les directeurs d'école, en particulier, manquent de pouvoir hiérarchique et de protection institutionnelle face aux contestations des parents.

      L'idée de rattacher les écoles aux collèges (proposée par un ancien ministre) aurait pu offrir une direction institutionnelle et une vie scolaire dans le primaire.

      La contestation des parents est également vue comme un symptôme d'une forte inquiétude quant à l'avenir de leurs enfants, qui se manifeste par une contestation systématique de la moindre sanction.

      Il est crucial qu'il y ait une "verticalité" et une "protection systématique" de la part de l'institution (rectorat, dasen, chefs d'établissement) pour soutenir les enseignants face aux contestations et aux pressions.

      Les failles institutionnelles peuvent être exploitées par des "adversaires de l'école".

      Citation : "il faudrait renforcer le pouvoir hiérarchique des directeurs du Prim."

      Citation : "il faut une verticalité il faut qu'il y ait des choses qui ne se négocient pas."

      Certaines règles devraient être non négociables : contenu des enseignements, règles de civilité/courtoisie, respect absolu des personnes, tenue correcte.

      Les enseignants sont des modèles pour les élèves et doivent également faire preuve d'élégance et de distinction.

      Le travail de rétablissement de l'autorité et de gestion des difficultés doit être "construit collectivement" au sein des établissements, en s'inspirant des erreurs et des bonnes pratiques.

      Le dispositif SCORE (inspiré de la préoccupation partagée) pour les classes difficiles est présenté comme un exemple d'approche collective et axée sur la recherche de solutions par les élèves eux-mêmes.

      Points d'action suggérés par Jean-Pierre Bellon :

      Développer des formations spécifiques pour les enseignants sur la gestion des comportements difficiles et l'application de protocoles.

      Établir une liste claire et hiérarchisée des infractions scolaires.

      Mettre en place un système de signalement systématique de toutes les infractions.

      Supprimer le pouvoir discrétionnaire de punition des enseignants et le confier, ainsi que la proposition de sanction, à une commission dédiée.

      Repenser et adapter les sanctions dans le premier degré, notamment en explorant l'idée d'espaces et d'organisation permettant des sanctions frustrantes (ex: récréation décalée).

      Généraliser l'exclusion temporaire "internée". Instaurer une règle claire et ferme concernant l'utilisation des téléphones portables en classe.

      Engager une réflexion à long terme sur l'architecture des établissements pour mieux gérer l'espace et le temps scolaire.

      Développer des protocoles d'accueil et de gestion des relations avec les familles, notamment dans le primaire, et renforcer le soutien institutionnel aux professionnels de terrain.

      Construire collectivement au sein des établissements des chartes ou des règles non négociables concernant le comportement, le respect et la tenue.

      Utiliser des dispositifs collectifs comme SCORE pour gérer les classes difficiles.

      Conclusion :

      Jean-Pierre Bellon propose une approche globale pour faire face à la crise de l'autorité à l'école, en identifiant les liens entre ce phénomène, le harcèlement et le chahut.

      Ses propositions visent à professionnaliser la gestion des incivilités et des conflits, à clarifier les règles et les conséquences des manquements, à repenser les sanctions pour les rendre plus éducatives, et à renforcer la protection et le soutien institutionnel des professionnels de l'éducation.

      L'accent est mis sur la nécessité d'une action collective au sein des établissements et d'une réaffirmation de la verticalité institutionnelle pour faire face aux contestations et garantir un climat scolaire apaisé.

    1. Note d'information : Éco-délégués, le pouvoir d'agir

      Résumé Exécutif

      Cette note d'information synthétise les perspectives et les analyses issues du podcast "Éco-délégués : donnons-leur le pouvoir d'agir".

      Le document met en lumière le rôle complexe des éco-délégués, les attentes élevées placées en eux — qualifiés de "héros ordinaires" — et les multiples facettes de leur engagement.

      Les motivations des élèves sont profondes, allant du désir d'agir pour la planète, souvent nourri par une certaine éco-anxiété, à un sentiment de responsabilité et à l'influence de leur entourage.

      L'analyse révèle que l'efficacité du dispositif repose de manière critique sur l'accompagnement des adultes. Un écueil majeur, l'« adultisme », où les projets sont imposés par les adultes, doit être évité au profit d'une approche qui laisse les élèves proposer, construire et piloter leurs propres initiatives.

      Le rôle du référent est de trouver un équilibre délicat entre l'écoute, le soutien logistique et l'impulsion, afin de transformer les idées en actions concrètes.

      Les projets menés varient considérablement, des éco-gestes classiques (tri des déchets) à des transformations ambitieuses de l'établissement (végétalisation de la cour, création de zones de bien-être), s'étendant au-delà de l'écologie pour englober l'ensemble des Objectifs de Développement Durable (ODD), comme l'égalité filles-garçons.

      Cependant, de nombreux obstacles freinent leur action : la lenteur administrative ("le temps des adultes"), les contraintes financières, le manque de reconnaissance par les pairs et l'absence d'un temps institutionnalisé pour leurs activités.

      La dynamique dépend fortement de la gouvernance de l'établissement, décrite comme des "montagnes russes".

      Malgré ces défis, l'engagement en tant qu'éco-délégué est profondément formateur. Il développe la confiance en soi, le sens de la citoyenneté et le "pouvoir d'agir" des élèves.

      Ce dispositif transforme également les adultes impliqués, modifiant leur regard sur les élèves et leurs propres pratiques pédagogiques, et a le potentiel de catalyser un changement positif à l'échelle de l'établissement et du territoire.

      --------------------------------------------------------------------------------

      Analyse Détaillée

      1. Portrait de l'Éco-délégué : Motivations et Identité

      Les Motivations de l'Engagement

      L'engagement des élèves en tant qu'éco-délégués est mû par un ensemble de motivations profondes, identifiées par la chercheuse Eveline Bois :

      Agir pour la planète : La motivation principale est la volonté de "sauver le monde", de le changer à leur échelle.

      Les élèves expriment une conscience aiguë de la dégradation de l'environnement et de l'urgence climatique, ce qui peut générer de l'éco-anxiété. L'action devient alors un moyen de la combattre.

      Sentiment de responsabilité : Les jeunes se sentent responsables de l'avenir et perçoivent l'établissement scolaire comme une bonne échelle pour commencer à agir.

      Influences externes : La famille et les amis jouent un rôle significatif. Des élèves s'engagent pour suivre l'exemple de parents impliqués dans des associations (ex: "zéro déchet") ou pour partager une expérience avec leurs camarades.

      Utilité et participation : Comme l'exprime Laur, éco-déléguée depuis quatre ans, le désir de "se rendre utile à la vie au collège" et de s'investir est un moteur important.

      Méthodes de Recrutement et Profils

      Le mode de désignation des éco-délégués influence la dynamique du groupe. Sandrine Aoussour, enseignante référente, a opté dans son collège pour un système basé sur le volontariat, ouvert tout au long de l'année. Ce choix vise à :

      • Garantir d'avoir des élèves "réellement motivés".

      • Éviter la compétition inhérente à une élection.

      • Créer un "noyau vraiment d'élèves motivés" tout en permettant une flexibilité (possibilité de rejoindre ou de quitter le groupe).

      2. Le Rôle Crucial de l'Accompagnement Adulte

      L'Écueil de l'« Adultisme »

      Eveline Bois met en garde contre l'« adultisme », une tendance des adultes à concevoir des dispositifs pour les élèves en minimisant leur capacité à faire des choix et des propositions.

      Le témoignage du professeur Raphaël Grass est emblématique : il a commencé par apporter lui-même les projets avant de réaliser qu'ils ne correspondaient pas aux attentes des élèves et de leur "donner la parole".

      Conséquence : Un décalage se crée entre les désirs des élèves (sauver le monde) et les actions qu'on leur propose (installer un cendrier devant le lycée).

      Solution : L'accompagnement doit évoluer pour faire confiance aux élèves et leur laisser l'initiative.

      Le Référent : un Équilibriste entre Soutien et Autonomie

      Le rôle de l'enseignant ou du CPE référent est central et complexe. Il ne s'agit pas de diriger mais de faciliter.

      Partir des préoccupations des élèves : Sandrine Aoussour insiste sur l'importance de partir des idées des élèves (créer un potager, un "coin zen") et d'aider à les concrétiser en trouvant des solutions (budgets participatifs, partenariats).

      Proposer sans imposer : L'adulte peut aussi être force de proposition (installation d'une ruche via une fondation), mais ces propositions sont soumises aux élèves.

      Besoin d'adultes "entreprenants" : Du point de vue de l'éco-déléguée Laur, les élèves attendent des adultes qu'ils soient encore plus proactifs pour les aider à réaliser leurs projets les plus ambitieux, comme la végétalisation de la cour.

      Les témoignages d'anciens éco-délégués de lycée confirment ce besoin d'un équilibre : ils préconisent une "instance autonome avec une certaine flexibilité", où ils peuvent travailler seuls pour libérer la parole, tout en bénéficiant de l'accompagnement des adultes pour les aspects logistiques et financiers.

      3. Des Éco-gestes à la Transformation Durable

      Le Spectre des Actions

      Les projets menés par les éco-délégués couvrent un large éventail, de l'action symbolique à la transformation structurelle de l'établissement.

      | Type d'Action | Exemples Concrets du Podcast | | --- | --- | | Éco-gestes classiques | Tri des bouchons, du papier, lutte contre le gaspillage alimentaire, ramassage des poubelles de tri. | | Amélioration de l'environnement scolaire | Installation d'un "coin zen", de plantes dans les classes, d'un apiscope, d'hôtels à insectes. | | Projets ambitieux et structurels | Création d'un potager, d'une zone de biodiversité, projet de végétalisation de la cour de récréation. | | Sensibilisation et citoyenneté | Campagnes de sensibilisation dans les classes, organisation d'une "manif au collège pour le climat". | | Actions sociales (ODD) | Recherche de sponsors pour un distributeur de protections périodiques, aménagement de la cour pour une meilleure égalité filles-garçons, collecte pour le Secours populaire. |

      Dépasser la "Vitrine Verte"

      Eveline Bois souligne le risque que les projets ne soient qu'une "vitrine verte". Le passage à une transformation durable dépend de plusieurs facteurs :

      1. La qualité de l'accompagnement : Un bon accompagnement permet de dépasser les actions de surface pour s'attaquer à des projets de plus grande ampleur.

      2. L'élargissement des thématiques : L'engagement va au-delà de l'écologie stricte pour inclure les 17 ODD, comme le bien-être animal ou l'égalité des genres.

      Sandrine Aoussour cite l'exemple d'un projet de réaménagement de la cour initié par les filles pour contrer l'occupation de l'espace par les garçons.

      3. Le frottement au réel : Les projets ambitieux confrontent les élèves aux réalités du monde adulte : recherche de financements (devis, sponsors), complexité des règles (mobilier urbain), et temporalité administrative.

      4. Obstacles et Limites à l'Action

      L'engagement des éco-délégués se heurte à des difficultés systémiques et culturelles.

      Le Temps et l'Argent :

      Le "temps des adultes" : Les élèves découvrent la lenteur des processus de décision et de mise en œuvre, ce qui peut être une source de frustration.   

      Le financement : La recherche de fonds est un obstacle majeur. Les élèves réalisent que les projets ont un coût élevé (ex : "un banc ça coûte extrêmement cher").   

      L'emploi du temps : Il n'y a pas de temps institutionnel dédié. Les réunions ont lieu sur la pause méridienne, après les cours ou, plus rarement, sur le temps de classe, ce qui pose des questions d'organisation et d'équité.

      Les Freins Institutionnels et Sociaux :

      La gouvernance : Le soutien de la direction est crucial mais fluctuant. Sandrine Aoussour parle de "montagnes russes" selon les équipes de direction en place.  

      Le manque de reconnaissance : Les éco-délégués peuvent souffrir d'un manque de reconnaissance de la part de leurs camarades ("vous servez à quoi, il y a déjà les délégués").  

      La valorisation : La question de la valorisation de leur engagement (par exemple, sur le dossier scolaire) reste à creuser pour éviter le désengagement.

      5. Le "Pouvoir d'Agir" : Impacts et Bénéfices

      Malgré les obstacles, le dispositif, lorsqu'il fonctionne bien, a un impact profondément positif sur tous les acteurs.

      Pour les élèves :

      Développement personnel : Gain de confiance en soi, joie de partager et de réaliser des projets collectifs.  

      Développement de compétences : Prise de parole en public, gestion de projet, argumentation, etc.    ◦ Développement citoyen : Le dispositif est un apprentissage concret de la citoyenneté. Certains élèves poursuivent leur engagement en dehors du collège (ex: au Secours populaire).  

      Sentiment d'empowerment : "Les jeunes interrogés qui se sentent libres et à qui on fait confiance entretiennent un fort sentiment du pouvoir agir" (Eveline Bois).

      Pour les enseignants et l'établissement :

      Épanouissement professionnel : Les référents parlent de "joie" et de "contact privilégié" avec les élèves.  

      Transformation des pratiques : L'engagement en tant que référent modifie le regard des adultes sur le potentiel des élèves et peut transformer leurs pratiques de classe.   

      Dynamique collective : Un projet réussi peut rayonner et impliquer toute la communauté éducative (gestionnaires, direction, agents), devenant un véritable projet d'établissement.

      Concepts Clés et Inspirations

      De la responsabilité de surface à la responsabilité intégrale : Eveline Bois cite la chercheuse Luce Sauvé pour distinguer deux approches de l'écocitoyenneté :

      1. Responsabilité de surface : Limitée aux "bons gestes" et à une vision normative (écocivisme).

      2. Responsabilité intégrale : Implique une "réflexion critique, un pouvoir d'agir et la participation à la vie démocratique".

      L'objectif est de tendre vers cette seconde approche.

      Le bonheur comme projet collectif : Sandrine Aoussour s'inspire d'un rapport de l'UNESCO pour souligner que le bonheur à l'école est un projet communautaire.

      La "joie de se découvrir capable d'être au service d'un collectif" est un levier d'apprentissage puissant.

      L'empouvoirement : Ce néologisme résume l'objectif final du dispositif : donner réellement du pouvoir aux élèves et aux enseignants pour qu'ils deviennent les moteurs du changement.

    1. Droits de l'enfant : Transformer l'École de l'intérieur

      Synthèse exécutive

      Ce document de synthèse analyse les stratégies et les impacts de l'intégration des droits de l'enfant au cœur du fonctionnement de l'école.

      Basée sur les témoignages d'experts et de praticiens, l'analyse révèle que si la France a ratifié la Convention Internationale des Droits de l'Enfant (CIDE) depuis 1990, son application reste inégale, notamment en ce qui concerne le droit à l'expression et à la participation des élèves.

      L'approche préconisée dépasse le simple enseignement théorique des droits pour les incarner dans la posture des adultes, les relations interpersonnelles et l'organisation même de l'établissement.

      Le programme "École amie des droits de l'enfant" de l'UNICEF sert de modèle central, illustrant une démarche qui vise un changement de culture profond et durable.

      Cette méthode s'appuie sur un diagnostic participatif, l'implication de toute la communauté éducative (enseignants, élèves, personnels, parents) et l'utilisation d'outils concrets comme la "marche exploratoire" pour évaluer l'environnement scolaire du point de vue de l'enfant.

      Les bénéfices identifiés sont significatifs : amélioration notable du climat scolaire, renforcement du respect de soi et des autres, et développement précoce des compétences citoyennes.

      Les données issues d'expériences internationales démontrent une augmentation du sentiment de sécurité et de l'écoute perçue par les élèves, ainsi que de leur capacité à influencer les décisions qui les concernent.

      Cependant, la mise en œuvre se heurte à des défis majeurs, tels que la prévalence de l' "adultisme" – la tendance des adultes à décider à la place des enfants – et la perception d'une surcharge de travail pour les enseignants.

      La clé du succès réside dans un engagement sur le temps long, considérant ces programmes non comme une initiative ponctuelle mais comme un investissement fondamental pour former des citoyens actifs et responsables.

      État des lieux des droits de l'enfant dans le système éducatif français

      La Convention Internationale des Droits de l'Enfant (CIDE) : Un cadre juridique sous-appliqué

      La CIDE, adoptée par les Nations Unies en 1989 et ratifiée par la France en 1990, constitue le socle juridique des droits de l'enfant.

      Ce texte de 54 articles protège les individus de 0 à 18 ans et couvre l'ensemble de leurs droits fondamentaux.

      Cependant, selon Valérie Becket, professeure en sciences de l'éducation, l'application de cette convention en France est "inégale selon les domaines".

      La France n'est pas considérée comme un "bon élève", particulièrement sur les enjeux d'expression et de participation.

      Des enquêtes comparatives à l'échelle européenne montrent que, malgré l'existence de dispositifs comme les conseils d'école ou les conseils d'enfants, un décalage persiste entre les droits permis et le ressenti réel des enfants, plaçant parfois la France en bas du classement.

      Julie Zarlot, de l'UNICEF France, précise que si la France est exemplaire dans certains domaines comme le droit global à la santé ou à l'éducation, des manques subsistent pour certains enfants qui n'ont pas un accès suffisant à l'école, à la santé ou à la protection.

      La perception des droits à l'école

      L'environnement scolaire présente des tensions inhérentes à l'application des droits de l'enfant. Richard Côtier, directeur d'école, souligne que "l'organisation prend le pas sur le respect de chacun".

      La focalisation sur les objectifs d'apprentissage peut parfois occulter la nécessité de garantir les droits fondamentaux des élèves.

      L'équilibre Droits/Devoirs : Une réaction fréquente des adultes (enseignants, parents) à l'évocation des droits de l'enfant est la question des devoirs.

      La réponse apportée est que le droit de l'un implique le devoir pour l'autre de le respecter. "Le devoir, c'est le devoir de respecter les droits de tous, y compris les siens propres et ceux des autres."

      Écart de perception : Les diagnostics menés en amont des projets révèlent souvent un décalage entre la perception de l'école par les élèves, qui la vivent de l'intérieur, et celle de leurs parents, qui sont à l'extérieur.

      Cette différence justifie la nécessité de recueillir le point de vue de toutes les parties prenantes.

      Le programme "École amie des droits de l'enfant" : Une approche transformative

      Philosophie et approche pédagogique

      Le programme de l'UNICEF est présenté comme une démarche de prévention positive.

      Plutôt que de se concentrer sur la lutte contre des problèmes (comme le harcèlement) par une approche "par la négative", il vise à "motiver tout le monde pour faire en sorte que les droits de tous soient respectés".

      L'approche pédagogique de l'UNICEF, qualifiée d'"approche par les droits", repose sur trois piliers :

      1. Apprendre sur les droits : Acquérir la connaissance de la CIDE.

      2. Apprendre par les droits : Expérimenter les droits dans la pratique quotidienne, via la posture de l'enseignant et le fonctionnement de l'école.

      3. Apprendre pour les droits : Devenir capable de défendre ses propres droits et ceux des autres.

      L'objectif est un "changement de comportement" et un "renforcement des capacités" des adultes comme des enfants. Il ne s'agit pas simplement d'un apport de connaissances, mais d'une transformation profonde du fonctionnement de l'école.

      Mise en œuvre concrète à l'école L. Martine

      L'école dirigée par Richard Côtier, engagée dans le programme depuis un an et demi, illustre cette mise en œuvre.

      Comité de Pilotage (Copil) : Un comité a été créé pour piloter le projet, rebaptisé "Conseil de vie citoyenne" pour préparer les élèves au collège.

      Sa particularité est d'inclure un large éventail d'acteurs : élèves, enseignants, AVS, personnel d'entretien, animateurs du périscolaire.

      Ce lieu permet de "penser tous avec nos regards différents le fonctionnement de l'école du point de vue des droits".

      La "Marche Exploratoire" : Cet outil concret consiste à parcourir l'école en se posant des questions spécifiques sous l'angle d'un droit (ex: la sécurité).

      Les élèves et adultes observent et analysent des lieux précis pour déterminer s'ils s'y sentent en sécurité, si les adultes sont perçus comme un secours potentiel, etc.

      Cette démarche permet d'objectiver le diagnostic initial en se basant sur la perception et le vécu de l'enfant.

      Une approche modeste et progressive : La première phase a consisté à assurer une formation sur les droits de l'enfant dans toutes les classes et à mettre en place les structures participatives.

      L'accent est mis sur la modestie des objectifs annuels pour assurer leur réalisation concrète et maintenir la confiance dans le processus.

      Le temps long comme condition du succès

      Richard Côtier insiste sur le fait que la transformation d'une culture scolaire est un processus long.

      Il considère la durée de trois ans du programme UNICEF comme "juste la piqûre, juste le vaccin". Selon lui, il faudra "peut-être encore 5, 10 ans derrière" pour qu'un établissement puisse affirmer avoir durablement intégré cette culture.

      Le but est de créer une dynamique pérenne où la communauté éducative constate un changement profond et irréversible dans son fonctionnement.

      Impacts, défis et généralisation

      Impacts mesurables sur le climat scolaire et les élèves

      L'expérience du Royaume-Uni, où le programme existe depuis plus de dix ans dans 4500 écoles, fournit des données quantitatives sur son impact.

      | Indicateur d'impact (Évaluation au Royaume-Uni) | Chiffres clés | | --- | --- | | Amélioration du respect de soi et des autres | 93 % des enfants | | Augmentation du sentiment de sécurité à l'école | \+ 5 % | | Augmentation du sentiment d'être écouté à l'école | \+ 5 % | | Augmentation de la capacité à influencer les décisions | \+ 14 % des enfants | | Augmentation du sentiment d'être respecté par les adultes/pairs | \+ 11 % | | Connaissance supérieure de leurs droits | \+ 37 % des enfants |

      Au-delà des chiffres, l'impact qualitatif est la formation de futurs citoyens qui ne sont pas "relativement passifs", mais qui ont expérimenté que leur parole peut avoir un effet sur le monde qui les entoure et que le changement nécessite un engagement collectif.

      Surmonter les freins et les obstacles

      L' "Adultisme" et la peur de la contestation : Valérie Becket identifie un frein majeur dans la tendance des adultes à voir les "risques" (désordre, contestation) de la participation des élèves plutôt que les "bénéfices" à long terme.

      Cette posture, qui consiste à décider "à la place de l'enfant", peut priver ce dernier d'expériences nécessaires à son développement.

      La charge de travail des enseignants : La crainte que ces programmes représentent une "couche" supplémentaire de travail est une objection fréquente. Julie Zarlot répond que les outils pédagogiques de l'UNICEF sont conçus pour être directement liés aux programmes scolaires, permettant aux enseignants de "piocher" dans différentes disciplines pour illustrer les droits "sans en avoir l'air".

      Au-delà du primaire : Application au collège et au lycée

      Valérie Becket note que l'enseignement secondaire dispose déjà de nombreuses structures de participation (Conseil de la Vie Collégienne, Conseil de la Vie Lycéenne, délégués). Cependant, leur existence ne garantit pas une meilleure application des droits ni une meilleure écoute des élèves.

      Elle suggère que des outils comme la "marche exploratoire" seraient très pertinents pour les adolescents afin d'analyser leur vécu de l'établissement.

      Surtout, elle insiste sur la nécessité de créer des "passerelles" entre le primaire et le secondaire pour assurer une continuité. Sans cela, un élève habitué à participer et à être écouté risque de subir un choc ("patatra") en arrivant dans un environnement où il "ne peut plus rien dire".

      Citations clés

      Valérie Becket, sur le droit le plus important à travailler à l'école : "Le droit d'avoir un point de vue."

      Richard Côtier, sur la nécessité d'un engagement sur le long terme : "Le programme UNICEF par exemple il est prévu sur 3 ans et moi je pense que 3 ans c'est juste la piqûre, c'est juste le vaccin.

      Ce n'est pas le temps qu'il va falloir pour construire un système où vraiment on aura pris en compte ce phénomène là."

      Julie Zarlot, sur l'intégration des droits dans le quotidien : "On peut parler des droits de l'enfant et les rendre quotidien, effectif, presque sans en avoir l'air."

      Richard Côtier, sur le risque de ne pas favoriser la participation : "Le risque de faire grandir des élèves qui sont pas dans la participation [...] ça veut dire que on fait des enfants qui sont relativement passifs, qui laissent prendre les autres des initiatives parce que finalement on leur demande pas leur avis."

    1. Santé Mentale à l'École : État des Lieux, Enjeux et Stratégies

      Résumé Exécutif

      Ce document de synthèse analyse l'état critique de la santé mentale des élèves dans le système éducatif français, en s'appuyant sur les témoignages d'experts de terrain.

      Il met en lumière une crise croissante, caractérisée par une augmentation des troubles dépressifs, des addictions et des tentatives de suicide chez les jeunes.

      Face à ce phénomène, l'institution scolaire, bien que consciente de l'enjeu, peine à déployer une réponse à la hauteur, confrontée à un manque de ressources humaines (médecins, infirmières scolaires) et à un déficit de formation généralisée des personnels.

      Dans ce contexte, les enseignants se retrouvent en première ligne, agissant comme des "sentinelles" essentielles mais souvent démunies.

      Deux stratégies d'action complémentaires émergent :

      d'une part, la structuration de l'intervention par des formations certifiées en premiers secours en santé mentale, comme le protocole "AÉRÉ" de PSSM France, qui vise à donner aux adultes un cadre d'action sécurisé.

      D'autre part, le développement de projets pédagogiques holistiques, à l'image de l'initiative "On se bouge", qui intègrent le bien-être et le "vivre ensemble" au cœur des apprentissages, améliorant ainsi la qualité de vie des élèves et des équipes.

      La conclusion est claire : une approche combinant la formation des adultes, la création d'un environnement scolaire bienveillant et l'implication des jeunes eux-mêmes est indispensable pour transformer l'école en un lieu promoteur de santé mentale.

      --------------------------------------------------------------------------------

      I. Le Constat : Une Crise de Santé Mentale Croissante chez les Jeunes

      A. L'Ampleur du Phénomène

      La question de la santé mentale des jeunes n'est plus un sujet tabou et s'impose avec une urgence inédite, au point d'être désignée "grande cause nationale pour 2025".

      Les professionnels de l'éducation dressent un constat alarmant, corroboré par de multiples études.

      Une souffrance inédite : Damien Duran, IAPR Établissement et Vie Scolaire, témoigne : « Je travaille dans l'éducation nationale depuis 1979 [...] et je n'ai jamais vu autant de jeunes en souffrance dans les établissements scolaires : de jeunes dépressifs, ayant des troubles du comportement, des addictions diverses et beaucoup de tentatives de suicide. »

      Une visibilité accrue : Anaïs Mangin, professeure d'EPS, observe que les élèves « vont de plus en plus mal et le montrent corporellement ou même via les réseaux sociaux ».

      Une problématique précoce : La crise ne se limite pas aux adolescents.

      Les phénomènes de harcèlement et les troubles du comportement sont de plus en plus présents et "massifiés" dès le premier degré (école maternelle et élémentaire).

      B. Les Facteurs Aggravants Identifiés

      Plusieurs facteurs sociétaux et relationnels sont identifiés comme contribuant à la détérioration de la santé mentale des élèves.

      L'impact des réseaux sociaux : La frontière entre la sphère scolaire et la sphère privée s'est estompée, privant les jeunes de moments de répit.

      Selon Anaïs Mangin, « les jeunes n'ont pas de pause en fait ».

      La défiance envers les adultes : Un point jugé fondamental par Damien Duran est la « défiance croissante à l'égard des adultes ».

      Cette perte de confiance constitue « le terreau de l'agressivité » et une marque d'inquiétude face à l'avenir.

      II. La Réponse Institutionnelle : Entre Prise de Conscience et Manque de Moyens

      L'Éducation Nationale reconnaît l'ampleur du défi, mais sa capacité d'action reste limitée par des contraintes structurelles et un manque de préparation historique.

      A. Une Institution en Difficulté

      Selon Damien Duran, la réponse institutionnelle est actuellement « faible par rapport à l'ampleur du phénomène ». Plusieurs points de friction sont soulignés :

      Pénurie de personnel qualifié : L'institution fait face à un manque criant de médecins scolaires, difficiles à recruter, et à un nombre insuffisant d'infirmières, qui ne sont pas présentes dans tous les établissements.

      Déficit de formation : La majorité des personnels n'a pas été formée pour aborder ces problématiques.

      Des plans de formation se déploient progressivement, sur le modèle du programme Phare contre le harcèlement, mais cela « prend du temps ».

      B. Le Rôle Central mais Complexe des Enseignants

      Les enseignants sont au cœur du dispositif de repérage et de premier soutien, un rôle qu'ils assument avec engagement malgré les difficultés.

      Des "sentinelles" en première ligne : Les professeurs d'EPS, comme Anaïs Mangin, se perçoivent comme des "sentinelles" capables de détecter un mal-être par l'expression corporelle des élèves, souvent avant leurs collègues.

      Le défi de l'intégration : La principale difficulté pour les enseignants est de trouver le temps d'intégrer la prévention et le soutien en santé mentale aux exigences de leurs programmes scolaires.

      Un engagement massif : Malgré ces obstacles, Damien Duran souligne l'implication remarquable des enseignants, qui sont « massivement présents » dans les formations sur le harcèlement et la santé mentale, y compris hors temps scolaire.

      III. Stratégies d'Action : Formation et Projets de Terrain

      Pour répondre à cette crise, deux approches complémentaires se dessinent : la formation structurée des personnels et la mise en place de projets pédagogiques innovants.

      A. Le Secourisme en Santé Mentale : Structurer l'Intervention

      L'objectif est de permettre à chaque adulte d'intervenir "à bon escient", en évitant les maladresses qui peuvent aggraver une situation.

      Le Protocole PSSM France ("AÉRÉ") : Issu d'une méthode australienne, ce programme de formation de deux jours propose un protocole d'intervention en quatre étapes pour la prise en charge d'une personne en difficulté.

      Approcher la personne, évaluer et assister en cas de crise.  

      Écouter activement et sans jugement.    ◦ Réconforter et informer sur les aides existantes.  

      Aller vers des professionnels (médecin, psychologue, etc.).

      Le Protocole du Ministère (DGESCO) : Ce protocole est décrit comme plus "organisationnel". Il vise à identifier et cartographier les ressources et partenaires disponibles dans et hors de l'établissement pour orienter les équipes.

      Les deux démarches sont vues comme parfaitement complémentaires.

      B. L'Exemple du Projet "On se bouge" : Une Approche Holistique

      Le projet mené par Anaïs Mangin au collège Croix de Metz est un exemple concret d'une école qui prend soin de ses élèves en intégrant le bien-être aux apprentissages.

      | Caractéristiques du Projet "On se bouge" | Description | | --- | --- | | Concept de base | "Apprendre autrement" en associant l'EPS à d'autres disciplines (histoire-géo, maths, physique) lors de sorties hebdomadaires. | | Objectifs atteints | Renforcer le "vivre ensemble" et créer un fort sentiment d'appartenance (à la classe, à une petite équipe). | | Actions sur la santé mentale | Création de six ateliers sportifs sur l'estime de soi, la confiance et la gestion des émotions, en partenariat avec le Centre Médico-Psychologique (CMP) local. | | Impact | Amélioration notable du bien-être des élèves, mais aussi des enseignants qui travaillent en équipe et partagent la charge de travail. |

      C. L'Importance des Gestes Quotidiens et de la Formation des Jeunes

      Au-delà des programmes structurés, l'amélioration de la santé mentale passe par des actions simples et par l'implication directe des élèves.

      Le pouvoir des micro-interactions : Anaïs Mangin insiste sur l'impact de gestes simples comme dire "bonjour", "bonne journée" ou "as-tu bien dormi ?", qui peuvent amorcer une relation de confiance et changer le déroulement de la journée d'un élève.

      Former les jeunes : Damien Duran plaide pour la formation des jeunes eux-mêmes au secourisme en santé mentale.

      Son anecdote d'une élève de 6e en pleurs, inaccessible pour lui (l'adulte) mais accompagnée par une camarade, illustre que les pairs sont souvent les mieux placés pour apporter un premier soutien.

      IV. Perspectives et Recommandations

      A. Changer de Paradigme : De "Climat Scolaire" à "Qualité de Vie au Travail"

      Damien Duran propose une évolution sémantique et conceptuelle :

      « Je pense qu'on devrait s'intéresser à la qualité de vie au travail des élèves et [...] faire le parallèle avec la qualité de vie au travail des personnels. »

      Cette approche positionne le bien-être comme une condition structurelle et non comme une simple "ambiance", reconnaissant que la souffrance des personnels et celle des élèves sont interconnectées.

      B. Les Bénéfices de la Formation pour les Personnels

      La formation en secourisme en santé mentale offre des avantages concrets et profonds, tant sur le plan professionnel que personnel.

      Réduction de l'anxiété et gain de compétence : Elle permet de se sentir "moins inquiet" face à une situation de crise et de savoir comment réagir.

      Un changement de posture : Damien Duran raconte comment sa formation lui a permis de "switcher" de la panique à l'action lors d'une tentative de suicide de sa voisine, en appliquant un raisonnement structuré.

      Une aide pour soi et pour les autres : La formation change le regard sur autrui, renforce l'empathie et donne des outils pour mieux accompagner ses collègues et ses proches.

      C. Ressources Clés

      Les intervenants ont partagé plusieurs ressources pour approfondir le sujet et passer à l'action :

      1. Le Cartable des compétences psychosociales : Une plateforme proposant des outils et des activités pratiques (jeux, exercices de 10 à 30 minutes) pour les enseignants, utilisables en classe pour travailler sur la gestion des conflits ou d'autres compétences.

      2. Le site de PSSM France (pssmfrance.fr) : Pour s'informer sur les modules de formation en premiers secours en santé mentale, accessibles aux personnels ou à titre privé.

      3. Le Protocole Santé Mentale des Élèves (DGESCO) : Document officiel ("Du repérage à la prise en charge") qui doit être disponible et renseigné dans chaque établissement, recensant les partenaires locaux et la marche à suivre en cas de crise.

    1. Dossier de Synthèse : Le Jeu Libre comme Outil Pédagogique Essentiel à l'École

      Synthèse Exécutive

      Ce document synthétise les perspectives d'experts sur le rôle fondamental du jeu libre dans le développement de l'enfant et sa mise en œuvre en milieu scolaire.

      L'analyse révèle que le jeu libre est une activité essentielle, souvent mal comprise et sous-évaluée, qui participe directement à la construction de soi, au développement de la pensée et à l'acquisition de compétences transversales.

      Les points critiques à retenir sont les suivants :

      1. Nature du Jeu Libre : Le jeu, par définition, est libre. Il est caractérisé par la décision du joueur, l'établissement d'un cadre de "second degré" (distinct de la réalité), l'absence de conséquences réelles ("frivolité"), une organisation interne et une incertitude quant à son issue.

      Il se distingue radicalement des "jeux" éducatifs structurés qui sont en réalité des formes de travail déguisé avec des objectifs et des attentes externes.

      2. Rôle de l'Enseignant : La posture professionnelle requise est celle d'un observateur disponible et d'un architecte du cadre ludique, et non celle d'un intervenant directif.

      L'enseignant doit agir sur l'environnement (aménagement de l'espace, choix des objets, règles de fonctionnement) pour permettre au jeu d'advenir, plutôt que de diriger l'activité des enfants.

      3. Bénéfices Pédagogiques : Bien que 80% des processus à l'œuvre dans le jeu soient invisibles, ses bénéfices sont profonds.

      Le jeu libre favorise la concentration, la socialisation, la créativité, la libération de la parole et l'expérimentation sans crainte de l'échec.

      Il constitue un espace d'expression émotionnelle crucial et un terrain d'observation privilégié pour déceler les besoins des élèves, notamment ceux à besoins éducatifs particuliers.

      4. Mise en Pratique : L'instauration du jeu libre en classe repose sur un aménagement réfléchi : des espaces bien définis, des objets de qualité et réalistes en quantité raisonnable, et une préparation soignée qui valorise l'activité.

      Le rangement devient un acte pédagogique à part entière.

      5. Portée Universelle : Le besoin et les bienfaits du jeu libre ne se limitent pas à l'école maternelle.

      Il est tout aussi pertinent et nécessaire à l'école élémentaire, offrant un espace d'exploration et de complexification des apprentissages adapté à chaque âge.

      En conclusion, redonner sa place au jeu libre à l'école n'est ni un luxe ni une perte de temps, mais une stratégie pédagogique fondamentale qui, en faisant confiance à l'enfant, permet l'émergence d'apprentissages informels profonds et la construction d'un rapport positif à l'école.

      --------------------------------------------------------------------------------

      1. Définir le Jeu Libre : Une Affaire Sérieuse

      Le concept de "jeu libre" souffre d'une mécompréhension fondamentale, souvent perçu à tort comme une simple récréation.

      Les experts soulignent qu'il s'agit d'une activité essentielle au développement de l'être humain.

      1.1. La Nature Intrinsèque du Jeu

      Selon Nadège Aberbuche, ludo-pédagogue, le terme "jeu libre" est un pléonasme.

      S'appuyant sur les travaux du sociologue Gilles Brougère, elle affirme que le jeu, par définition, est libre.

      Il n'est pas une simple activité, mais un cadre spécifique que le joueur décide de créer et d'habiter.

      Citation clé : "On rate l'essentiel [...] c'est-à-dire que le jeu participe à la construction de soi, à la construction de la pensée, donc ça n'est pas que pour se distraire et s'amuser, c'est absolument essentiel au développement de l'être humain." - Nadège Aberbuche

      1.2. Les Cinq Caractéristiques du Jeu (selon Gilles Brougère)

      Pour clarifier ce qu'est le jeu, cinq caractéristiques principales sont identifiées :

      1. Le Second Degré : Le jeu n'est pas la réalité. Le joueur adhère à un cadre fictionnel, une réalité alternative, pour la durée du jeu.

      2. La Prise de Décision : Le jeu n'existe que par les décisions des joueurs.

      Ils décident de tout : entrer dans le jeu, en définir les contours, et même en sortir à tout moment.

      3. La Frivolité : Ce qui se passe dans le jeu n'a pas de conséquences directes sur la réalité du joueur.

      Cette caractéristique est cruciale car elle autorise l'exploration, le tâtonnement, l'erreur et l'invention sans pression ni enjeu réel.

      4. Les Mécanismes d'Organisation : Tout jeu, même le plus simple, est structuré. Les joueurs définissent des règles, des rôles, des scénarios et des limites.

      5. L'Incertitude : L'issue du jeu n'est jamais connue à l'avance, ce qui en fait son "sel" et motive les joueurs à recommencer.

      2. La Confusion Fondamentale : Jeu contre Travail Déguisé

      Un obstacle majeur à la mise en place du jeu libre à l'école est la confusion entre le jeu authentique et les activités d'apprentissage ludifiées.

      2.1. L'instrumentalisation du Jeu

      Cécile Beautier Richard, enseignante en toute petite section, observe que de nombreux enseignants utilisent le mot "jeu" pour désigner des activités avec des objectifs pédagogiques précis (travailler les couleurs, les mathématiques) et parfois même une évaluation.

      Le point de vue de l'enseignant : Il s'agit d'un "travail" visant l'acquisition de compétences définies dans les programmes.

      Le point de vue de l'enfant : L'enfant, qui sait intuitivement ce qu'est jouer, peut se sentir "trompé" lorsque l'activité annoncée comme un jeu se révèle être un exercice scolaire.

      Ce sentiment peut conduire à un désinvestissement de l'enfant vis-à-vis du jeu lui-même.

      2.2. La Nécessité de la Clarté

      Les intervenantes s'accordent sur l'importance d'être clair avec les enfants.

      Il n'y a pas de honte à proposer des "ateliers" ou du "travail", car les enfants ont un désir naturel d'apprendre.

      La distinction sémantique et conceptuelle est essentielle pour préserver l'intégrité et la puissance du jeu libre.

      3. L'Importance Capitale du Jeu Libre pour le Développement de l'Enfant

      Le jeu libre est un espace-temps où se déroulent des apprentissages informels, invisibles mais cruciaux.

      3.1. Un Laboratoire Cognitif et Émotionnel

      Cécile Beautier Richard illustre ce point avec l'exemple d'un élève de 3 ans manipulant des aimants pendant 15 minutes en totale concentration.

      Citation clé : "80 % du jeu de l'enfant [...] n'est pas visible en fait à l'œil nu. [...] je ne sais pas ce qui se passait dans sa tête [...] visiblement il a l'air de se passer 1000 connexions à la seconde dans son cerveau et c'est super." - Cécile Beautier Richard

      Ce temps, qui peut sembler improductif, est en réalité un moment de construction intense de la pensée, de la représentation spatiale et d'autres compétences non-identifiables sur le moment.

      3.2. Un Espace d'Expression et de Transformation

      Libération de la parole : En grande section, des enfants peu locuteurs dans un cadre formel se mettent à parler abondamment lorsqu'ils jouent librement, distribuant les rôles et créant des scénarios complexes.

      Expression des émotions : Le jeu permet de "faire semblant" et d'exprimer des émotions ou des pulsions (colère, agressivité) de manière symbolique et sans conséquence.

      Nadège Aberbuche insiste sur le fait que jouer à la bagarre ou à la guerre est un exutoire nécessaire qui, en étant autorisé dans le "faux", peut prévenir des passages à l'acte dans le "vrai".

      Il est crucial de ne pas confondre ces jeux symboliques avec des activités dangereuses (comme le "jeu du foulard") qui ne sont pas des jeux.

      Inclusion : Le jeu libre est particulièrement bénéfique pour les élèves à besoins éducatifs particuliers.

      Comme le souligne Cédric Guerro, directeur du Centre national de formation au métier du jeu et du jouet, le jeu "accepte l'autre tel qu'il est", sans les exigences parfois écrasantes des situations d'apprentissage formelles.

      4. La Posture Professionnelle de l'Enseignant : De l'Intervention à l'Observation

      Le succès du jeu libre dépend entièrement de la posture de l'adulte.

      4.1. La Métaphore du "Culbuto"

      Cédric Guerro propose la métaphore du "culbuto" (jouet qui revient toujours à sa base) pour décrire la posture de l'enseignant.

      Sa position de base doit être celle de l'observateur disponible.

      Toute intervention doit être une réponse à une observation et à l'interprétation d'un besoin, et non une action par défaut.

      4.2. Agir sur le Cadre, pas sur l'Enfant

      L'enseignant doit se concentrer sur la création et le maintien d'un cadre propice au jeu. Ce cadre comprend :

      • L'aménagement de l'espace.

      • Le choix et la disposition des objets.

      • Les règles de fonctionnement claires (distinction entre "faire semblant" et "faire pour de vrai").

      En agissant sur ce cadre, l'enseignant influence indirectement et positivement le comportement des enfants, leur permettant de développer leur jeu en autonomie et en sécurité.

      4.3. Un Nouveau Regard sur l'Élève

      L'observation du jeu libre permet de découvrir les élèves sous un autre jour, de voir émerger des compétences (concentration, socialisation, leadership) insoupçonnées dans un cadre scolaire classique.

      L'enseignant voit alors "l'enfant plus que l'élève".

      5. Mise en Pratique en Classe : Aménager un Environnement Propice

      La mise en place du jeu libre n'est pas une improvisation mais le résultat d'un travail pédagogique rigoureux en amont.

      5.1. L'Aménagement de l'Espace

      Cécile Beautier Richard donne plusieurs conseils concrets :

      Se mettre à hauteur d'enfant pour concevoir les espaces.

      Définir clairement les zones de jeu (par exemple avec des morceaux de lino de couleurs différentes).

      Ne pas surcharger les espaces.

      5.2. Le Choix des Objets

      Privilégier la qualité à la quantité. Des objets réalistes, fonctionnels et en bon état sont essentiels.

      Une poêle doit être à la taille des aliments factices, une poupée ne doit pas être cassée.

      Organiser de manière logique et accessible. Éviter d'empiler les puzzles ; les objets doivent être facilement préhensibles.

      5.3. Le Rangement comme Acte Pédagogique

      Le temps passé à ranger et à préparer l'espace de jeu après le départ des enfants (30-40 minutes par jour pour Cécile Beautier Richard) est fondamental.

      • Cela valorise l'activité aux yeux de l'enfant.

      • Cela donne envie de jouer le lendemain.

      • Cela constitue la préparation de la séance, au même titre que la préparation d'un atelier dirigé.

      Citation clé : "Le jeu libre c'est pas ce qu'on fait quand les enfants ils ont plus rien à faire [...] Non, il faut le considérer comme un atelier à part entière." - Cécile Beautier Richard

      6. Au-delà de la Maternelle : Le Jeu Libre pour Tous les Âges

      L'importance du jeu libre ne s'arrête pas aux portes de l'école élémentaire. Nadège Aberbuche qualifie ce combat de "même combat" pour tous les niveaux.

      • À la ludothèque "Les enfants du jeu", des classes jusqu'au CM2 sont accueillies.

      • Les élèves plus âgés se réapproprient des espaces de jeu de "petits" (bacs à sable, etc.), mais pour y mener des expérimentations plus complexes, adaptées à leur développement cognitif. Il ne s'agit pas d'une régression.

      • C'est une occasion rare pour les enseignants du primaire de voir leurs élèves jouer, une activité qui a largement disparu des cours de récréation, souvent au profit de tensions et de violences.

      --------------------------------------------------------------------------------

      7. Recommandations et Ressources

      Les expertes proposent des ressources pour les enseignants souhaitant se lancer ou approfondir leur pratique du jeu libre.

      | Type | Titre | Auteur / Source | Description | Recommandé par | | --- | --- | --- | --- | --- | | Livre (Théorique) | Jouer/Apprendre | Gilles Brougère | Une référence pour comprendre la distinction et les passerelles entre éducation formelle et informelle. | Cécile Beautier Richard | | Documents Pédagogiques | Jouer et apprendre | Eduscol | Des documents très bien faits (cadrage général, volets par type de jeu, vidéos) pour se lancer. | Cécile Beautier Richard | | Livre (Psychologie) | Libre pour apprendre | Peter Grey | Un ouvrage d'un psychologue américain qui requestionne la notion d'apprentissage, avec un chapitre important sur le jeu. | Nadège Aberbuche | | Film | Permis de jouer | \- | Un film tourné à la ludothèque "Les enfants du jeu", centré sur le jeu symbolique des enfants d'âge élémentaire, avec des témoignages d'enseignants. | Nadège Aberbuche |

    1. Le Conseil de Classe Participatif : Analyse d'une Initiative Pédagogique

      Résumé Exécutif

      Ce document de synthèse analyse le projet de "conseil de classe participatif" mis en œuvre par Émilie Roger, professeure de SVT au collège de la Largue.

      Né du constat de l'inefficacité et du manque d'engagement suscités par les conseils de classe traditionnels, ce dispositif vise à transformer cette instance en un outil pédagogique centré sur l'élève.

      En s'appuyant sur les sciences cognitives et la métacognition, le projet prépare les élèves de 6ème à auto-évaluer leurs compétences, à formuler un bilan personnel et à définir des objectifs de progression.

      Les principaux résultats montrent un engagement accru des élèves, qui développent une conscience juste de leurs points forts et de leurs difficultés.

      Le format, bien que lourd sur le plan organisationnel, génère des moments d'échange d'une grande richesse, valorisant l'élève et renforçant le dialogue pédagogique.

      Les défis majeurs résident dans la logistique complexe, qui limite son déploiement à un seul niveau, et dans la nécessité d'un suivi régulier pour ancrer les objectifs fixés.

      L'implication des parents, expérimentée ponctuellement, est identifiée comme un levier majeur pour décupler l'impact du dispositif.

      1. Contexte et Genèse du Projet

      L'initiative du conseil de classe participatif a été développée en réponse à une double insatisfaction concernant le format traditionnel de cette instance.

      Le Constat d'Inefficacité

      Émilie Roger, en tant que professeure de SVT participant à de nombreux conseils de classe, a identifié plusieurs limites au format classique :

      Rôle passif des enseignants : Hormis le professeur principal, les autres enseignants assistent principalement à une "lecture d'appréciation globale" avec très peu d'échanges pédagogiques de fond.

      Absence de focus sur les compétences : Les discussions sont rarement centrées sur les compétences de l'élève et les moyens de les améliorer.

      Manque d'impact sur l'élève : Le déclencheur du projet fut la révélation qu'un élève n'avait même pas lu les conseils formulés sur son bulletin scolaire.

      L'Objectif de Transformation

      Face à ce constat, l'objectif était clair : "comment finalement transformer un conseil de classe classique en quelque chose qui pourrait être utile à l'élève où l'élève pourrait s'engager dans son évaluation de son parcours et pouvoir s'asseoir dessus pour progresser".

      Le projet vise à rendre l'élève acteur de son évaluation et de sa progression.

      2. Le Dispositif du Conseil de Classe Participatif

      Le projet se décompose en une phase de préparation rigoureuse et un déroulement spécifique, repensé pour maximiser l'interaction individuelle.

      La Phase de Préparation

      Avant chaque conseil, trois à quatre séances sont organisées, généralement sur les heures de "devoir fait" ou de "vie de classe", pour préparer les élèves.

      Cette préparation inclut :

      1. Introduction aux Compétences : Explication de ce qu'est une compétence, comment elle est évaluée et comment atteindre les meilleurs niveaux de maîtrise.

      2. Auto-positionnement : L'élève est invité à se positionner sur les différentes compétences évaluées.

      3. Construction du Bilan : Les élèves apprennent à construire leur propre bilan, en identifiant leurs points forts et les points à améliorer pour la période suivante.

      Le Déroulement Concret

      La session du conseil de classe participatif dure au total entre 1h30 et 1h45.

      Session Plénière (15 minutes) : Un bilan global est présenté par les élèves délégués, puis par la professeure principale.

      Ateliers par Pôles : La classe est ensuite divisée en deux équipes équilibrées (ex: "pôle scientifique" et "pôle français").

      Entretiens Individuels (7 minutes par élève) : Chaque élève présente son bilan personnel aux enseignants du pôle.

      Pour une classe de 30, chaque pôle gère environ 15 élèves.

      3. Fondements Pédagogiques et Approche Cognitive

      Le projet est explicitement ancré dans les apports des sciences cognitives, visant à doter l'élève d'une meilleure compréhension de ses propres mécanismes d'apprentissage.

      Formation en Neuroéducation : L'initiatrice du projet a obtenu un diplôme en neuroéducation et s'est formée auprès de l'association "Apprendre et former avec les sciences cognitives".

      Éducation au fonctionnement du cerveau : L'objectif est de former l'élève sur son propre cerveau : comment il apprend, mémorise et maintient son attention.

      Développement de la Métacognition : L'approche consiste à amener l'élève à réfléchir sur ses propres processus d'apprentissage.

      Il est encouragé à s'auto-évaluer face à une tâche ("Est-ce que c'est facile, difficile ?"), et si elle est difficile, à identifier les stratégies à mettre en place ("Quelle aide tu pourrais demander pour justement atteindre tes objectifs ?").

      4. Résultats, Impacts et Témoignages

      Le dispositif a produit des effets significatifs sur l'engagement, la lucidité et la confiance des élèves.

      L'Engagement et la Prise de Conscience

      Le principal bénéfice observé est une prise de conscience par les élèves de leurs propres difficultés et de leurs capacités à progresser.

      Impact émotionnel sur l'enseignante : Émilie Roger témoigne être systématiquement impressionnée, au point d'avoir "envie de pleurer", en voyant "les plus timides qui osent parler, qui osent dire leur fragilité".

      Transformation des élèves "difficiles" : Même les élèves souvent perçus comme perturbateurs parviennent à verbaliser leurs difficultés (ex: le bavardage), ce qui est considéré comme une victoire pédagogique majeure.

      Le fait qu'ils "s'expriment" sur leurs défis est vu comme "magnifique".

      La Justesse de l'Auto-évaluation

      Il est noté que les élèves font preuve d'une grande lucidité. Il y a rarement une différence entre leur auto-évaluation et les appréciations des enseignants sur le bulletin.

      Témoignages d'Élèves

      Les extraits de dialogues illustrent la capacité des élèves à analyser leur parcours et à se projeter.

      | Thème | Citation de l'élève | Contexte / Analyse | | --- | --- | --- | | Effort et Motivation | "Ce qu'il faut savoir c'est qu'il aime pas l'école en fait. \[...\] Il fait d'énormes efforts pour réussir sans avoir forcément la motivation derrière ça." | Un élève exprime son manque d'intérêt pour certains sujets, tout en fournissant un travail important. | | Identification des Difficultés | "J'ai plus de difficultés à mémoriser l'histoire-géo \[...\] J'ai du mal à redire ce que j'ai appris." | L'élève distingue un problème de mémorisation d'un problème de compréhension. | | Fierté et Résilience | "Je suis fière d'avoir réussi à m'organiser pour réviser pour les contrôles, de ne pas avoir baissé les bras alors que c'est difficile pour moi." | Une élève met en avant sa capacité d'organisation et sa persévérance face à la difficulté. | | Définition d'Objectifs | "Mon objectif pour l'année prochaine serait de rester plus concentré. \[...\] Me mettre pas avec des personnes que j'aime bien forcément à côté." | Un élève identifie le bavardage comme sa difficulté et propose une stratégie concrète pour y remédier. | | Stratégies d'Entraide | "Quand eux \[les copains\] ils t'aident, est-ce que tu arrives mieux à comprendre ? - Oui un petit peu." | Une élève reconnaît que travailler avec ses pairs l'aide à mieux comprendre les exercices de mathématiques. |

      5. Défis, Limites et Perspectives

      Malgré son succès pédagogique, le dispositif fait face à des obstacles importants qui freinent son expansion.

      Les Contraintes Organisationnelles

      La "plus grosse difficulté" est d'ordre logistique.

      Gestion du temps : Le format se déroule sur des créneaux de cours (généralement 15h-17h), ce qui oblige à "libérer des classes" et à réorganiser les emplois du temps des enseignants et des élèves.

      Limitation au niveau 6ème : En raison de cette complexité, le projet est actuellement cantonné aux classes de 6ème. L'équipe pédagogique souhaiterait l'étendre au niveau 3ème, où il serait pertinent pour l'orientation, mais cela n'est pas réalisable pour le moment.

      La Question du Suivi Post-Conseil

      "L'après est plus difficile" et reste un point en cours d'amélioration.

      L'oubli étant "biologique", il est nécessaire de rappeler régulièrement aux élèves leurs objectifs et de les interroger sur les moyens qu'ils mettent en œuvre pour les atteindre, afin d'ancrer durablement la progression.

      Le Potentiel de l'Implication Parentale

      Une expérience a été menée il y a deux ans en faisant venir les parents pour qu'ils écoutent le bilan de leur enfant et échangent avec l'équipe.

      Cette formule est décrite comme "le top du top", car elle combine l'engagement de l'enfant et l'écoute du parent dans une démarche de "valorisation de l'élève".

      L'Adoption par l'Équipe Enseignante

      Le projet est activement soutenu et mis en place par trois professeurs principaux, qui seront quatre l'année prochaine.

      D'autres collègues sont plus réticents, non pas pour des raisons pédagogiques, mais principalement à cause du "beaucoup de temps" que l'organisation requiert.

    1. Synthèse sur l'Éducation à la Citoyenneté Numérique : S'appuyer sur les Pratiques des Jeunes

      Résumé Exécutif

      Ce document de synthèse analyse les perspectives et stratégies d'éducation à la citoyenneté numérique, basées sur les interventions d'experts en sociologie, en éducation au numérique et d'un praticien en milieu scolaire.

      L'idée centrale est un changement de paradigme : passer d'une approche "riscocentrée", focalisée sur la protection et l'interdiction, à une posture d'accompagnement qui s'appuie sur les pratiques réelles et les centres d'intérêt des jeunes.

      Les intervenants soulignent que les jeunes utilisent le numérique pour des raisons profondes liées à la construction identitaire, à la régulation du stress et à la recherche de réponses que les adultes ne fournissent pas toujours.

      Pour être efficaces, les éducateurs doivent adopter une posture d'empathie, de légitimation des cultures numériques des jeunes et de co-construction des savoirs.

      L'objectif final est de développer leur réflexivité, leur esprit critique et leur pouvoir d'agir, en les aidant à comprendre les mécanismes des plateformes, leurs droits, leurs devoirs et le potentiel émancipateur du numérique, plutôt que de se limiter à une posture de méfiance.

      --------------------------------------------------------------------------------

      1. Redéfinir la Citoyenneté Numérique au-delà des Risques

      Le point de départ de la discussion est le constat que la notion de citoyenneté numérique est souvent perçue par les adultes à travers le prisme de l'inquiétude et de la protection.

      Les intervenants s'accordent sur la nécessité d'élargir cette vision.

      Une Définition Élargie : La définition du Conseil de l'Europe est citée comme un modèle, incluant des dimensions positives telles que l'inclusion, la créativité, l'empathie et la participation active.

      Faire "avec eux" plutôt que "pour eux" : Il y a une prise de conscience croissante de l'importance d'impliquer les jeunes dans la construction de leur citoyenneté numérique.

      Un Vocabulaire Inadapté : Selon Nicolas Bourgeon, professeur documentaliste, le terme "citoyenneté numérique" est un jargon institutionnel qui ne résonne pas chez les élèves.

      L'approche efficace consiste à "utiliser leur mot à eux".

      Priorités pour l'Éducation au Numérique

      Chaque intervenant définit une priorité pour l'éducation à la citoyenneté numérique :

      | Intervenant | Organisation | Priorité | Citation Clé | | --- | --- | --- | --- | | Axel Dein | Directrice, Internet sans crainte | Comprendre | "Comprendre l'espace numérique dans lequel on évolue, comprendre les services qu'on utilise, comprendre les algorithmes pour être un utilisateur éclairé." | | Jocelyn Lachance | Sociologue, Crédat | Valoriser | "Ce qu'on oublie souvent, c'est que la plupart des jeunes se comportent quand même bien à l'heure du numérique et la question c'est en tant qu'adulte qu'est-ce qu'on est capable de valoriser les bonnes pratiques." | | Nicolas Bourgeon | Professeur Documentaliste | S'adapter | "Ce sont des mots qui appartiennent au vocabulaire plutôt institutionnel et l'approche que j'essaie d'avoir bah d'utiliser leur mot à eux." |

      2. Changer le Regard des Adultes sur les Pratiques Numériques des Jeunes

      Une critique fondamentale adressée à l'approche actuelle est le regard que les adultes portent sur les usages numériques des jeunes, souvent teinté de méconnaissance et de fantasmes.

      Le Regard "Riscocentré" et ses Limites

      Jocelyn Lachance identifie que l'intérêt des adultes pour les pratiques des jeunes est souvent "riscocentré", se concentrant sur les aspects délétères.

      Cette focalisation a plusieurs conséquences négatives :

      Elle occulte les bénéfices : Les jeunes utilisent le numérique pour des raisons essentielles à leur développement : construction de l'identité, gestion de questions existentielles, socialisation.

      Elle crée un décalage : Les jeunes ont l'impression que les adultes "passent à côté de ce qui est l'essentiel pour eux", à savoir le sens et les avantages qu'ils trouvent en ligne.

      La Solitude des Jeunes et l'Indisponibilité des Adultes

      Un thème récurrent est le sentiment de solitude des jeunes face au numérique.

      Manque d'accompagnement : Selon Axel Dein, les jeunes "sont extrêmement seuls" et "n'identifient pas les adultes autour d'eux comme des personnes qui sont susceptibles de les accompagner".

      Le numérique comme palliatif : Jocelyn Lachance confirme que les jeunes vont chercher en ligne ce qu'ils ne trouvent pas auprès des adultes.

      Une recherche sur l'usage de l'IA par les jeunes montre qu'ils s'en servent pour obtenir "une réponse structurée et rassurante" lorsqu'ils perçoivent les adultes comme indisponibles ou que le sujet est délicat (sexualité, mort).

      La Question de l'Interdiction

      L'interdiction est une pratique éducative structurante, mais son application au numérique soulève des questions complexes.

      Jocelyn Lachance met en garde contre une approche simpliste :

      1. Le Sens : Les adultes doivent s'interroger sur leurs motivations réelles derrière une interdiction.

      2. L'Efficacité et le Déplacement : Interdire l'accès à un espace peut pousser les jeunes vers un autre espace potentiellement moins sécurisé.

      3. La Perte de Bénéfices : L'interdiction peut supprimer des pratiques bénéfiques pour les jeunes, comme la régulation du stress.

      L'exemple d'un lycée québécois interdisant les smartphones est parlant : les élèves ont révélé qu'ils utilisaient leur téléphone pour écouter de la musique et s'isoler afin de gérer leur stress avant les examens.

      3. De la Prévention à l'Accompagnement : S'appuyer sur les Pratiques Réelles

      La deuxième partie de la discussion se concentre sur les méthodes pour passer d'une posture de simple prévention à un véritable accompagnement, en partant des usages concrets des jeunes.

      La Co-construction et l'Immersion

      Internet sans crainte, dirigé par Axel Dein, développe des ressources (serious games, scénarios interactifs) en impliquant directement les jeunes.

      Le rôle des panels de jeunes : Ils sont essentiels pour assurer la justesse et l'authenticité des ressources.

      Les jeunes poussent souvent les scénarios à être plus intenses pour refléter la réalité ("Mais là c'est trop tiède ce qu'on vit c'est plus intense c'est plus dur que ça.").

      Susciter l'esprit critique : L'objectif n'est pas de donner des leçons de manière descendante, mais de "les amener à prendre du recul, à se questionner".

      Ces séances collectives permettent une "autorégulation" bienveillante entre pairs.

      Partir des Centres d'Intérêt des Élèves

      Nicolas Bourgeon mène un projet avec des élèves de 6ème sur les influenceurs, un sujet qui les passionne. La démarche est la suivante :

      1. Point de départ : Les élèves choisissent un influenceur qu'ils apprécient.

      2. Analyse guidée : Ils décryptent le modèle économique (économie de l'attention, monétisation), les partenariats commerciaux (encadrés par la loi de 2023) et les techniques pour capter l'audience.

      3. Prise de conscience : Ce travail leur fait réaliser que lorsqu'ils consultent du contenu, "ils créent de la valeur". Les élèves identifient facilement les circuits financiers (produits, boutiques, microdons).

      4. La Citoyenneté Numérique en Action : Vers l'Émancipation

      La dernière partie explore les moyens de donner aux jeunes un réel pouvoir d'agir (empowerment) et de développer leur réflexivité.

      L'Expérience "Digital Practice Awareness" (DPA)

      Une recherche menée par Mélina Solari Landa auprès de lycéens offre des enseignements clés :

      La primauté du désir : "Le désir est le meilleur moteur de l'usage des adolescents."

      Le besoin de socialiser et les émotions l'emportent sur une évaluation rationnelle des risques, même lorsque les jeunes sont informés de l'utilisation de leurs données.

      Difficulté avec la temporalité et la distance : Les jeunes ont du mal à percevoir comment leurs actions en ligne actuelles peuvent avoir des conséquences à long terme ou affecter des personnes à une échelle globale.

      L'inefficacité des approches prescriptives : Les logiques restrictives ne permettent pas de développer la réflexivité.

      Développer la Réflexivité et la Confiance

      L'objectif de réflexivité : Pour Jocelyn Lachance, le but est d'amener les jeunes à réfléchir à ce qu'ils vivent et ressentent avant qu'une situation problématique ne survienne.

      Un "carnet de déconnexion" accompagné est plus efficace qu'un simple défi.

      Le risque de briser la confiance : Une approche trop axée sur les risques peut être contre-productive.

      Une jeune fille, après avoir reçu de la prévention, n'a pas osé parler à ses parents d'une expérience sur une application de rencontre par peur de se faire gronder.

      Légitimer leur culture : Pour Nicolas Bourgeon, établir la confiance passe par la reconnaissance de la légitimité de la "culture geek" des élèves.

      Éduquer aux Droits, aux Devoirs et au Pouvoir d'Agir

      Axel Dein insiste sur la nécessité de former les jeunes à la compréhension de leurs droits et devoirs en ligne, car leur première activité numérique est souvent sociale.

      Internet sans crainte a développé une mallette pédagogique qui aborde trois axes :

      1. Comprendre ses droits et devoirs.

      2. Comprendre le rapport à l'autre en ligne (limites public/privé, liberté d'expression).

      3. Comment le numérique donne le pouvoir d'agir.

      S'inspirer des Codes Numériques

      Jocelyn Lachance suggère de s'inspirer des raisons du succès des YouTubeurs et Twitchers auprès des jeunes. Ces créateurs donnent le sentiment de créer un espace sécurisant où :

      • Les jeunes sentent que les discussions partent d'eux ("on peut poser les vraies questions").

      • Leur parole est comprise et valorisée.

      • Leur culture n'est pas "délégitimée".

      L'enjeu pour l'adulte est de s'interroger sur sa propre posture :

      "Est-ce que moi je suis personnellement dans une posture qui délégitime les pratiques numériques et qui fait un espèce de mouvement de répulsion par rapport aux jeunes ?"

    1. Améliorer l'Engagement des Élèves en Collège et Lycée : Synthèse des Stratégies de Hassan Nassiri

      Résumé Exécutif

      Ce document synthétise les stratégies et les réflexions partagées par Hassan Nassiri, professeur et formateur, pour améliorer l'engagement des élèves dans le second degré.

      L'approche préconisée repose sur quatre leviers d'action fondamentaux :

      Ritualiser les cours pour créer un cadre sécurisant,

      Varier les supports et les modalités pour maintenir l'attention,

      Donner des responsabilités pour impliquer les élèves, et

      Valoriser la progression plutôt que la seule performance.

      Pour les élèves les plus réticents, la méthode consiste à identifier les causes de leur décrochage et à proposer des "entrées progressives" via des micro-tâches pour créer des premiers succès.

      La création d'une dynamique de classe collective, à travers des projets interdisciplinaires et une charte de classe co-construite, est également essentielle.

      La posture de l'enseignant est déterminante : elle doit incarner une alchimie entre exigence et bienveillance, en établissant un cadre clair tout en offrant des encouragements constants.

      La gestion de l'erreur doit être dédramatisée, celle-ci étant présentée comme une étape nécessaire à l'apprentissage.

      Enfin, il est crucial de ne pas rester isolé et de s'appuyer sur l'équipe pédagogique (collègues, CPE, direction) pour gérer les situations complexes et assurer une cohérence éducative.

      1. Introduction et Contexte

      Hassan Nassiri, professeur en établissement à mi-temps et formateur pour le Réseau Canopé et l'inspection académique, aborde la question centrale de l'engagement des élèves en collège et lycée.

      Fort de son expérience, notamment en lycée professionnel, il partage des gestes professionnels concrets et des retours de terrain destinés à aider les enseignants, particulièrement les débutants, à "ne laisser personne sur le bord de la route".

      La problématique principale est de savoir comment mettre en activité tous les élèves, y compris ceux qui semblent les moins coopératifs, afin de créer et de pérenniser une dynamique de classe positive tout au long de l'année.

      Ses conseils s'appliquent aussi bien aux classes dédoublées (12-15 élèves) qu'aux classes à effectif plus lourd (24-30 élèves et plus).

      2. Fondements Théoriques et Pédagogiques

      Pour nourrir sa réflexion, Hassan Nassiri s'appuie sur plusieurs références clés qui soulignent l'importance de la pédagogie et de l'organisation dans la gestion de classe :

      François Dubet ("Les lycéens") : Cet ouvrage analyse finement le rapport des élèves au travail scolaire, montrant la grande variété des profils et l'influence de leur histoire personnelle sur leur engagement.

      Philippe Merieu ("La pédagogie différenciée") : Nassiri retient de Merieu l'idée fondamentale d'adapter les dispositifs pédagogiques pour que chaque élève trouve sa place et que personne ne se sente exclu.

      Ressources Eduscol et Réseau Canopé : Ces ressources rappellent un principe essentiel : "la gestion de classe, ce n'est pas que de la discipline, c'est avant tout de l'organisation et de la pédagogie".

      3. Les Quatre Levier Fondamentaux de l'Engagement

      Hassan Nassiri identifie quatre leviers concrets pour transformer ces idées en actions en classe.

      a. Ritualiser

      Instaurer des rituels en début et en fin de cours permet de créer un cadre rassurant pour les élèves, notamment les plus effacés.

      Début de séance : Commencer par une "question flash" ou un "mot d'actualité".

      Fin de séance : Terminer par un rapide tour de table pour synthétiser ce qui doit être retenu.

      b. Varier

      Pour éviter la routine et l'ennui, il est crucial de varier les supports et les modalités de travail.

      Alternance des supports : Combiner des supports écrits traditionnels ("la bonne vieille méthode du papier") avec des outils numériques (quiz, etc.). Hassan Nassiri insiste sur l'importance de faire écrire les élèves, estimant qu'ils "n'écrivent pas assez".

      Alternance des activités : Le but est de casser la routine durant l'heure de cours pour capter l'attention.

      Travaux de groupe : Cette modalité est jugée "très intéressante" pour responsabiliser les élèves et impliquer ceux qui sont plus effacés ou timides.

      c. Donner (des responsabilités)

      Attribuer des rôles spécifiques aux élèves, notamment dans le cadre des travaux de groupe, modifie radicalement leur implication.

      Exemples de rôles : Gardien du temps, rapporteur, responsable du matériel.

      Impact : "Quand tes élèves se sentent utiles, leur implication change." Cela fonctionne particulièrement bien pour les élèves timides.

      Pédagogie de projet : Mettre les élèves en projet les rend "vraiment acteurs de leur formation".

      Des exemples concrets incluent la création d'une mini-entreprise ou l'organisation de mobilités internationales (Erasmus).

      d. Valoriser

      Ce levier est jugé "très, très important". Il s'agit de valoriser la progression des élèves et pas uniquement leur performance finale.

      Signal fort : Féliciter un élève en difficulté non seulement pour une bonne réponse, mais aussi pour une démarche claire ou une progression par rapport à la séance précédente.

      Message transmis : "Cela montre que l'effort compte autant que le résultat."

      Impact sur l'élève : L'encouragement régulier et la reconnaissance de l'effort boostent l'élève et renforcent sa confiance.

      4. Stratégies pour les Élèves Réticents et en Retrait

      Face aux élèves qui résistent à l'engagement, Hassan Nassiri propose une approche ciblée.

      Identifier la cause : Il faut se demander ce qui motive le refus (peur de l'échec, rejet de l'école, historique personnel). "Il y a toujours une explication."

      Proposer des "entrées progressives" : Commencer par une "micro-tâche" simple, par exemple en binôme, puis augmenter progressivement la difficulté.

      L'objectif est de "créer un premier succès, même petit", pour encourager l'élève.

      Accompagnement personnalisé : Profiter des moments en demi-groupe pour s'approcher physiquement des élèves les plus timides, s'asseoir à côté d'eux pour les rassurer et les accompagner de manière individualisée.

      Valoriser la participation : L'élève doit sentir qu'il a le droit à l'erreur. Il faut l'encourager pour sa tentative, même s'il se trompe.

      Lui confier une mission simple, comme expliquer une réponse au tableau, le rend visible et le valorise.

      5. La Force du Collectif : Créer une Dynamique de Classe

      Au-delà des actions individuelles, il est primordial de construire une culture de classe collective.

      Charte de classe : Élaborer une charte avec les élèves sur les valeurs et les attitudes à adopter.

      Cette démarche, bien que chronophage, les rend "complètement acteurs de leur apprentissage".

      Projets communs : Lancer des projets interdisciplinaires permet de travailler avec d'autres collègues, de ne pas rester seul, et de montrer aux élèves les liens entre les disciplines.

      Cela crée du lien tant pour les élèves que pour les enseignants.

      6. Thèmes Spécifiques Abordés (Session Q&A)

      | Thème | Stratégies et Conseils | | --- | --- | | Gestion des binômes | Il n'y a pas de "formule miracle" (faibles ensemble vs. mixité). L'enseignant connaît ses élèves et doit adapter la composition. Hassan Nassiri privilégie les groupes par affinité et insiste : "il ne faut jamais imposer les binômes", sauf cas exceptionnel. La supervision de l'enseignant est clé. | | Gestion de l'erreur | L'erreur doit être dédramatisée et valorisée comme un moteur d'apprentissage. Il faut affirmer aux élèves : "vous avez le droit de vous tromper. L'erreur n'est pas négative, justement l'erreur permet d'avancer". L'erreur peut aussi provenir d'un manque de clarté dans les consignes de l'enseignant. | | Engagement inter-matières | Pour contrer la tendance des élèves à négliger les matières à faible coefficient, il faut leur expliquer, en s'appuyant sur le référentiel du baccalauréat, que "toutes les matières comptent". Ce discours doit être porté par toute l'équipe pédagogique pour être efficace. | | Usage du numérique | L'alternance papier/numérique est essentielle, car le "tout numérique" peut lasser les élèves. Pour les outils comme Kahoot en collège (sans smartphone autorisé), l'enseignant doit fixer un cadre et des règles très claires avant de lancer l'activité et sanctionner si elles ne sont pas respectées pour garantir sa crédibilité. | | Gestion des classes agitées | Face à une classe très remuante (26 élèves et plus) : ne pas rester seul. S'appuyer sur l'équipe (collègues, CPE, direction), identifier les meneurs, les interroger en individuel pour comprendre leur comportement, et utiliser des outils comme le plan de classe. | | Intelligence Artificielle (IA) | Il est préconisé d'adopter une approche proactive : plutôt que d'interdire, il faut accompagner les élèves. Cela passe par une séance dédiée pour leur apprendre à "rédiger un prompt" et à utiliser l'IA de manière "raisonnée", en comprenant les réponses générées. La fixation d'un cadre par l'enseignant est impérative. |

      7. La Posture Enseignante : Clé de Voûte de l'Engagement

      La réussite de ces stratégies repose fondamentalement sur la posture de l'enseignant.

      L'alchimie de l'exigence et de la bienveillance : C'est le principe central. Il faut donner un cadre clair et être exigeant, mais toujours accompagné d'encouragements constants et d'une écoute bienveillante.

      Accessibilité et crédibilité : Soyez accessible et tenez parole. "Quand vous dites quelque chose, bah faites-le", car ne pas le faire fait perdre toute crédibilité.

      La gentillesse ne doit pas être perçue comme de la faiblesse, mais comme une partie d'un cadre respecté.

      S'appuyer sur l'équipe : Il est essentiel de collaborer avec les collègues expérimentés et surtout avec le ou la CPE, qui a une connaissance fine des élèves et peut apporter une aide précieuse sur l'aspect psychologique.

      La liberté pédagogique : Hassan Nassiri conclut en rappelant que la "fameuse liberté pédagogique" est un atout précieux qui permet aux enseignants de mettre en place ces stratégies et de donner tout son sens à leur métier.

    1. Micro-violences et Micro-attentions en Milieu Éducatif : Analyse et Perspectives

      Synthèse Exécutive

      Ce document de synthèse analyse les concepts de micro-violences et de micro-attentions en milieu éducatif, en s'appuyant sur l'expertise de Laurent Muller, maître de conférences en sciences de l'éducation, et de Lucie Perrin, inspectrice de l'Éducation nationale (faisant fonction d'IEN).

      Les micro-violences sont définies comme des gestes, paroles, attitudes ou oublis quotidiens, souvent banalisés et passant sous les radars, qui dégradent la personne à petit feu.

      Elles ne sont pas seulement interpersonnelles mais aussi institutionnelles, découlant d'une logique qui privilégie les intérêts de l'institution sur ceux des usagers.

      L'impact de ces "presque-riens" est considérable car ils heurtent des besoins psychiques fondamentaux et universels (autonomie, appartenance, compétence), particulièrement chez des élèves en pleine construction identitaire.

      La prise de conscience par les enseignants est un processus complexe, souvent freiné par un sentiment de jugement ou de culpabilité, qui peut mener au déni.

      Les facteurs systémiques, tels que la culture de conformité à l'autorité (l'état agentique de Milgram), la gestion du temps collectif au détriment du temps individuel, et la reproduction sociale par des enseignants "survivants" du système scolaire, entretiennent ces pratiques.

      En contrepoint, les micro-attentions — un sourire, un mot bienveillant, une écoute active — sont présentées comme des outils puissants pour prévenir et restaurer le lien éducatif.

      Des stratégies concrètes sont proposées, comme la Communication Non-Violente, la création d'espaces de parole pour les élèves et la nécessité pour les enseignants de prendre soin de leurs propres besoins avec le soutien de l'institution.

      La transformation des pratiques passe par une posture d'humilité, une analyse réflexive et une volonté de "perdre du temps" pour en gagner sur le plan des apprentissages et du bien-être.

      --------------------------------------------------------------------------------

      1. Définition et Impact des Micro-violences Éducatives

      1.1. Nature et Caractéristiques des Micro-violences

      Les micro-violences sont décrites comme des "presque-riens qui ne sont pas des riens". Il s'agit de violences banalisées, normalisées et souvent invisibles, qui prennent la forme de :

      Paroles : Remarques blessantes, humour humiliant, expressions toutes faites. Exemples cités : "Hélène, ne te leurre pas, tu ne feras jamais de science", "c'est pas grave, c'était pour rire".

      Attitudes : Regards qui éteignent, souffles exaspérés, postures de supériorité.

      Gestes : Classer les copies par ordre de notes.

      Oublis et silences : Ne pas dire bonjour, ignorer un élève, créer des silences qui excluent.

      Selon Laurent Muller, ces actes dégradent la personne "à petit feu" et ne doivent pas être confondus avec la notion de "micro-agression", qui est plus subjective.

      L'objectivité de la micro-violence réside dans sa capacité à heurter des besoins psychiques universels.

      1.2. La Double Dimension : Interpersonnelle et Institutionnelle

      Les micro-violences ne se limitent pas aux interactions entre enseignants et élèves.

      Elles possèdent une dimension institutionnelle profonde.

      Violence institutionnelle : Laurent Muller, citant Eliane Corbet, la définit comme le fait de "privilégier l'intérêt de l'institution sur l'intérêt des usagers".

      Logique biopolitique : Au sens de Michel Foucault, il s'agit d'une "gestion des flux de population qui sert à normaliser les corps et les pensées".

      Les enseignants et les directions peuvent eux-mêmes être victimes de cette logique systémique.

      Cette double dimension explique pourquoi les enseignants peuvent être à la fois auteurs et victimes de micro-violences, pris dans des logiques qui les dépassent.

      1.3. L'Impact sur les Élèves : Le Heurt des Besoins Psychiques

      L'impact puissant des micro-violences, même subtiles, s'explique par deux facteurs principaux :

      1. L'âge des élèves : Ils sont en pleine construction identitaire, ce qui les rend particulièrement vulnérables.

      2. Le heurt des besoins psychiques : Considérés comme des "nutriments psychiques", leur non-satisfaction produit une dégradation de l'état psychique.

      Laurent Muller s'appuie sur les travaux de Deci et Ryan pour identifier trois besoins fondamentaux et universels :

      | Besoin Psychique | Description | Conséquence du Heurt | | --- | --- | --- | | Autonomie | Besoin de se sentir à l'origine de ses propres actions. | Sentiment d'aliénation, perte de motivation intrinsèque. | | Appartenance | Besoin de se sentir respecté, reconnu, accueilli, en lien. | Isolement, qui est un facteur majeur de morbidité. | | Compétence | Besoin de se sentir efficace et capable d'agir sur son environnement. | Sentiment d'échec, dévalorisation, décrochage. |

      Lucie Perrin confirme que partir des besoins de l'élève est essentiel pour créer les conditions favorables à l'apprentissage.

      2. La Prise de Conscience : Un Processus Délicat

      2.1. Réactions des Enseignants et Obstacles

      Lors des formations, Lucie Perrin observe que les enseignants sont souvent "étonnés" et "bouche bée" face à la liste des violences pédagogiques ordinaires (recensées par Christophe Marcellier), car "ils se reconnaissent".

      Cette reconnaissance peut entraîner deux réactions problématiques :

      Le sentiment d'être jugé : Les enseignants peuvent se sentir accusés, ce qui entrave la réflexion.

      La culpabilisation : Laurent Muller avertit que la culpabilité "risque de conduire au déni" et de renforcer les mécanismes de défense.

      L'objectif n'est pas de culpabiliser mais de responsabiliser, c'est-à-dire de "reprendre des marges de liberté" pour éviter d'entretenir le cycle de la violence.

      2.2. Le Rôle du Langage et de l'Humour

      Des automatismes de langage, analysés par Hannah Arendt dans le contexte du cas Eichmann, fonctionnent comme des "mécanismes de défense" qui invisibilisent la souffrance de l'autre et autorisent à "faire mal pour faire faire".

      | Type d'Expression | Exemples | Fonction | | --- | --- | --- | | Anticipation positive | "C'est pour ton bien", "Tu me remercieras plus tard" | Justifier une action douloureuse par un bénéfice futur. | | Version accusatoire | "C'est à moi que ça fait mal" | Inverser la culpabilité. | | Fatalisme | "C'est la vie", "On n'a pas le choix" | Se déresponsabiliser en invoquant une force supérieure. | | Minimisation | "On n'en est pas mort", "Moi aussi, je suis passé par là" | Nier l'impact du ressenti de l'autre. | | Exagération/Ironie | "C'est bon, t'exagères", "Mon pauvre chou, tu fais ta princesse" | Ridiculiser l'émotion de l'autre. | | Verdict de facilité | "Allez-y, c'est facile" (ajouté par Lucie Perrin) | Créer une pression et un sentiment d'incompétence chez l'élève en difficulté. |

      L'humour est un vecteur particulièrement puissant, car il permet de "détruire l'autre en l'accusant de manquer d'humour s'il ne rigole pas à l'humiliation qu'il est en train de subir".

      2.3. Stratégies de Conscientisation

      Pour prendre conscience de ces gestes sans se filmer, plusieurs pistes sont évoquées :

      Reconnaître l'écart entre intention et action : Accepter que de bonnes intentions ne garantissent pas des pratiques bienveillantes.

      L'analyse réflexive : Se remémorer les micro-violences subies et celles que l'on a pu commettre.

      Inviter des collègues en classe : Obtenir un regard extérieur sur ses pratiques.

      Donner la parole aux élèves : Leur permettre d'exprimer leur ressenti, comme l'a expérimenté Laurent Muller.

      3. Les Facteurs Systémiques d'Entretien des Micro-violences

      3.1. Conformisme et Soumission à l'Autorité

      Laurent Muller s'appuie sur les travaux de Stanley Milgram sur la "conversion à l'état agentique" pour expliquer une tendance au conformisme dans l'Éducation nationale.

      Dans cet état, un individu ne se sent plus à l'origine de son action et devient un "agent d'exécution" d'une volonté extérieure jugée légitime.

      Cela conduit à une "culture de la reproduction des attitudes".

      Ce phénomène est renforcé par le fait que les enseignants sont des "survivants du système scolaire" et donc porteurs d'un "biais particulier" qui les incline à reproduire les normes qui ont assuré leur propre succès.

      3.2. L'Influence de la Forme Scolaire

      La structure même de l'école ("forme scolaire") est un terreau fertile pour les micro-violences.

      La gestion du temps : La priorité donnée au temps collectif (finir les programmes) sur le temps propre de chaque élève est une source majeure de micro-violence.

      Comme le dit Rousseau cité par L. Muller, le paradoxe de l'éducation est de "savoir en perdre [du temps]".

      La taille des classes : Une classe de 30 ou 35 élèves rend la prise en compte des besoins individuels extrêmement difficile, favorisant une approche normalisatrice.

      L'espace : Lucie Perrin évoque la posture de l'enseignant "systématiquement debout face à ses élèves" comme un geste sécurisant pour lui, mais qui peut instaurer une distance.

      Le contexte de l'enseignement spécialisé (SEGPA), avec des effectifs réduits, montre a contrario que lorsque les conditions le permettent, la création de lien et l'attention aux besoins individuels deviennent prioritaires.

      4. Stratégies de Transformation : Les Micro-attentions

      4.1. Le Pouvoir des Micro-attentions

      Face aux micro-violences, les micro-attentions sont les "véritables petits moteurs du lien".

      Elles préviennent et peuvent restaurer la relation.

      Exemples : "Je t'écoute", "Tu as raison de dire ça", un bonjour et un sourire à l'accueil, une main sur l'épaule, un mot sympathique.

      L'importance de l'accueil : Pour Lucie Perrin, tout se joue dans les premières minutes.

      Un "bonjour" et un "sourire" peuvent "instaurer un climat de confiance et mettre les élèves dans de bonnes conditions".

      4.2. Outils et Postures

      Plusieurs approches sont proposées pour cultiver une pédagogie de la micro-attention :

      La Communication Non-Violente (CNV) : Développée par Marshall Rosenberg, elle propose un processus pour clarifier les pratiques langagières violentes.

      Laurent Muller précise que ce n'est pas une "solution mécanique" ou "miraculeuse" et qu'elle doit être "irriguée par une culture éthique de l'attention".

      Donner du temps et la parole aux élèves : Consacrer 10 minutes en début de cours pour demander aux élèves comment ils vont n'est pas du temps perdu, mais un investissement qui facilite les apprentissages en créant un climat de bien-être.

      La posture d'humilité : Lucie Perrin insiste sur la nécessité d'être prudent et humble, de reconnaître que l'on a pu soi-même commettre des erreurs, et de contextualiser les réactions des enseignants, qui font face à des adolescents aux vécus parfois complexes.

      4.3. Restaurer la Relation et Soutenir les Enseignants

      Lorsqu'une micro-violence a été commise, il est possible d'agir.

      Restaurer, non réparer : Laurent Muller préfère le terme "restaurer" ou "raccommoder" à "réparer", car il s'agit du vivant et non d'un mécanisme.

      La reconnaissance et les excuses : Le processus de restauration commence par "la reconnaissance explicite de ce qui a été fait" et le fait de "présenter simplement ses excuses".

      C'est en mettant des mots (M-O-T-S) que l'on peut soigner les maux (M-A-U-X).

      Le soutien institutionnel : Pour que les enseignants puissent prodiguer des micro-attentions, il est crucial que "l'institution puisse également soutenir les enseignants".

      La bienveillance doit commencer par soi-même : les enseignants doivent pouvoir prendre soin de leurs propres besoins pour pouvoir s'occuper de ceux de leurs élèves.

      5. Inspirations et Références Clés

      Pour approfondir la réflexion et l'action, les intervenants proposent les pistes suivantes :

      Laurent Muller :

      La psychologie humaniste : Les travaux de Carl Rogers et Marshall Rosenberg (fondateur de la CNV).  

      L'écoute des élèves : "Ils ont tout à nous apprendre par rapport à cette question-là."

      Lucie Perrin :

      Les travaux de Rebecca Shankland : Spécialiste du bien-être à l'école.  

      La qualité du temps passé à l'école : Reconnaître que les élèves voient parfois plus leurs enseignants que leur famille, et que ce temps doit être de qualité, empreint de bienveillance.

    1. Développer les Compétences Psychosociales à l'École : Synthèse de la Table Ronde "Osons la Communication NonViolente"

      Résumé Exécutif

      Ce document de synthèse analyse les points clés de la table ronde organisée par le Réseau Canopé autour de l'ouvrage "Développer les compétences psychosociales à l'école - Osons la Communication NonViolente".

      Confrontée à un contexte sociétal anxiogène et violent, l'école doit opérer une révolution éducative en intégrant pleinement les compétences psychosociales (CPS) et socio-émotionnelles.

      Loin d'être une simple opinion, cette approche repose sur des fondements scientifiques solides issus des neurosciences, qui démontrent le lien direct entre la sécurité affective, l'empathie et le développement cérébral optimal pour l'apprentissage.

      La Communication NonViolente (CNV) est présentée comme un levier majeur de cette transformation. Elle offre des outils concrets pour réguler les émotions, gérer les conflits et changer fondamentalement la posture des adultes.

      Il ne s'agit pas de renoncer à l'autorité, mais de la redéfinir comme une auctoritas inspirante, basée sur la confiance et le respect, plutôt que sur un pouvoir coercitif.

      Le point de départ de ce changement réside dans la formation des adultes de la communauté éducative (enseignants, personnels de direction, administratifs), qui, par leur propre exemplarité et leur capacité à l'auto-empathie, deviennent des modèles pour les élèves.

      La systématisation de formations volontaires, inter-catégorielles et leur intégration dans la formation initiale sont identifiées comme des conditions essentielles pour répondre à l'urgence actuelle tout en construisant une vision éducative durable.

      --------------------------------------------------------------------------------

      1. Contexte et Urgence : L'École Face aux Défis Sociétaux

      La discussion s'est ouverte sur le constat d'une pression croissante sur le système scolaire, qui doit naviguer entre des crises multiples et la nécessité de maintenir un climat apaisé.

      Le Reflet des Tensions Mondiales : Christophe Kéréro, Recteur de Paris, souligne que l'école, en tant que "reflet de la société", subit de plein fouet les agressions et la violence d'un contexte international et sociétal "extrêmement complexe".

      Les élèves, réceptacles de ces tensions (géostratégiques, climatiques), vivent dans un environnement anxiogène qui impacte leur construction en tant qu'individus et citoyens.

      L'Impératif d'un Climat Scolaire Apaisé : Face à ce constat, l'institution est sommée de garantir la sérénité dans les établissements.

      Cela s'inscrit dans un cadre plus large de lutte contre des phénomènes comme le harcèlement, mais ne peut faire abstraction des "fractures" qui traversent la société.

      La Double Temporalité : Un défi majeur réside dans la gestion d'une double temporalité. D'une part, une "société très impatiente" demande des résultats rapides face à l'urgence.

      D'autre part, le développement des compétences psychosociales est un "travail sur le temps long", s'étalant sur "une voire deux générations". L'enjeu pour l'Éducation nationale est donc de "gérer à la fois l'urgence et le travail sur le temps long".

      2. Le Lien Indissociable entre Émotions et Apprentissages : Fondements Scientifiques

      L'intégration des émotions à l'école, souvent perçue comme une perturbation, est en réalité un prérequis fondamental pour l'apprentissage, soutenu par des décennies de recherche scientifique.

      Les Neurosciences Affectives et Sociales comme Base : Catherine Gueguen, pédiatre, insiste sur le fait que l'importance des compétences émotionnelles et sociales "ne sont pas des opinions ni des croyances, c'est fondé sur des recherches scientifiques".

      Ces recherches montrent que l'empathie favorise le développement global du cerveau, à la fois intellectuel et affectif.

      Le Rôle Crucial de l'Empathie dans le Développement Cérébral : Des études précises sont citées pour étayer ce propos :

      ◦ Une étude hollandaise a montré que chez les enfants de 7 ans dont les parents sont empathiques, "toute la substance grise du cerveau se développe" avec un "épaississement du cortex préfrontal".  

      ◦ La bienveillance et l'empathie développent le cortex orbitofrontal, siège de fonctions humaines essentielles : l'empathie, la gestion des émotions, la capacité à faire des choix et le sens éthique et moral.  

      ◦ À l'inverse, les punitions et humiliations (physiques ou verbales) entravent le développement de ce cortex.

      L'Adulte comme Modèle par Imprégnation : La formation doit prioritairement concerner les adultes de la communauté éducative, car "les enseignants sont des modèles très puissants pour les enfants". Une fois les enseignants formés, les enfants "vont imiter par imprégnation".

      3. La Communication NonViolente (CNV) comme Levier de Transformation

      La CNV est présentée comme une approche pratique et profonde pour mettre en œuvre le développement des compétences socio-émotionnelles.

      Un Changement de Regard : des Comportements aux Besoins : Catherine Schmid-Gherardi explique que le principe fondamental de la CNV est de comprendre que "toute parole et tout comportement sert à nourrir un besoin".

      Cette prise de conscience permet de "beaucoup moins prendre les paroles et les comportements contre nous" et de voir derrière une maladresse une tentative de prendre soin de soi.

      La Régulation Émotionnelle pour Libérer l'Espace d'Apprentissage : Lorsque l'enfant est submergé par une émotion, "ça prend toute la place et [...] il n'y a pas d'espace pour que les apprentissages se fassent".

      Accueillir et nommer l'émotion (la sienne ou celle de l'élève) permet de la libérer et de rendre "l'espace à nouveau ouvert pour les apprentissages". La CNV lie l'émotion à un besoin, ce qui permet à l'enfant de devenir "proactif" et autonome.

      La Gestion des Conflits et la Désescalade : La CNV transforme la gestion des conflits en déplaçant le focus de la recherche du "coupable" ("qui a commencé ?") vers l'écoute empathique de deux individus en souffrance.

      En accueillant les émotions et besoins de chacun, on "désamorce les tensions" et on amène les élèves à "trouver des solutions par eux-mêmes".

      4. Redéfinir l'Autorité : De la Puissance sur l'Autre à l'« Auctoritas »

      La table ronde a unanimement rejeté l'idée que l'empathie serait incompatible avec l'autorité, proposant au contraire une vision plus mature et efficace de celle-ci.

      Compatibilité entre Empathie et Cadre : Catherine Gueguen précise que l'empathie "n'a strictement rien à voir avec le laxisme".

      L'adulte doit transmettre des valeurs, savoir dire non et rappeler ce qui est "permis ou interdit", mais il peut le faire "en comprenant les émotions et les besoins de l'enfant et sans l'humilier".

      Équilibre entre Verticalité et Horizontalité : Patrice Noy, enseignant, témoigne de la recherche d'un équilibre entre la "verticalité" (être garant du cadre et transmettre un savoir) et une "horizontalité" pour créer "une relation à l'élève qui lui permet un épanouissement de l'autonomie".

      L'Enseignant comme Figure d'Inspiration : Véronique Gaspard distingue le "pouvoir sur l'autre" de l'autorité que les élèves "accordent" à un adulte inspirant.

      Elle cite une autrice : "partout où il y a violence, il y a perte d'autorité". L'enjeu est de faire en sorte que les adultes deviennent inspirants et donnent envie aux jeunes de grandir à leur contact.

      5. La Posture de l'Adulte : Point de Départ du Changement

      Le succès de l'intégration des CPS et de la CNV dépend avant tout du travail que les adultes font sur eux-mêmes.

      L'Auto-Empathie et la Responsabilité Émotionnelle : Patrice Noy explique que la CNV lui a permis de prendre conscience que "mes émotions m'appartenaient" et de ne pas "en faire porter la responsabilité aux autres".

      La formation offre "ce temps entre la colère qui peut monter [...] et de voir d'où ça vient", permettant d'éviter de pénaliser l'acte pédagogique. Il souligne également l'importance de savoir s'excuser ("être désolé") après avoir "débordé".

      L'Impact de la Formation sur le Bien-Être des Enseignants : Catherine Gueguen rapporte que les études montrent que lorsque les enseignants sont formés, "ils vont beaucoup mieux, ils se sentent plus compétents [...] et ensuite ça prévient le burnout".

      L'Exemplarité au Sein de Toute la Communauté Éducative :

      Catherine Piel, ancienne personnel de direction, insiste : "si on veut que les enseignants le fassent vivre à leurs élèves, ça me semble indispensable que les personnels de direction le fassent vivre aussi à leur équipe".

      François Moutapa ajoute qu'il y a une "urgence à ce que l'ensemble des adultes de la communauté éducative, particulièrement les personnels administratifs, soit exposé et sensibilisé".

      L'impact est systémique : l'amélioration des relations entre adultes a un effet direct sur le climat scolaire et le comportement des élèves.

      6. Stratégies de Déploiement : La Formation comme Clé de Voûte

      Le déploiement de ces compétences passe par une stratégie de formation réfléchie, systémique et ambitieuse.

      Une Approche Volontaire et non Injonctive : Plusieurs intervenants, dont Véronique Gaspard, insistent sur le fait que la formation doit partir d'un "espace choisi" et non d'une injonction. Ces approches étant "bouleversantes", il est crucial que les personnes soient prêtes à s'engager dans ce processus.

      L'Importance des Formations Inter-Catégorielles : Le format privilégié est celui qui rassemble différents corps de métier d'un même établissement (enseignants, direction, administratifs, etc.).

      Patrice Noy et Catherine Schmid-Gherardi soulignent que cela crée "une entité qui partage des expériences en local" et une "qualité de relation" qui transforme la dynamique d'équipe.

      Vers une Intégration dans la Formation Initiale (INSPÉ) : Un consensus fort émerge sur la nécessité d'intégrer ces compétences "dès la formation initiale".

      Les enseignants formés plus tardivement déplorent unanimement de ne pas avoir eu ces outils plus tôt, ce qui leur aurait permis de "gagner du temps" et d'adopter directement une posture plus constructive.

      L'Écosystème de Formation : La table ronde met en lumière les initiatives et collaborations existantes pour déployer ces formations sur le territoire.

      | Organisation/Programme | Rôle et Initiatives | | --- | --- | | Réseau Canopé | Publication de l'ouvrage de référence, organisation de conférences et d'ateliers sur la plateforme Canotech et dans les ateliers Canopé (Paris, Alençon, Arras). | | Académie de Paris (Labs'Orbonne, EFC) | Partenariat fort avec Canopé pour construire des formations d'ampleur, professionnaliser les acteurs et créer un pôle de formateurs experts. | | Santé publique France | Mandaté pour le déploiement national des CPS, avec un travail de recensement et de formation par région et par académie. | | Déclic CNV | Association visant à rendre chaque académie autonome avec un pôle de formateurs qualifiés en CNV pour répondre aux demandes des personnels. |

    1. Juego de roles (role-play): Imagina que tú trabajas en uno de estos puestos de comida, y tu compañero de clase viene a comprar algo de comer. Necesitan hablar de qué quiere, cómo quiere su comida, si quiere algo más y cuánto cuesta. Intenta no usar nada de inglés al hacer la transacción. Entonces tú vas a comprar algo de tu compañero de clase de otro puesto de comida.

      ¡Hola! Bienvenido. ¿Qué quieres comer hoy?Quiero una arepa con queso, por favor.¿La quieres con un poco de salsa picante o sin salsa?Sin salsa, gracias. Yo: Muy bien. ¿Quieres algo de beber también? Compañero: Sí, un refresco, por favor. Perfecto. Son cinco dólares en total. Compañero:Aquí tiene. Gracias. ¡Gracias a ti! Que disfrutes tu comida.

      Ahora yo voy a comprar algo del puesto de salchipapas de mi compañero: Hola, quiero una porción de salchi papas, por favor. ¿Con salsa de ajo o ketchup? Con salsa de ajo, gracias. ¿Quieres algo de beber? Sí, un jugo de naranja, por favor. Muy bien. Son seis dólares en total. Aquí tiene. Gracias. ¡Gracias! Que disfrutes.

    2. Compara estos puestos de comida callejera rápida con la comida rápida en tu pueblo. ¿Cuál es más similar? ¿Cuándo comes este tipo de comida? ¿Hay días feriados o eventos cuando comes más comida callejera?

      Los puestos de comida callejera rápida que vemos en las imágenes son diferentes a la comida rápida de mi pueblo. En mi pueblo, la comida rápida suele ser hamburguesas, pizzas o papas fritas, mientras que en los puestos hispanos venden arepas, empanadas o salchipapas.

      Este tipo de comida callejera lo como cuando quiero algo rápido o diferente, como en la calle o en festivales. También como más comida callejera en días feriados o en fiestas locales, porque hay más puestos y es parte de la celebración.

    3. ¿Cuáles de las comidas en el mapa vas a comer hoy o en los próximos días? ¿En cuáles platos o recetas vas a comer esas comidas? (e.g. Voy a comer patatas en papas fritas.)

      En los próximos días voy a comer patatas en papas fritas y en puré de patatas. Voy a comer arroz en arroz con verduras. Voy a comer tomates en ensalada y en salsa de pasta. Voy a comer lechuga en una ensalada y voy a comer manzanas como merienda o en un postre.

    4. ¿Cuáles de las comidas en el mapa te gustan y cuáles no te gustan?

      En el mapa hay muchas comidas interesantes. Me gustan las patatas, el arroz, las manzanas, las naranjas y el maíz porque los como muy a menudo y tienen buen sabor. No me gustan mucho las cebollas crudas ni el azúcar en exceso, porque el sabor es muy fuerte o demasiado dulce para mí.

    5. ¿De dónde son las fresas? ¿Y el maíz? ¿Y los tomates? ¿Y las manzanas? ¿Y las naranjas? ¿Y las cebollas? ¿Y el arroz? ¿Y la lechuga? ¿Y el azúcar? ¿Y las patatas?

      Las fresas son de Europa. El maíz es de América, especialmente de México. Los tomates son de América, de la región de México y Perú. Las manzanas son de Asia, especialmente de Asia Central. Las naranjas son de Asia, principalmente de China. Las cebollas son de Asia. El arroz es de Asia. La lechuga es de la región del Mediterráneo. El azúcar es de India, donde se cultivaba la caña de azúcar. Las patatas son de América, especialmente de Perú.

    6. Imagina que vas a tener una fiesta en tu clase de español. ¿Quieres servir este plato en la fiesta?

      Sí, quiero servir este plato en la fiesta porque el gazpacho es un plato tradicional de España y es refrescante. Creo que a mis compañeros les va a gustar y es una buena forma de compartir la cultura española.

    7. ¿Tienes un plato similar en la comida tradicional de tu pueblo?

      En la comida tradicional de mi pueblo no hay un plato exactamente igual al gazpacho, pero sí tenemos sopas frías o platos con tomate que se comen en verano. Son un poco parecidos, pero el gazpacho es único por su sabor y su forma de preparación.

    8. ¿Te gustan todos los ingredientes de la receta? ¿Te va a gustar ese plato?

      Sí, me gustan casi todos los ingredientes de la receta, como los tomates, el pepino, el pimiento verde y el aceite de oliva. El ajo crudo es un poco fuerte para mí, pero creo que en general sí me va a gustar ese plato, especialmente en un día caluroso.

    9. ¿Cuál es el ingrediente principal de tu receta? ¿Puedes encontrar el ingrediente en el supermercado de tu barrio?

      El ingrediente principal de mi receta de gazpacho es el tomate. Sí, puedo encontrar tomates fácilmente en el supermercado de mi barrio, especialmente en la sección de frutas y verduras.

    1. La Métacognition : Stratégies pour des Apprentissages Réussis

      Résumé Exécutif

      Ce document de synthèse analyse les stratégies pédagogiques fondées sur la métacognition pour favoriser la réussite de tous les élèves.

      La métacognition est définie comme l'ensemble des processus par lesquels un individu régule ses propres activités cognitives, devenant ainsi le "pilote de sa cognition".

      Elle se décline en deux facettes principales : la métacognition explicite, qui est la connaissance consciente de ses propres processus d'apprentissage ("apprendre à apprendre"), et la métacognition implicite, qui repose sur les sentiments et la motivation intrinsèque.

      Face aux constats partagés de difficultés d'attention, d'oubli des savoirs et d'un manque de motivation chez les élèves, l'enseignement direct des stratégies métacognitives apparaît comme un levier puissant.

      Les approches concrètes incluent l'explication du fonctionnement du cerveau, la gestion de l'attention, la régulation de la mémorisation et le développement de la flexibilité cognitive pour résister aux automatismes.

      Un point central est la relation entre succès et motivation. Plutôt que de postuler que la motivation précède la réussite, les expériences de terrain suggèrent que c'est la réussite qui engendre la motivation et l'envie d'apprendre.

      En mettant les élèves en situation de succès, en leur proposant des tâches accessibles et en clarifiant les objectifs d'apprentissage, on crée un cercle vertueux d'engagement.

      Cette démarche ne constitue pas une révolution, mais une évolution des pratiques professionnelles vers un enseignement plus ciblé ("moins mais mieux") et un outil efficace pour lutter contre les inégalités scolaires.

      --------------------------------------------------------------------------------

      1. Fondements de la Métacognition

      La métacognition est présentée comme une méthode pédagogique efficace, s'appuyant sur la recherche, pour prévenir les difficultés scolaires et favoriser la réussite de tous les élèves.

      1.1. Définition et Capacités Clés

      La métacognition englobe l'ensemble des processus par lesquels un individu régule son apprentissage.

      Selon Frédéric Guy, chargé de mission au Cézanne, cela inclut les capacités à :

      • Réguler son attention

      • Choisir de s'informer

      • Planifier et résoudre un problème

      • Repérer et corriger ses propres erreurs

      Ces processus permettent de prédire la faisabilité d'une tâche et d'évaluer ses propres performances. Ils reposent sur quatre capacités fondamentales :

      1. Fixer des buts et identifier les actions nécessaires pour les atteindre.

      2. Détecter et identifier les erreurs pour y remédier.

      3. Évaluer ses résultats et ses conclusions.

      4. Réviser les stratégies utilisées.

      1.2. Les Deux Facettes de la Métacognition

      Il est essentiel de distinguer deux aspects complémentaires de la métacognition :

      | Type de Métacognition | Description | Caractéristiques | | --- | --- | --- | | Explicite (ou Déclarative) | L'approche classique de la "cognition sur la cognition". C'est la capacité de l'élève à verbaliser ses stratégies et ses connaissances sur l'apprentissage. | • Consciente et conceptuelle.<br>• Repose sur des méta-représentations (ex: "pour apprendre, je dois faire cela").<br>• Concerne les perceptions sur les tâches ("c'est difficile") ou sur soi ("je suis bon en maths"). | | Implicite | Une régulation qui se fait sur la base de sentiments dédiés à l'apprentissage.

      Elle est liée à la motivation et à l'évaluation intuitive de l'effort à fournir. | • Basée sur des sentiments et des intuitions.<br>• Moins consciente, plus automatique.<br>• Influence directement la motivation et l'engagement. |

      2. Pistes Pédagogiques pour la Métacognition Explicite

      L'objectif est de donner aux élèves les outils pour devenir autonomes dans leur apprentissage.

      La citation clé de Marie Bridenne, Conseillère Pédagogique, résume cette ambition :

      « Développer ses compétences métacognitives, c’est devenir pilote de sa cognition. »

      2.1. Comprendre le Fonctionnement du Cerveau

      Pour que les élèves puissent réguler leur cognition, il faut d'abord qu'ils en comprennent les mécanismes de base.

      Action : Parler du cerveau en classe, à tous les niveaux, et questionner les élèves sur leurs représentations ("A-t-on tous le même cerveau ?", "Comment fonctionne-t-il ?").

      Outils : Utilisation de ressources pédagogiques comme les ouvrages Découvrir le cerveau à l'école (Canopé), _Kididoc :

      Explore ton cerveau_, ou C'est (pas) moi, c'est mon cerveau !.

      2.2. Gérer et Adapter son Attention

      L'attention est une ressource limitée qui doit être maîtrisée.

      Action : Mettre en place des programmes attentionnels pour faire découvrir aux élèves ce qu'est l'attention, ses limites, et comment la maîtriser de façon autonome (équilibre attentionnel, retour au calme).

      Outils : Programmes structurés comme ATOLE (Apprendre l'ATtention à l'écOLE) pour les cycles 2 et 3, et ADOLE pour le collège et le lycée.

      2.3. Réguler les Processus de Mémorisation

      La mémorisation efficace repose sur trois piliers : comprendre, se questionner, répéter.

      Action : Mettre en place des routines et des outils pour structurer la mémorisation et la révision.

      Outils :

      Fiches mémo pour synthétiser les savoirs.  

      Cartes quiz rédigées par les élèves pour s'auto-interroger.  

      Boîtes de Leitner pour organiser la répétition espacée des notions.  

      Calendrier de reprises expansées pour planifier les révisions.

      2.4. Résister aux Automatismes et Être Flexible

      Apprendre, c'est acquérir des automatismes, mais c'est aussi savoir y résister pour progresser.

      Action : Entraîner les élèves à inhiber leurs réflexes pour développer de nouvelles stratégies, un regard critique et une plus grande tolérance à l'erreur.

      Exemples :

      ◦ Comprendre que la lettre "O" ne produit pas systématiquement le son [o].    ◦ Changer de procédure en calcul mental (ex: pour ajouter 9, ajouter 10 puis retirer 1).

      3. Motivation et Métacognition Implicite : Le Cercle Vertueux de la Réussite

      La motivation est indispensable à l'engagement dans les tâches. Les sources soulèvent une question fondamentale :

      « Faut-il être motivé pour vouloir apprendre et réussir ? Ou faut-il réussir pour vouloir apprendre et se motiver ? » La réponse apportée par l'expérience de terrain est que la réussite est le principal moteur de la motivation.

      3.1. Les Levier pour Vouloir Apprendre

      Pour susciter l'envie, il est crucial de créer les conditions de la réussite et du plaisir d'apprendre.

      Mettre les élèves en réussite : Les buts de performance peuvent avoir des effets délétères en cas d'échec. Il faut donc concevoir des tâches que les élèves considèrent comme accessibles.

      Développer des projets motivants : Lier les apprentissages à des projets concrets et stimulants (rallyes mathématiques, balades lexicales, projet CNR "J'y arrive !").

      S'appuyer sur les 4 piliers de la motivation :

      Intérêt : Le plaisir pris à réaliser la tâche.  

      Importance : La valeur accordée à la tâche.  

      Effort : La perception du coût en énergie.   

      Succès : Le sentiment de compétence et la réussite effective.

      3.2. Les Levier pour Pouvoir Apprendre

      Donner aux élèves la capacité d'apprendre passe par la clarification du cadre et des objectifs.

      Clarifier les objectifs d'apprentissage : Différencier l'objectif réel de la consigne.

      L'élève doit comprendre ce qu'il est en train d'apprendre (ex : non pas "colorier une carte", mais "apprendre à réaliser une carte en respectant un code de couleurs").

      Structurer le temps et les activités : Utiliser un "Menu du jour" pour rendre les objectifs de la journée visibles et explicites.

      Verbaliser les apprentissages : Instaurer un "Journal des apprentissages" où l'élève note ce qu'il a compris ("J'ai compris que...").

      Cela aide à la prise de conscience et à l'appropriation des savoirs.

      4. Mise en Œuvre Stratégique

      L'intégration de la métacognition dans les pratiques pédagogiques doit être pensée de manière systémique et progressive.

      4.1. Exemple d'une Dynamique de Circonscription (2022-2025)

      | Année | Actions Clés | Objectifs | | --- | --- | --- | | 2022-2023 | • Conférences "Talents du cerveau".<br>• Séminaire sur les neuromythes et la flexibilité. | Développement d’une culture commune autour de la métacognition. | | 2023-2024 | • Diffusion auprès des équipes (conseils de maîtres).<br>• Ateliers pratiques (F. Guilleray).<br>• Séminaire sur les pratiques évaluatives. | Acculturation des enseignants et déploiement des outils. | | 2024-2025 | • Conseil-École-Collège sur les compétences attentionnelles et mémorielles.<br>• Projet CNR "J'y arrive" (accompagné par JF Chesné).<br>• Accompagnement des enseignants débutants. | Ancrage des pratiques et suivi des effets sur les élèves. |

      4.2. Une Évolution des Pratiques Professionnelles

      L'approche métacognitive n'est « pas une révolution mais une évolution des gestes professionnels ».

      Elle invite à une rationalisation des pratiques sous le principe « MOINS MAIS MIEUX », en se concentrant sur les stratégies qui ont le plus d'impact.

      Conclusion

      Enseigner les connaissances et les stratégies métacognitives est un levier puissant pour lutter contre les inégalités éducatives et favoriser la réussite scolaire de TOUS les élèves. En leur donnant les clés pour comprendre et réguler leur propre fonctionnement cognitif, l'école leur permet de passer d'un statut d'apprenant passif à celui d'acteur autonome et conscient de ses apprentissages. Cette démarche outille les élèves pour qu'ils puissent, tout au long de leur vie, apprendre de manière plus efficace et plus sereine.

    1. , la lista no ordenada se construirá a partir de una colección de nodos, cada uno vinculado al siguiente mediante referencias explícitas. Siempre y cuando sepamos dónde encontrar el primer nodo (que contiene el primer ítem), cada ítem posterior se puede encontrar sucesivamente siguiendo los enlaces subsiguientes. Con esto en mente, la clase ListaNoOrdenada debe mantener una referencia al primer nodo. El Programa 2 muestra el constructor. Tenga en cuenta que cada objeto de la lista mantendrá una sola ref

      la otra parte de la implementación de una lista enlazada, consiste en construir esa colección de nodos cada uno vinculado, para ello se crea la clase ListaEnlazada con referencia al primer nodo, es decir a la cabeza, que contiene el primer ítem de la lista. A su vez este nodo contiene la referencia al siguiente nodo y asi sucesivamente.

    1. Author response:

      General Statements

      In this manuscript we characterize an exquisitely reproducible model of iPSC differentiation into neuroepithelial cells, use it to mechanistically study cell shape changes and planar cell polarity signaling activation during this transition, then apply it to identify patient-specific cell deficiencies in both forward and reverse genetic screens as a power tool for patient-stratification in personalized medicine. To our knowledge, we provide the first evidence of a human pathogenic mutation directly impairing apical constriction: an evolutionarily conserved behavior of epithelial cells which is the subject of intense research. 

      We are very pleased with the balanced and rigorous reviews generated through Review Commons, which we have already used to improve our manuscript. Reviewer 1 highlights that our study “is significant not only for verifying the cell behaviors necessary for neural tube closure in a human iPSC model, but also for establishing a robust assay for the functional testing of NTD-associated sequence variants.” Reviewer 2 agrees that “results are solid and convincing, the data are quantitative, and the manuscript is well written”, and that our “derivation of patient lines from amniotic fluid and execution of experiments before birth is a remarkable demonstration that points toward precision-medicine applications, while motivating rescue strategies and additional clean genetic models.” Reviewer 3 is “enthusiastic about this work and believe it represents a significant step forward in the effort to establish precision medicine approaches for diagnoses of the patient-specific causative cellular defects underlying human neural tube closure defects.” 

      Below, we have replied to each of the reviewers’ comments.

      Description of the planned revisions

      R2.2. Lines 156-166. The authors claim that changes in gene expression precede morphological changes. I am not convinced this is supported by their data. Fig. 1g (epithelial thickness) and Fig. 1k (PAX6 expression) seem to have similar dynamics. The authors can perform a cross-correlation between the two plots to see which Δt gives maximum correlation. If Δt < 0, then it would suggest that gene expression precedes morphology, as they claim. Fig. 1j shows that NANOG drops before the morphological changes, but loss of NANOG is not specific to neural differentiation and therefore should not be related to the observed morphological changes.

      We are happy to do this analysis fully in revision. Our initial analysis performing crosscorrelation between apical area and CDH2 protein in one line shows the highest crosscorrelation at Δt = -1, suggesting neuroepithelial CDH2 increases before apical area decreases. In contrast, the same analysis comparing apical area versus PAX6 shows Δt = 0, suggesting concurrence. This analysis will be expanded to include the other markers we quantified and the manuscript text amended accordingly. We are keen to undertake additional experiments to test whether these cells swap their key cadherins – CDH1 and CDH2 - before they begin to undergo morphological changes (see the response to Reviewer 3’s minor comment 1 immediately below).

      R3.1(Minor) There seems to be a critical window at day 5 of the differentiation protocol, both in terms of cell morphology and the marker panel presented in Figure 1i. Do the authors have any data spanning the hours from day 5 to 6? If not, I don't think they need to generate any, but do I think this is a very interesting window worthy of further discussion for a couple of reasons. First, several studies of mouse neural tube closure have shown that various aspects of cell remodeling are temporally separable. For example, between Grego-Bessa et al 2016 and Brooks et al 2020 we can infer that apicobasal elongation rapidly increases starting at E8.5, whereas apical surface area reduction and constriction are apparent somewhat earlier at E8.0. I think it would be interesting to see if this separability is conserved in humans. Second, is there a sense of how the temporal correlation between the pluripotent and early neural fate marker data presented here corroborate or contradict the emerging set of temporally resolved RNA seq data sets of mouse development at equivalent early neural stages?

      Cell shape analysis between days 5 and 6 has now been added (see the response to point 2.1 below). As the reviewer predicted, this is a transition point when apical area begins to decrease and apicobasal elongation begins to increase.

      We also thank the reviewer for this prompt to more closely compare our data to the previous mouse publications, which we have added to the discussion. The Grego-Bessa 2016 paper appears to show an increase in thickness between E7.75 and E8.5, but these are not statistically compared. Previous studies showed rapid apicobasal elongation during the period of neural fold elevation, when neuroepithelial cells apically constrict. This has now been added to the discussion: 

      Discussion: “In mice, neuroepithelial apicobasal thickness is spatially-patterned, with shorter cells at the midline under the influence of SHH signalling[14,77,78]. Apicobasal thickness of the cranial neural folds increases from ~25 µm at E7.75 to ~50 µm at E8.5[79]: closely paralleling the elongation between days 2 and 8 of differentiation in our protocol. The rate of thickening is non-uniform, with the greatest increase occurring during elevation of the neural folds[80], paralleled in our model by the rapid increase in thickness between days 4-6 as apical areas decrease. Elevation requires neuroepithelial apical constriction and these cells’ apical area also decreases between E7.75 and E8.5 in mice[79], but we and others have recently shown that this reduction is both region and sex-specific[14,81]. Specifically, apical constriction occurs in the lateral (future dorsal) neuroepithelium: this corresponds with the identity of the cells generated by the dual SMAD inhibition model we use[56]. More recently, Brooks et al[82] showed that the rapid reduction in apical area from E8-E8.5 is associated with cadherin switching from CDH1 (E-cadherin) to CDH2 (N-cadherin). This is also directly paralleled in our human system, which shows low-level co-expression of CDH1 and CDH2 at day 4 of differentiation, immediately before apical area shrinks and apicobasal thickness increases.”

      Prompted by the in vivo data in Brooks et al (2025)[82], we are keen to further explore the timing of CDH1/CDH2 switching versus apical constriction with new experimental data in revisions.

      R3.2(Minor) 2) Can the authors elaborate a bit more on what is known regarding apicobasal thickening and pseudo-stratification and how their work fits into the current understanding in the discussion? This is a very interesting and less well studied mechanism critical to closure, which their model is well suited to directly address. I am thinking mainly of the Grego-Bessa at al., 2016 work on PTEN, though interestingly the work of Ohmura et al., 2012 on the NUAK kinases also shows reduced tissue thickening (and apical constriction) and I am sure I have missed others. Given that the authors identify MED24 as a likely candidate for the lack of apicobasal thickening in one of their patient derived lines, is there any evidence that it interacts with any of the known players?

      We have now added further discussion on the mechanisms by which the neuroepithelium undergoes apicobasal elongation. Nuclear compaction is likely to be necessary to allow pseudostratification and apicobasal elongation. The reviewer’s comment has led us to realise that diminished chromatin compaction is a potential outcome of MED24 down-regulation in our GOSB2 patient-derived line. Figure 4D suggests the nuclei of our MED24 deficient patientderived line are less compacted than control equivalents and we propose to quantify nuclear volume in more detail to explore this possibility.

      Additionally, we have already expanded our discussion as suggested by the reviewer:

      Discussion: “Mechanistic separability of apical constriction and apicobasal elongation is consistent with biomechanical modelling of Xenopus neural tube closure showing that both are independently required for tissue bending[61]. Nonetheless, neuroepithelial apical constriction and apicobasal elongation are co-regulated in mouse models: for example, deletion of Nuak1/2[83], Cfl1[84], and Pten[79] all produce shorter neuroepithelium with larger apical areas. Neuroepithelial cells of the GOSB2 line described here, which has partial loss of MED24, similarly produces a thinner neuroepithelium with larger apical areas. Although apical areas were not analysed in mouse models of Med24 deletion, these embryos also have shorter and non-pseudostratified neuroepithelium.

      Our GOSB2 line – which retains readily detectable MED24 protein – is clearly less severe than the mouse global knockout, and the clinical features of the patient from which this line was derived are milder than the phenotype of Med24 knockout embryos[68]. Mouse embryos lacking one of Med24’s interaction partners in the mediator complex, Med1, also have thinner neuroepithelium and diminished neuronal differentiation but successfully close their neural tube[85]. As general regulators of polymerase activity, MED proteins have the potential to alter the timing or level of expression of many other genes, including those already known to influence pseudostratification or apicobasal elongation. MED depletion also causes redistribution of cohesion complexes[86] which may impact chromatin compaction, reducing nuclear volume during differentiation.”

      R3.3(Minor) 3) Is there any indication that Vangl2 is weakly or locally planar polarized in this system? Figure 2F seems to suggest not, but Supplementary Figure 5 does show at least more supracellular cable like structures that may have some polarity. I ask because polarization seems to be one of the properties that differs along the anteroposterior axis of the neural plate, and I wonder if this offers some insight into the position along the axis that this system most closely models?

      VANGL2 does not appear to be planar polarised in this system. This is similar to the mouse spinal neuroepithelium, in which apical VANGL2 is homogenous but F-actin is planar polarised (Galea et al Disease Models and Mechanisms 2018). We do observe local supracellular cablelike enrichments of F-actin in the apical surface of iPSC-derived neuroepithelial cells:

      Author response image 1.

      Preliminary identification of apical supracellular cables suggestive of local polarity. Top: F-actin staining shown in inverted grey LUT highlighting enrichment along directionally-polarised cell borders (blue arrows). Bottom: Staining orientation (blue ~ X axis, red ~ Y axis) based on OrientationJ analysis illustrating localised organisation of F-actin enrichment.

      We propose to compare the length of F-actin cables and coherency of their orientation at the start and end of neuroepithelial differentiation, and in wild-type versus VANGL2mutant epithelia.

      Description of the revisions that have already been incorporated in the transferred manuscript

      Reviewer #1:

      Major points

      (1) It is mentioned throughout the manuscript that 3 plates were evaluated per line. I believe these are independently differentiated plates. This detail is critical concerning rigor and reproducibility. This should be clearly stated in the Methods section and in the first description of the experimental system in the Results section for Figure 1.

      These experimental details have now been clarified. Unless otherwise stated, all findings were confirmed in three independently differentiated plates from the same line or at least one differentiation from each of three lines. 

      Methods: Unless otherwise stated, for each iPSC line three independently differentiated plates were generated and analysed, with each plate representing a separate differentiation experiment performed on different days.

      (2) For the patient-specific lines - how many lines were derived per patient?

      This has now been clarified in the methods. Microfluidic reprogramming of a small number of amniocytes produces one line per patient representing a pool of clones. Subcloning from individual cells would not be possible within the timeframe of a pregnancy. 

      Methods: For patient-specific iPSC lines, one independent iPSC line was obtained per patient following microfluidic mmRNA reprogramming.

      (3) Was the Vangl2 variant introduced by prime editing? Base editing? The details of the methods are sparse.

      We have now expanded these details:

      Methods: “VANGL2 knock-in lines were generated using CRSIPR-Cas9 homology directed repair editing by Synthego (SO-9291367-1). The guide sequence was AUGAGCGAAGGGUGCGCAAG and the donor sequence was CAATGAGTACTACTATGAGGAGGCTGAGCATGAGCGAAGGGTGTGCAAGAGGAGGGCCAGGTGGGTCCCTGGGGGAGAAGAGGAGAG.

      Sequence modification was confirmed by Sanger sequencing before delivery of the modified clones, and Sanger sequencing was repeated after expansion of the lines (Supplementary Figure 5) as well as SNP arrays (Illumina iScan, not shown) confirming genomic stability.”

      Author response image 2.

      Snapshot of Illumina iScan SNP array showing absence of chromosomal duplications or deletions in the CRISPR-modified VANGL2-knockin lines or their congenic control.

      (4) Suggested text changes.

      Some additional suggestions for improvement.

      The abstract could be more clearly written to effectively convey the study's importance. Here are some suggestions

      Line 26: Insert "apicobasal" before "elongation" - the way it is written, I initially interpreted it as anterior-posterior elongation.

      Line 29: Please specify that the lines refer to 3 different established parent iPSC lines with distinct origins and established using different reprogramming methods, plus 2 control patient-derived lines. - The reproducibility of the cell behaviors is impressive, but this is not captured in the abstract.

      Line 32: add that this mutation was introduced by CRISPR-Cas9 base/prime editing.

      The last sentence of the abstract states that the study only links apical constriction to human NTDs, but also reveals that neural differentiation and apical-basal elongation were found. The introduction could also use some editing.

      Line 71: insert "that pulls actin filaments together" after "power strokes" Line 73: "apically localized," do you mean "mediolaterally" or "radially"?

      Line 75: Can you specify that PCP components promote "mediolaterally orientated" apical constriction Lines 127: Specify that NE functions include apical basal elongation and neurodifferentiation are disrupted in patient-derived models

      All have now been corrected.

      Reviewer #2:

      Major comments:

      (1) Figure 1. The authors use F-actin to segment cell areas. Perhaps this could be done more accurately with ZO-1, as F-actin cables can cross the surface of a single cell. In any case, the authors need to show a measure of segmentation precision: segmented image vs. raw image plus a nuclear marker (DAPI, H2B-GFP), so we can check that the number of segmented cells matches the number of nuclei.

      We used ZO-1 to quantify apical areas of the VANGL2-konckin lines in Figure 3. Segmentation of neuroepithelial apical areas based on F-actin staining is commonplace in the field (e.g. in the Brooks et al 2022 paper cited by another reviewer), and is generally robust because the cell junctions are much brighter than any apical fibres not associated with the apical cortex. However, we accept that at earlier stages of differentiation there may be more apical fibres when cells are cuboidal. We have therefore repeated our analysis of apical area using ZO-1 staining as suggested, analysing a more temporally-detailed time course in one iPSC line. This new analysis confirms our finding of lack of apical area change between days 2-4 of differentiation, then progressive reduction of apical area between days 4-8, further validating our system. Including nuclear images is not helpful because of the high nuclear index of pseudostratified epithelia (e.g. see Supplementary Figure 7) which means that nuclei overlap along the apicobasal axis. Individual nuclei cannot be related to their apical surface in projected images.

      (3) Figure 2d. The laser ablation experiment in the presence of ROCK inhibitor is clear, as I can easily see the cell outlines before and after the experiment. In the absence of ROCK inhibitor, the cell edges are blurry, and I am not convinced the outline that the authors drew is really the cell boundary. Perhaps the authors can try to ablate a larger cell patch so that the change in area is more defined.

      The outlines on these images are not intended to show cell boundaries, but rather link landmarks visible at both timepoints to calculate cluster (not cell) change in area. This is as previously shown in Galea et al Nat Commun 2021 and Butler et al J Cell Sci 2019. We have now amended the visualisation of retraction to make representation of differences between conditions more intuitive. 

      (4) Figure 2d. Do the cells become thicker after recoil?

      This is unlikely because the ablated surface remains in the focal plane. Unfortunately, we are unable to image perpendicularly to the direction of ablation to test whether their apical surface moves in Z even by a very small amount. This has now been clarified in the results:

      Results: “The ablated surface remained within the focal plane after ablation, indicating minimal movement along the apical-basal axis.”

      (6) Lines 403-415. The authors report poor neural induction and neuronal differentiation in GOSB2. As far as I understand, this phenotype does not represent the in vivo situation. Thus, it is not clear to what extent the in vitro 2D model describes the human patient.

      The GOSB2 iPSC line we describe does represent the in vivo situation in Med24 knockout mouse embryos, but is clearly less severe because we are still able to detect MED24 protein expressed in this line. We do not have detailed clinical data of the patient from which this line was obtained to determine whether their neurological development is normal. However, it is well established that some individuals who have spina bifida also have abnormalities in supratentorial brain development. It is therefore likely that abnormalities in neuron differentiation/maturation are concomitant with spina bifida. Our findings in the GOSB2 line complement earlier studies which also identified deficiencies in the ability of patient-derived lines to form neurons, but were unable to functionally assess neuroepithelial cell behaviours we studied. This has now been clarified in the discussion:

      Discussion: “Neuroepithelial cells of the GOSB2 line described here, which has partial loss of MED24, similarly produces a thinner neuroepithelium with larger apical areas. Although apical areas were not analysed in mouse models of Med24 deletion, these embryos also have shorter and non-pseudostratified neuroepithelium. 

      Our GOSB2 line – which retains readily detectable MED24 protein – is clearly less severe than the mouse global knockout, and the clinical features of the patient from which this line was derived are milder than the phenotype of Med24 knockout embryos[68].

      Mouse embryos lacking one of Med24’s interaction partners in the mediator complex, Med1, also have thinner neuroepithelium and diminished neuronal differentiation but successfully close their neural tube[85].”

      (7) The experimental feat to derive cell lines from amniotic fluid and to perform experiments before birth is, in my view, heroic. However, I do not feel I learned much from the in vitro assays. There are many genetic changes that may cause the in vivo phenotype in the patient. The authors focus on MED24, but there is not enough convincing evidence that this is the key gene. I would like to suggest overexpression of MED24 as a rescue experiment, but I am not sure this is a single-gene phenotype. In addition, the fact that one patient line does not differentiate properly leads me to think that the patient lines do not strengthen the manuscript, and that perhaps additional clean mutations might contribute more.

      We appreciate the reviewer’s praise of our personalised medicine approach and fully agree that neural tube defects are rarely monogenic. The patient lines we studied were not intended to provide mechanistic insight, but rather to demonstrate the future applicability of our approach to patient care. Our vision is that every patient referred for fetal surgery of spina bifida will have amniocytes (collected as part of routine cystocentesis required before surgery) reprogrammed and differentiated into neuroepithelial cells, then neural progenitors, to help stratify their postnatal care. One could also picture these cells becoming an autologous source for future cellbased therapies if they pass our reproducible analysis pipeline as functional quality control. This has now been clarified in the discussion:

      Discussion: “The multi-genic nature of neural tube defect susceptibility, compounded by uncontrolled environmental risk factors (including maternal age and parity[102]), mean that patient-derived iPSC models are unlikely to provide mechanistic insight. They do provide personalised disease models which we anticipate will enable functional validation of genetic diagnoses for patients and their parents’ recurrence risk in future pregnancies, and may eventually stratify patients’ postnatal care. We also envision this model will enable quality control of patient-derived cells intended for future autologous cell replacement therapies, as is being developed in post-natal spinal cord injury[103]. Thus, the highly reproducible modelling platform we evaluate – which is robust to differences in iPSC reprogramming method, sex and ethnicity – represents a valuable tool for future mechanistic insights and personalised disease modelling applications.”

      Significance:

      In addition, the model was unsuccessful in one of the two patient-derived lines, which limits generalizability and weakens claims of patient-specific predictive value.

      We disagree with the reviewer that “the model was unsuccessful in one of the two patientderived lines”. The GOSB1 line demonstrated deficiency of neuron differentiation independently of neuroepithelial biomechanical function, whereas the GOSB2 line showed earlier failure of neuroepithelial function. We also do not, at this stage, make patient-specific predictive claims: this will require longer-term matching of cell model findings with patient phenotypes over the next 5-10 years.  

      Reviewer #3:

      Major comments

      (1) One of my few concerns with this work is that the relative constriction of the apical surface with respect to the basal surface is not directly quantified for any of the experiments. This worry is slightly compounded by the 3D reconstructions Figure 1h, and the observation that overall cell volume is reduced and cell height increased simultaneously to area loss. Additionally, the net impact of apical constriction in tissues in vivo is to create local or global curvature change, but all the images in the paper suggest that the differentiated neural tissues are an uncurved monolayer even missing local buckles. I understand that these cells are grown on flat adherent surfaces limiting global curvature change, but is there evidence of localized buckling in the monolayer? While I believe-along with the authors-that their phenotypes are likely failures in apical constriction, I think they should work to strengthen this conclusion. I think the easiest way (and hopefully using data they already have) would be to directly compare apical area to basal area on a cell wise basis for some number of cells. Given the heterogeneity of cells, perhaps 30-50 cells per condition/line/mutant would be good? I am open to other approaches; this just seems like it may not require additional experiments.

      As the reviewer observes, our cultures cannot bend because they are adhered on a rigid surface. The apical and basal lengths of the cultures will therefore necessarily be roughly equal in length. Some inwards bending of the epithelium is expected at the edges of the dish, but these cannot be imaged. The live imaging we show in Figure 2 illustrates that, just as happens in vivo, apical constriction is asynchronous. This means not all cells will have ‘bottle’ shapes in the same culture. We now illustrate the evolution of these shapes in more detail in Supplementary Figure 1.

      Additionally, the reviewer’s comment motivated us to investigate local buckles in the apical surface of our cultures when their apical surfaces are dilated by ROCK inhibition. We hypothesised that the very straight apical surface in normal cultures is achieved by a balance of apical cell size and tension with pressure differences at the cell-liquid interface. Consistent with our expectation, the apical surface of ROCK-inhibited cultures becomes wrinkled (Supplementary figure 4). The VANGL2-KI lines do not develop this tortuous apical surface (as shown in Figure 3), which is to be expected given their modification is present throughout differentiation unlike the acute dilation caused by ROCK inhibition.

      This new data complements our visualisation of apical constriction in live imaging, apical accumulation of phospho-myosin, and quantification of ROCK-dependent apical tension as independent lines of evidence that our cultures undergo apical constriction. 

      (2) Another slight experimental concern I have regards the difference in laser ablation experiments detailed in Figure 3h-i from those of Figure 2d-e. It seems like WT recoil values in 3h-I are more variable and of a lower average than the earlier experiments and given that it appears significance is reached mainly by impact of the lower values, can the authors explain if this variability is expected to be due to heterogeneity in the tissue, i.e. some areas have higher local tension? If so, would that correspond with more local apical constriction?

      There is no significant difference in recoil between the control lines in Figures 2 and 3, albeit the data in Figure 3 is more variable (necessitating more replicates: none were excluded). We also showed laser ablation recoil data in Supplementary Figure 10, in which we did identify a graphing error (now corrected, also no significant difference in recoil from the other control groups as shown in Author response image 3).

      Author response image 3.

      Recoil following laser ablation is not significantly different between different experiments. X axis labels indicate the figure panel each set of ablation data is shown in. Points represent an independent differentiation dish.

      (4)(Minor) I think some of the commentary on the strengths and limitations of the model found in the Results section should be collated and moved to the discussion in a single paragraph. For example, this could also briefly touch on/compare to some of the other models utilizing hiPSCs (These are mentioned briefly in the intro, but this comparison could be elaborated on a bit after seeing all the great data in this work).

      These changes have now been made:

      Discussion: “Some of these limitations, potentially including inclusion of environmental risk factors, can be addressed by using alternative iPSC-derived models[93,94]. For example, if patients have suspected causative mutations in genes specific to the surface (non-neural) ectoderm, such as GRHL2/3, 3D models described by Karzbrun et al[49] or Huang et al[95] may be informative. Characterisation of surface ectoderm behaviours in those models is currently lacking. These models are particularly useful for high-throughput screens of induced mutations[95], but their reproducibility between cell lines, necessary to compare patient samples to non-congenic controls, remains to be validated. Spinal cell identities can be generated in human spinal cord organoids, although these have highly variable morphologies[96,97]. As such, each iPSC model presents limitations and opportunities, to which this study contributes a reductionist and highly reproducible system in which to quantitatively compare multiple neuroepithelial functions.”

      (5) While the authors are generally good about labeling figures by the day post smad inhibition, in some figures it is not clear either from the images or the legend text. I believe this includes supplemental figures 2,5,6,8, and 10 (apologies if I simply missed it in one or more of them)

      These have now been added.

      (6) The legend for Figure 2 refers to a panel that is not present and the remaining panel descriptions are off by a letter. I'm guessing this is a versioning error as the text itself seems largely correct, but it may be good to check for any other similar errors that snuck in

      This has now been corrected.

      (7) The cell outlines in Figure 3d are a bit hard to see both in print and on the screen, perhaps increase the displayed intensity?

      This has now been corrected.

      Description of analyses that authors prefer not to carry out

      R2.5. Figure 3. The authors mention their previous study in which they show that Vangl2 is not cell-autonomously required for neural closure. It will be interesting to study whether this also the case in the present human model by using mosaic cultures.

      The reviewer is correct that this is one of the exciting potential future applications of our model, which will first require us to generate stable fluorescently-tagged lines (to identify those cells which lack VANGL2). We will also need to extensively analyze controls to validate that mixing fluo-tagged and untagged lines does not alter the homogeneity of differentiation, or apical constriction, independently of VANGL2 deletion. As such, the reviewer is suggesting an altogether new project which carries considerable risk and will require us to secure dedicated funding to undertake.

      R3.8(Minor) The authors show a fascinating piece of data in Supplementary Figure 1, demonstrating that nuclear volume is halved by day 8. Do they have any indication if the DNA content remains constant (e.g., integrated DAPI density)? I suppose it must, and this is a minor point in the grand scheme, but this represents a significant nuclear remodeling and may impact the overall DNA accessibility.

      We agree with the reviewer that the reduction in nuclear volume is important data both because it informs understanding of the reduction in total cell volume, and because it suggests active chromatin compaction during differentiation. Unfortunately, the thicker epithelium and superimposition of nuclei in the differentiated condition means the laser light path is substantially different, making direct comparisons of intensity uninterpretable. Additionally, the apical-most nuclei will mostly be in G2/M phase due to interkinetic nuclear migration. As such, the comparison of DAPI integrated density between epithelial morphologies would not be informative (Author response image 4).

      Author response image 4.

      Lateral views of DAPI-stained nuclei on Days 2 and 8 of differentiation. Note the rapid loss of staining intensity below the apical pseudo-row of nuclei on Day 8. This intensity change is likely due to the apical nuclei being in G2/M phase and therefore having more DNA, and rapid loss of 405nm wavelength signal at depth.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We were very pleased to see the very positive evaluation of our work by all 3 reviewers and appreciate their constructive comments and suggestions. We have now addressed all reviewers’ comments by making changes and clarifications to the manuscript.

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      In the present manuscript, the authors present an in-depth study on the effect of a heat-shock response on the ability of yeast to regain viability after quiescence when their ability to respire is inhibited. They nicely demonstrate that these effects correlate with the measured diffusion coefficients, providing deeper insight into the (at least partially) responsible environmental stress response and the molecular players involved. This work is an important contribution to the growing (or resurging) field of the physical properties of the cell.

      We thank this reviewer for this very positive evaluation.

      My two main comments are the following:

      • The authors determine the diffusion coefficients from the MSD, as well as further analyze them all the way up to the confinement size. As far as I can judge from the manuscript, these analyses are for 2D systems and were initially developed for processes on membranes. How does this change for 3D systems? I understand that for a straightforward qualitative comparison of apparent MSD, this assumption is acceptable, but it may deviate more strongly with the additional analyses the authors present.

      This is indeed an important point, and the reviewer is correct that the trajectories are analyzed in 2D (x,y) while the cytoplasm is a 3D environment. We fully agree that this requires careful interpretation, particularly for metrics beyond the short-lag diffusion coefficient.

      First, for the diffusion coefficient, it is well established that for isotropic 3D motion the movements in all three dimensions are independent of each other and the projected 2D MSD satisfies:

      = 4*D*τ

      Thus, estimating from the short-lag slope of the 2D MSD yields the correct diffusivity of the underlying 3D process (up to standard experimental corrections such as localization error and motion blur). This approach is therefore widely used in cytoplasmic SPT and GEM studies, including in yeast, and is not restricted to membrane diffusion [1, 2].

      Regarding confinement-related metrics derived from longer time lags, we agree that these were originally developed and most rigorously interpreted for 2D systems. In our study, these quantities are intentionally used as effective in-plane (x,y) descriptors of particle motion rather than as a full reconstruction of a 3D confinement geometry. Mapping a 2D MSD plateau to an absolute 3D confinement size depends on assumptions about geometry and isotropy and cannot be done uniquely without full 3D tracking. Nevertheless, MSD-based analyses have been successfully extended to explicitly model and quantify 3D confined diffusion in previous studies, provided that full 3D trajectories or well-defined confinement geometries are available. [2, 3]

      [1] Gómez-García, P.A., Portillo-Ledesma, S., Neguembor, M.V., Pesaresi, M., Oweis, W., Rohrlich, T., Wieser, S., Meshorer, E., Schlick, T., Cosma, M.P., Lakadamyali, M., 2021. Mesoscale Modeling and Single-Nucleosome Tracking Reveal Remodeling of Clutch Folding and Dynamics in Stem Cell Differentiation. Cell Rep. 34. https://doi.org/10.1016/j.celrep.2020.108614

      [2] Delarue, M., Brittingham, G.P., Pfeffer, S., Surovtsev, I. V., Pinglay, S., Kennedy, K.J., Schaffer, M., Gutierrez, J.I., Sang, D., Poterewicz, G., Chung, J.K., Plitzko, J.M., Groves, J.T., Jacobs-Wagner, C., Engel, B.D., Holt, L.J., 2018. mTORC1 Controls Phase Separation and the Biophysical Properties of the Cytoplasm by Tuning Crowding. Cell 174, 338-349.e20.

      [3] Lerner, J., Gómez-García, P.A., McCarthy, R.L., Liu, Z., Lakadamyali, M., Zaret, K.S., 2020. Two-parameter single-molecule analysis for measurement of chromatin mobility. STAR Protoc 1.

      Importantly, we do not assume perfect isotropy of the yeast cytoplasm. Local anisotropies are expected due to organelles, crowding heterogeneity, and cell geometry. However, the system is sufficiently close to isotropic at the length and time scales probed that the extracted confinement radius is highly reproducible across independent biological replicates. In our experiments, we observe consistent radius of confinements across three replicates, indicating that any bias introduced by partial anisotropy or projection into 2D is systematic and small.

      Based on the observed reproducibility and the finite depth of field of our measurements (~100 nm), we estimate that potential errors in the absolute values of confinement-related parameters arising from 2D projection and incomplete isotropy are on the order of We have now clarified this point explicitly in the Methods section, emphasizing that confinement parameters are effective 2D measures, that the cytoplasm is not assumed to be perfectly isotropic, and that the conclusions rely on consistent, comparative measurements obtained under identical imaging and analysis conditions. The updated Methods paragraph is as follows:

      […] Trajectory analysis: Radius of Confinement

      The radius of confinement was obtained only for the subgroup of confined trajectories. It quantifies the degree of confinement by estimating the radius of the 2D area explored by the particle in the imaging plane, which serves as a proxy measurement for the 3D volume that it explores. It was measured by fitting a circle-confined diffusion model to the TE-MSD (ensemble of all trajectories) (Wieser and Schütz, 2008).

      TE-MSD = R^2 * (1 - exp(-4*D*t_lag/R^2)) + O

      where R is the radius of confinement and D is the diffusion coefficient at short timescales. O is an offset value that comes from the localization precision limit inherent to localization-based microscopy methods.

      Trajectories were analyzed in the imaging plane (x,y), and confinement metrics were therefore derived from 2D MSDs. Although particles diffuse in a three-dimensional cytoplasmic environment, projection onto 2D does not bias estimation of the short-lag diffusion coefficient for isotropic motion, since the projected MSD follows ⟨Δr_xy²(τ)⟩ = 4Dτ. However, confinement-related parameters derived from longer lag times should be interpreted as effective in-plane descriptors of mobility rather than as a direct reconstruction of a full 3D confinement geometry. Mapping a 2D MSD plateau to an absolute 3D confinement size would require explicit assumptions about geometry or full 3D tracking. Our conclusions rely on comparative analyses performed under identical imaging and analysis conditions, and the extracted confinement radii were highly reproducible across biological replicates, indicating that any bias introduced by 2D projection or moderate anisotropy is systematic and does not affect the validity of the relative differences reported.

      • The authors show data in the supporting information where the GEMs provide larger foci after stress with longer imaging times. Could the authors provide the images of the shorter imaging times that they use? That seems a more equal comparison than Figure C. It is also unclear to me why fixed cells are used in Figure C, as well as the meaning of the x-axis. In line with this, can the authors exclude that GEMs dimerize/oligomerize after stress, and therefore display a lower diffusion coefficient?

      We are happy to include the images acquired at a shorter time interval and have done so (Fig S2A). We apologize for insufficiently explaining the GEM intensity experiment shown in Figure S2C. The fixation was done to immobilize the GEMs, since they are rapidly diffusing in live cell imaging and the diffusion speed relative to camera exposure time will impact the brightness (any movement of a particle during exposure causes the signal on the detector to become “blurred” and reduces the intensity per pixel). Hence, GEM brightness does not solely reflect the monomer or potential aggregate/multimer state, but is also affected by diffusion speed and exposure time: faster moving GEMs will generally appear dimmer than slower moving ones, since the signal detection during the acquisition time is reduced by the particle movement. Another effect is that, since GEMs are moving in live cell imaging, they have a probability of spatially overlapping, enhancing the signal levels of the single detected spots.

      We have quantified the brightness distribution in the different conditions to detect aggregation or multimerization of GEMs, which we expect to be visible as a shoulder on the Gaussian curve. The x-axis shows the intensity which we have determined for each trajectory. We chose to assess GEM intensity in the frame with the highest intensity, and to take the “Total” intensity, meaning we sum up the intensity of the pixels within the Point Spread Function (PSF) of each localization in that frame.

      To clarify these points, we have extended the description of this experiment in the Results and Methods sections:

      Results:

      [...] Additional evidence for this comes from the observation that imaging GEMs at a lower frame rate (i.e., longer exposure time of 100 ms) showed a uniformly diffuse signal in SCD, whereas distinct foci appeared under starvation conditions (Figures S2A and S2B). This might suggest that GEMs aggregate in starvation. However, imaging GEMs at a faster frame rate (used for SPT, 30 ms exposure time) shows GEMs freely diffusing in all conditions (Figure S2A). Furthermore, analyzing GEM particle intensities in fixed cells, to eliminate motion blur-induced intensity attenuation, showed uniform GEM brightness distributions in all conditions (Figure S2C). Rather than aggregates, the bright foci thus represent immobile, single GEM particles that are confined and appear brighter during long exposure times due to their confinement in low-diffusive compartments. [...]

      Methods:

      [...] Trajectory analysis: Track Total Intensity

      To assess GEM brightness, we determined the intensity of each trajectory in fixed cells. Cell fixation eliminates the motion blur-induced intensity attenuation, which would otherwise confound the GEM brightness depending on the movement speed and confinement. For each individual particle trajectory, the frame with the highest signal intensity of the localized particle was determined and the sum of the pixel intensities of the particle in that frame was calculated as the “Track Total Intensity”. In fixed cells, the GEM intensities were comparable in all conditions (Figure S2C). All GEM intensity histograms show a single, bell-shaped distribution of intensities with no indication of several GEM particles aggregating into brighter foci. [...]

      Other comments: - For the precision of the language, the authors equate ribosome content with macromolecular crowding, with the diffusion of the GEMs throughout, and this becomes more conflated in the discussion, where it is compared to viscosity and macromolecular crowding effects, e.g., translation. Is it macromolecular crowding, mesoscale crowding, nano-rheology, or ribosome crowding? What is measured precisely?

      We agree that careful and consistent nomenclature is important and thank the reviewer for bringing this point to our attention. We believe our manuscript maintains the proper distinctions of the terms diffusion, crowding and viscosity. We refer to what we study with the GEM single-particle tracking consistently as “(cytoplasmic) diffusion”. In Figure 2, we add “crowding” as an additional term since we observe a change in ribosome concentration and we affect the cytoplasmic crowdedness with a hyperosmotic shock. Our in-depth analysis of the confined and unconfined trajectory diffusion suggested that the cytoplasm is not simply globally affected by crowding or viscosity, but contains regions or compartments that trap GEM. Apart from Figure 2, we do not use the term viscosity or crowding, and we only return to “crowding” in the Discussion, either in reference to the aforementioned experiments from Figure 2 (ribosome concentration, hyperosmotic shock) or when discussing studies from cited works.

      However, we did not use the term “macromolecular crowding” consistently and simplified it to “crowding” in a few instances. To be more precise, we now specify “macromolecular crowding” instead of “crowding” wherever applicable; namely in the text referring to Figure 2, where we specifically assess macromolecular crowding.

      • In the EM images, the ribosomes seem smaller after starvation. Is that correct, and how should we interpret this? Is this due to an increased number of monosomes?

      This is an important point, and it indeed appears that in SCD some ribosomes are close together, potentially as polysomes. In SC, the ribosomes appear more distinctly separated from each other, which would be expected due to the polysome collapse that occurs in starvation. However, the apparent size of individual ribosomes is identical in both conditions. Unfortunately, the resolution is not good enough to accurately measure the sizes of the ribosomes and clearly determine their monomer/polysome state.

      • The authors refer to recent work on how biochemical reactions, such as translation, are determined by the cytoplasm. There is some older work on this, see for example in bacteria https://doi.org/10.1073/pnas.1310377110, and also in vitro here DOI: 10.1021/acssynbio.0c00330

      We thank this reviewer for pointing out these publications and have included them in this group of citations.

      • On the section of correlating diffusion and survival outcomes (bottom page 12), it is mentioned that the lowered diffusion could enhance aggregation. However, literature indicates that the opposite is true in buffer; lower diffusion reduces aggregation (also nucleation is inversely proportional to the viscosity).

      This is a valuable point and we have happily expanded on it in the Discussion section. It is true that chemical assays have demonstrated that higher viscosity and slower diffusion decrease nucleation and aggregate formation. However, in vitro studies that alter diffusion through crowding changes have revealed a complex relation between crowding and aggregation propensity. The basic idea is that the excluded volume effect decreases aggregation by stabilization of the more compact, folded state. But the opposite effect, precluded protein folding, has also been ascribed to the excluded volume effect. As of now, studies with different crowders (dextran, ficoll, PEG, etc.) demonstrated increased or reduced protein aggregation upon crowding [1, 2, 3, 4]. The variable effect on aggregation seems to be not only based on the protein that is studied, but also the properties of the crowder (charges, shape, size), the interaction of the crowder with the protein, and the mixture of crowders [5].

      Even though the relationship between crowding and protein aggregation is complex, we speculate that lower diffusion in our more crowded cells could cause protein aggregation, because these starvation conditions are known to induce the formation of protein fibrils and the condensation of mRNA and proteins.

      [1] Uversky, V.N., M. Cooper, E., Bower, K.S., Li, J., Fink, A.L., 2002. Accelerated α-synuclein fibrillation in crowded milieu. FEBS Lett. 515, 99–103. https://doi.org/10.1016/S0014-5793(02)02446-8

      [2] Munishkina, L.A., Cooper, E.M., Uversky, V.N., Fink, A.L., 2004. The effect of macromolecular crowding on protein aggregation and amyloid fibril formation. J. Mol. Recognit. 17, 456–464. https://doi.org/10.1002/jmr.699

      [3] Biswas, S., Bhadra, A., Lakhera, S., Soni, M., Panuganti, V., Jain, S., Roy, I., 2021. Molecular crowding accelerates aggregation of α-synuclein by altering its folding pathway. Eur. Biophys. J. https://doi.org/10.1007/s00249-020-01486-1

      [4] Mittal, S., Singh, L.R., 2014. Macromolecular crowding decelerates aggregation of a β-rich protein, bovine carbonic anhydrase: a case study. J. Biochem. 156, 273–282. https://doi.org/10.1093/jb/mvu039

      [5] Kuznetsova, I.M., Zaslavsky, B.Y., Breydo, L., Turoverov, K.K., Uversky, V.N., 2015. Beyond the excluded volume effects: Mechanistic complexity of the crowded milieu. Molecules 20, 1377–1409. https://doi.org/10.3390/molecules20011377

      To be more precise, we have therefore extended our Discussion section. We believe part of this additional discussion fits better in an earlier section, where we specifically discuss how the cytoplasmic properties, and specifically crowding, have been linked to filament/condensate formation. The updated paragraphs are as follows:

      [...] Additional cytoplasmic rearrangements occur upon energy depletion, including filament formation or the formation of biomolecular condensates (Narayanaswamy et al., 2009; Noree et al., 2010; Petrovska et al., 2014; Prouteau et al., 2017; Riback et al., 2017; Saad et al., 2017; Marini et al., 2020; Stoddard et al., 2020; Cereghetti et al., 2021) highlighting a broader reorganization of the cytoplasm that could further affect the diffusion of macromolecules. In turn, the amount of crowding might also influence the propensity to form condensates and filaments (Heidenreich et al., 2020). Interestingly, in vitro studies have demonstrated a complex, dual effect of crowding on protein fibrillation and aggregation, in suppressing or accelerating it (Uversky et al., 2002; Munishkina et al., 2004; Mittal and Singh, 2014; Biswas et al., 2021). This appears to be dependent not only on the protein of study, but the properties of the crowder (size, charge, shape) and the specific mixture of crowders (Kuznetsova et al., 2015). [...]

      [...] By contrast, extremely low diffusion, as seen in the absence of respiration in glucose starvation, might irreversibly impair cellular functions due to limited movement of proteins and RNA in and out of certain compartments, cellular territories and condensates. Such a model is supported by our analysis of how lower diffusion is the result of confined spaces becoming more prevalent, creating compartments that can trap macromolecules. As previously mentioned, increased crowding and reorganization of the cytoplasm have been linked to condensation and fibril formation of proteins, and, in certain in vitro contexts, accelerated aggregation. This state of crowding-induced low diffusion might therefore enhance protein aggregation or preclude the refolding of damaged proteins, which could disrupt proteostasis and lead to toxic aggregates that are a hallmark of the aging process (López-Otín et al., 2013). Together, these effects on proteins, RNA and other macromolecules likely lead to loss of cell fitness and irreversible arrest of the cells, preventing their reentry into the cell division cycle. [...]

      Reviewer #1 (Significance (Required)):

      General assessment: Strengths: It is a comprehensive study that provides a wealth of information and insight into the intricacies of a field that has received considerable attention, and its views are evolving rapidly. Weaknesses: It may suffer from some overinterpretation of diffusion data. Advance: The significant advance is that the molecular response pathway and precise molecular players are connected to the biophysical response of cells to starvation/quiescence. The dependence of diffusion on starvation has received considerable attention (Jacobs-Wagner, Cell, 2014; the current authors in eLife, 2016; and more recent investigations by Holt, Delarue, and others). Still, the authors take the next step and demonstrate how quiescence, and particularly how the history of a cell affects it, correlates strongly with the diffusion. As far as I can tell, this is new. As mentioned, the molecular insights into the pathways are exceptionally strong from my perspective. From personal experience, this work is also very important for researchers outside of the field from a practical standpoint: Do your measurements change when you stress cells by walking to a microscope? And even if you incubate them there, your measurement outcome will change. In my experience, this is a crucial point, and the cell's history is often overlooked. Audience: Broad -- biophysicists, molecular biologists, cell biologists, biotechnologists. My field of expertise: Biophysics.


      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      This manuscript addresses an important and longstanding question in the field: how eukaryotic cells remodel themselves to enter and survive quiescence, particularly under nutrient limitation and energy stress. The authors combine tools from biophysics, proteomics, stress signaling, and functional genomics to propose that stress-induced cytoplasmic reorganization, rather than ATP availability per se, is critical for long-term survival when respiration is impaired. The topic is timely, the experiments are generally well executed, and the initial phenomenology is compelling. The paper begins with a set of clear and convincing figures that establish an interesting and biologically important phenotype: when cells are shifted into glucose starvation, they can survive long term only if respiration is functional. Blocking respiration with Antimycin A (AntA) severely compromises viability. One straightforward hypothesis is that this defect simply reflects a failure to generate sufficient ATP. The authors, however, show that a 30-minute heat shock (HS) before glucose withdrawal in the presence of AntA largely rescues survival, even though cellular ATP levels remain critically low. In parallel, they use very well-executed GEM single-particle tracking experiments to demonstrate that cytoplasmic particle mobility decreases markedly in glucose-starved, respiration-deficient cells, and that this diffusion defect is also rescued by the pre-HS, again without restoring ATP. Together, these initial figures strongly support the idea that stress-induced remodeling of the cytoplasm, rather than ATP levels per se, is a key determinant of whether cells can enter and maintain a viable quiescent state. The authors then propose that this protective effect of HS is mediated by induction of the environmental stress response (ESR) and by resulting changes in protein expression. To test whether new protein synthesis is required, they pre-treat cells with cycloheximide during the HS and recovery period. This treatment largely, although not completely, abrogates the beneficial effect of HS on survival and diffusion in AntA-treated, glucose-starved cells. This is a strong experiment and supports the idea that HS-induced synthesis of specific proteins is important for protection, while also hinting that some cycloheximide-insensitive or pre-existing components may contribute. To identify the relevant proteins, the authors turn to global proteomic analysis, comparing multiple conditions: glucose starvation (SC), heat shock followed by glucose starvation (HS SC), glucose starvation plus AntA (SC + AntA), and heat shock followed by glucose starvation plus AntA (HS SC + AntA), each at 1 and 20 hours. This is where, in my view, the story becomes significantly harder to follow. The text for Figure 3 relies almost entirely on GO term enrichment, with very little description of individual proteins or even basic quantitative summaries of the dataset. For example, the authors never clearly state how many proteins were robustly quantified, nor what fraction of the proteome that represents. Without this foundational information, it is difficult to evaluate the strength and generality of their conclusions. Related to this, the GO analysis in Figure 3F reports "significant" enrichment for categories such as ribosomes or translation, yet the underlying number of proteins making up these enrichments is not shown. From the volcano plots, it appears that only a very small number of proteins change in some conditions (e.g., SC 20 h), and yet GO terms appear with extremely strong q-values. This is confusing: how can such strong enrichment occur if only a handful of proteins are changing? At minimum, the authors should provide: • the number of significantly up- or down-regulated proteins in each comparison • the number of proteins contributing to each enriched GO category • the magnitude of the changes for these proteins Because the absolute number of significantly changing proteins appears small in several conditions, the current heavy reliance on GO analysis feels unwarranted and potentially misleading. In such cases, it would likely be more informative to list all differentially abundant proteins-either in supplementary materials or in a main-text table-and briefly describe the most relevant ones, rather than relying on broad category labels. Figure 3F, in particular, needs substantially more explanation. A related issue appears in Figure 3G (and the associated text), where the authors emphasize that the proteomic response to HS + AntA and the response to long-term glucose starvation are distinct. While this conclusion is plausible, the analysis also shows a subset of proteins that are upregulated in both conditions. These overlapping proteins may, in fact, represent the core protective module that enables survival in quiescence. The authors do not discuss these proteins at all; instead, they are effectively dismissed in favor of the "distinct responses" narrative. I encourage the authors to identify and discuss these overlapping proteins explicitly. Are they chaperones, proteasome components, antioxidant enzymes, or other classical stress-response factors? Even if the global proteomes differ, the overlapping subset could be highly informative about the minimal set of proteins required to stabilize the cytoplasm and support entry into quiescence. The SATAY screen is a major strength of the paper, as it moves from correlative proteomics to functional genetic analysis. The approach appears well-controlled, but key information is missing: How many unique insertions were obtained? Was the library saturating? What was the read distribution and coverage? The authors also discuss only a small subset of the screen hits. The volcano plots show many additional genes that are not addressed. What categories do these fall into? Are they informative about pathways beyond Ras/PKA and Msn2/4? Presenting a fuller analysis would strengthen the mechanistic interpretation. The parts of the SATAY analysis that are discussed are solid. The screen implicates the Ras/PKA signaling axis and Msn2/4 in survival under HS-preconditioned, respiration-deficient starvation, and the authors validate these hits with targeted survival assays. The correspondence between genetic perturbations and changes in cytoplasmic diffusion is an intriguing connection. However, the analysis stops short of identifying the downstream effector proteins that actually produce the biophysical benefits observed. The manuscript then returns to the idea that improved cytoplasmic diffusion and reduced confinement may be essential for survival. This is an appealing hypothesis, but the evidence remains correlative. It is still unclear whether biophysical rescue is the cause of improved survival or simply a downstream marker of a properly induced stress response. What remains missing is deeper integration of the proteomics and SATAY data to identify which proteins are likely responsible for the adaptive changes in cytoplasmic organization. Overexpression of promising candidates-such as chaperones or proteostasis factors found in the overlap between HS and long-term starvation responses-could help determine whether any single protein or small group of proteins can phenocopy the HS-induced rescue. Importantly, many of the comments above are intentionally broad: the manuscript does not simply require small clarifications but would benefit from substantial expansion and deepening of the analysis. The observations are compelling, but the mechanistic chain connecting ESR activation → proteomic remodeling → cytoplasmic biophysics → survival remains insufficiently developed in the current draft. Clearer quantitative reporting, fuller presentation of the data, and more thoughtful interpretation would significantly strengthen the manuscript.

      We thank reviewer 2 for this very thoughtful evaluation of our manuscript. We agree that expanding the descriptions and analysis of the presented data will improve the manuscript. Importantly, we now provide the proteomics data and the SATAY screen in an accessible format as supplementary materials. We address the individual points below.

      Summary of Major Issues That Need to Be Addressed • Quantitative clarity in the proteomics o State how many proteins were quantified. o Report the numbers of significantly changing proteins in each condition. o Identify the proteins underlying each GO term and provide effect sizes.

      We have now included a supplemental table containing label-free protein abundances for all 3308 reproducibly quantified proteins across all nine conditions (Supplemental Table S4). In addition, we added a sentence to the main text specifying both the number of reproducibly identified proteins and the approximate coverage of the yeast proteome.

      For the comparison of protein abundances between the different stress conditions and logarithmically growing SCD cells, we now indicate the number of significantly changed proteins in the legend of Figure 3E. Furthermore, we include a heatmap of standardized protein abundances for all proteins that were significantly changed in at least one stress condition (Supplemental File S1) and provide all pairwise comparison results in the supplemental table (Supplemental Table S5). This new Supplemental File S1 replaces the previous Supplemental File S1, which had a stricter cutoff, showing all proteins with an abundance change greater than 2 standard deviations.

      The information requested by the reviewer regarding GO term analysis is indeed important and was missing in the original version. We now report, for each GO term, the number of proteins in the top or bottom 10% of differentially abundant proteins and provide the corresponding effect size, calculated as the ratio of the observed to expected hits (Figure 3F).

      • Over-reliance on GO analysis o Provide explicit lists of differentially expressed proteins. o Indicate whether enrichment results are meaningful given the small number of hits.

      We appreciate this reviewer’s comment and agree that the presentation of the proteomic data in Figure 3 relies strongly on GO term enrichment, with limited description of individual proteins. Our primary goal for the proteomic analysis was to characterize the cellular response to stress at a global level rather than to focus on individual proteins or stress-specific details. We therefore intentionally opted for a broader, more coarse-grained analysis to not overcomplicate the manuscript and maintain accessibility for a broad readership.

      That said, we agree that the underlying data should be made fully accessible. We have therefore expanded the supplemental materials to include a heatmap of all proteins that were significantly changed in at least one condition (Supplemental File S1), as well as comprehensive tables reporting protein abundances and pairwise differences across all stress conditions (Supplemental Tables S4 and S5). These additions provide direct access to the protein-level data while preserving the clarity of the main text.

      With respect to GO term analysis, to avoid overinterpretation driven by small protein sets and better comparability across different conditions, we always performed the GO enrichment based on the top and bottom 10% changed proteins. This is already stated in the legend of Figure 3F and in the Methods section. We have now added the key missing parameters of the analysis to Figure 3F (see response above). Given that the analysis identifies multiple GO terms generally associated with the environmental stress response and that these terms exhibit coordinated behavior across conditions (Figure S3A), we are confident that the conclusions drawn from this analysis are robust.

      • Overlooked overlapping proteins o Analyze and discuss the subset of proteins upregulated both by HS and by long-term starvation. o These may represent the core factors enabling survival.

      Indeed, we agree that the overlapping proteins that are observed in our Figure 3G analysis should be presented. Perhaps surprisingly, these proteins (Hxt5, Sps19, Atg8, Aim17, Put1, Fmp45, YNL194C) have diverse functions and have so far not been implemented in the environmental stress response.

      In the Results section, we now mention and briefly discuss the four that are present in both time points of the HS SC +AntA condition. We now mention all of them in the figure legend.

      The modified text from the Results section is as follows:

      [...] Furthermore, the proteins that are enriched in long-term starvation (SC 20 h vs. SCD) and those enriched in pre-HS respiration-deficient starvation (HS SC +AntA 1 h vs. SCD; HS SC +AntA 20 h vs. SCD) are poorly correlated and there is only a small overlap of factors that are significantly upregulated in all conditions (Figure 3G). These proteins are Aim17, Put1, Fmp45 and YNL194C. Aim17 is a mitochondrial protein of unknown function and Put1 is a mitochondrial proline dehydrogenase. Fmp45 and YNL194C are paralogous membrane proteins involved in cell wall organization. Focusing on the broad proteomic adaptation, we looked at the Gene Ontology (GO) terms of the proteomic changes across all conditions, and observed that long-term starvation (SC 20) leads to the upregulation of a few groups of proteins, mostly involved in respiratory activity and rewiring of the metabolism (Figure S3A). [...]

      We greatly appreciate the suggestion to do an overexpression experiment. However, the overlapping proteins are not significant hits in the SATAY, suggesting that they are individually not required for the survival rescue although their overexpression might benefit survival.

      We have therefore chosen to keep a broad perspective on the proteomics results and investigate instead the SATAY results in more detail, since they inherently contain functional relevance to survival. Overall, we feel that the overexpression of those (individually or as a group) would extend beyond the scope of our current manuscript.

      • SATAY analysis needs fuller presentation o Provide insertion numbers, coverage, and basic library statistics. o Discuss additional hits beyond the Ras/PKA/Msn2/4 pathways. o Integrate SATAY results more deeply with proteomics.

      We have added the insertion numbers and genome coverage percentages to the Methods section as follows:

      [...] SATAY Screen: Analysis and Plotting

      Sequencing detected the following total unique transposon numbers: 690’935 (A1), 558’932 (HA1), and 359’935 (HA4d) unique transposons. The transposon insertions in the different genes yielded the following genome coverages: 96.3% (A1), 94.5% (HA1) and 89.3% (HA4). For each gene [...]

      We now also provide the SATAY screen data as Supplemental Table S6.

      In the Results section, we mention some additional hits from the SATAY screen (ribosome biogenesis, mitochondrial respiration) but then shift our focus to the ESR genes. We now add a comment to the ribosome biogenesis genes before going to the ESR:

      [...] The screen revealed several highly significant gene disruptions that promote or impair the HS-mediated rescue of respiration-deficient, glucose-starved cells (Figure 4A, Supplemental Table S6). The most significant gene hits that impair survival in 4 d HS SC +AntA when disrupted are involved in a variety of cellular processes, including ribosome biogenesis (e.g., ARX1, BUD22, RRP6), mitochondrial respiration (e.g., CBR1, COX23, ETR1), and ESR (e.g., MSN2, PSR2, YAP1). Intriguingly, the ribosome biogenesis genes being crucial for survival suggests that new ribosomes might have to be produced to ensure proper translational response during the HS. Notable among the ESR genes are MSN2 and, less significantly scored, MSN4, the master regulators of the ESR. [...]

      To deepen the discussion on the lack of overlap between the SATAY screen and the proteomics, we have added a sentence highlighting that the SATAY screen detected the main regulators of the ESR, and the proteomics revealed its downstream targets involved in proteostasis and other stress proteins, and therefore these two data sets do both point to the ESR as the crucial response behind the HS-induced rescue. The modified Discussion text is as follows:

      [...] Furthermore, the signaling genes that scored highly in the SATAY screen are often regulated through their activity rather than their abundance. Plausibly, their downstream target proteins are differentially expressed, whereas disrupting the regulators themselves leads to strong survival phenotypes. Similar observations have been made in other stress conditions, where fitness-relevant genes showed little overlap with genes with upregulated expression (Birrell et al., 2002; Giaever et al., 2002). Nonetheless, the SATAY screen revealed the principal regulators of the ESR while the proteomic analysis detected many of the ESR downstream targets involved in proteostasis and oxidative stress, demonstrating a functional convergence on the ESR in both data sets. [...]

      • Mechanistic depth remains limited o Clarify whether cytoplasmic biophysical rescue is causal or downstream. o Test whether overexpression of candidate proteins can mimic HS-induced protection. o Expand the discussion of potential mechanisms using insights from both datasets.

      Indeed, the specific mechanism(s) that govern the cytoplasmic properties in our conditions are currently not known, preventing us from manipulating the cytoplasmic properties and confirming a causal relationship. To uncover the mechanisms, extensive follow-up studies on ESR genes and/or proteins would be required, going beyond the scope of this manuscript. Furthermore, our ongoing follow-up studies are pointing towards redundancy of some potential regulation of the cytoplasmic diffusion, further complicating the analysis.

      The suggested overexpression experiment is addressed in a previous comment where the overlapping proteins are mentioned.

      Reviewer #2 (Significance (Required)):

      This manuscript addresses a fundamental and timely question in cell biology: how eukaryotic cells remodel themselves to enter and survive quiescence, particularly under conditions of nutrient depletion and compromised energy production. Although quiescence has been studied for decades, the mechanisms that link metabolic state, stress signaling, and the physical properties of the cytoplasm remain incompletely understood. This work brings together biophysical measurements, global proteomics, and unbiased genetic screening in an ambitious effort to illuminate how cells maintain viability when respiration-and thus efficient ATP generation-is disrupted. A key conceptual contribution of this study is the demonstration that ATP levels alone do not dictate survival during starvation. Rather, the ability of cells to mount an appropriate stress response and reorganize the cytoplasm appears to be crucial. The early figures provide compelling evidence that heat shock preconditioning can rescue both viability and cytoplasmic mobility in respiration-deficient cells, even when ATP remains low. This finding is notable because it challenges the widely held assumption that energy charge is the primary determinant of successful entry into quiescence. If strengthened by deeper mechanistic analysis, this insight could reshape how the field views energy stress and cellular dormancy. The identification of the Ras/PKA-Msn2/4 axis as a key regulatory node is also significant, as it connects quiescence survival to well-established nutrient and stress signaling pathways. The integration of a genome-wide SATAY screen adds functional depth and offers the potential to uncover specific downstream effectors that remodel the cytoplasm or stabilize cellular structures during prolonged stress. Finally, the manuscript touches on a concept that is gaining traction across many subfields of biology: that the biophysical state of the cytoplasm is a regulated and physiologically meaningful parameter, not merely a passive consequence of metabolic decline. Understanding how cells tune macromolecular crowding, diffusion, and spatial organization during quiescence could have broad implications beyond yeast, including in stem cell biology, microbial dormancy, cancer cell persistence, and aging. Overall, the questions addressed are important, and the study has the potential to make a meaningful conceptual contribution. However, realizing that impact will require clearer and deeper mechanistic analysis-particularly in the proteomics and SATAY sections-to convincingly identify the specific factors and pathways that mediate the cytoplasmic remodeling underlying survival.


      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Summary. Yeast haploid cells enter quiescence during nutrient deprivation, undergoing major metabolic, transcriptional and biophysical changes. In particular, quiescent cells remodel their cytoplasm, increasing macromolecular crowding and reducing diffusion. Respiration is known to be essential for entry into quiescence and long-term survival.

      In this study, the authors discovered that respiration is not intrinsically required for yeast to survive glucose-starvation-induced quiescence. In particular, they found that a short heat shock before starvation restores survival even in the absence of respiration (Antimycin A treatment), demonstrating that a stress-induced adaptation can bypass the respiratory requirement. This rescue occurs without ATP recovery and relies on de novo protein synthesis. This stress-induced adaptation also rescues quiescent-like biophysical properties of the cytoplasm (increased crowding) that are normally prevented in non-respiring cells, which are thought to be relevant for cell survival . Proteomics reveals that heat shock induces a distinct stress-response proteome enriched in proteostasis factors. A genetic screen reveals that Ras/PKA inhibition and Msn2/4 activation enable this protective reprogramming. Altogether this highlights the importance and complexity of stress adaptation to quiescence establishment.

      This is an excellent paper in all aspects. I have no major points besides the data accessibility, below.

      We thank this reviewer for this very positive evaluation.

      Main comments. - It would be nice to have the MS data available as Excel files for the community, and uploaded to repositories such as PRIDE. Description of the MS data is a bit expedited to serve the purpose of the paper (clustering to evaluate the similarity of proteomic profiles between conditions, GO term enrichment) so having the full data available might help.

      We agree that the MS data should be accessible. The label-free protein abundances for the reproducibly quantified proteins across all nine conditions (Supplemental Table S4) and the pairwise comparisons shown in Figure 3E (Supplemental Table S5) are now included as supplementary Excel files. The MS data is currently not on PRIDE but we will deposit it there upon publication of our manuscript.

      • Same thing for the SATAY screen. The data is summarized in Fig 4B but I believe that the data should be provided.

      We agree that the SATAY screen results should be accessible as well, and we have now included the data as Supplemental Table S6.

      Minor comments and questions. -I believe that in graphs, the X axis should start at 0 to avoid confusion about the strength of the effect (eg. Fig 2B)

      We thank reviewer 3 for pointing this out, and we have re-evaluated the axis limits of all plots. As suggested, we have adjusted the x-axis in Fig 2B to start at 0 to better highlight the strength of the effect. For our Radius of Confinement and %Confined Trajectories graphs, we believe adjusting the y-axis to start and end at the same values will allow better comparison across figures. However, we chose not to set those y-axes to start at 0, since our measurements lie in a range which is covered by these axes, and these plots would simply include blank space if set to start at 0.

      -I found that using imaging of GEMs at low frequency to reveal cytoplasmic crowding heterogeneity very interesting. Quiescent cells are known to accumulate many "bodies" as discussed in the text, would any of those co-localize with GEM foci?

      Indeed, the imaging at low frequency has revealed that fluorescently-tagged proteins might become trapped in certain regions of the cytoplasm, allowing their detection at conventional imaging frequencies. It is very likely that a similar effect occurs for other cytoplasmic “bodies”, which become visible not only through protein accumulation in a single body but also through low mobility. We have not performed any colocalization experiment with known “bodies” (such as P-bodies or stress granules). Therefore, we do not know if any stress-induced “bodies” are confined to the same spaces as GEMs. However, we would expect at best an incomplete colocalization based on the observation that glucose starvation-induced “bodies” are generally present in a higher percentage of cells than the GEM foci we observe, i.e. it is unlikely that all “bodies” overlap with a GEM focus. It might be interesting to perform such colocalization experiments in follow-up studies, but we feel that such an analysis would go beyond the current scope of this manuscript.

      Reviewer #3 (Significance (Required)):

      General assessment, advances in the field This is an excellent study. The key finding of this paper, ie. that heat shock can compensate for lack of respiration for entry into quiescence, challenges the current views on quiescence establishment. It describes an alternative program that contributes to cell viability upon C source depletion, with details on the proteomic changes occurring in this condition and some of the genetic basis of this pathway. The study is well designed and controlled, the conclusions are in line with the obtained results and very well discussed and placed in perspective. Experimentally, the authors combine several experimental approaches including live-cell single-particle tracking of GEM nanoparticles to quantify cytoplasmic diffusion, FIB-SEM ultrastructural imaging of the cytoplasm to measure macromolecular crowding, proteomics to map stress-induced protein changes and genome-wide SATAY transposon mutagenesis to identify genes required for survival in respiration-deficient cells. The limitations are: -we don't know how this stress program facilitates survival in the absence of restoration of ATP levels. The data suggest that protein homeostasis is involved (chaperones and proteasome up-regulated upon stress, reduced ribosomal and translation-associated proteins down-regulated in the absence of respiration) but the mechanism remains elusive. -the relationships between cytoplasmic crowding and quiescence establishment remain correlative. Yet, the authors provide another pathway to favour viability upon quiescence establishment (with HS) whose activation also displays an increased crowding and reduction of cytoplasmic movement, further consolidating this link. Both of these points are adequately discussed in the manuscript. None of these points should preclude publication of this study, in my opinion.

      Audience. This study would be of interest to researchers in the field of quiescence, biophysics, proteostasis, stress response, nutrient signaling and yeast biology.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thank reviewers for the general positive feedback and insightful suggestions. Reviewers found that our study “provides a rich resource of potential E3-sensor interactions and represents a conceptual and technical advance for the field” and that our “key conclusions are convincing and interesting”. Reviewers suggested both editorial changes to improve the narrative of the manuscript and additional experiments to strengthen the conclusions of the study. We agree with both types of suggestions and decided to modify our manuscript accordingly.

      Reviewer #1 (Evidence, reproducibility and clarity (Required)): 

      The authors present a rational, AlphaFold-based strategy to systematically identify interactions between human nucleic acid sensors and SPRY-containing proteins. Their findings demonstrate that SPRY domains encode substrate-specific recognition patterns that govern immune responses: TRIM25-ZAP in antiviral defense and restricts LNP-encapsulated RNA, while Riplet-RIG-I for the IFNB1 production and restricts lipofection. They further dissect residue-level contributions to the ZAP-TRIM25 interface by integrating structural predictions with experimental validation. 

      Specific comments.  1. The title of this manuscript appears quite broad given that this study mostly focuses on just TRIM25-ZAP and Riplet-RIG-I pairs. 

      We agree that the original title was broader than the main mechanistic focus of the study. We will therefore revise the title to better reflect that the manuscript primarily dissects SPRY-domain–mediated specificity in the TRIM25-ZAP and Riplet-RIG-I interactions (identified through our AlphaFold-based screening framework), while retaining the broader screening context. Proposed new title: "SPRY domains encode ubiquitin ligase specificity for ZAP and RIG-I"

      In Figure 1b, several predicted interaction scores appear inconsistent with previously reported experimental interactions. For instance, KHNYN has been experimentally validated as a TRIM25-interacting protein, yet its interaction score is notably low in your computational results. Could the authors clarify whether this discrepancy arises because the TRIM25 SPRY domain does not significantly contribute to KHNYN binding? 

      We thank the reviewer for raising this point. To our knowledge, published data only support co-immunoprecipitation of TRIM25 and KHNYN in ZAP-deficient in cells (PMID: 31284899), but this does not by itself demonstrate a direct binary interaction, as the association could be mediated by other factors. Consistent with this, our AlphaFold-based screen predicts a low interaction score between KHNYN and TRIM25, suggesting that this may not be a direct protein-protein interaction. Nevertheless, we concede that our approach may have missed interactions that are governed by a small number of interacting residues. We added the following sentences on the limitation of this approach for such interactions in our discussion:

      • While our screen revealed novel interactions between SPRY domain containing proteins and innate immune sensors, it is plausible that certain interactions were missed. Interactions that rely on a small number of contacting residues or interactions that may be mediated by a third binding partner are likely to score poorly in our approach. Future optimization of our algorithm will improve the detection of such interactions.”*

      In Figure 2c, the authors provide intriguing examples for shared targets by SPRY proteins with quite low homology, and distinct target profiles by nearly identical SPRY domains. However, the underlying mechanisms responsible for these observations are not discussed. 

      This is an important point. At present, we cannot assign a single definitive mechanism for every example, but there are several plausible explanations consistent with our framework. First, our analysis indicates that substrate recognition is often driven by a limited subset of residues at the interaction surface, such that distinct sequences can converge on similar three-dimensional interface chemistry, while small local differences can shift binding preferences. Second, we note that although a large fraction of predicted contacting residues are within SPRY domains, other domains can also contribute to interaction and substrate recognition, which could modulate binding profiles even when SPRY sequences are near-identical. Third, the Pearson’s correlation coefficient was calculated all scores, which may include structures with low confidence scores or low interaction scores

      In Figure 3e and 3f, the authors state that the Riplet-T25 SPRY chimeric protein showed enhanced AlphaFold predicted interaction with ZAP, and validated the interaction experimentally. However, the Alphafold also predicted that an increased interaction for the T25-Riplet chimera, although this mutant failed to be co-precipitated with ZAP. How do the authors reconcile this discrepancy between prediction and experimental outcome? 

      The reviewer noticed an important, nuanced result in Fig. 3e. AlphaFold predicts that the TRIM25 chimera containing the Riplet SPRY domain (T25–Riplet) has a higher interaction score with ZAP than Riplet alone (Fig. 3e), yet this chimera is not recovered in ZAP co-immunoprecipitation (Fig. 3f). We reconcile this by emphasising that our framework uses an empirically benchmarked threshold: known SPRY–sensor interactions typically score >2.5, and we therefore adopted >2.5 as the cutoff for “high-confidence” candidate interactions. While the T25–Riplet chimera shows an increased score relative to Riplet, its score remains below this >2.5 cutoff in Fig. 3e (which reports interaction scores of the chimeras against ZAP). Therefore, the model is consistent with the experimental outcome: AlphaFold suggests some degree of interface compatibility, but not at a level we would classify as a robust/predictive interaction under our validated threshold. We clarified this point in the Results section to explicitly note that sub-threshold “increases” should be interpreted cautiously:

      Using the T25-RipletSPRY instead of the Riplet protein, predicted a higher interaction score despite the lack of specific pull-down between this chimera and ZAP; importantly, this interaction score is below our defined threshold (2.5), highlighting the importance of benchmarking predicted scores against known interactions.”

      It is curious if the authors explain why TRIM25 consistently appears as two bands in many of the presented figures. 

      We have also wondered about this observation as well. Other studies report that the double band pattern in western blots of TRIM25 (PMID: 17392790, 28060952, 21292167) and it is believed to be a product of non-degradative self-ubiquitination of TRIM25, primarily acting on the K117 residue (PMID: 21292167). We will add a brief description of this phenomenon in the figure legend.

      In Figure 4b, the authors show that treatment with a proteasome inhibitor increased RIG-I ligand-induced IFNB1 expression and propose that RIG-I may undergo rapid degradation following its interaction with Riplet. However, the evidence supporting this claim is weak. The authors should demonstrate: (1) that RIG-I is indeed degraded via the proteasome, and (2) whether RIG-I undergoes K48-linked ubiquitination. Mutational analysis of putative ubiquitination sites in RIG-I would help clarify its contribution to the observed IFN responses. 

      This is an important point and we are currently performing experiments addressing these questions. Specifically we will provide evidence of (1) whether RIG-I is degraded after activation using a combination of western blotting and pharmacological inhibition of the proteosome/translation machinery; (2) whether RIG-I goes K48- or K63-mediated ubiquitination by performing coIPs of RIG-I in the presence of HA-Ub wildtype or the commonly used HA-Ub K48 and K63 mutants (PMID: 15728840); and (3) whether lysine-to-arginine substitution of key residues impacts RIG-I ubiquitination/degradation.

      Figure 5 c-g: why do the authors show ZAP-L, but not ZAP-S? 

      Both ZAP-S and ZAP-L isoforms contain identical N-terminal domains, which is the region that interacts with TRIM25. Therefore, we assumed that the interaction between TRIM25 and ZAP-L would be similar to TRIM25-ZAP-S. However, to test this assumption, we will generate equivalent mutations in ZAP-S and perform similar co-immunoprecipitation experiments.

      Reviewer #1 (Significance (Required)): 

      This manuscript starts with the AlphaFold-based screening of interactions between human nucleic acid sensors and SPRY-containing proteins. However, the authors then just focused on TRIM25-ZAP and Riplet-RIG-I, whose interactions have been well demonstrated previously, although other protein interactions were not further explored. Also, the information on the evolutional aspects of TRIM25, ZAP, Riplet and RIG-I did not lead to clear conclusions. The differential contribution of TRIM25-ZAP and Riplet-RIG-I in LNP- and lipofectamine-transduced RNAs is interesting, although data shown in Fig.6 are expected from previous studies, and are not so relevant to other data in this study.  Therefore, the study is not well integrated, although pieces are interesting.  This study may attract researchers in innate 

      My expertise is innate immunity and RNA biology. 

      Reviewer #2 (Evidence, reproducibility and clarity (Required)): 

      The paper describes the discovery of unknown E3-RNA sensor interactions from a large scale in silico prediction screen, as well as the clarification of previously described E3-sensor interactions. These findings extend previous work showing ancient relationships between nucleic acid sensors and RING E3s (e.g. PMID: 33373584), which also described the RIPLET-RIG-I pairing identified in the present screen. 

      The interactions focused on are shown to have functional implications for immune signaling pathways, and stability implications for the bound sensor. The argument for the screen is that E3-target interactions are often too transient to detect biochemically. While possibly true, several of the pairings are confirmed by co-IP, with either WT E3 or a catalytically deficient E3 (known elsewhere as a 'substrate trap'). 

      The key conclusions are convincing and interesting; in particular, the conserved interactions between RIPLET and RIG-I, and TRIM25 and ZAP. The hypothesis that the two E3s arose from a common ancestor is intriguing, and the use of chimeras in functional experiments suggest that the length of the coiled coil domains contributes to substrate targeting. The most interesting observation IMO is that showing that RNA vaccines can be sensed by orthogonal sensor/E3 pathways, depending on transfection method, suggesting that distinct entry routes are surveyed by different sensors. These experiments are well performed as E3 manipulation phenocopies sensor manipulation, supporting that the in silico approach will ID relevant pairings. 

      Including the PAE plots for some of the key interactions would be helpful, as a lot of the interaction confidence metrics are hidden in interaction 'scores'. Fig. 1b heatmap is presented as a row max, so it is difficult to calibrate one E3 against another. The raw data from e.g. fig. 1c would be a valuable addition. This would also help orientate future studies predicting similar protein-protein interactions. 

      We agree with the reviewer and we will provide the raw values for the interaction scores and PAE maps as supplementary data to be included in the final publication.

      Figure 1 appears to just use the isolated SPRY domain for screening - were full-length proteins used?

      The data in Figure 1 was generated using full-length proteins, but it will be interesting to test if a similar screen with SPRY domains alone can replicate the predicted interactions. We will repeat this using SPRY domain sequences.

      How many copies of the FL protein were used. TRIM5 employs a low affinity, high avidity binding method; do binding patterns change when the valency of either component is altered? The Alphafold approach perhaps selects for high affinity binders? I don't expect many more experiments to be done here, but commenting on this would be useful. __ __

      This is a rational consideration that we overlooked. We included in our discussion a comment on the limitation of this approach in the context of multimeric assemblies:

      Similarly, the oligomeric nature of some SPRY-containing proteins [22] is likely to impact the correct placement of these domains and, therefore, impact the predicted interaction score. Future optimization of our algorithm will improve the detection of such interactions.”

      The TRIM25 -Riplet PRYSPRY swap experiments in Figure 3 are very informative and powerful. Some more detail on domain boundaries used would be helpful, including AF predictions of what these chimeras look like with respect to their natural counterparts. 

      We agree with the reviewer about the need to explicitly define domain boundaries. We will include as supplemental information a comparison of the AF prediction of these chimeras in relation to the native proteins.

      While AF can place confidence metrics on domain-domain interactions, SPRY containing proteins are themselves often comprised of regions of high structural confidence (e.g. many available PDBs for RINGs, coils and SPRYs) but their relative arrangement within the molecule is unpredictable. Do isolated SPRYs show any better/worse binding to targets? 

      This is a good point as well, and this can be addressed by repeating the AlphaFold screen using only SPRY domain proteins rather than full-length protein (as described above).

      Technically, fig. 1f does not show that TRIM58 destabilises OAS1, as there is no condition with OAS1 alone. Perhaps alter the text to reflect this or repeat with the necessary control. The direction of the text is fine, as Fig. 1g provides a striking result, but 1f needs attention. 

      The reviewer raises an important consideration. To address this, we will repeat the experiment using a OAS1 alone condition, as suggested.

      Fig. 2c - for clarity, please specify the meaning of the connecting lines between the bait 'hits' in the legend. What does the correlation coefficient relate to exactly? % similarity, is this across the whole molecule, or the PRYSPRY (presumably the latter would be a more useful comparison). And it is well established that single variations in SPRY variable loops can toggle binding, so this could be better referenced in the text. It would also be helpful to see e.g. dissimilar PRYSPRYs binding a common target, as PAE plots in the supplementary. Do any shared motifs occur at the variable loops between dissimilar SPRY molecules? 

      We agree that this figure could be clearer. The connecting lines in Fig. 2c indicate protein-protein predictions with common sensors, i.e. connecting lines between the interaction score of ASH2L-MDA5 and the interaction score of TRIM51-MDA5. We only use % similarity of the SPRY domain alone, not the whole molecule. We have modified the figure legend to clarify this point and we include the PAE maps as supplementary information, as requested.

      Fig. 2i - Bat RIG-I binds both TRIM25 and Riplet? This is in contrast to the predicted directionality in 2h? 

      The reviewer astutely noted that, in Fig.2i, pulling down bat RIG-I co-immunoprecipitated with both bat Riplet and bat TRIM25, while AlphaFold predictions only suggest a Riplet-RIG-I interaction. However, while bat Riplet and bat TRIM25 express at comparable levels in the input sample, bat Riplet was far more enriched in RIG-I pulldowns than bat TRIM25. Our interpretation of this data is that, indeed, bat Riplet-RIG-I interaction is more powerful than TRIM25-Rig-I.

      Fig. 3a-b, Sup Fig. 3c,d - IFNB1 transcript normalised to 3p-hRNA transfection in control NTC cells - the presentation chosen obscures the baseline IFNB1 levels in the different siRNA transfections. What is the fold induction of IFNB1 in the different cell lines? 

      We will include the fold induction in each cell line as supplementary information.

      Fig. 3g - RLUs of EV-A71 is only tested in TRIM25 KO cells transfected with the Riplet T25 chimera. The full panel of cDNAs (parental E3s and the inverse 25-riplet swap) should be tested in parallel to confirm the effect is specific to TRIM25 PRYSPRY. 

      This is a great suggestion that will help clarify the role of different domains of TRIM25 in its antiviral activity. We are currently generating cell-line stably expressing these truncations and will perform the suggested experiment.

      Fig. 4b - time point of 3p-hRNA transfection? Y-axis label suggested normalisation to NTC - incorrect label? What is the effect of bortezomib on IFNB1 mRNA in mock treated cells? 

      We thank the reviewer for spotting this typo, we have known corrected the axis label. We harvested cellular mRNA 8h post-transfection. Bortezomib-treatment slightly reduced the background expression of IFNB1 mRNA, but this signal is very close to the detection limit that it is difficult to draw conclusions. Nevertheless, the addition of bortezomib did not increase IFNB1 mRNA expression in the absence of a stimulus.

      Fig. 4g - these experiments would benefit from an untransfected control cell to clarify how cDNA expression impacts sensor stability. 

      We agree with the reviewer and we will include an untransfected control.

      There seems to be an inverse correlation between sensor degradation and signaling output - is that the summary of Fig. 4? On the one hand, sensor degradation attenuates functional output (Fig. 4b), and the E3 that degrades the sensor is required for sensor function; on the other hand, changing coil-length in the E3 disables sensor degradation (Fig. 4g) but and enhances signaling response (Fig. 3j). Do the chimeras of panel Fig. g, h influence IFNB1 expression in the assay from Fig. 3j - this experiment would marry RIG-I expression with signal output. 

      This is an interesting experiment. We are in the process of generating a TRIM25-/- Riplet-/- cell line, which we will use to reconstitute with the chimeras mentioned above and perform the requested experiment.

      The data is generally clear. To facilitate their interpretation and for clarity, Western blots require size markers and Co-IPs should indicate the flag-/ha-epitope tags. Would make fig. 2 i-j much clearer, particularly given apparent co-migration of IgG (heavy chain?) and riplet, and the lack of control IPs. 

      We agree that contextual markings will improve the interpretation of these results. We will add size markers to the western blots in fig2 in order to improve clarity.

      The figure legends could provide more detail. 

      We will add additional experimental details (such as time points) to the figure legends.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The paper describes the discovery of unknown E3-RNA sensor interactions from a large scale in silico prediction screen, as well as the clarification of previously described E3-sensor interactions. These findings extend previous work showing ancient relationships between nucleic acid sensors and RING E3s (e.g. PMID: 33373584), which also described the RIPLET-RIG-I pairing identified in the present screen.

      The interactions focused on are shown to have functional implications for immune signaling pathways, and stability implications for the bound sensor. The argument for the screen is that E3-target interactions are often too transient to detect biochemically. While possibly true, several of the pairings are confirmed by co-IP, with either WT E3 or a catalytically deficient E3 (known elsewhere as a 'substrate trap').

      The key conclusions are convincing and interesting; in particular, the conserved interactions between RIPLET and RIG-I, and TRIM25 and ZAP. The hypothesis that the two E3s arose from a common ancestor is intriguing, and the use of chimeras in functional experiments suggest that the length of the coiled coil domains contributes to substrate targeting. The most interesting observation IMO is that showing that RNA vaccines can be sensed by orthogonal sensor/E3 pathways, depending on transfection method, suggesting that distinct entry routes are surveyed by different sensors. These experiments are well performed as E3 manipulation phenocopies sensor manipulation, supporting that the in silico approach will ID relevant pairings.

      Including the PAE plots for some of the key interactions would be helpful, as a lot of the interaction confidence metrics are hidden in interaction 'scores'. Fig. 1b heatmap is presented as a row max, so it is difficult to calibrate one E3 against another. The raw data from e.g. fig. 1c would be a valuable addition. This would also help orientate future studies predicting similar protein-protein interactions.

      Figure 1 appears to just use the isolated SPRY domain for screening - were full-length proteins used? How many copies of the FL protein were used. TRIM5 employs a low affinity, high avidity binding method; do binding patterns change when the valency of either component is altered? The Alphafold approach perhaps selects for high affinity binders? I don't expect many more experiments to be done here, but commenting on this would be useful.

      The TRIM25 -Riplet PRYSPRY swap experiments in Figure 3 are very informative and powerful. Some more detail on domain boundaries used would be helpful, including AF predictions of what these chimeras look like with respect to their natural counterparts.

      While AF can place confidence metrics on domain-domain interactions, SPRY containing proteins are themselves often comprised of regions of high structural confidence (e.g. many available PDBs for RINGs, coils and SPRYs) but their relative arrangement within the molecule is unpredictable. Do isolated SPRYs show any better/worse binding to targets?

      Technically, fig. 1f does not show that TRIM58 destabilises OAS1, as there is no condition with OAS1 alone. Perhaps alter the text to reflect this or repeat with the necessary control. The direction of the text is fine, as Fig. 1g provides a striking result, but 1f needs attention.

      Fig. 2c - for clarity, please specify the meaning of the connecting lines between the bait 'hits' in the legend. What does the correlation coefficient relate to exactly? % similarity, is this across the whole molecule, or the PRYSPRY (presumably the latter would be a more useful comparison). And it is well established that single variations in SPRY variable loops can toggle binding, so this could be better referenced in the text. It would also be helpful to see e.g. dissimilar PRYSPRYs binding a common target, as PAE plots in the supplementary. Do any shared motifs occur at the variable loops between dissimilar SPRY molecules?

      Fig. 2i - Bat RIG-I binds both TRIM25 and Riplet? This is in contrast to the predicted directionality in 2h?

      Fig. 3a-b, Sup Fig. 3c,d - IFNB1 transcript normalised to 3p-hRNA transfection in control NTC cells - the presentation chosen obscures the baseline IFNB1 levels in the different siRNA transfections. What is the fold induction of IFNB1 in the different cell lines?

      Fig. 3g - RLUs of EV-A71 is only tested in TRIM25 KO cells transfected with the Riplet T25 chimera. The full panel of cDNAs (parental E3s and the inverse 25-riplet swap) should be tested in parallel to confirm the effect is specific to TRIM25 PRYSPRY.

      Fig. 4b - time point of 3p-hRNA transfection? Y-axis label suggested normalisation to NTC - incorrect label? What is the effect of bortezomib on IFNB1 mRNA in mock treated cells?

      Fig. 4g - these experiments would benefit from an untransfected control cell to clarify how cDNA expression impacts sensor stability.

      There seems to be an inverse correlation between sensor degradation and signaling output - is that the summary of Fig. 4? On the one hand, sensor degradation attenuates functional output (Fig. 4b), and the E3 that degrades the sensor is required for sensor function; on the other hand, changing coil-length in the E3 disables sensor degradation (Fig. 4g) but and enhances signaling response (Fig. 3j). Do the chimeras of panel Fig. g, h influence IFNB1 expression in the assay from Fig. 3j - this experiment would marry RIG-I expression with signal output.

      The data is generally clear. To facilitate their interpretation and for clarity, Western blots require size markers and Co-IPs should indicate the flag-/ha-epitope tags. Would make fig. 2 i-j much clearer, particularly given apparent co-migration of IgG (heavy chain?) and riplet, and the lack of control IPs.

      The figure legends could provide more detail.

      Significance

      The paper provides a rich resource of potential E3-sensor interactions and represents a conceptual and technical advance for the field. The authors take a novel approach to identify these pairings. Several pairings are validated in CoIPs, and two pairings (T25-ZAP, RIPLET-RIG-I) are studied in detail. Many E3s - including the TRIM proteins which comprise the bulk of E3s studied here - are purported to regulate key nucleic acid sensors in the literature, but the large scale approach taken here provides evidence that the pairings are really quite specific. The findings also supports prior work showing that the PRYSPRY domain (here called the SPRY) is a functionally plastic module that through variable loops can bind a range of different protein substrates.

      The paper will be most interesting to the innate immune field, those working on nucleic acid sensing, and those looking at innate responses to RNA vaccines.

      Regulation of E3 ubiquitin ligases, viral RNA sensing

    1. La Coopération en Classe au Service des Apprentissages et du Bien-être

      Résumé Exécutif

      Ce document synthétise les interventions du webinaire organisé par la Cardie de l'Académie de Paris, portant sur le développement des habiletés à coopérer.

      La coopération est identifiée comme un levier fondamental pour renforcer l'engagement des élèves et améliorer le climat scolaire.

      Les retours d'expérience du collège Antoine Quoisevaux, couplés à l'analyse experte de Laurent Renault, soulignent que la coopération ne doit pas être un simple "supplément d'âme", mais une modalité pédagogique structurée.

      Les points clés incluent la distinction cruciale entre coopération (visant le progrès individuel par l'échange) et collaboration (visant la performance collective), l'importance de la réciprocité de l'aide pour éviter les biais de l'effet tuteur, et la nécessité de ritualiser des instances comme le conseil d'élèves pour transformer les conflits en opportunités d'apprentissage.

      Bien que chronophage, cette approche favorise la motivation et le développement de compétences psychosociales essentielles.

      --------------------------------------------------------------------------------

      I. Retours d'Expérience : Le Projet du Collège Antoine Quoisevaux

      Mis en place il y a quatre ans par Marion Saag (mathématiques) et Antoine Marteille (français), ce projet concerne des classes de 5ème dans un établissement multisecteur du 18ème arrondissement de Paris, caractérisé par une grande mixité sociale.

      1. Genèse et Méthodologie

      Le projet a évolué d'une pratique empirique vers une démarche étayée par la recherche et la formation (notamment les travaux de Laurent Renault et les ressources du lycée Jacques Feyder).

      Objectif : Associer des temps formels (conseils d'élèves) et informels (apprentissage coopératif en cours).

      Convaincre les élèves : La coopération n'est pas innée. Des activités "décrochées" de la didactique (ex: construire la plus haute tour de chamallows, marché de connaissances) sont organisées dès la rentrée pour apprendre à travailler en groupe.

      Métacognition : Chaque activité est suivie d'un temps de retour sur ce qui a fonctionné ou non, permettant aux élèves de s'interroger sur l'efficacité de leur travail collectif.

      2. Modalités de Travail en Classe

      Le travail collectif intervient généralement après une phase de réflexion individuelle ("mise en effort intellectuel"). Les enseignants font varier le tempo des séances via :

      Le binôme : Notamment pour des clôtures de séance (l'élève A explique à l'élève B ce qu'il a retenu).

      Les îlots : Groupes de quatre élèves dans des salles disposées en "L" pour faciliter la circulation.

      La classe puzzle et l'arpentage : Pour l'étude de textes.

      L'autonomie collective : Organisation spatiale spontanée pour reconstituer un récit (ex: après la projection d'un film).

      --------------------------------------------------------------------------------

      II. Le Conseil d'Élèves : Pilier du Climat de Classe

      Le conseil d'élèves se tient tous les quinze jours. C'est un espace de parole, de régulation des conflits et de recherche collective de solutions.

      1. Rôles et Responsabilités

      Pour assurer un fonctionnement démocratique et serein, les rôles tournent entre les élèves :

      | Rôle | Fonction | | --- | --- | | Président | Rappelle les règles et ouvre la séance de façon solennelle. | | Adjoint | Rappelle les décisions prises lors du conseil précédent. | | Secrétaire | Garde une trace écrite des échanges et des décisions. | | Distributeur de parole | Utilise un bâton de parole pour réguler les échanges. | | Protecteur de parole | Assure un cadre bienveillant et sécurisant. | | Observateur | Analyse la répartition de la parole (bilan genré, équité). |

      2. Structure et Contenu du Conseil

      Le conseil suit un ordre du jour ritualisé basé sur des messages écrits par les élèves :

      Remerciements et Félicitations : Valorisation de l'entraide et de l'estime de soi (ex: "Je remercie X de m'avoir expliqué les maths").

      Problèmes et Soucis : Régulation des relations entre élèves (médiation par les pairs) ou de la relation pédagogique avec les enseignants.

      Propositions : Projets de sorties, mais aussi demandes pédagogiques (ex: "Faire plus d'exposés en Histoire-Géo").

      --------------------------------------------------------------------------------

      III. Analyse Conceptuelle et Points de Vigilance

      Laurent Renault, expert en pédagogie coopérative, apporte un éclairage théorique pour "réinterroger les évidences".

      1. Coopération vs Collaboration

      Il est impératif de distinguer ces deux modalités pour éviter l'exclusion des élèves les plus fragiles :

      La Coopération (visée : Progresser) : Échange de points de vue sans obligation de production immédiate (ex: le conseil d'élèves).

      La Collaboration (visée : Performer) : Répartition des tâches pour produire un résultat (ex: une affiche). Le risque est que seuls les "concepteurs" apprennent, tandis que les autres exécutent des tâches subalternes.

      2. L'Effet Tuteur et la Réciprocité

      L'aide entre élèves n'est pas automatiquement bénéfique pour celui qui la reçoit.

      L'aidant : Progresse toujours (mémorisation, abstraction, valorisation).

      L'aidé : Peut subir l'aide comme une illusion de compréhension et intérioriser une dépendance.

      Solution : Garantir la réciprocité de l'aide. Chaque élève doit, au cours d'une période, occuper la position d'aidant sur des compétences variées (rédaction, schéma, etc.).

      3. La Posture de l'Enseignant : "Travailler à capot ouvert"

      Innover, c'est accepter une part d'humilité et de déstabilisation.

      S'effacer : Dans le conseil, l'enseignant ne doit pas être moralisateur mais garant de la sécurité de la parole.

      Gérer le "bazar" initial : La coopération peut dégrader le climat scolaire à court terme car elle fait émerger des conflits latents. Ces conflits sont des matériaux d'apprentissage pour "penser ensemble".

      Considérer l'élève comme un interlocuteur valable : S'appuyer sur son ressenti et sa motivation.

      --------------------------------------------------------------------------------

      IV. Enjeux et Perspectives

      1. Bénéfices Constatés

      Engagement : Plaisir des élèves à venir au collège et investissement accru dans les disciplines (français/mathématiques).

      Compétences psychosociales : Travail sur les trois macro-compétences définies par Santé publique France.

      Émulation : Utilisation de la motivation collective sans tomber dans la rivalité destructrice.

      2. Limites et Défis

      Aspect chronophage : Nécessite un investissement important pour mener les conseils et suivre les décisions.

      Isolement de l'équipe : Difficulté à étendre le projet au-delà du binôme initial. Un tiers de l'emploi du temps est couvert, mais une cohérence d'équipe serait préférable.

      Aménagement spatial : Importance de l'ergonomie (classes flexibles, îlots en L) pour faciliter les transitions entre travail individuel et collectif.

      3. Conclusion

      La coopération en classe ne s'improvise pas. Elle repose sur un "tâtonnement balisé" par la recherche (Sylvain Conac, Philippe Meirieu) et une organisation rigoureuse.

      L'objectif final est de passer du simple "vivre ensemble" au "penser ensemble", en respectant l'équilibre entre l'individu (le "Je") et le groupe (le "Nous").

    1. https://bafybeigi4urr6jumopybpwxfu2i5edncg4e64c2z6dgtgm2clro7ibxmpe.ipfs.dweb.link/?filename=O%20%E2%80%94%20The%20Last%20Debt.%20When%20the%20empire%E2%80%99s%20money%20lies%2C%20its%E2%80%A6%20%EF%BD%9C%20by%20Ray%20Podder%20%EF%BD%9C%20Medium.html

      Page saved with SingleFile web-indy://💻/asus/ 🧊/♖//hyperpost/~/indyweb/📓/20/26/15/o-the-last-debt

      Page saved with SingleFile url: https://raypodder.medium.com/o-the-last-debt-3c12a1d998e7 saved date: Sat Dec 27 2025 21:56:44 GMT+0100 (Central European Standard Time)

      https://raypodder.medium.com/o-the-last-debt-3c12a1d998e7

    1. Author response:

      The following is the authors’ response to the original reviews.

      Public Reviews:

      Reviewer #1 (Public review):

      Summary:

      In this manuscript, the authors explore a novel concept: GPCR-mediated regulation of miRNA release via extracellular vesicles (EVs). They perform an EV miRNA cargo profiling approach to investigate how specific GPCR activations influence the selective secretion of particular miRNAs. Given that GPCRs are highly diverse and orchestrate multiple cellular pathways - either independently or collectively - to regulate gene expression and cellular functions under various conditions, it is logical to expect alterations in gene and miRNA expression within target cells.

      Strengths:

      The novel idea of GPCRs-mediated control of EV loading of miRNAs.

      Weaknesses:

      Incomplete findings failed to connect and show evidence of any physiological parameters that are directly related to the observed changes. The mechanical detail is lacking.

      We appreciate the reviewer's acknowledgment of the novelty of this study. We agree with the reviewer that further mechanistic insights would strengthen the manuscript. The mechanisms by which miRNA is sorted into EVs remain poorly understood. Various factors, including RNAbinding protein, sequence motifs, and cellular location, can influence this sorting process(Garcia-Martin et al., 2022; Liu & Halushka, 2025; Villarroya-Beltri et al., 2013; Yoon et al., 2015). Ago2, a key component of the RNA-induced silencing complexes, binds to miRNA and facilitates miRNA sorting. Ago2 has been found in the EVs and can be regulated by the cellular signaling pathway.  For instance, McKenzie et al. demonstrated that KRAS-dependent activation of MEK-ERK can phosphorylate Ago2 protein, thereby regulating the sorting of specific miRNAs into EVs(McKenzie et al., 2016). In the differentiated PC12 cells, Gαq activation leads to the formation of Ago2-associated granules, which selectively sequester unique transcripts(Jackson et al., 2022). Investigating GPCR, G protein, and GPCR signaling on Ago2 expression, location, and phosphorylation states could provide valuable insights into how GPCRs regulate specific miRNAs within EVs. We have expanded these potential mechanisms and future research in the discussion section (page 16-17).

      The manuscript falls short of providing a comprehensive understanding. Identifying changes in cellular and EV-associated miRNAs without elucidating their physiological significance or underlying regulatory mechanisms limits the study's impact. Without demonstrating whether these miRNA alterations have functional consequences, the findings alone are insufficient. The findings may be suitable for more specialized journals.

      Thank you for the feedback. We acknowledge that validating the target genes of the top candidate miRNAs is an important next step. In response to the reviewer's concerns, we have expanded the discussion of future research in the manuscript (page 19-20). Although this initial study is primarily descriptive, it establishes a novel conceptual link between GPCR signaling and EV-mediated communication.

      Furthermore, a critical analysis of the relationship between cellular miRNA levels and EV miRNA cargo is essential. Specifically, comparing the intracellular and EV-associated miRNA pools could reveal whether specific miRNAs are preferentially exported, a behavior that should be inversely related to their cellular abundance if export serves a beneficial function by reducing intracellular levels. This comparison is vital to strengthen the biological relevance of the findings and support the proposed regulatory mechanisms by GPCRs.

      We appreciate the valuable suggestions from the reviewer. EV miRNA and cell miRNAs may exhibit distinct profiles as miRNAs can be selectively sorted into or excluded from EVs(Pultar et al., 2024; Teng et al., 2017; Zubkova et al., 2021). Investigating the difference between cellular miRNA levels and EV miRNA cargo would provide insight into the mechanism of miRNA sorting and the functions of miRNAs in the recipient cells. The expression of the cellular miRNAs is a highly dynamic process. To accurately compare the miRNA expression levels, profiling of EV miRNA and cellular miRNA should be conducted simultaneously. However, as an exploratory study, we were unable to measure the cellular miRNAs without conducting the entire experiment again.

      Reviewer #2 (Public review):

      Summary:

      This study examines how activating specific G protein-coupled receptors (GPCRs) affects the microRNA (miRNA) profiles within extracellular vesicles (EVs). The authors seek to identify whether different GPCRs produce unique EV miRNA signatures and what these signatures could indicate about downstream cellular processes and pathological processes.

      Methods:

      (1) Used U2OS human osteosarcoma cells, which naturally express multiple GPCR types.

      (2) Stimulated four distinct GPCRs (ADORA1, HRH1, FZD4, ACKR3) using selective agonists.

      (3) Isolated EVs from culture media and characterized them via size exclusion chromatography, immunoblotting, and microscopy.

      (4) Employed qPCR-based miRNA profiling and bioinformatics analyses (e.g., KEGG, PPI networks) to interpret expression changes.

      Key Findings:

      (1) No significant change in EV quantity or size following GPCR activation.

      (2) Each GPCR triggered a distinct EV miRNA expression profile.

      (3) miRNAs differentially expressed post-stimulation were linked to pathways involved in cancer, insulin resistance, neurodegenerative diseases, and other physiological/pathological processes.

      (4) miRNAs such as miR-550a-5p, miR-502-3p, miR-137, and miR-422a emerged as major regulators following specific receptor activation.

      Conclusions:

      The study offers evidence that GPCR activation can regulate intercellular communication through miRNAs encapsulated within extracellular vesicles (EVs). This finding paves the way for innovative drug-targeting strategies and enhances understanding of drug side effects that are mediated via GPCR-related EV signaling.

      Strengths:

      (1) Innovative concept: The idea of linking GPCR signaling to EV miRNA content is novel and mechanistically important.

      (2) Robust methodology: The use of multiple validation methods (biochemical, biophysical, and statistical) lends credibility to the findings.

      (3) Relevance: GPCRs are major drug targets, and understanding off-target or systemic effects via EVs is highly valuable for pharmacology and medicine.

      Weaknesses:

      (1) Sample Size & Scope: The analysis included only four GPCRs. Expanding to more receptor types or additional cell lines would enhance the study's applicability.

      We are encouraged that the reviewer recognized the novelty, methodological rigor, and significance of our work. We recognize the limitations of our current model system and emphasize the need to test additional GPCR families and cell lines in the future studies, as detailed in the discussion section (Page 19, second paragraph).

      (2) Exploratory Nature: This study is primarily descriptive and computational. It lacks functional validation, such as assessing phenotypic effects in recipient cells, which is acknowledged as a future step.

      We appreciate the feedback. We recognize the importance of validating the function of the top candidate miRNAs in the recipient cells, and this will be included in future studies (page 19-20).  

      (3) EV heterogeneity: The authors recognize that they did not distinguish EV subpopulations, potentially confounding the origin and function of miRNAs.

      Thank you for the comment. EV isolation and purification are major challenges in EV research. Current isolation techniques are often ineffective at separating vesicles produced by different biogenetic pathways. Furthermore, the lack of specific markers to differentiate EV subtypes adds to this complexity. We recognize that the presence of various subpopulations can complicate the interpretation of EV cargos. In our study, we used a combined approach of ultrafiltration followed by size-exclusion chromatography to achieve a balance between EV purity and yield. We adhere to the MISEV (Minimal Information for Studies of Extracellular Vesicles 2023) guidelines by reporting detailed isolation methods, assessing both positive and negative protein markers, and characterizing EVs by electron microscopy to confirm vesicle structure, as well as nanoparticle tracking analysis to verify particle size distribution(Welsh et al., 2024). By following these guidelines, we can ensure the quality of our study and enhance the ability to compare our findings with other studies.

      Recommendations for the authors:

      Reviewer #2 (Recommendations for the authors):

      Suggestions for Future Research:

      (1) Functionally validate top candidate miRNAs in recipient cells.

      We acknowledge that validating the target genes of the top candidate miRNAs is a crucial next step. In response to the reviewer's concerns, we have included this in the discussion as future research in the manuscript (page 19-20).

      (2) Investigate other GPCR families and repeat in primary or disease-relevant cell lines.

      The inclusion of different GPCRs and cell lines is suggested as an area for further investigation in the discussion. (Page 19).

      (3) Apply similar approaches in in vivo models or patient samples to assess clinical relevance.

      In response to the reviewer's concerns, we have included this in the discussion as future research in the manuscript (page 19-20).

      References

      Garcia-Martin, R., Wang, G., Brandão, B. B., Zanotto, T. M., Shah, S., Kumar Patel, S., Schilling, B., & Kahn, C. R. (2022). MicroRNA sequence codes for small extracellular vesicle release and cellular retention. Nature, 601(7893), 446-451. https://doi.org/10.1038/s41586021-04234-3  

      Jackson, L., Rennie, M., Poussaint, A., & Scarlata, S. (2022). Activation of Gαq sequesters specific transcripts into Ago2 particles. Sci Rep, 12(1), 8758. https://doi.org/10.1038/s41598022-12737-w  

      Liu, X.-M., & Halushka, M. K. (2025). Beyond the Bubble: A Debate on microRNA Sorting Into Extracellular Vesicles. Laboratory Investigation, 105(2), 102206. https://doi.org/10.1016/j.labinv.2024.102206  

      McKenzie, A. J., Hoshino, D., Hong, N. H., Cha, D. J., Franklin, J. L., Coffey, R. J., Patton, J. G., & Weaver, A. M. (2016). KRAS-MEK Signaling Controls Ago2 Sorting into Exosomes. Cell  Rep, 15(5), 978-987. https://doi.org/10.1016/j.celrep.2016.03.085  

      Pultar, M., Oesterreicher, J., Hartmann, J., Weigl, M., Diendorfer, A., Schimek, K., Schädl, B., Heuser, T., Brandstetter, M., Grillari, J., Sykacek, P., Hackl, M., & Holnthoner, W. (2024).Analysis of extracellular vesicle microRNA profiles reveals distinct blood and lymphatic endothelial cell origins. J Extracell Biol, 3(1), e134. https://doi.org/10.1002/jex2.134  

      Teng, Y., Ren, Y., Hu, X., Mu, J., Samykutty, A., Zhuang, X., Deng, Z., Kumar, A., Zhang, L., Merchant, M. L., Yan, J., Miller, D. M., & Zhang, H.-G. (2017). MVP-mediated exosomal sorting of miR-193a promotes colon cancer progression. Nature Communications, 8(1), 14448. https://doi.org/10.1038/ncomms14448  

      Villarroya-Beltri, C., Gutiérrez-Vázquez, C., Sánchez-Cabo, F., Pérez-Hernández, D., Vázquez, J., Martin-Cofreces, N., Martinez-Herrera, D. J., Pascual-Montano, A., Mittelbrunn, M., & Sánchez-Madrid, F. (2013). Sumoylated hnRNPA2B1 controls the sorting of miRNAs into exosomes through binding to specific motifs. Nat Commun, 4, 2980. https://doi.org/10.1038/ncomms3980

      Welsh, J. A., Goberdhan, D. C. I., O'Driscoll, L., Buzas, E. I., Blenkiron, C., Bussolati, B., Cai, H., Di Vizio, D., Driedonks, T. A. P., Erdbrügger, U., Falcon-Perez, J. M., Fu, Q. L., Hill, A. F., Lenassi, M., Lim, S. K., Mahoney, M. G., Mohanty, S., Möller, A., Nieuwland, R., . . .Witwer, K. W. (2024). Minimal information for studies of extracellular vesicles (MISEV2023): From basic to advanced approaches. J Extracell Vesicles, 13(2), e12404. https://doi.org/10.1002/jev2.12404  

      Yoon, J. H., Jo, M. H., White, E. J., De, S., Hafner, M., Zucconi, B. E., Abdelmohsen, K., Martindale, J. L., Yang, X., Wood, W. H., 3rd, Shin, Y. M., Song, J. J., Tuschl, T., Becker, K. G., Wilson, G. M., Hohng, S., & Gorospe, M. (2015). AUF1 promotes let-7b loading on Argonaute 2. Genes Dev, 29(15), 1599-1604. https://doi.org/10.1101/gad.263749.115  

      Zubkova, E., Evtushenko, E., Beloglazova, I., Osmak, G., Koshkin, P., Moschenko, A., Menshikov, M., & Parfyonova, Y. (2021). Analysis of MicroRNA Profile Alterations in Extracellular Vesicles From Mesenchymal Stromal Cells Overexpressing Stem Cell Factor. Front Cell Dev Biol, 9, 754025. https://doi.org/10.3389/fcell.2021.754025

    1. I would like to clarify a point regarding the discovery of lymphatic-like vessels in the brain. The preprint appears to hint towards positioning it as the first report of this finding. However, the first published study on this subject was by Chang et al.(Ref 1 below). The authors reported on the presence and characteristics (diameters, origin, length, depth and regulation by stress) of deep brain lymphatic vessels. Their work was subsequently followed by two other independent studies (Ref 2 and 3). All studies were reviewed in a review article in 2024 (https://www.sciopen.com/article/10.26599/SAB.2024.9060001?issn=2709-1325).

      Therefore, the preprint would represent a fourth report—not the first—of this discovery. It is concerning that previous foundational work has been either overlooked or cited in a misleading manner.

      References 1. Chang J, Guo B, Gao Y, Li W, Tong X, Feng Y, et al. Characteristic Features of Deep Brain Lymphatic Vessels and Their Regulation by Chronic Stress. Research (Washington, DC), 2023, 6:0120.

      1. Öz E. 'Game changer' method lets scientists peer into mice. Science (New York, NY), 2023, 380(6644):443.

      2. Liu X-G, Hua Q, Peng T-T, Chang K-X, Deng C-G, Zhang J-N, et al. Histomorphological analysis of perfusion parameters and CNS lymphatic vessels in mice: an experimental method study. NeuroReport, 2024, 35(3).

    1. Author response:

      General Statements

      We are delighted that all reviewers found our manuscript to be a technical advance by providing a much sought after method to arrest budding yeast cells in metaphase of mitosis or both meiotic metaphases. The reviewers also valued our use of this system to make new discoveries in two areas. First, we provided evidence that the spindle checkpoint is intrinsically weaker in meiosis I and showed that this is due to PP1 phosphatase. Second, we determined how the composition and phosphorylation of the kinetochore changes during meiosis, providing key insights into kinetochore function and providing a rich dataset for future studies.

      The reviewers also made some extremely helpful suggestions to improve our manuscript, which we will now implement:

      (1) Improvements to the discussion throughout the manuscript. The reviewers recommended that we focus our discussion on the novel findings of the manuscript and drew out some key points of interest that deserve more attention. We fully agree with this and we will address this in a revised version.

      (2) We will add a new supplemental figure to help interpret the mass spectrometry data, to address Reviewer #3, point 4.

      (3) We are currently performing an additional control experiment to address the minor point 1 from reviewer #3. Our experiment to confirm that SynSAC relies on endogenous checkpoint proteins was missing the cell cycle profile of cells where SynSAC was not induced for comparison. We will add this control to our full revision.

      (4) In our full revision we will also include representative images of spindle morphology as requested by Reviewer #1, point 2

      Description of the planned revisions

      Reviewer #1 (Evidence, reproducibility and clarity):

      These authors have developed a method to induce MI or MII arrest. While this was previously possible in MI, the advantage of the method presented here is that it works for MII, and chemically inducible because it is based on a system that is sensitive to the addition of ABA. Depending on when the ABA is added, they achieve a MI or MII delay. The ABA promotes dimerizing fragments of Mps1 and Spc105 that can't bind their chromosomal sites. The evidence that the MI arrest is weaker than the MII arrest is convincing and consistent with published data and indicating the SAC in MI is less robust than MII or mitosis. The authors use this system to find evidence that the weak MI arrest is associated with PP1 binding to Spc105. This is a nice use of the system.

      The remainder of the paper uses the SynSAC system to isolate populations enriched for MI or MII stages and conduct proteomics. This shows a powerful use of the system but more work is needed to validate these results, particularly in normal cells.

      Overall the most significant aspect of this paper is the technical achievement, which is validated by the other experiments. They have developed a system and generated some proteomics data that maybe useful to others when analyzing kinetochore composition at each division. Overall, I have only a few minor suggestions.

      We appreciate the reviewers’ support of our study.

      (1) In wild-type - Pds1 levels are high during M1 and A1, but low in MII. Can the authors comment on this? In line 217, what is meant by "slightly attenuated? Can the authors comment on how anaphase occurs in presence of high Pds1? There is even a low but significant level in MII.

      The higher levels of Pds1 in meiosis I compared to meiosis II has been observed previously using immunofluorescence and live imaging[1–3]. Although the reasons are not completely clear, we speculate that there is insufficient time between the two divisions to re-accumulate Pds1 prior to separase re-activation.

      We agree “slightly attenuated” was confusing and we have re-worded this sentence to read “Addition ABA at the time of prophase release resulted in Pds1securin stabilisation throughout the time course, consistent with delays in both metaphase I and II”.

      We do not believe that either anaphase I or II occur in the presence of high Pds1. Western blotting represents the amount of Pds1 in the population of cells at a given time point. The time between meiosis I and II is very short even when treated with ABA. For example, in Figure 2B, spindle morphology counts show that the anaphase I peak is around 40% at its maxima (105 min) and around 40% of cells are in either metaphase I or metaphase II, and will be Pds1 positive. In contrast, due to the better efficiency of meiosis II, anaphase II hardly occurs at all in these conditions, since anaphase II spindles (and the second nuclear division) are observed at very low frequency (maximum 10%) from 165 minutes onwards. Instead, metaphase II spindles partially or fully breakdown, without undergoing anaphase extension. Taking Pds1 levels from the western blot and the spindle data together leads to the conclusion that at the end of the time-course, these cells are biochemically in metaphase II, but unable to maintain a robust spindle. Spindle collapse is also observed in other situations where meiotic exit fails, and potentially reflects an uncoupling of the cell cycle from the programme governing gamete differentiation[3–5]. We will explain this point in a revised version while referring to representative images that from evidence for this, as also requested by the reviewer below.

      (2) The figures with data characterizing the system are mostly graphs showing time course of MI and MII. There is no cytology, which is a little surprising since the stage is determined by spindle morphology. It would help to see sample sizes (ie. In the Figure legends) and also representative images. It would also be nice to see images comparing the same stage in the SynSAC cells versus normal cells. Are there any differences in the morphology of the spindles or chromosomes when in the SynSAC system?

      This is an excellent suggestion and will also help clarify the point above. We will provide images of cells at the different stages. For each timepoint, 100 cells were scored. We have already included this information in the figure legends 

      (3) A possible criticism of this system could be that the SAC signal promoting arrest is not coming from the kinetochore. Are there any possible consequences of this? In vertebrate cells, the RZZ complex streams off the kinetochore. Yeast don't have RZZ but this is an example of something that is SAC dependent and happens at the kinetochore. Can the authors discuss possible limitations such as this? Does the inhibition of the APC effect the native kinetochores? This could be good or bad. A bad possibility is that the cell is behaving as if it is in MII, but the kinetochores have made their microtubule attachments and behave as if in anaphase.

      In our view, the fact that SynSAC does not come from kinetochores is a major advantage as this allows the study of the kinetochore in an unperturbed state. It is also important to note that the canonical checkpoint components are all still present in the SynSAC strains, and perturbations in kinetochore-microtubule interactions would be expected to mount a kinetochore-driven checkpoint response as normal. Indeed, it would be interesting in future work to understand how disrupting kinetochore-microtubule attachments alters kinetochore composition (presumably checkpoint proteins will be recruited) and phosphorylation but this is beyond the scope of this work. In terms of the state at which we are arresting cells – this is a true metaphase because cohesion has not been lost but kinetochore-microtubule attachments have been established. This is evident from the enrichment of microtubule regulators but not checkpoint proteins in the kinetochore purifications from metaphase I and II. While this state is expected to occur only transiently in yeast, since the establishment of proper kinetochore-microtubule attachments triggers anaphase onset, the ability to capture this properly bioriented state will be extremely informative for future studies. We appreciate the reviewers’ insight in highlighting these interesting discussion points which we will include in a revised version.

      Reviewer #1 (Significance):

      These authors have developed a method to induce MI or MII arrest. While this was previously possible in MI, the advantage of the method presented here is it works for MII, and chemically inducible because it is based on a system that is sensitive to the addition of ABA. Depending on when the ABA is added, they achieve a MI or MII delay. The ABA promotes dimerizing fragments of Mps1 and Spc105 that can't bind their chromosomal sites. The evidence that the MI arrest is weaker than the MII arrest is convincing and consistent with published data and indicating the SAC in MI is less robust than MII or mitosis. The authors use this system to find evidence that the weak MI arrest is associated with PP1 binding to Spc105. This is a nice use of the system.

      The remainder of the paper uses the SynSAC system to isolate populations enriched for MI or MII stages and conduct proteomics. This shows a powerful use of the system but more work is needed to validate these results, particularly in normal cells.

      Overall the most significant aspect of this paper is the technical achievement, which is validated by the other experiments. They have developed a system and generated some proteomics data that maybe useful to others when analyzing kinetochore composition at each division.

      We appreciate the reviewer’s enthusiasm for our work.

      Reviewer #2 (Evidence, reproducibility and clarity):

      The manuscript submitted by Koch et al. describes a novel approach to collect budding yeast cells in metaphase I or metaphase II by synthetically activating the spinde checkpoint (SAC). The arrest is transient and reversible. This synchronization strategy will be extremely useful for studying meiosis I and meiosis II, and compare the two divisions. The authors characterized this so-named syncSACapproach and could confirm previous observations that the SAC arrest is less efficient in meiosis I than in meiosis II. They found that downregulation of the SAC response through PP1 phosphatase is stronger in meiosis I than in meiosis II. The authors then went on to purify kinetochore-associated proteins from metaphase I and II extracts for proteome and phosphoproteome analysis. Their data will be of significant interest to the cell cycle community (they compared their datasets also to kinetochores purified from cells arrested in prophase I and -with SynSAC in mitosis).

      I have only a couple of minor comments:

      (1) I would add the Suppl Figure 1A to main Figure 1A. What is really exciting here is the arrest in metaphase II, so I don't understand why the authors characterize metaphase I in the main figure, but not metaphase II. But this is only a suggestion.

      This is a good suggestion, we will do this in our full revision.

      (2) Line 197, the authors state: “...SyncSACinduced a more pronounced delay in metaphase II than in metaphase I”. However, line 229 and 240 the authors talk about a "longer delay in metaphase <i compared to metaphase II"... this seems to be a mix-up.

      Thank you for pointing this out, this is indeed a typo and we have corrected it.

      (3) The authors describe striking differences for both protein abundance and phosphorylation for key kinetochore associated proteins. I found one very interesting protein that seems to be very abundant and phosphorylated in metaphase I but not metaphase II, namely Sgo1. Do the authors think that Sgo1 is not required in metaphase II anymore? (Top hit in suppl Fig 8D).

      This is indeed an interesting observation, which we plan to investigate as part of another study in the future. Indeed, data from mouse indicates that shugoshin-dependent cohesin deprotection is already absent in meiosis II in mouse oocytes[6], though whether this is also true in yeast is not known. Furthermore, this does not rule out other functions of Sgo1 in meiosis II (for example promoting biorientation). We will include this point in the discussion.

      Reviewer #2 (Significance):

      The technique described here will be of great interest to the cell cycle community. Furthermore, the authors provide data sets on purified kinetochores of different meiotic stages and compare them to mitosis. This paper will thus be highly cited, for the technique, and also for the application of the technique.

      Reviewer #3 (Evidence, reproducibility and clarity):

      In their manuscript, Koch et al. describe a novel strategy to synchronize cells of the budding yeast Saccharomyces cerevisiae in metaphase I and metaphase II, thereby facilitating comparative analyses between these meiotic stages. This approach, termed SynSAC, adapts a method previously developed in fission yeast and human cells that enables the ectopic induction of a synthetic spindle assembly checkpoint (SAC) arrest by conditionally forcing the heterodimerization of two SAC components upon addition of the plant hormone abscisic acid (ABA). This is a valuable tool, which has the advantage that induces SAC-dependent inhibition of the anaphase promoting complex without perturbing kinetochores. Furthermore, since the same strategy and yeast strain can be also used to induce a metaphase arrest during mitosis, the methodology developed by Koch et al. enables comparative analyses between mitotic and meiotic cell divisions. To validate their strategy, the authors purified kinetochores from meiotic metaphase I and metaphase II, as well as from mitotic metaphase, and compared their protein composition and phosphorylation profiles. The results are presented clearly and in an organized manner.

      We are grateful to the reviewer for their support.

      Despite the relevance of both the methodology and the comparative analyses, several main issues should be addressed:

      (1) In contrast to the strong metaphase arrest induced by ABA addition in mitosis (Supp. Fig. 2), the SynSAC strategy only promotes a delay in metaphase I and metaphase II as cells progress through meiosis. This delay extends the duration of both meiotic stages, but does not markedly increase the percentage of metaphase I or II cells in the population at a given timepoint of the meiotic time course (Fig. 1C). Therefore, although SynSAC broadens the time window for sample collection, it does not substantially improve differential analyses between stages compared with a standard NDT80 prophase block synchronization experiment. Could a higher ABA concentration or repeated hormone addition improve the tightness of the meiotic metaphase arrest?

      For many purposes the enrichment and extended time for sample collection is sufficient, as we demonstrate here. However, as pointed out by the reviewer below, the system can be improved by use of the 4A-RASA mutations to provide a stronger arrest (see our response below). We did not experiment with higher ABA concentrations or repeated addition since the very robust arrest achieved with the 4A-RASA mutant deemed this unnecessary.

      (2) Unlike the standard SynSAC strategy, introducing mutations that prevent PP1 binding to the SynSAC construct considerably extended the duration of the meiotic metaphase arrests. In particular, mutating PP1 binding sites in both the RVxF (RASA) and the SILK (4A) motifs of the Spc105(1-455)-PYL construct caused a strong metaphase I arrest that persisted until the end of the meiotic time course (Fig. 3A). This stronger and more prolonged 4A-RASA SynSAC arrest would directly address the issue raised above. It is unclear why the authors did not emphasize more this improved system. Indeed, the 4A-RASA SynSAC approach could be presented as the optimal strategy to induce a conditional metaphase arrest in budding yeast meiosis, since it not only adapts but also improves the original methods designed for fission yeast and human cells. Along the same lines, it is surprising that the authors did not exploit the stronger arrest achieved with the 4A-RASA mutant to compare kinetochore composition at meiotic metaphase I and II.

      We agree that the 4A-RASA mutant is the best tool to use for the arrest and going forward this will be our approach. We collected the proteomics data and the data on the SynSAC mutant variants concurrently, so we did not know about the improved arrest at the time the proteomics experiment was done. Because very good arrest was already achieved with the unmutated SynSAC construct, we could not justify repeating the proteomics experiment which is a large amount of work using significant resources. However, we will highlight the potential of the 4A-RASA mutant more prominently in our full revision.

      (3) The results shown in Supp. Fig. 4C are intriguing and merit further discussion. Mitotic growth in ABA suggest that the RASA mutation silences the SynSAC effect, yet this was not observed for the 4A or the double 4A-RASA mutants. Notably, in contrast to mitosis, the SynSAC 4A-RASA mutation leads to a more pronounced metaphase I meiotic delay (Fig. 3A). It is also noteworthy that the RVAF mutation partially restores mitotic growth in ABA. This observation supports, as previously demonstrated in human cells, that Aurora B-mediated phosphorylation of S77 within the RVSF motif is important to prevent PP1 binding to Spc105 in budding yeast as well.

      We agree these are intriguing findings that highlight key differences as to the wiring of the spindle checkpoint in meiosis and mitosis and potential for future studies, however, currently we can only speculate as to the underlying cause. The effect of the RASA mutation in mitosis is unexpected and unexplained. However, the fact that the 4A-RASA mutation causes a stronger delay in meiosis I compared to mitosis can be explained by a greater prominence of PP1 phosphatase in meiosis. Indeed, our data (Figure 4A) show that the PP1 phosphatase Glc7 and its regulatory subunit Fin1 are highly enriched on kinetochores at all meiotic stages compared to mitosis.

      We agree that the improved growth of the RVAF mutant is intriguing and points to a role of Aurora B-mediated phosphorylation, though previous work has not supported such a role [7].

      We will include a discussion of these important points in a revised version.

      (4) To demonstrate the applicability of the SynSAC approach, the authors immunoprecipitated the kinetochore protein Dsn1 from cells arrested at different meiotic or mitotic stages, and compared kinetochore composition using data independent acquisition (DIA) mass spectrometry. Quantification and comparative analyses of total and kinetochore protein levels were conducted in parallel for cells expressing either FLAG-tagged or untagged Dsn1 (Supp. Fig. 7A-B). To better detect potential changes, protein abundances were next scaled to Dsn1 levels in each sample (Supp. Fig. 7C-D). However, it is not clear why the authors did not normalize protein abundance in the immunoprecipitations from tagged samples at each stage to the corresponding untagged control, instead of performing a separate analysis. This would be particularly relevant given the high sensitivity of DIA mass spectrometry, which enabled quantification of thousands of proteins. Furthermore, the authors compared protein abundances in tagged-samples from mitotic metaphase and meiotic prophase, metaphase I and metaphase II (Supp. Fig. 7E-F). If protein amounts in each case were not normalized to the untagged controls, as inferred from the text (lines 333 to 338), the observed differences could simply reflect global changes in protein expression at different stages rather than specific differences in protein association to kinetochores.

      While we agree with the reviewer that at first glance, normalising to no tag appears to be the most appropriate normalisation, in practice there is very low background signal in the no tag sample which means that any random fluctuations have a big impact on the final fold change used for normalisation. This approach therefore introduces artefacts into the data rather than improving normalisation.

      To provide reassurance that our kinetochore immunoprecipitations are specific, and that the background (no tag) signal is indeed very low, we will provide a new supplemental figure showing the volcanos comparing kinetochore purifications at each stage with their corresponding no tag control.

      It is also important to note that our experiment looks at relative changes of the same protein over time, which we expect to be relatively small in the whole cell lysate. We previously documented proteins that change in abundance in whole cell lysates throughout meiosis[8]. In this study, we found that relatively few proteins significantly change in abundance.

      Our aim in the current study was to understand how the relative composition of the kinetochore changes and for this, we believe that a direct comparison to Dsn1, a central kinetochore protein which we immunoprecipitated is the most appropriate normalisation.

      (5) Despite the large amount of potentially valuable data generated, the manuscript focuses mainly on results that reinforce previously established observations (e.g., premature SAC silencing in meiosis I by PP1, changes in kinetochore composition, etc.). The discussion would benefit from a deeper analysis of novel findings that underscore the broader significance of this study.

      We strongly agree with this point and we will re-frame the discussion to focus on the novel findings, as also raised by the other reviewers.

      Finally, minor concerns are:

      (1) Meiotic progression in SynSAC strains lacking Mad1, Mad2 or Mad3 is severely affected (Fig. 1D and Supp. Fig. 1), making it difficult to assess whether, as the authors state, the metaphase delays depend on the canonical SAC cascade. In addition, as a general note, graphs displaying meiotic time courses could be improved for clarity (e.g., thinner data lines, addition of axis gridlines and external tick marks, etc.).

      We will generate the data to include a checkpoint mutant +/- ABA for direct comparison. We will take steps to improve the clarity of presentation of the meiotic timecourse graphs, though our experience is that uncluttered graphs make it easier to compare trends.

      (2) Spore viability following SynSAC induction in meiosis was used as an indicator that this experimental approach does not disrupt kinetochore function and chromosome segregation. However, this is an indirect measure. Direct monitoring of genome distribution using GFP-tagged chromosomes would have provided more robust evidence. Notably, the SynSAC mad3Δ mutant shows a slight viability defect, which might reflect chromosome segregation defects that are more pronounced in the absence of a functional SAC.

      Spore viability is a much more sensitive way of analysing segregation defects that GFP-labelled chromosomes. This is because GFP labelling allows only a single chromosome to be followed. On the other hand, if any of the 16 chromosomes mis-segregate in a given meiosis this would result in one or more aneuploid spores in the tetrad, which are typically inviable. The fact that spore viability is not significantly different from wild type in this analysis indicates that there are no major chromosome segregation defects in these strains, and we therefore do not plan to do this experiment.

      (3) It is surprising that, although SAC activity is proposed to be weaker in metaphase I, the levels of CPC/SAC proteins seem to be higher at this stage of meiosis than in metaphase II or mitotic metaphase (Fig. 4A-B).

      We agree, this is surprising and we will point this out in the revised discussion. We speculate that the challenge in biorienting homologs which are held together by chiasmata, rather than back-to-back kinetochores results in a greater requirement for error correction in meiosis I. Interestingly, the data with the RASA mutant also point to increased PP1 activity in meiosis I, and we additionally observed increased levels of PP1 (Glc7 and Fin1) on meiotic kinetochores, consistent with the idea that cycles of error correction and silencing are elevated in meiosis I.

      (4) Although a more detailed exploration of kinetochore composition or phosphorylation changes is beyond the scope of the manuscript, some key observations could have been validated experimentally (e.g., enrichment of proteins at kinetochores, phosphorylation events that were identified as specific or enriched at a certain meiotic stage, etc.).

      We agree that this is beyond the scope of the current study but will form the start of future projects from our group, and hopefully others.

      (5) Several typographical errors should be corrected (e.g., "Knetochores" in Fig. 4 legend, "250uM ABA" in Supp. Fig. 1 legend, etc.)

      Thank you for pointing these out, they have been corrected.

      Reviewer #3 (Significance):

      Koch et al. describe a novel methodology, SynSAC, to synchronize budding yeast cells in metaphase I or metaphase II during meiosis, as well and in mitotic metaphase, thereby enabling differential analyses among these cell division stages. Their approach builds on prior strategies originally developed in fission yeast and human cells models to induce a synthetic spindle assembly checkpoint (SAC) arrest by conditionally forcing the heterodimerization of two SAC proteins upon addition of abscisic acid (ABA). The results from this manuscript are of special relevance for researchers studying meiosis and using Saccharomyces cerevisiae as a model. Moreover, the differential analysis of the composition and phosphorylation of kinetochores from meiotic metaphase I and metaphase II adds interest for the broader meiosis research community. Finally, regarding my expertise, I am a researcher specialized in the regulation of cell division.

      Description of the revisions that have already been incorporated in the transferred manuscript

      We have only corrected minor typos as detailed above.

      Description of analyses that authors prefer not to carry out

      The revisions we plan are detailed above. There are just two revisions we believe are either unnecessary or beyond the scope, both minor concerns of Reviewer #3. For clarity we have reproduced them, along with our justification below. In the latter case, the reviewer also acknowledged that further work in this direction is beyond the scope of the current study.

      (2) Spore viability following SynSAC induction in meiosis was used as an indicator that this experimental approach does not disrupt kinetochore function and chromosome segregation. However, this is an indirect measure. Direct monitoring of genome distribution using GFP-tagged chromosomes would have provided more robust evidence. Notably, the SynSAC mad3Δ mutant shows a slight viability defect, which might reflect chromosome segregation defects that are more pronounced in the absence of a functional SAC.

      Spore viability is a much more sensitive way of analysing segregation defects that GFP-labelled chromosomes. This is because GFP labelling allows only a single chromosome to be followed. On the other hand, if any of the 16 chromosomes mis-segregate in a given meiosis this would result in one or more aneuploid spores in the tetrad, which are typically inviable. The fact that spore viability is not significantly different from wild type in this analysis indicates that there are no major chromosome segregation defects in these strains, and we therefore do not plan to do this experiment.

      (4) Although a more detailed exploration of kinetochore composition or phosphorylation changes is beyond the scope of the manuscript, some key observations could have been validated experimentally (e.g., enrichment of proteins at kinetochores, phosphorylation events that were identified as specific or enriched at a certain meiotic stage, etc.).

      We agree that this is beyond the scope of the current study but will form the start of future projects from our group, and hopefully others.

      (1) Salah, S.M., and Nasmyth, K. (2000). Destruction of the securin Pds1p occurs at the onset of anaphase during both meiotic divisions in yeast. Chromosoma 109, 27–34.

      (2) Matos, J., Lipp, J.J., Bogdanova, A., Guillot, S., Okaz, E., Junqueira, M., Shevchenko, A., and Zachariae, W. (2008). Dbf4-dependent CDC7 kinase links DNA replication to the segregation of homologous chromosomes in meiosis I. Cell 135, 662–678.

      (3) Marston, A.L.A.L., Lee, B.H.B.H., and Amon, A. (2003). The Cdc14 phosphatase and the FEAR network control meiotic spindle disassembly and chromosome segregation. Developmental cell 4, 711–726. https://doi.org/10.1016/S1534-5807(03)00130-8.

      (4) Attner, M.A., and Amon, A. (2012). Control of the mitotic exit network during meiosis. Molecular Biology of the Cell 23, 3122–3132. https://doi.org/10.1091/mbc.E12-03-0235.

      (5) Pablo-Hernando, M.E., Arnaiz-Pita, Y., Nakanishi, H., Dawson, D., del Rey, F., Neiman, A.M., and de Aldana, C.R.V. (2007). Cdc15 Is Required for Spore Morphogenesis Independently of Cdc14 in Saccharomyces cerevisiae. Genetics 177, 281–293. https://doi.org/10.1534/genetics.107.076133.

      (6) El Jailani, S., Cladière, D., Nikalayevich, E., Touati, S.A., Chesnokova, V., Melmed, S., Buffin, E., and Wassmann, K. (2025). Eliminating separase inhibition reveals absence of robust cohesin protection in oocyte metaphase II. EMBO J 44, 5187–5214. https://doi.org/10.1038/s44318-025-00522-0.

      (7) Rosenberg, J.S., Cross, F.R., and Funabiki, H. (2011). KNL1/Spc105 Recruits PP1 to Silence the Spindle Assembly Checkpoint. Current Biology 21, 942–947. https://doi.org/10.1016/j.cub.2011.04.011.

      (8) Koch, L.B., Spanos, C., Kelly, V., Ly, T., and Marston, A.L. (2024). Rewiring of the phosphoproteome executes two meiotic divisions in budding yeast. EMBO J 43, 1351–1383. https://doi.org/10.1038/s44318-024-00059-8.

    1. Author response:

      The following is the authors’ response to the original reviews.

      Public Reviews:

      Reviewer #1 (Public review):

      Summary:  

      ZMAT3 is a p53 target gene that the Lal group and others have shown is important for p53mediated tumor suppression, and which plays a role in the control of RNA splicing. In this manuscript, Lal and colleagues perform quantitative proteomics of cells with ZMAT3 knockout and show that the enzyme hexokinase HKDC1 is the most upregulated protein. Mechanistically, the authors show that ZMAT3 does not appear to directly regulate the expression of HKDC1; rather, they show that the transcription factor c-JUN was strongly enriched in ZMAT3 pull-downs in IP-mass spec experiments, and they perform IP-western to demonstrate an interaction between c-JUN and ZMAT3. Importantly, the authors demonstrate, using ChIP-qPCR, that JUN is present at the HKDC1 gene (intron 1) in ZMAT3 WT cells and shows markedly enhanced binding in ZMAT3 KO cells. The data best fit a model whereby p53 transactivates ZMAT3, leading to decreased JUN binding to the HKDC1 promoter, and altered mitochondrial respiration.  

      Strengths:

      The authors use multiple orthogonal approaches to test the majority of their findings.  The authors offer a potentially new activity of ZMAT3 in tumor suppression by p53: the control of mitochondrial respiration.  

      Weaknesses:

      Some indication as to whether other c-JUN target genes are also regulated by ZMAT3 would improve the broad relevance of the authors' findings.  

      We thank the reviewer for the kind words and the thoughtful suggestion. As recommended, to identify additional c-JUN targets potentially regulated by ZMAT3, we intersected the genes upregulated upon ZMAT3 knockout (from our RNA-seq data) with the ChIP-Atlas dataset for human c-JUN and cross-referenced these with c-JUN peaks from three ENCODE cell lines. From this analysis, we selected for further analysis the top 4 candidate genes - LAMA2, VSNL1, SAMD3, and IL6R (Figure 5-figure supplement 2A-D). Like HKDC1, these genes were upregulated in ZMAT3-KO cells, and this upregulation was abolished upon siRNA-mediated JUN knockdown in ZMAT3-KO cells (Figure 5-figure supplement 2E). Moreover, by ChIP-qPCR we observed increased JUN binding to the JUN peak for these genes in ZMAT3-KO cells as compared to the ZMAT3-WT (Figure 5- figure supplement 2F). As described on page 11 of the revised manuscript, these results suggest that the ZMAT3/JUN axis negatively regulates HKDC1 expression and additional c-JUN target genes.   

      Reviewer #2 (Public review):

      Summary:

      The study elucidates the role of the recently discovered mediator of p53 tumor suppressive activity, ZMAT3. Specifically, the authors find that ZMAT3 negatively regulates HKDC1, a gene involved in the control of mitochondrial respiration and cell proliferation.  

      Strengths:

      Mechanistically, ZMAT3 suppresses HKDC1 transcription by sequestering JUN and preventing its binding to the HKDC1 promoter, resulting in reduced HKDC1 expression. Conversely, p53 mutation leads to ZMAT3 downregulation and HKDC1 overexpression, thereby promoting increased mitochondrial respiration and proliferation. This mechanism is novel; however, the authors should address several points.  

      Weaknesses:

      The authors conduct mechanistic experiments (e.g., transcript and protein quantification, luciferase assays) to demonstrate regulatory interactions between p53, ZMAT3, JUN, and HKDC1. These findings should be supported with functional assays, such as proliferation, apoptosis, or mitochondrial respiration analyses.  

      We thank the reviewer for appreciating our work and for this valuable suggestion. The reviewer rightly pointed out that supporting the regulatory interactions between p53, ZMAT3, JUN and HKDC1 with functional assays such as proliferation, apoptosis and mitochondrial respiration analyses would strengthen our mechanistic data. During the revision of our manuscript, we attempted to address this point by performing simultaneously knockdown of these proteins; however, we observed substantial toxicity under these conditions, making the functional assays technically unfeasible. This outcome was not unexpected as knockdown of JUN or HKDC1 individually results in growth defects.  We therefore focused our efforts on addressing the recommendation for authors.  

      Reviewer #3 (Public review):

      Summary:  

      In their manuscript, Kumar et al. investigate the mechanisms underlying the tumor suppressive function of the RNA binding protein ZMAT3, a previously described tumor suppressor in the p53 pathway. To this end, they use RNA-sequencing and proteomics to characterize changes in ZMAT3-deficient cells, leading them to identify the hexokinase HKDC1 as upregulated with ZMAT3 deficiency first in colorectal cancer cells, then in other cell types of both mouse and human origin. This increase in HKDC1 is associated with increased mitochondrial respiration. As ZMAT3 has been reported as an RNA-binding and DNA-binding protein, the authors investigated this via PAR-CLIP and ChIP-seq but did not observe ZMAT3 binding to HKDC1 pre-mRNA or DNA. Thus, to better understand how ZMAT3 regulates HKDC1, the authors used quantitative proteomics to identify ZMAT3interacting proteins. They identified the transcription factor JUN as a ZMAT3-interacting protein and showed that JUN promotes the increased HKDC1 RNA expression seen with ZMAT3 inactivation. They propose that ZMAT3 inhibits JUN-mediated transcriptional induction of HKDC1 as a mechanism of tumor suppression. This work uncovers novel aspects of the p53 tumor suppressor pathway.  

      Strengths:

      This novel work sheds light on one of the most well-established yet understudied p53 target genes, ZMAT3, and how it contributes to p53's tumor suppressive functions. Overall, this story establishes a p53-ZMAT3-HKDC1 tumor suppressive axis, which has been strongly substantiated using a variety of orthogonal approaches, in different cell lines and with different data sets.  

      Weaknesses:

      While the role of p53 and ZMAT3 in repressing HKDC1 is well substantiated, there is a gap in understanding how ZMAT3 acts to repress JUN-driven activation of the HKDC1 locus. How does ZMAT3 inhibit JUN binding to HKDC1? Can targeted ChIP experiments or RIP experiments be used to make a more definitive model? Can ZMAT3 mutants help to understand the mechanisms? Future work can further establish the mechanisms underlying how ZMAT3 represses JUN activity.  

      We thank the reviewer for the kind words and the invaluable suggestion. The reviewer has an excellent point regarding how ZMAT3 inhibits JUN binding to HKDC1 locus.Our new data included in the revised manuscript show that the ZMAT3-JUN interaction is lost in the presence of DNase or RNase, indicating that the interaction requires both DNA and RNA. This result suggests that ZMAT3 and JUN  form an RNA-dependent, chromatin- associated complex. Although not directly investigated in our study, this finding is consistent with emerging evidence that RBPs can function as chromatin-associated cofactors in transcription. For example, functional interplay between transcription factor YY1 and the RNA binding protein RBM25 co-regulates a broad set of genes, where RBM25 appears to engage promoters first and then recruit YY1, with RNA proposed to guide target recognition. We have discussed this possibility in the discussion section of revised manuscript (page 13). We agree that future work using ZMAT3 mutants and targeted ChIP or RIP assays will be valuable to delineate the precise mechanism by which ZMAT3 inhibits JUN binding to its target genes.   

      Recommendations for the authors:

      Reviewer #1 (Recommendations for the authors):

      ZMAT3 is a p53 target gene that the Lal group and others have shown is important for p53mediated tumor suppression, and which plays a role in the control of RNA splicing. In this manuscript, Lal and colleagues perform quantitative proteomics of cells with ZMAT3 knockout and show that the enzyme hexokinase HKDC1 is the most upregulated protein. HKDC1 is emerging as an important player in human cancer. Importantly, the authors show both acute (gene silencing) and chronic (CRISPR KO) approaches to silence ZMAT3, and they do this in several cell lines. Notably, they show that ZMAT3 silencing leads to impaired mitochondrial respiration, in a manner that is rescued by silencing of HKDC1. Mechanistically, the authors show that ZMAT3 does not appear to directly regulate the expression of HKDC1; rather, they show that the transcription factor c-JUN was strongly enriched in ZMAT3 pull-downs in IP-mass spec experiments, and they perform IP-western to demonstrate an interaction between c-JUN and ZMAT3. Importantly, the authors demonstrate, using ChIP-qPCR, that JUN is present at the HKDC1 gene (intron 1) in ZMAT3 WT cells, and shows markedly enhanced binding in ZMAT3 KO cells. The data best fit a model whereby p53 transactivates ZMAT3, leading to decreased JUN binding to the HKDC1 promoter (intron 1), and altered mitochondrial respiration. The findings are compelling, and the authors use multiple orthogonal approaches to test most findings. And the authors offer a potentially new activity of ZMAT3 in tumor suppression by p53: the control of mitochondrial respiration. As such, enthusiasm is high for this manuscript. 

      Addressing the following question would improve the manuscript. 

      It is not clear how many (other) c-JUN target genes might be impacted by ZMAT3; other important c-JUN targets in cancer include GLS1, WEE1, SREBP1, GLUT1, and CD36, so there could be a global impact on metabolism in ZMAT3 KO cells. Can the authors perform qPCR on these targets in ZMAT3 WT and KO cells and see if these target genes are differentially expressed? 

      We thank the reviewer for this thoughtful suggestion. As recommended, we examined the expression of key c-JUN target genes GLS1 (also known as GLS), WEE1, SREBP1, GLUT1, and CD36 in ZMAT3-WT and ZMAT3-KO cells. We first analyzed publicly available JUN ChIP-Seq data from three ENCODE cell lines, which revealed JUN binding peaks near or upstream of exon 1 for GLS1/GLS, SREBP1, and SLC2A1/GLUT1, but not for WEE1 or CD36 (Appendix 1, panels A-E). Based on these results, we performed RT-qPCR for GLS1/GLS, SREBP1 and SLC2A1 in ZMAT3-WT and ZMAT3-KO cells, with or without JUN knockdown. GLS mRNA was significantly reduced upon JUN knockdown in both ZMAT3-WT cells and ZMAT3-KO cells, but it was not upregulated upon loss of ZMAT3, indicating that GLS is a JUN target gene, but it is not regulated by ZMAT3. In contrast, SREBF1 or SLC2A1 expression remained unchanged upon ZMAT3 loss or JUN knockdown (Appendix 1 panels F-H). These data suggest that the ZMAT3/JUN axis does not regulate the expression of these genes.

      To identify additional c-JUN targets potentially regulated by ZMAT3, we intersected the genes upregulated upon ZMAT3 knockout (from our RNA-seq data) with the ChIP-Atlas dataset for human c-JUN and cross-referenced these with c-JUN peaks from three ENCODE cell lines. From this analysis, we selected for further analysis the top 4 candidate genes - LAMA2, VSNL1, SAMD3, and IL6R (Figure 5-figure supplement 2A-D). Like HKDC1, these genes were upregulated in ZMAT3-KO cells, and this upregulation was abolished upon siRNA-mediated JUN knockdown in ZMAT3-KO cells (Figure 5-figure supplement 2E). Moreover, by ChIP-qPCR we observed increased JUN binding to the JUN peak for these genes in ZMAT3-KO cells as compared to the ZMAT3-WT (Figure 5- figure supplement 2F). As described on page 11 of the revised manuscript, these results suggest that the ZMAT3/JUN axis negatively regulates HKDC1 expression and additional c-JUN target genes.   

      Minor concerns: 

      (1) Line 150: observed a modest. 

      (2) Line 159: Figure 2G appears to be inaccurately cited. 

      (3) Line 191: assays to measure. 

      We thank the reviewer for pointing these out. These minor concerns have been addressed in the text.  

      Reviewer #2 (Recommendations for the authors): 

      (1) Figure 1E: Can the authors clarify what the numbers on the left side of the chart represent? Do they refer to the scale?

      The numbers on the Y-axis represent the -log 10 (p- value) where higher values correspond to more significant changes. For visualization purposes, the significant changes are shown in red.  

      (2) Page 5, line 123: The sentence "As expected, ZMAT3 mRNA levels were decreased in the ZMAT3-KO cells" is redundant, as this information was already mentioned on page 4, line 103.  

      We thank the reviewer for noticing this redundancy. The repeated sentence has been removed in the revised manuscript.  

      (3) Page 5: The authors state: "Transcriptome-wide, upon loss of ZMAT3, 606 genes were significantly up-regulated (adj. p < 0.05 and 1.5-fold change) and 552 were down-regulated, with a median fold change of 1.76 and 0.55 for the up- and down-regulated genes, respectively." Later, on page 6, they write: "Comparison of the RNA-seq data from ZMAT3WT vs. ZMAT3-KO and CTRL siRNA vs. ZMAT3 siRNA-transfected HCT116 cells indicated that 1023 genes were commonly up-regulated, and 1042 were commonly down-regulated upon ZMAT3 loss (Figure S2C and D)." Why is the number of deregulated transcripts higher in the ZMAT3-WT vs. ZMAT3-KO comparison than in the CTRL siRNA vs. ZMAT3 siRNA comparison? Are the authors using less stringent criteria in the second analysis? This point should be clarified. 

      We thank the reviewer for highlighting this point. The reviewer is correct that less stringent criteria were used in the second analysis. On page 5, we applied stringent thresholds (adjusted p-value < 0.05 and 1.5-fold change) to identify high-confidence transcriptome-wide changes upon ZMAT3 loss. In contrast, for the comparison of both RNA-seq datasets (ZMAT3-WT vs. KO and siCTRL vs. siZMAT3), we included genes that were consistently up- or downregulated, without applying a fold change threshold, focusing instead on significantly altered genes (adjusted p < 0.05) in both datasets. This allowed us to capture broader and more reproducible transcriptomic changes that occur upon ZMAT3 depletion, including modest but significant changes upon transient ZMAT3 knockdown with siRNAs. We have now clarified this distinction on page 6 of the revised manuscript.

      (4) Figures 2B and 2E: The authors should provide quantification of HKDC1 protein levels normalized to a loading control. In addition, they should assess HKDC1 protein abundance upon ZMAT3 interference in SWI1222 and HCEC1CT cells, not just in HepG2 and HCT116 cells. 

      We thank the reviewer for this suggestion. We have now quantified all immunoblots presented throughout the manuscript, including those shown in Figures 2B and 2E, and all other figures containing protein analyses. Band intensities were quantified using ImageJ densitometry and normalized to GAPDH as the loading control. In addition, as suggested, we examined HKDC1 protein levels following ZMAT3 knockdown in two additional cell lines, SW1222 and HCEC-1CT. Consistent with our observations in HepG2 and HCT116 cells, ZMAT3 depletion led to increased HKDC1 protein levels in both SW1222 and HCEC-1CT cells. These new data are now included in Figure 2-figure supplement 1F and G. We have updated the Results section, figure legends, and figures to reflect these additions.

      (5) Figure 3A: It is unclear which gene was knocked out in the "KO cells." The authors should clearly specify this.

      We thank the reviewer for pointing this out. We have now updated Figure 3A.

      (6) Figure 3D: The result appears counterintuitive in comparison to Figure 3E. Why does HKDC1 knockdown reduce cell confluency more in ZMAT3 KO cells than in control (ZMAT3 wild-type) cells? The authors should explain this discrepancy more clearly.

      We thank the reviewer for this insightful comment. As shown in Figure 3D and 3E, knockdown of HKDC1 resulted in a greater decrease in proliferation in ZMAT3-KO cells than in ZMAT3-WT cells. This observation was indeed unexpected, given that HKDC1 acts downstream of ZMAT3. One possible explanation is that elevated HKDC1 expression in ZMAT3-KO cells increases their reliance on HKDC1 for sustaining proliferation, and that HKDC1 may also participate in additional pathways in ZMAT3-KO cells. Consequently, transient knockdown of HKDC1 in ZMAT3-KO cells would have a more pronounced effect on proliferation due to their increased dependency on HKDC1 activity. In contrast, ZMAT3WT cells which express lower levels of HKDC1 are less dependent on its function and therefore less sensitive to its depletion. We have now clarified this point on page 8 of the revised manuscript.  

      Reviewer #3 (Recommendations for the authors):  

      (1) Why do the authors start their analysis by knocking out the p53 response element in Zmat3? That should be clarified. In addition, since clones were picked after CRISPR KO of Zmat3, were experiments done to confirm that p53 signaling was not disrupted?

      We thank the reviewer for this thoughtful question. We began our study by targeting the p53 response element (p53RE) in the ZMAT3 locus because the basal expression of ZMAT3 is regulated by p53 (Muys, Bruna R. et al., Genes & Development, 2021). Deleting the p53RE therefore allowed us to markedly reduce ZMAT3 expression without disrupting the entire ZMAT3 locus. We have clarified this rationale on page 4 of the revised manuscript. To ensure that p53 signaling was not affected by this modification, we verified that canonical p53 targets such as p21 were equivalently induced in both ZMAT3WT and KO cells following Nutlin treatment and that p53 induction was unchanged(Figure 4F and Figure 1 – figure supplement 1A).

      (2) Throughout the text, many immunoblots are used to validate the knockouts and knockdowns used, but some clarification is needed. In Figure S1A, the Zmat3-WT sample seems to have significantly more p53 than the Zmat3 KO sample. Does Zmat3 KO compromise p53 levels in other experiments? It would be good to understand if Zmat3 affects p53 function by affecting its levels. Also, the p21 blot is overloaded.

      We thank the reviewer for this helpful observation. To determine whether ZMAT3 knockout affects p53 function by affecting its levels, we repeated the experiment three independent times. Western blots from these biological replicates, together with protein quantification, are now included in Appendix-2 and Figure 1-figure supplement 1A. These data show no significant differences in p53 or p21 induction between ZMAT3-WT and ZMAT3-KO cells following Nutlin treatment. In the revised manuscript, we have replaced the blot in Figure 1-figure supplement 1A with a more representative image from one of these replicate experiments.

      In Figure 2E, HKDC1 protein levels are not shown for the SW1222 and HCEC-1CT cell lines, 

      We thank the reviewer for this suggestion. HKDC1 protein levels in SW1222 and HCEC1-CT cells following ZMAT3 knockdown are now included as Figure 2- figure supplement 1F and 1G, together with the corresponding quantification.

      and Zmat3 does not appear as its characteristic two bands on the blot. What does this signify?

      We thank the reviewer for this observation. Endogenous ZMAT3 typically appears as two closely migrating bands on immunoblots. As shown in Figure 4D and Appendix 2A and 2B, these two bands are observed at the expected molecular weight following Nutlin treatment and are specific to ZMAT3, as they are markedly reduced in ZMAT3-KO cells. In contrast, only a single ZMAT3 band is visible in Figure 2E. This likely reflects limited resolution of the two bands in some blots rather than a biological difference.   

      (3) Why does HKDC1 knockdown only have an effect on metabolic phenotypes when ZMAT3 is gone? In Figure 3A, there does not seem to be a decrease in hexokinase activity in the siCTRL + siHKDC1 condition compared to siCTRL alone. Also, in Figure 3A, does phosphorylation activity of HKDC1 necessarily reflect glucose uptake, as stated? Additionally, in Figure 3C, there is no effect on mitochondrial respiration with siHKDC1, even though recent studies have shown a significant effect of HKDC1 on this.

      We thank the reviewer for raising these important questions. As noted, HKDC1 knockdown alone in wild-type cells (siCTRL + siHKDC1) does not significantly reduce hexokinase activity (Figure 3A). This likely reflects the low basal expression of HKDC1 in these cells. Thus, the metabolic phenotype may only become apparent when HKDC1 expression exceeds a functional threshold, as observed in ZMAT3-KO cells where HKDC1 is upregulated.

      Regarding the glucose uptake assay, HKDC1 itself is not phosphorylated; rather, it phosphorylates a non-catabolizable glucose analog, 2-deoxyglucose (2-DG) upon cellular uptake. According to the manufacturer’s protocol, intracellular 2-DG is phosphorylated by hexokinases to 2-deoxyglucose-6-phosphate (2-DG6P), which cannot be further metabolized and therefore accumulates. The accumulated 2-DG6P is quantified using a luminescence-based readout. This assay is widely used as a surrogate for glucose uptake because it reflects both glucose import and phosphorylation — the first step of glycolytic flux. As for the lack of change in mitochondrial respiration (Figure 3C), we acknowledge that some studies have reported mitochondrial roles for HKDC1 under basal conditions; however, such effects may be cell type-specific.

      (4) The emphasis on glycolysis signatures is confusing, as in the end, glycolysis does not seem to be affected by ZMAT3 status, but mitochondrial respiration is affected. Can the text be clarified to address this? It is also difficult to understand the role of oxygen consumption rate (OCR) in ZMAT3 phenotypes, as it does not fully track with proliferation. For example, ZMAT3 KD has the highest OCR, and the other conditions have similar OCRs but different proliferative rates in Figure 3D. Also, the colors used in Figure 3 to denote different genotypes change between B/C and D, which is confusing.

      We thank the reviewer for pointing out the inconsistency in the colors of the graph in Figure 2, which we have now corrected. Our data indicates that ZMAT3 regulates mitochondrial respiration without significantly affecting glycolysis. It is possible that mitochondria in ZMAT3-KO cells are oxidizing more substrates that are not produced by glycolysis. Additional work will be required to fully determine these mechanisms. We have clarified this on page 8 of the revised manuscript.      

      (5) The lack of ZMAT3 binding to RNAs in PAR-CLIP is not proof that it does not do so. A more targeted approach should be used, using individual RIP assays. The authors should also analyze the splicing of HKDC1, which could be affected by ZMAT3.

      As suggested, we performed ZMAT3 RNA IP experiments (RIP) using doxycycline-inducible HCT116-ZMAT3-FLAG cells. However, we did not observe significant enrichment of HKDC1 mRNA in the ZMAT3 IPs (Figure 5 – figure supplement 1A), consistent with previously published ZMAT3 RIP-seq data (Bersani et al, Oncotarget, 2016). These findings further support the notion that ZMAT3 does not directly bind to HKDC1 mRNA in these cells. We Accordingly, we have modified the text on page 10 of the revised manuscript.

      In addition, as suggested by the reviewer, we analyzed changes in splicing of HKDC1 pre-mRNA using rMATS in HCT116 cells by comparing our previously published RNA-seq data from siCTRL and siZMAT3-transfected HCT116 cells (Muys et al, Genes Dev, 2021). We focused on splicing events with an FDR < 0.05 and a delta PSI > |0.1| (representing at least a 10% change in splicing). The splicing analysis (data not shown) did not reveal any significant alterations in HKDC1 pre-mRNA splicing upon ZMAT3 knockdown. Corresponding text has been updated on page 10 of the revised manuscript.

      (6) The authors say that they examine JUN binding at the HKDC1 promoter several times, but they focus on intron 1 in Figure 5. They should revise the text accordingly, and they should also show JUN ChIP data traces for the whole HKDC1 locus in Figure 5C.

      We thank the reviewer for this helpful suggestion. As recommended, we have revised the text throughout the manuscript and replaced HKDC1 promoter with HKDC1 intron 1 DNA to accurately reflect our analysis, and Figure 5 now shows the JUN ChIP-seq signal across the entire HKDC1 locus.

      (7) In the ZMAT3 and JUN interaction assays, were these tested in the presence of DNAse or RNAse to determine if nucleic acids mediate the interaction?

      We thank the reviewer for this valuable suggestion. To test whether nucleic acids mediate the ZMAT3-JUN interaction, we performed ZMAT3 immunoprecipitation (IPs) in the presence or absence of DNase and RNase from doxycycline-inducible ZMAT3-FLAG expressing HCT116 cells. The ZMAT3-JUN interaction was lost upon treatment with either DNase or RNase, indicating that the interaction is mediated by nucleic acids. This data has been added in the revised manuscript (Figure 5-figure supplement 1D and on page 11).

    1. Reviewer #4 (Public review):

      This is an important paper that can do much to set an example for thoughtful and rigorous evaluation of a discipline-wide body of literature. The compiled website of publications in Drosophila immunity is by itself a valuable contribution to the field. There is much to praise in this work, especially including the extensive and careful evaluation of the published literature. However, there are also cautions.

      One notable concern is that the validation experiments are generally done at low sample sizes and low replication rates, and often lack statistical analysis. This is slippery ground for declaring a published study to be untrue. Since the conclusions reported here are nearly all negative, it is essential that the experiments be performed with adequate power to detect the originally described effects. At a minimum, they should be performed with the same sample size and replication structure as the originally reported studies.

      The first section of Results should be an overview of the general accuracy of the literature. Of all claims made in the 400 evaluated papers, what proportion fell into each category of "verified", "unchallenged", "challenged", "mixed", or "partially verified"? This summary overview would provide a valuable assessment of the field as a whole. A detailed dispute of individual highlighted claims could follow the summary overview.

      Section headings are phrased as declarative statements, "Gene X is not involved in process Y", which is more definitive phrasing than we typically use in scientific research. It implies proving a negative, which is difficult and rare, and the evidence provided in the present manuscript generally does not reach that threshold. A more common phrasing would be "We find no evidence that gene X contributes to process Y". A good model for this more qualified phrasing is the "We conclude that while Caspar might affect the Imd pathway in certain tissue-specific contexts, it is unlikely to act as a generic negative regulator of the Imd pathway," concluding the section on the role of Caspar. I am sure the authors feel that the softer, more qualified phrasing would undermine their article's goal of cleansing the literature of inaccuracies, but the hard declarative 'never' statements are difficult to justify unless every validation experiment is done with a high degree of rigor under a variety of experimental conditions. This caveat is acknowledged in the 3rd paragraph of the Discussion, but it is not reflected in the writing of the Results. The caveat should also appear in the Introduction.

      The article is clear that "Claims were assessed as verified, unchallenged, challenged, mixed, or partially verified," but the project is called "reproducibility project" in the 7th line of the abstract, and the website is "ReproSci". The fourth line of the abstract and the introduction call some published research "irreproducible". Most of the present manuscript does not describe reproduction or replication. It describes validation, or independent experimental tests for consistency. Published work is considered validated if subsequent studies using distinct approaches yielded consistent results. For work that the authors consider suspicious, or that has not been subsequently tested, the new experiments provided here do not necessarily recreate the published experiment. Instead, the published result is evaluated with experiments that use different tools or methods, again testing for consistency of results. This is an important form of validation, but it is not reproduction, and it should not be referred to as such. I strongly suggest that variations of the words "reproducible" or "replication" be removed from the manuscript and replaced with "validation". This will be more scientifically accurate and will have the additional benefit of reducing the emotional charge that can be associated with declaring published research to be irreproducible.

      The manuscript includes an explanatory passage in the Results section, "Our project focuses on assessing the strength of the claims themselves (inferential/indirect reproducibility) rather than testing whether the original methods produce repeatable results (results/direct reproducibility). Thus, our conclusions do not directly challenge the initial results leading to a claim, but rather the general applicability of the claim itself." Rather than first appearing in Results, this statement should appear prominently in the abstract and introduction because it is a core element of the premise of the study. This can be combined with the content of the present Disclaimer section into a single paragraph in the Introduction instead of appearing in two redundant passages. I would again encourage the authors to substitute the word validation for reproduction, which would eliminate the need for the invented distinction between indirect versus direct reproduction. It is notable that the authors have chosen to title the relevant Methods section "Experimental Validation" and not "Replication".

      Experimental data "from various laboratories" in the last paragraph of the Introduction and the first paragraph of the Results are ambiguous. Since these new experiments are part of the central core of the manuscript, the specific laboratories contributing them should be named in the two paragraphs. If experiments are being contributed by all authors on the manuscript, it would suffice to say "the authors' laboratories". The attribution to "various labs" appears to be contradicted by the Discussion paragraph 2, which states "the host laboratory has expertise in" antibacterial and antifungal defense, implying a single lab. The claim of expertise by the lead author's laboratory is unnecessary and can be deleted if the Lemaitre lab is the ultimate source of all validation experiments.

      The passage on the controversial role of Duox in the gut is balanced and scholarly, and stands out for its discussion of multiple alternative lines of evidence in the published literature and supplement. This passage may benefit from research by multiple groups following up on the original claims that are not available for other claims, but the tone of the Duox section can be a model for the other sections.

      Comments on other sections and supplements:

      I understand the desire to explain how original results may have been obtained when they are not substantiated by subsequent experiments. However, statements such as "The initial results may have been obtained due to residual impurities in preparations of recombinant GNBP1" and "Non-replicable results on the roles of Spirit, Sphinx and Spheroide in Toll pathway activation may be due to off-target effects common to first-generation RNAi tools" are speculation. No experimental data are presented to support these assertions, so these statements and others like them (currently at the end of most "insights" sections) should not appear in Results. I recognize that the authors are trying to soften their criticism of prior studies by providing explanations for how errors may have occurred innocently. If they wish to do so, the speculative hypotheses should appear in the Discussion.

      The statement in Results that "The initial claim concerning wntD may be explained by a genetic background effect independent of wntD" similarly appears to be a speculation based on the reading of the main text Results. However, the Discussion clarifies that "Here, we obtained the same results as the authors of the claim when using the same mutant lines, but the result does not stand when using an independent mutant of the same gene, indicating the result was likely due to genetic background." That additional explanation in the Discussion greatly increases reader confidence in the Result and should be explained with reference to S5 in the Results. Such complete explanations should be provided everywhere possible without requiring the reader to check the Supplement in each instance.

      In some cases, such as "The results of the initial papers are likely due to the use of ubiquitous overexpression of PGRP-LE, resulting in melanization due to overactivation of the Imd pathway and resulting tissue damage", the claim to explain the original finding would be easy to test. The authors should perform those tests where they can, if they wish to retain the statements in the manuscript. Similarly, the claim "The published data are most consistent with a scenario in which RNAi generated off-target knockdown of a protein related to retinophilin/undertaker, while Undertaker itself is unlikely to have a role in phagocytosis" would be stronger if the authors searched the Drosophila genome for a plausible homolog that might have been impacted by the RNAi construct, and then put forth an argument as to why the off-target gene is more likely to have generated the original phenotype than the nominally targeted gene. There is a brief mention in S19 that junctophilin is the authors' preferred off-target candidate, but no evidence or rationale is presented to support that assertion. If the original RNAi line is still available, it would be easy enough to test whether junctophilin is knocked down as an off-target, and ideally then to use an independent knockdown of junctophilin to recapitulate the original phenotype. Otherwise, the off-target knockdown hypothesis is idle speculation.

      A good model is the passage on extracellular DNA, which states, "experiments performed for ReproSci using the original DNAse IIlo hypomorph show that elevated Diptericin expression in the hypomorph is eliminated by outcrossing of chromosome II, and does not occur in an independent DNAse II null mutant, indicating that this effect is due to genetic background (Supplementary S11)." In this case, the authors have performed a clear experiment that explains the original finding, and inclusion of that explanation is warranted. Similar background replacement experiments in other validations are equally compelling.

      The statement "Analysis of several fly stocks expected to carry the PGRP-SDdS3 mutation used in the initial study revealed the presence of a wild-type copy PGRP-SD, suggesting that either the stock used in this study did not carry the expected mutation, or that the mutation was lost by contamination prior to sharing the stock with other labs" provides a documentable explanation of a potential error in the original two manuscripts, but the subsequent "analysis of several fly stocks" needs citations to published literature or explanation in the supplement. It is unclear from this passage how the wildtype allele in the purportedly mutant stocks could have led to the misattribution of function to PGRP-SD, so that should be explained more clearly in the manuscript.

      The originally claimed anorexia of the Gr28b mutation is explained as having been "likely obtained due to comparison to a wild-type line with unusually high feeding rates". This claim would be stronger if the wildtype line in question were named and data showing a high rate of feeding were presented in the supplement or cited from published literature. Otherwise, this appears to be speculation.

      In the section "The Toll immune pathway is not negatively regulated by wntD", FlyAtlas is cited as evidence that wntD is not expressed in adult flies. However, the FlyAtlas data is not adequately sensitive to make this claim conclusively. If the present authors wish to state that wntD is not expressed in adults, they should do a thorough test themselves and report it in the Supplement.

      Alternatively, the statement "data from FlyAtlas show that wntD is only expressed at the embryonic stage and not at the adult stage at which the experiments were performed by (Gordon et al., 2005a)" could be rephrased to something like "data from FlyAtlas show strong expression of wntD in the embryo but not the adult" and it should be followed by a direct statement that adult expression was also found to be near-undetectable by qPCR in supplement S5. That data is currently "not shown" in the supplement, but it should be shown because this is a central result that is being used to refute the original claim. This manuscript passage should also describe the expression data described in Gordon et al. (2005), for contrast, which was an experimental demonstration of expression in the embryo and a claim "RT-PCR was used to confirm expression of endogenous wntD RNA in adults (data not shown)."

      Inclusion of the section on croquemort is curious because it seems to be focused exclusively on clearance of apoptotic cells in the embryo, not on anything related to immunity. The subsection is titled "Croquemort is not a phagocytic engulfment receptor for apoptotic cells or bacteria", but the text passage contains no mention of phagocytosis of bacteria, and phagocytosis of bacteria is not tested in the S17 supplement. I would suggest deleting this passage entirely if there is not going to be any discussion of the immune-related phenotypes.

      The claim "Toll is not activated by overexpression of GNBP3 or Grass: Experiments performed for ReproSci find that contrary to previous reports, overexpression of GNBP3 (Gottar et al., 2006) or<br /> Grass (El Chamy et al., 2008) in the absence of immune challenge does not effectively activate Toll signaling (Supplementaries S6, S7)" is overly strongly stated unless the authors can directly repeat the original published studies with identical experimental conditions. In the absence of that, the claim in the present manuscript needs to be softened to "we find no evidence that..." or something similar. The definitive claim "does not" presumes that the current experiments are more accurate or correct than the published ones, but no explanation is provided as to why that should be the case. In the absence of a clear and compelling argument as to why the current experiment is more accurate, it appears that there is one study (the original) that obtained a certain result and a second study (the present one) that did not. This can be reported as an inconsistency, but the second experiment does not prove that the first was an error. The same comment applies to the refutation of the roles for Edin and IRC. Even though the current experiments are done in the context of a broader validation study, this does not automatically make them more correct. The present work should adhere to the same standards of reporting that we expect in any other piece of science.

      The statement "Furthermore, evidence from multiple papers suggests that this result, and other instances where mutations have been found to specifically eliminate Defensin expression, is likely due to segregating polymorphisms within Defensin that disrupt primer binding in some genetic backgrounds and lead to a false negative result (Supplementary S20)" should include citations to the multiple papers being referenced. This passage would benefit from a brief summary of the logic presented in S20 regarding the various means of quantifying Defensin expression.

      In S22 Results, the statement "For general characterization of the IrcMB11278 mutant, including developmental and motor defects and survival to septic injury, see additional information on the ReproSci website" is not acceptable. All necessary information associated with the paper needs to be included in the Supplement. There cannot be supporting data relegated to an independent website with no guaranteed stability or version control. The same comment applies to "Our results show that eiger flies do not have reduced feeding compared to appropriate controls (See ReproSci website)" in S25.

      Supplement S21 appears to show a difference between the wildtype and hemese mutants in parasitoid encapsulation, which would support the original finding. However, the validation experiment is performed at a small sample size and is not replicated, so there can be no statistical analysis. There is no reported quantification of lamellocytes or total hemocytes. The validation experiment does not support the conclusion that the original study should be refuted. The S21 evaluation of hemese must either be performed rigorously or removed from the Supplement and the main text.

      In S22, the second sentence of the passage "Due to the fact that IrcMB11278 flies always survived at least 24h prior to death after becoming stuck to the substrate by their wings, we do not attribute the increased mortality in Ecc15-fed IrcMB11278 flies primarily to pathogen ingestion, but rather to locomotor defects. The difference in survival between sucrose-fed and Ecc15-fed IrcMB11278 flies may be explained by the increased viscosity of the Ecc15-containing substrate compared to the sucrose-containing substrate" is quite strange. The first sentence is plausible and a reasonable interpretation of the observations. But to then conclude that the difference between the bacterial treatment versus the control is more plausibly due to substrate viscosity than direct action of the bacteria on the fly is surprising. If the authors wish to put forward that interpretation, they need to test substrate viscosity and demonstrate that fly mortality correlates with viscosity. Otherwise, they must conclude that the validation experiment is consistent with the original study.

      In S27, the visualization of eiger expression using a GFP reporter is very non-standard as a quantitative assay. The correct assay is qPCR, as is performed in other validation experiments, and which can easily be done on dissected fat body for a tissue-specific analysis. S27 Figure 1 should be replaced with a proper experiment and quantitative analysis. In S27 Figure 2, the authors should add a panel showing that eiger is successfully knocked down with each driver>construct combination. This is important because the data being reported show no effect of knockdown; it is therefore imperative to show that the knockdown is actually occurring. The same comment applies everywhere there is an RNAi to demonstrate a lack of effect.

      The Drosomycin expression data in S3 Figure 2A look extremely noisy and are presented without error bars or statistical analysis. The S4 claim that sphinx and spheroid are not regulators of the Toll pathway because quantitative expression levels of these genes do not correlate with Toll target expression levels is an extremely weak inference. The RNAi did not work in S4, so no conclusion should be inferred from those experiments. Although the original claims in dispute may be errors in both cases, the validation data used to refute the original claims must be rigorous and of an acceptable scientific standard.

      In S6 Figure 1, it is inappropriate to plot n=2 data points as a histogram with mean and standard errors. If there are fewer than four independent points, all points should be plotted as a dot plot. This comment applies to many qPCR figures throughout the supplement. In S7 Figure 1, "one representative experiment" out of two performed is shown. This strongly suggests that the two replicates are noisy, and a cynical reader might suspect that the authors are trying to hide the variance. This also applies to S5 Fig 3. Particularly in the context of a validation study, it is imperative to present all data clearly and objectively, especially when these are the specific data that are being used to refute the claim.

      Other comments:

      In S26, the authors suggest that much of the observed melanization arises from excessive tissue damage associated with abdominal injection contrasted to the lesser damage associated with thoracic injection. I believe there may be a methodological difference here. The Methods of S27 are not entirely clear, but it appears that the validation experiment was done with a pinprick, whereas the original Mabary and Schneider study was done with injection via a pulled capillary. My lab group (and I personally) have extensive experience with both techniques. In our hands, pinpricks to the abdomen do indeed cause substantial injury, and the physically less pliable thorax is more robust to pinpricks. However, capillary injections to the abdomen do virtually no tissue damage - very probably less than thoracic injections - and result in substantially higher survivals of infection even than thoracic injections. Thus, the present manuscript may infer substantial tissue damage in the original study because they are employing a different technique.

    1. Pérez told me stories of scientists who sacrificed their academic careers to build software, because building software counted for so little in their field: The creator of matplotlib, probably the most widely used tool for generating plots in scientific papers, was a postdoc in neuroscience but had to leave academia for industry. The same thing happened to the creator of NumPy, a now-ubiquitous tool for numerical computing. Pérez himself said, “I did get straight-out blunt comments from many, many colleagues, and from senior people and mentors who said: Stop doing this, you’re wasting your career, you’re wasting your talent.” Unabashedly, he said, they’d tell him to “go back to physics and mathematics and writing papers.”

      También he vivido la subvaloración asociada a publicar y sostener software libre en y desde contextos comunitarios en contraste con la publicación en circuitos académicos clásicos. Y si bien las universidades locales se están pensando esto en aras de visibilizar innovación, lo hacen muy lentamente, como es habitual, mientras los incentivos siguen estando alineados a las métricas convencionales

    2. As science becomes more about computation, the skills required to be a good scientist become increasingly attractive in industry. Universities lose their best people to start-ups, to Google and Microsoft. “I have seen many talented colleagues leave academia in frustration over the last decade,” he wrote, “and I can’t think of a single one who wasn’t happier years later.”

      Yo he escuchado esa sirena en el pasado, e incluso me propusieron trabajar con una de las big pharma y con una aseguradora, básicamente por mis conocimientos en programación (Pharo Smalltalk, específicamente). Sin embargo, "recaí a la academia, después de ser un académico en rehabilitación", como suelo decir y ahora volví de tiempo completo. Creo que una alternativa entre un camino y otro es ser académico/consultor, produciendo bienes comunes que uno trae del sector de las MiPyMes (micro, pequeña y medianas empresas) que hacen innovación local hacia la academía, en lo que uno esperaría que sea un ciclo virtuoso.

      Lo anterior es lo que he intentando con mutabiT de manera sostenida desde hace un par de décadas, gracias a las economías de los afectos (incluyendo mi mamá, mi hermana, Adriana y otros amigues) y si bien eso ha permitido hacer investigación de largo aliento desde las latitudes de la Mayoría Global sin mayores pérdidas de dinero, tampoco ha sido un esfuerzo lucrativo. Creo que, en caso de no poder continuar produciendo bienes comunes que vinculen los mundos académicos y productivos en esa escala sostenible y autónoma en los contextos locales, tendría que decidirme entre dejar alguno de los dos, como cuentan que han hecho estos académicos de otras latitudes.

    3. Stephen Wolfram who titled a book about his own work on cellular automata A New Kind of Science. In his blog post about computational essays, he writes, “At the core of computational essays is the idea of expressing computational thoughts using the Wolfram Language.”

      Esta idea de vincular sus productos a sus discursos hace ver todas las charlas de Stephen Wolfram, como charlas de mercadeo de sus productos, más que de sus ideas, con la consecuente necesidad de mostrar su productos como las únicas alternativas valiosas para explorar ideas que ocurren en muchos lados y de muchas formas.

    4. it might be too much to ask publishers to abandon PDFs, an open format, for a proprietary product. “Right now if you make a Mathematica notebook and you try to send that to a journal,” Gray says, “they’re gonna complain: Well, we don’t have Mathematica, this is an expensive product—give us something that’s more of a standard.”

      Hoy podrían enviarle una libreta computacional libre, incluso con un contenedor que reproduzca todo el entorno y los datos que hacen el artículo posible. Yo experimenté con algo así en 2016 durante mi pasantía doctoral para mi prototipo titulado "Panama Papers: a case for reproducible research, data activism and frictionless data" e incluso creé una versión web y una versión PDF, con su respectivo repositorio de código. Dado que fue un enfoque original cuando aún no conocía de los esfuerzos resonantes en el Norte Global, usé un entorno más ligero con Grafoscopio y la imagen de Pharo en lugar de contenedores.

      Hoy, lugares como NextJournal o Marimo están pensando en otras maneras de publicar para la web usando libretas computacionales interactivas y continúan con tradiciones del Norte Global, a la vez que ignoran lo que hemos hecho desde la mayoría Global, como es habitual. Sin embargo es bueno ver esas miradas en resonancia e incluso los adelantos que tenemos acá en publicaciones multiformato, de fuenté única (Perro Tuerto, del MIAU, también hablaba de esto)

    5. The Mathematica notebook is the more coherently designed, more polished product—in large part because every decision that went into building it emanated from the mind of a single, opinionated genius. “I see these Jupyter guys,” Wolfram said to me, “they are about on a par with what we had in the early 1990s.” They’ve taken shortcuts, he said. “We actually want to try and do it right.”

      Desde mediados/finales de los noventas no uso Mathematica, e incluso en ese momento era un gran sistema, altamente integrado y coherente. Sin embargo, en la medida en que me decanté por el software libre, empecé prontamente a buscar alternativas e inicié con TeXmacs, del cual traduje la mayor parte de su documentación al español, como una de mis primeras contribuciones a un proyecto de software libre (creo que aún la traducción es la que se está usando y por aquella época usábamos SVN para coordinar cambio e incluso enviábamos archivos compresos, pues el control de versiones no era muy popular).Por ejemplo el bonito y minimalista Yacas, con el que hiciera muchas de mis tareas en pregrado y colocara algunos talleres y corrigiría parciales cuando me convirtiese en profesor del departamento de Matemáticas

      TeXmacs, a diferencia de sistemas monolíticos como Mathematica, se conectaba ya desde ese entonces con una gran variedad de Sistemas de Álgebra Computacional (o CAS, por sus siglas en inglés) exponiéndonos a una diversidad de enfoques y paradigmas CAS, con sus sintaxis e idiosincracias particulares, en una riqueza que Mathematica nunca tendrá.

      TeXmacs también me expondría a ideas poderosas, como poder cambiar el software fácilmente a partir de pequeños scripts (en Scheme), que lo convirtieron en el primer software libre que modifiqué, y las poderosas S-expressions que permitían definir un documento y su interacción con CAS externos, si bien TeXmas ofrecía un lenguaje propio mas legible y permitía pasar de Scheme a este y viceversa.

      En general esa es la diferencia de los sistemas privativos con los libres: una monocultura versus una policultura, con las conveniencias de la primera respecto a los enfoques unificantes contra la diversidad de la segunda. Si miramos lo que ha ocurrido con Python y las libretas computacionales abiertas como Marimo y Jupyter, estos han ganado en la conciencia popular con respecto a Mathematica y han incorporado funcionalidad progresiva que Mathematica tenía, mientras que otra sigue estando aún presente en los sistemas privativos y no en los libres y viceversa. Yo no diría que las libretas computacionales libres están donde estaba Mathematica en los 90's, sino que han seguido rutas históricas diferentes, cada una con sus valores y riquezas.

    6. built their notebooks as simple web pages. The interface is missing Mathematica’s Steve Jobsian polish, and its sophistication. But by latching itself to the web, IPython got what is essentially free labor: Any time Google, Apple, or a random programmer open-sourced a new plotting tool, or published better code for rendering math, the improvement would get rolled into IPython. “It has paid off handsomely,” Pérez said.

      Algo similar es lo que quiero capitalizar con Cardumem y luego portar a Grafoscopio, pues, como lo ha mostrado la experiencia con este último, las interfaces en Spec, el toolkit gráfico de Pharo, si bien brindan algunas cosas que las interfaces web no tienen, adolecen del basto ecosistema de ésta última y mantienen los documentos y la computación aisladas dentro de la imagen.

      La web, por el contrario, es casi ubicua en términos de las tecnologías ya instaladas y así no se cuente con una conexión a internet en el equipo de cómputo, si este tiene una interfaz gráfica, muy seguramente contará con un naveador web. Y ahora que los sistemas hipermedia, hacen posible programar la web desde cualquier lenguaje (HOWL: Hypermedia On Whatever you Like), se puede aprovechar tanto lo que sabemos de los lenguajes/entornos que nos gustan (Pharo o Lua) como del amplio sistema de la web. Antes de 2023, que se popularizaron los sistemas hipermedia, teníamos que elegir entre lo uno y lo otro. Y yo deselegí activamente la web, debido al adefesio de JavaScript y lo engorroso del CSS. Hoy, las condiciones son bien distintas.

    7. In early 2001, Fernando Pérez found himself in much the same position Wolfram had 20 years earlier: He was a young graduate student in physics running up against the limits of his tools. He’d been using a hodgepodge of systems, Mathematica among them, feeling as though every task required switching from one to the next. He remembered having six or seven different programming-language books on his desk. What he wanted was a unified environment for scientific computing.

      Como he documentado en mi blog, antes de Grafoscopio, mis primeros intentos por crear flujos y herramientas de documentación interactiva a medida fueron en IPython, pero me cansé de lidiar con la complejidad incidental del stack con el que IPython y luego Jupyter estaban hechos, encontrando en Pharo Smalltalk una plataforma más coherente, simple, entendible y adaptable.

      Como he explicado en otros apartados, si bien Grafoscopio mezcla herramientas distintas de nuestros flujos de documentación y publicación, ahora Cardumem surge con la idea de crear una interfaz web y una experiencia unificada para las integraciones, con una sintaxis minimalista y sin los requerimientos conceptuales de la programación orientada a objetos.

    8. Grafoscopio incluía los automatismos que permitían exportar un documento/árbol a distintos formatos. Cardumem debería continuar dicha tradición, indicando qué versión y checksum del mismo fue utilizado para la exportación, junto con el algoritmo de selección, traversal y expotación que dieron lugar a un documento exportado en un formato particular. Dicha información debería ser parte del documento fuente, del mismo modo que lo era en Grafoscopio y, eventualmente, del exportado, cuando sea posible. (en Grafoscopio lo colocaba en los anexos.

    9. To write a paper in a Mathematica notebook is to reveal your results and methods at the same time; the published paper and the work that begot it. Which shouldn’t just make it easier for readers to understand what you did—it should make it easier for them to replicate it (or not). With millions of scientists worldwide producing incremental contributions, the only way to have those contributions add up to something significant is if others can reliably build on them. “That’s what having science presented as computational essays can achieve,” Wolfram said.

      La idea y el uso de las libretas computacionales es menos generales que la de (Inter) Personal Knowledge Management, como era de esperarse y ha evidenciado nuestras prácticas en la comunidad de Grafoscopio, donde sus libretas interactivas fueron usadas extensivamente y de acuerdo a las necesidades descubiertas con la comunidad en la creación y articulación de flujos documentales a medida. Algo similar se puede decir en las ciencias sociales y humanas, donde también se escribe, pero no simulaciones de sistemas complejos.

      En estos contextos comunitarios y de las ciencias amplias y estudios críticos, el hipertexto y la interactividad puede servir, pero más para explorar la memoria propia, imaginar y enactuar otras formas de comunicarla y construirla.

      El énfasis actual en Cardumem, en lugar de Grafoscopio, y los posibles vínculos entre ambos reconoce estas otras posibilidades de interacción y computación desde esa memoria interpersonal e interactiva.

    1. To g e t t h e m o s t o u t o f y o u r reading, follow the five steps of the reading process

      I often read for my own enjoyment and entertainment so when it comes to having to active read it is very different. I'm glad there is a reminder of the steps here to show that it does take a little more time but, active reading is simple once you get the hang of it and have had practice.

    2. To m o v e f r o m r e a d i n g t o w r i t i n g , y o u n e e d t o r e a d a c t i v e l y, i n a t h o u g h t -ful spirit, and with an alert, inquiring mind. Reading actively means learning how to analyze what you read.

      It is important to learn how to truly analyze a text and figure out what are the key points that you're supposed to be focusing on. Active reading is important when it comes to writing about a specific reading.

    3. o g e t t h e m o s t o u t o f y o u r reading, follow the five steps of the reading process.

      I am glad that the article is giving these notes. It will be something I will look forward to working on and applying to my day to day readings.

    4. To m o v e f r o m r e a d i n g t o w r i t i n g , y o u n e e d t o r e a d a c t i v e l y, i n a t h o u g h t -ful spirit, and with an alert, inquiring mind. Reading actively means learning how to analyze what you read.

      Active reading is a skill that if learned properly I believe really elevates what you get out of whatever it is that you are reading. Understanding the piece and remembering parts of it come so much easier if you active read.

    1. examen físico revela sensibilidad difusa y vigilancia sin hallazgos localizados, ausencia de una fuente de infección tratable quirúrgicamente en un estudio de imagen, y la presencia de más de 250 neutrófilos/mL en fluido obtenido mediante paracentesis.

      x

    1. Et quel est le degré de rejouabilité des animations Flash ?

      Il y avait un problème de rupture dans la première version de ce bout de texte. Je l'ai donc reformulé pour compléter la question qui se limitait d'abord à "rejouabilité des animations Flash?"

    2. mais aussi que la capture d’un site ne soit pas complète, en fonction des liens suivis ou non par le robot.

      Il semble y avoir une rupture dans la phrase. Je proposerais de reformuler ainsi pour fluidifier la lecture :

      "[...] site. Il est aussi possible que la capture d'un site soit incomplète, en fonction des liens suivis ou non par le robot."

    1. Author response:

      The following is the authors’ response to the original reviews.

      Public Reviews:

      Reviewer #1 (Public review):

      Summary:

      Fungal survival and pathogenicity rely on the ability to undergo reversible morphological transitions, which are often linked to nutrient availability. In this study, the authors uncover a conserved connection between glycolytic activity and sulfur amino acid biosynthesis that drives morphogenesis in two fungal model systems. By disentangling this process from canonical cAMP signaling, the authors identify a new metabolic axis that integrates central carbon metabolism with developmental plasticity and virulence.

      Strengths:

      The study integrates different experimental approaches, including genetic, biochemical, transcriptomic, and morphological analyses, and convincingly demonstrates that perturbations in glycolysis alter sulfur metabolic pathways and thus impact pseudohyphal and hyphal differentiation. Overall, this work offers new and important insights into how metabolic fluxes are intertwined with fungal developmental programs and therefore opens new perspectives to investigate morphological transitioning in fungi.

      We thank the reviewer for finding this study to be of importance and for appreciating our multipronged approach to substantiate our finding that perturbations in glycolysis alter sulfur metabolism and thus impact pseudohyphal and hyphal differentiation in fungi.

      Weaknesses:

      A few aspects could be improved to strengthen the conclusions. Firstly, the striking transcriptomic changes observed upon 2DG treatment should be analyzed in S. cerevisiae adh1 and pfk1 deletion strains, for instance, through qPCR or western blot analyses of sulfur metabolism genes, to confirm that observed changes in 2DG conditions mirror those seen in genetic mutants. Secondly, differences between methionine and cysteine in their ability to rescue the mutant phenotype in both species are not mentioned, nor discussed in more detail. This is especially important as there seem to be differences between S. cerevisiae and C. albicans, which might point to subtle but specific metabolic adaptations.

      The authors are also encouraged to refine several figure elements for clarity and comparability (e.g., harmonized axes in bar plots), condense the discussion to emphasize the conceptual advances over a summary of the results, and shorten figure legends.

      We are grateful for this valuable and constructive feedback, and we agree with the reviewer on the necessity of performing RT-qPCR analysis of sulfur metabolism genes in ∆∆pfk1 and ∆∆adh1 strains of S. cerevisiae to validate our RNA-Seq results using 2DG. We have performed this experiment, and our results show that several genes involved in the de novo biosynthesis of sulfur-containing amino acids are downregulated in both the ∆∆pfk1 and ∆∆adh1 strains, corroborating the downregulation of sulfur metabolism genes in the 2DG treated samples. This new data is now included in the revised manuscript as Supplementary Figure 2C. 

      Furthermore, we acknowledge the reviewer’s point regarding the significance of comparing the differences in the ability of methionine and cysteine to rescue filamentation defects exhibited by the mutants, between S. cerevisiae and C. albicans. The observed differences between S. cerevisiae and C. albicans likely highlight species-specific metabolic adaptations within the sulfur assimilation pathway.  While both yeasts employ the transsulfuration pathway to interconvert these sulfur-containing amino acids, the precise regulatory points including the specific enzymes, their compartmentalization, and transcriptional control are not identical. For instance, differences in the feedback inhibition mechanisms or the expression levels of key transsulfuration enzymes between S. cerevisiae and C. albicans could explain the variations in the phenotypic rescue experiments (Chebaro et al., 2017; Lombardi et al., 2024; Rouillon et al., 2000; Shrivastava et al., 2021; Thomas and Surdin-Kerjan, 1997). Furthermore, the species-specific differences in amino acid transport systems (permeases) adds another layer of complexity. S. cerevisiae primarily uses multiple, low-affinity permeases for cysteine transport (Gap1, Bap2, Bap3, Tat1, Tat2, Agp1, Gnp1, Yct1), while relying on a limited set of high-affinity transporters (like Mup1) for methionine transport, with the added complexity that its methionine transporters can also transport cysteine (Düring-Olsen et al., 1999; Huang et al., 2017; Kosugi et al., 2001; Menant et al., 2006). In contrast, C. albicans utilizes a high-affinity transporters for the uptake of both amino acids, employing Cyn1 specifically for cysteine and Mup1 for methionine, indicating a greater reliance on dedicated transport mechanisms for these sulfur-containing molecules in the pathogenic yeast (Schrevens et al., 2018; Yadav and Bachhawat, 2011). A combination of the aforesaid factors could be the potential reason for the differences in the ability of cysteine and methionine to rescue filamentation in S. cerevisiae and C. albicans.

      Finally, we have enhanced the quantitative rigor and clarity of the data presentation in the revised manuscript by implementing Y-axis uniformity across all relevant bar graphs to facilitate a more robust and direct comparative analysis. We have also condensed the discussion to emphasize the conceptual advances and have shortened the figure legends as per the reviewer suggestions

      Reviewer #2 (Public review):

      Summary:

      This manuscript investigates the interplay between glycolysis and sulfur metabolism in regulating fungal morphogenesis and virulence. Using both Saccharomyces cerevisiae and Candida albicans, the authors demonstrate that glycolytic flux is essential for morphogenesis under nitrogen-limiting conditions, acting independently of the established cAMP-PKA pathway. Transcriptomic and genetic analyses reveal that glycolysis influences the de novo biosynthesis of sulfur-containing amino acids, specifically cysteine and methionine. Notably, supplementation with sulfur sources restores morphogenetic and virulence defects in glycolysis-deficient mutants, thereby linking core carbon metabolism with sulfur assimilation and fungal pathogenicity.

      Strengths:

      The work identifies a previously uncharacterized link between glycolysis and sulfur metabolism in fungi, bridging metabolic and morphogenetic regulation, which is an important conceptual advance and fungal pathogenicity. Demonstrating that adding cysteine supplementation rescues virulence defects in animal models connects basic metabolism to infection outcomes, which adds to biomedical importance.

      We would like to thank the reviewer for the positive comments on our work. We are pleased that they recognize the novel metabolic link between glycolysis and sulfur metabolism as a key conceptual advance in fungal morphogenesis. 

      Weaknesses:

      The proposed model that glycolytic flux modulates Met30 activity post-translationally remains speculative. While data support Met4 stabilization in met30 deletion strains, the mechanism of Met30 modulation by glycolysis is not demonstrated.

      We thank the reviewer for this valuable feedback. The activity of the SCF<sup>Met30</sup> E3 ubiquitin ligase, mediated by the F box protein Met30, is dynamically regulated through both proteolytic degradation and its dissociation from the SCF complex, to coordinate sulfur metabolism and cell cycle progression (Smothers et al., 2000; Yen et al., 2005). Our transcriptomic (RNA-seq analysis) and protein expression analysis (Fig. 3J) confirms that Met30 expression is not differentially regulated in the presence of 2DG, effectively eliminating changes in synthesis or SCF<sup>Met30</sup> proteasomal degradation as the dominant regulatory mechanism. This observation is consistent with the established paradigm wherein stress signals, such as cadmium (Cd<sup>2+</sup>) exposure, rapidly inactivates the SCF<sup>Met30</sup> E3 ubiquitin ligase via the dissociation of Met30 from the Skp1 subunit of the SCF complex (Lauinger et al., 2024; Yen et al., 2005). We therefore propose that active glycolytic flux modulates SCF<sup>Met30</sup> activity post-translationally, specifically by triggering Met30 detachment from the SCF complex. This mechanism would stabilize the primary substrate, the transcription factor Met4, thus promoting the biosynthesis of sulfur-containing amino acids. Mechanistic validation of this hypothesis, particularly the assessment of Met30 dissociation from the SCF<sup>Met30</sup> complex via immunoprecipitation (IP), is technically challenging. Since these experiments will involve isolation of cells from colonies undergoing pseudohyphal differentiation, on solid media (given that pseudohyphal differentiation does not occur in liquid media that is limiting for nitrogen (Gancedo, 2001; Gimeno et al., 1992)), current cell yields (OD<sub>600</sub>≈1 from ≈80-100 colonies) are significantly below the amount of cells that is needed to obtain the required amount of total protein concentration, for standard pull down assays (OD<Sub>600</sub>≈600-800 is required to achieve 1-2 mg/ml of total protein which is the standard requirement for pull down protocols in S. cerevisiae (Lauinger et al., 2024)).

      Given that the primary objective of our study is to establish the novel regulatory link between glycolysis and sulfur metabolism in the context of fungal morphogenesis, we would like to explore these crucial mechanistic details, in depth, in a subsequent study.

      Reviewer #3 (Public review):

      This study investigates the connection between glycolysis and the biosynthesis of sulfur-containing amino acids in controlling fungal morphogenesis, using Saccharomyces cerevisiae and C. albicans as model organisms. The authors identify a conserved metabolic axis that integrates glycolysis with cysteine/methionine biosynthetic pathways to influence morphological transitions. This work broadens the current understanding of fungal morphogenesis, which has largely focused on gene regulatory networks and cAMP-dependent signaling pathways, by emphasizing the contribution of metabolic control mechanisms. However, despite the novel conceptual framework, the study provides limited mechanistic characterization of how the sulfur metabolism and glycolysis blockade directly drive morphological outcomes. In particular, the rationale for selecting specific gene deletions, such as Met32 (and not Met4), or the Met30 deletion used to probe this pathway, is not clearly explained, making it difficult to assess whether these targets comprehensively represent the metabolic nodes proposed to be critical. Further supportive data and experimental validation would strengthen the claims on connections between glycolysis, sulfur amino acid metabolism, and virulence.

      Strengths:

      (1) The delineation of how glycolytic flux regulates fungal morphogenesis through a cAMP-independent mechanism is a significant advancement. The coupling of glycolysis with the de novo biosynthesis of sulfur-containing amino acids, a requirement for morphogenesis, introduces a novel and unexpected layer of regulation.

      (2) Demonstrating this mechanism in both S. cerevisiae and C. albicans strengthens the argument for its evolutionary conservation and biological importance.

      (3) The ability to rescue the morphogenesis defect through exogenous supplementation of sulfur-containing amino acids provides functional validation.

      (4) The findings from the murine Pfk1-deficient model underscore the clinical significance of metabolic pathways in fungal infections.

      We are grateful for this comprehensive and insightful summary of our work. We deeply appreciate the reviewer's recognition of the key conceptual breakthroughs regarding the metabolic regulation of fungal morphogenesis and the clinical relevance of our findings.

      Weaknesses:

      (1) While the link between glycolysis and sulfur amino acid biosynthesis is established via transcriptomic and proteomic analysis, the specific regulation connecting these pathways via Met30 remains to be elucidated. For example, what are the expression and protein levels of Met30 in the initial analysis from Figure 2? How specific is this effect on Met30 in anaerobic versus aerobic glycolysis, especially when the pentose phosphate pathway is involved in the growth of the cells when glycolysis is perturbed ?

      We are grateful for the insightful feedback provided by the reviewer. S. cerevisiae is a Crabtree positive organism that primarily uses anaerobic glycolysis to metabolize glucose, under glucose-replete conditions (Barford and Hall, 1979; De Deken, 1966) and our pseudohyphal differentiation assays are performed in glucose-rich conditions (Gimeno et al., 1992). Furthermore, perturbation of glycolysis is known to induce compensatory upregulation of the Pentose Phosphate Pathway (PPP) (Ralser et al., 2007) and we have also observed the upregulation of the gene that encodes for transketolase-1 (Tkl1), a key enzyme in the PPP, in our RNA-seq data. Importantly, our transcriptomic (RNA-seq analysis) and protein expression analysis (Fig. 3J) confirms that Met30 expression is not differentially regulated in the presence of 2DG, effectively eliminating changes in synthesis or SCF<sup>Met30</sup> proteasomal degradation as the dominant regulatory mechanism.  This aligns with the established paradigm wherein stress signals, such as cadmium (Cd<sup>2+</sup>) exposure, rapidly inactivates SCF<sup>Met30</sup> E3 ubiquitin ligase via Met30 dissociation from the Skp1 subunit of the complex (Lauinger et al., 2024; Yen et al., 2005). We therefore propose that active glycolytic flux modulates SCF<sup>Met30</sup> activity post-translationally, specifically by triggering Met30 detachment from the SCF complex. This mechanism would stabilize the primary substrate, the transcription factor Met4, thus promoting the biosynthesis of sulfur-containing amino acids. Further experiments are required to delineate the specific role of pentose phosphate pathway in the aforesaid proposed regulation of the Met30 activity under glycolysis perturbation and this will be explored in our subsequent study.

      (2) Including detailed metabolite profiling could have strengthened the metabolic connection and provided additional insights into intermediate flux changes, i.e., measuring levels of metabolites to check if cysteine or methionine levels are influenced intracellularly. Also, it is expected to see how Met30 deletion could affect cell growth. Data on Met30 deletion and its effect on growth are not included, especially given that a viable heterozygous Met30 strain has been established. Measuring the cysteine or methionine levels using metabolomic analysis would further strengthen the claims in every section.

      We are grateful to the reviewer for this constructive feedback. To address the potential impact of met30 deletion on cell growth, we have included new data (Suppl. Fig. 4A) demonstrating that the deletion of a single copy of met30 in diploid S. cerevisiae does not compromise overall cell growth under nitrogen-limiting conditions as the ∆met30 strain grows similar to the wild-type strain. 

      Our pseudohyphal/hyphal differentiation assays show that the defects induced by glycolytic perturbation is fully rescued by the exogenous supplementation of sulfur-containing amino acids, cysteine or methionine. Since these data conclusively demonstrate that the primary metabolic limitation caused by the perturbation of glycolysis, which leads to filamentation defects is sulfur metabolism, we posit that performing comprehensive metabolic profiling would primarily reconfirm the aforesaid results. We believe that our in vitro and in vivo sulfur add-back experiments sufficiently substantiate the novel regulatory metabolic link between glycolysis and sulfur metabolism.

      (3) In comparison with the previous bioRxiv (doi: https://doi.org/10.1101/2025.05.14.654021) of this article in May 2025 to the recent bioRxiv of this article (doi: https://doi.org/10.1101/2025.05.14.654021), there have been some changes, and Met30 deletion has been recently included, and the chemical perturbation of glycolysis has been added as new data. Although the changes incorporated in the recent version of the article improved the illustration of the hypothesis in Figure 6, which connects glycolysis to Sulfur metabolism, the gene expression and protein levels of all genes involved in the illustrated hypothesis are not consistently shown. For example, in some cases, the Met4 expression is not shown (Figure 4), and the Met30 expression is not shown during profiling (gene expression or protein levels) throughout the manuscript. Lack of consistency in profiling the same set of key genes makes understanding more complicated.

      We thank the reviewer for this feedback which helps us to clarify the scope of our transcriptomic analysis. Our decision to focus our RT-qPCR experiments on downstream targets, while excluding met4 and met30 from the RT-qPCR analysis, is based on their known regulatory mechanisms. Met4 activity is predominantly regulated by post-translational ubiquitination by the SCFMet30 complex followed by its degradation (Rouillon et al., 2000; Shrivastava et al., 2021; Smothers et al., 2000)  while Met30 activity is primarily regulated by its auto-degradation or its dissociation from the SCFMet30 complex (Lauinger et al., 2024; Smothers et al., 2000; Yen et al., 2005).  Consistent with this, our RNA-Seq results indicate that neither met4 nor met30 transcripts are differentially expressed, in response to 2DG addition. For all our RT-qPCR analysis in S. cerevisiae and C. albicans, we have consistently used the same set of sulfur metabolism genes and these include met32, met3, met5, met10 and met17. Our data on protein expression analysis of Met30 in S. cerevisiae (Fig. 3J) confirms that Met30 expression is not differentially regulated in the presence of 2DG, effectively eliminating changes in synthesis or SCFMet30 proteasomal degradation as the dominant regulatory mechanism.

      (4) The demonstrated link between glycolysis and sulfur amino acid biosynthesis, along with its implications for virulence in C. albicans, is important for understanding fungal adaptation, as mentioned in the article; however, the Met4 activation was not fully characterized, nor were the data presented when virulence was assessed in Figure 4. Why is Met4 not included in Figure 4D and I? Especially, according to Figure 6, Met4 activation is crucial and guides the differences between glycolysis-active and inactive conditions.

      We thank the reviewer for their input. As the Met4 transcription factor in C. albicans is primarily regulated post-translationally through its degradation and inactivation by the SCFMet30 E3 ubiquitin ligase complex (Shrivastava et al., 2021), we opted to monitor the transcriptional status of downstream targets of Met4 (i.e., genes directly regulated by Met4), as these are the genes that exhibit the most direct and functionally relevant transcriptional changes in response to the altered Met4 levels.

      (5) Similarly, the rationale behind selecting Met32 for characterizing sulfur metabolism is unclear. Deletion of Met32 resulted in a significant reduction in pseudohyphal differentiation; why is this attributed only to Met32? What happens if Met4 is deleted? It is not justified why Met32, rather than Met4, was chosen. Figure 6 clearly hypothesizes that Met4 activation is the key to the mechanism.

      We sincerely thank the reviewer for this insightful query regarding our selection of the met32 for our gene deletion experiments. The choice of ∆∆met32 strain was strategically motivated by its unique phenotypic properties within the de novo biosynthesis of sulfur-containing amino acids pathway. While deletions of most the genes that encode for proteins involved in the de novo biosynthesis of sulfurcontaining amino acids, result in auxotrophy for methionine or cysteine, ∆∆met32 strain does not exhibit this phenotype (Blaiseau et al., 1997). This key distinction is attributed to the functional redundancy provided by the paralogous gene, met31 (Blaiseau et al., 1997). Crucially, given that the deletion of the central transcriptional regulator, met4, results in cysteine/methionine auxotrophy, the use of the ∆∆met32 strain provides an essential, viable experimental model for investigating the role of sulfur metabolism during pseudohyphal differentiation in S. cerevisiae.

      (6) The comparative RT-qPCR in Figure 5 did not account for sulfur metabolism genes, whereas it was focused only on virulence and hyphal differentiation. Is there data to support the levels of sulfur metabolism genes?

      We thank the reviewer for this feedback. We wish to respectfully clarify that the data pertaining to expression of sulfur metabolism genes in the presence of 2DG or in the ∆∆pfk1 strain in C. albicans are already included and discussed within the manuscript. These results can be found in Figure 4, panels D and I, respectively.

      (7) To validate the proposed interlink between sulfur metabolism and virulence, it is recommended that the gene sets (illustrated in Figure 6) be consistently included across all comparative data included throughout the comparisons. Excluding sulfur metabolism genes in Figure 5 prevents the experiment from demonstrating the coordinated role of glycolysis perturbation → sulfur metabolism → virulence. The same is true for other comparisons, where the lack of data on Met30, Met4, etc., makes it hard.to connect the hypothesis. It is also recommended to check the gene expression of other genes related to the cAMP pathway and report them to confirm the cAMP-independent mechanism. For example, gap2 deletion was used to confirm the effects of cAMP supplementation, but the expression of this gene was not assessed in the RNA-seq analysis in Figure 2. It would be beneficial to show the expression of cAMP-related genes to completely confirm that they do not play a role in the claims in Figure 2.

      We thank the reviewer for this valuable feedback. The transcriptional analysis of the sulfur metabolism genes in the presence of 2DG and the ∆∆pfk1 strain is shown in Figures 4D and 4I.

      Our RNA-seq analysis (Author response image 1) confirms that there is no significant transcriptional change in the expression of cAMP-PKA pathway associated genes (Log2 fold change ≥ 1 for upregulated genes and Log2 fold change ≤ -1 for downregulated genes) in 2DG treated cells compared to the untreated control cells, reinforcing our conclusion that the glycolytic regulation of fungal morphogenesis is mediated through a cAMP-PKA pathway independent mechanism.

      Author response image 1.

      (8) Although the NAC supplementation study is included in the new version of the article compared to the previous version in BioRxiv (May 2025), the link to sulfur metabolism is not well characterized in Figure 5 and their related datasets. The main focus of the manuscript is to delineate the role of sulfur metabolism; hence, it is anticipated that Figure 5 will include sulfur-related metabolic genes and their links to pfk1 deletion, using RT-PCR measurements as shown for the virulence genes.

      We thank the reviewer for this question. The relevant data are indeed present within the current submission. We respectfully direct the reviewer's attention to Figure 4, panels D and I, where the data pertaining to expression of sulfur metabolism genes in the presence of 2DG or in the ∆∆pfk1 strain in C. albicans can be found.

      (9) The manuscript would benefit from more information added to the introduction section and literature supports for some of the findings reported earlier, including the role of (i) cAMP-PKA and MAPK pathways, (ii) what is known in the literature that reports about the treatment with 2DG (role of Snf1, HXT1, and HXT3), as well as how gpa2 is involved. Some sentences in the manuscripts are repetitive; it would be beneficial to add more relevant sections to the introduction and discussion to clarify the rationale for gene choices.

      We thank the reviewer for this valuable feedback. We have incorporated these changes in our revised manuscript.

      Recommendations for the authors:

      Reviewer #1 (Recommendations for the authors):

      (1) Line 107: As morphological transitions are indeed a conserved phenomenon across fungal species, hosts & environmental niches, the authors could refer to a few more here (infection structures like appressoria; fruiting bodies, etc.).

      We thank the reviewer for this valuable feedback. We have incorporated these changes in our revised manuscript.

      Line 119/120: That's a bit misleading in my opinion. Gpr1 acts as a key sensor of external carbon, while Ras proteins control the cAMP pathway as intracellular sensory proteins. That should be stated more clearly. cAMP is the output and not the sensor.

      We appreciate the reviewer's detailed attention to this signaling network. We have revised the manuscript to precisely reflect this established signaling hierarchy for maximum clarity.

      (2) Line 180: ..differentiation

      We thank the reviewer for this valuable feedback. We have incorporated this change in our revised manuscript.

      (3) Figure 1 panels C & F. The authors should provide the same scale for all experiments. Otherwise, the interpretation can be difficult. The same applies to the different bar plots in Figure 4. Have the authors quantified pseudohyphal differentiation in the cAMP add-back assays? I agree that the chosen images look convincing, but they don't reflect quantitative analyses.

      We thank the reviewer for detailed and constructive feedback. We have changed the Y-axis and made it more uniform to improve the clarity of our data presentation in the revised manuscript.

      We have also incorporated the quantitative analysis of the cAMP add-back assays in S. cerevisiae, in Figure 2 Panel L.

      (4) Line 367/68: "cysteine or methionine was able to completely rescue". Here, the authors should phrase their wording more carefully. Figure 3C shows the complete rescue of the phenotype qualitatively, but Figure 3D clearly shows that there are differences between the supplementation of cysteine and methionine, with the latter not fully restoring the phenotype.

      We sincerely appreciate the reviewer's meticulous attention to the data interpretation. We fully agree that the initial phrasing in lines 367/368 requires adjustment, as Figure 3D establishes a quantitative difference in the efficiency of phenotypic rescue between cysteine and methionine supplementation. We have revised the text to articulate this difference.

      (5) Line 568: Here, apparently, the ability to rescue the differentiation phenotype is reversed compared to the experiment with S. cerevisiae. Cysteine only results in ~20% hyphal cells, while methionine restores to wild-type-like hyphal formation. Can the authors comment on where these differences might originate from? Is there a difference in the uptake of cysteine vs. methionine in the two species or consumption rates?

      We thank the reviewer for their detailed and constructive feedback. We believe this phenotypic difference can be due to the distinct metabolic prioritization of sulfur amino acids in C. albicans. Methionine is a known trigger for hyphal differentiation in C. albicans and serves as the immediate precursor for the universal methyl donor, S-adenosylmethionine (SAM) (Schrevens et al., 2018). (Kraidlova et al., 2016). The morphological transition to hyphae involves a complex regulatory cascade which requires high rates of methylation, and this requires a rapid and direct conversion of methionine into SAM (Kraidlova et al., 2016; Schrevens et al., 2018). Cysteine, however, must first be converted into methionine via the transsulfuration pathway to produce SAM, making it metabolically less efficient for these aforesaid processes.

      Reviewer #2 (Recommendations for the authors):

      The study's comprehensive experimental approach with integrating pharmacological inhibition, genetic manipulation, transcriptomics, and infection animal model, provides strong evidence for a conserved mechanism, though some aspects need further clarification.

      Major Comments:

      (1) While the data suggest that glycolysis affects Met30 activity post-translationally, the underlying mechanism remains speculative. The authors should perform co-immunoprecipitation or ubiquitination assays to confirm whether glycolytic perturbation alters Met30-SCF complex interactions or Met4 ubiquitination levels.

      We thank the reviewer for this valuable feedback. The activity of the SCF<sup>Met30</sup> E3 ubiquitin ligase, mediated by the F box protein Met30, is dynamically regulated through both proteolytic degradation and its dissociation from the SCF complex, to coordinate sulfur metabolism and cell cycle progression (Smothers et al., 2000; Yen et al., 2005). Our transcriptomic (RNA-seq analysis) and protein expression analysis (Fig. 3J) confirms that Met30 expression is not differentially regulated in the presence of 2DG, effectively eliminating changes in synthesis or SCF<sup>Met30</sup> proteasomal degradation as the dominant regulatory mechanism. This observation is consistent with the established paradigm wherein stress signals, such as cadmium (Cd<sup>2+</sup>) exposure, rapidly inactivates the SCF<sup>Met30</sup> E3 ubiquitin ligase via the dissociation of Met30 from the Skp1 subunit of the SCF complex (Lauinger et al., 2024; Yen et al., 2005). We therefore propose that active glycolytic flux modulates SCF<sup>Met30</sup> activity post-translationally, specifically by triggering Met30 detachment from the SCF complex. This mechanism would stabilize the primary substrate, the transcription factor Met4, thus promoting the biosynthesis of sulfur-containing amino acids. Mechanistic validation of this hypothesis, particularly the assessment of Met30 dissociation from the SCF<sup>Met30 </sup>complex via immunoprecipitation (IP), is technically challenging. Since these experiments will involve isolation of cells from colonies undergoing pseudohyphal differentiation, on solid media (given that pseudohyphal differentiation does not occur in liquid media that is limiting for nitrogen (Gancedo, 2001; Gimeno et al., 1992)), current cell yields (OD<sup>600</sup>≈1 from ≈80-100 colonies) are significantly below the amount of cells that is needed to obtain the required amount of total protein concentration, for standard pull down assays (OD600≈600-800 is required to achieve 1-2 mg/ml of total protein which is the standard requirement for pull down protocols in S. cerevisiae (Lauinger et al., 2024)).

      Given that the primary objective of our study is to establish the novel regulatory link between glycolysis and sulfur metabolism in the context of fungal morphogenesis, we would like to explore these crucial mechanistic details, in depth, in a subsequent study.

      (2) 2DG can exert pleiotropic effects unrelated to glycolytic inhibition (e.g., ER stress, autophagy induction). The authors are encouraged to perform complementary metabolic flux analyses, such as quantification of glycolytic intermediates or ATP levels, to confirm specific glycolytic inhibition.

      We appreciate the reviewer's concern regarding the potential pleiotropic effects of 2DG. While we acknowledge that 2DG may induce secondary cellular stress, we are confident that the observed phenotypes are robustly attributed to glycolytic inhibition based on our complementary genetic evidence. Specifically, the deletion strains ∆∆pfk1 and ∆∆adh1, which genetically perturb distinct steps in glycolysis, recapitulate the phenotypic results observed with 2DG treatment. Given this strong congruence between chemical inhibition and specific genetic deletions of key glycolytic enzymes, we are confident that our observed phenotypes are predominantly driven by the perturbation of the glycolytic pathway by 2DG.

      (3) The differential rescue effects (cysteine-only in inhibitor assays vs. both cysteine and methionine in genetic mutants) require further explanation. The authors should discuss potential differences in metabolic interconversion or amino acid transport that may account for this observation.

      We thank the reviewer for their valuable feedback. One explanation for the observed differential rescue effects of cysteine and methionine can be due to the distinct amino acid transport systems used by S. cerevisiae to transport these amino acids. S. cerevisiae primarily uses multiple, lowaffinity permeases (Gap1, Bap2, Bap3, Tat1, Tat2, Agp1, Gnp1, Yct1) for cysteine transport, while relying on a limited set of high-affinity transporters (like Mup1) for methionine transport, with the added complexity that its methionine transporters can also transport cysteine (Düring-Olsen et al., 1999; Huang et al., 2017; Kosugi et al., 2001; Menant et al., 2006). Hence, it is likely that cysteine uptake could be happening at a higher efficiency in S. cerevisiae compared to methionine uptake. Therefore, to achieve a comparable functional rescue by exogenous supplementation of methionine, it is necessary to use a higher concentration of methionine. When we performed our rescue experiments using higher concentrations of methionine, we did not see any rescue of pseudohyphal differentiation in the presence of 2DG and in fact we noticed that, at higher concentrations of methionine, the wild-type strain failed to undergo pseudohyphal differentiation even in the absence of 2DG. This is likely due to the fact that increasing the methionine concentration raises the overall nitrogen content of the medium, thereby making the medium less nitrogen-starved. This presents a major experimental constraint, as pseudohyphal differentiation is strictly dependent on nitrogen limitation, and the elevated nitrogen resulting from the higher methionine concentration can inhibit pseudohyphal differentiation.

      (4) NAC may influence host redox balance or immune responses. The discussion should consider whether the observed virulence rescue could partly result from host-directed effects.

      We thank the reviewer for this valuable feedback. We acknowledge the role of NAC in host directed immune response. It is important to note that, in the context of certain bacterial pathogens, NAC has been reported to augment cellular respiration, subsequently increasing Reactive Oxygen Species (ROS) generation, which contributes to pathogen clearance (Shee et al., 2022). Interestingly, in our study, NAC supplementation to the mice was given prior to the infection and maintained continuously throughout the duration of the experiment. This continuous supply of NAC likely contributes to the rescue of virulence defects exhibited by the ∆∆pfk1 strain (Fig. 5I and J). Essentially, NAC likely allows the mutant to fully activate its essential virulence strategies (including morphological switching), to cause a successful infection in the host. As per the reviewer suggestion, this has been included in the discussion section of the manuscript.

      Reviewer #3 (Recommendations for the authors):

      Most of the comments related to improving the manuscript have been provided in the public review. Here are some specifics for the authors to consider:

      (1) It is important to clarify the rationale for choosing specific gene deletions over other key genes (e.g., Met32 and Met30) and explain why Met4 was not included, given its proposed central role in Figure 6.

      We sincerely thank the reviewer for this insightful query regarding our selection of the met32 for our gene deletion experiments. The choice of ∆∆met32 strain was strategically motivated by its unique phenotypic properties within the de novo biosynthesis of sulfur-containing amino acids pathway. While deletions of most the genes that encode for proteins involved in the de novo biosynthesis of sulfurcontaining amino acids, result in auxotrophy for methionine or cysteine, ∆∆met32 strain does not exhibit this phenotype (Blaiseau et al., 1997). This key distinction is attributed to the functional redundancy provided by the paralogous gene, met31 (Blaiseau et al., 1997). Crucially, given that the deletion of the central transcriptional regulator, met4, results in cysteine/methionine auxotrophy, the use of the ∆∆met32 strain provides an essential, viable experimental model for investigating the role of sulfur metabolism during pseudohyphal differentiation in S. cerevisiae.

      (2) Comparison of consistent gene and protein expression data (Met30, Met4, Met32) across all relevant figures and analyses would strengthen the mechanistic connection in a better way. Some data that might help connect the sections is not included; please see the public review for more details.

      We thank the reviewer for this valuable input, which helps us to clarify the scope of our transcriptomic analysis. Our decision to focus our RT-qPCR experiments on downstream targets, while excluding Met4 and Met30 from the RT-qPCR analysis, is based on their known regulatory mechanisms. Met4 activity is predominantly regulated by post-translational ubiquitination by the SCFMet30 complex followed by its degradation (Rouillon et al., 2000; Shrivastava et al., 2021; Smothers et al., 2000)  while Met30 activity is primarily regulated by its auto-degradation or its dissociation from the SCFMet30 complex (Lauinger et al., 2024; Smothers et al., 2000; Yen et al., 2005).  Consistent with this, our RNA-Seq results indicate that neither met4 nor met30 transcripts are differentially expressed, in response to 2DG addition. For all our RT-qPCR analysis in S. cerevisiae and C. albicans, we have consistently used the same set of sulfur metabolism genes and these include met32, met3, met5, met10 and met17. Our data on protein expression analysis of Met30 in S, cerevisiae (Fig. 3J) confirms that Met30 expression is not differentially regulated in the presence of 2DG, effectively eliminating changes in synthesis or SCFMet30 proteasomal degradation as the dominant regulatory mechanism.

      (3) Suggested to include metabolomic profiling (cysteine, methionine, and intermediate metabolites) to substantiate the proposed metabolic flux between glycolysis and sulfur metabolism.

      We thank the reviewer for this valuable input. Our pseudohyphal/hyphal differentiation assays show that the defects induced by glycolytic perturbation is fully rescued by the exogenous supplementation of sulfur-containing amino acids, cysteine or methionine. Since these data conclusively demonstrate that the primary metabolic limitation caused by the perturbation of glycolysis, which leads to filamentation defects, is sulfur metabolism, we posit that performing comprehensive metabolic profiling would primarily reconfirm the aforesaid results. We believe that our in vitro and in vivo sulfur add-back experiments sufficiently substantiate the novel regulatory metabolic link between glycolysis and sulfur-metabolism.

      (4) Data on the effects of Met30 deletion on cell growth are currently not included, and relevant controls should be included to ensure observed phenotypes are not due to general growth defects.

      We are grateful to the reviewer for this constructive feedback. To address the potential impact of met30 deletion on cell growth, we have included new data (Suppl. Fig. 4A) demonstrating that the deletion of a single copy of met30 in diploid S. cerevisiae does not compromise overall growth under nitrogen-limiting conditions as the ∆met30 strain grows similar to the wild-type strain.

      (5) Expanding RT-qPCR and data from transcriptomic analyses to include sulfur metabolism genes and key cAMP pathway genes to confirm the proposed cAMP-independent mechanism during virulence characterization is necessary.

      We thank the reviewer for this valuable feedback. The transcriptional analysis of the sulfur metabolism genes in the presence of 2DG and the ∆∆pfk1 strain is shown in Figures 4D and 4I. 

      In order to confirm that glycolysis is critical for fungal morphogenesis in a cAMP-PKA pathway independent manner under nitrogen-limiting conditions in C. albicans, we performed cAMP add-back assays. Interestingly, corroborating our S. cerevisiae data, the exogenous addition of cAMP failed to rescue hyphal differentiation defect caused by the perturbation of glycolysis through 2DG addition or by the deletion of the pfk1 gene, under nitrogen-limiting condition in C. albicans. This data is now included in Suppl. Fig. 5B.

      (6) Enhancing the introduction and discussion by providing a clearer rationale for gene selection and more detailed references to established pathways (cAMP-PKA, MAPK, Snf1/HXT regulation, gpa2 involvement) is needed to reinstate the hypothesis.

      We thank the reviewer for this valuable feedback. We have incorporated these changes in our revised manuscript.

      (7) Reducing redundancy in the text and improving figure consistency, particularly by ensuring that the gene sets depicted in Figure 6 are represented across all datasets, would strengthen the interconnections among sections.

      We thank the reviewer for this valuable feedback.  We have incorporated these changes in our revised manuscript.

      References

      Barford JP, Hall RJ. 1979. An examination of the crabtree effect in Saccharomyces cerevisiae: The role of respiratory adaptation. J Gen Microbiol. https://doi.org/10.1099/00221287-114-2-267

      Blaiseau, P. L., & Thomas, D. (1998). Multiple transcriptional activation complexes tether the yeast activator Met4 to DNA. The EMBO journal, 17(21), 6327–6336. https://doi.org/10.1093/emboj/17.21.6327

      Chebaro, Y., Lorenz, M., Fa, A., Zheng, R., & Gustin, M. (2017). Adaptation of Candida albicans to Reactive Sulfur Species. Genetics, 206(1), 151–162. https://doi.org/10.1534/genetics.116.199679

      De Deken R. H. (1966). The Crabtree effect: a regulatory system in yeast. Journal of general microbiology, 44(2), 149–156. https://doi.org/10.1099/00221287-44-2-149

      Düring-Olsen, L., Regenberg, B., Gjermansen, C., Kielland-Brandt, M. C., & Hansen, J. (1999). Cysteine uptake by Saccharomyces cerevisiae is accomplished by multiple permeases. Current genetics, 35(6), 609–617. https://doi.org/10.1007/s002940050459

      Gancedo J. M. (2001). Control of pseudohyphae formation in Saccharomyces cerevisiae. FEMS microbiology reviews, 25(1), 107–123. https://doi.org/10.1111/j.1574-6976.2001.tb00573.x

      Gimeno, C. J., Ljungdahl, P. O., Styles, C. A., & Fink, G. R. (1992). Unipolar cell divisions in the yeast S. cerevisiae lead to filamentous growth: regulation by starvation and RAS. Cell, 68(6), 1077–1090. https://doi.org/10.1016/0092-8674(92)90079-r

      Huang, C. W., Walker, M. E., Fedrizzi, B., Gardner, R. C., & Jiranek, V. (2017). Yeast genes involved in regulating cysteine uptake affect production of hydrogen sulfide from cysteine during fermentation. FEMS yeast research, 17(5), 10.1093/femsyr/fox046. https://doi.org/10.1093/femsyr/fox046

      Kosugi, A., Koizumi, Y., Yanagida, F., & Udaka, S. (2001). MUP1, high affinity methionine permease, is involved in cysteine uptake by Saccharomyces cerevisiae. Bioscience, biotechnology, and biochemistry, 65(3), 728–731. https://doi.org/10.1271/bbb.65.728

      Kraidlova, L., Schrevens, S., Tournu, H., Van Zeebroeck, G., Sychrova, H., & Van Dijck, P. (2016). Characterization of the Candida albicans Amino Acid Permease Family: Gap2 Is the Only General Amino Acid Permease and Gap4 Is an S-Adenosylmethionine (SAM) Transporter Required for SAM-Induced Morphogenesis. mSphere, 1(6), e00284-16. https://doi.org/10.1128/mSphere.00284-16

      Lauinger, L., Andronicos, A., Flick, K., Yu, C., Durairaj, G., Huang, L., & Kaiser, P. (2024). Cadmium binding by the F-box domain induces p97-mediated SCF complex disassembly to activate stress response programs. Nature communications, 15(1), 3894. https://doi.org/10.1038/s41467-024-48184-6

      Lombardi, L., Salzberg, L. I., Cinnéide, E. Ó., O'Brien, C., Morio, F., Turner, S. A., Byrne, K. P., & Butler, G. (2024). Alternative sulphur metabolism in the fungal pathogen Candida parapsilosis. Nature communications, 15(1), 9190. https://doi.org/10.1038/s41467-024-53442-8

      Menant, A., Barbey, R., & Thomas, D. (2006). Substrate-mediated remodeling of methionine transport by multiple ubiquitin-dependent mechanisms in yeast cells. The EMBO journal, 25(19), 4436–4447. https://doi.org/10.1038/sj.emboj.7601330

      Ralser, M., Wamelink, M. M., Kowald, A., Gerisch, B., Heeren, G., Struys, E. A., Klipp, E., Jakobs, C., Breitenbach, M., Lehrach, H., & Krobitsch, S. (2007). Dynamic rerouting of the carbohydrate flux is key to counteracting oxidative stress. Journal of biology, 6(4), 10. https://doi.org/10.1186/jbiol61

      Rouillon, A., Barbey, R., Patton, E. E., Tyers, M., & Thomas, D. (2000). Feedback-regulated degradation of the transcriptional activator Met4 is triggered by the SCF(Met30 )complex. The EMBO journal, 19(2), 282–294. https://doi.org/10.1093/emboj/19.2.282

      Schrevens, S., Van Zeebroeck, G., Riedelberger, M., Tournu, H., Kuchler, K., & Van Dijck, P. (2018). Methionine is required for cAMP-PKA-mediated morphogenesis and virulence of Candida albicans. Molecular microbiology, 108(3), 258–275. https://doi.org/10.1111/mmi.13933

      Shee, S., Singh, S., Tripathi, A., Thakur, C., Kumar T, A., Das, M., Yadav, V., Kohli, S., Rajmani, R. S., Chandra, N., Chakrapani, H., Drlica, K., & Singh, A. (2022). Moxifloxacin-Mediated Killing of Mycobacterium tuberculosis Involves Respiratory Downshift, Reductive Stress, and Accumulation of Reactive Oxygen Species. Antimicrobial agents and chemotherapy, 66(9), e0059222. https://doi.org/10.1128/aac.00592-22

      Shrivastava, M., Feng, J., Coles, M., Clark, B., Islam, A., Dumeaux, V., & Whiteway, M. (2021). Modulation of the complex regulatory network for methionine biosynthesis in fungi. Genetics, 217(2), iyaa049. https://doi.org/10.1093/genetics/iyaa049

      Smothers, D. B., Kozubowski, L., Dixon, C., Goebl, M. G., & Mathias, N. (2000). The abundance of Met30p limits SCF(Met30p) complex activity and is regulated by methionine availability. Molecular and cellular biology, 20(21), 7845–7852. https://doi.org/10.1128/MCB.20.21.7845-7852.2000

      Thomas, D., & Surdin-Kerjan, Y. (1997). Metabolism of sulfur amino acids in Saccharomyces cerevisiae. Microbiology and molecular biology reviews : MMBR, 61(4), 503–532. https://doi.org/10.1128/mmbr.61.4.503532.1997

      Yadav, A. K., & Bachhawat, A. K. (2011). CgCYN1, a plasma membrane cystine-specific transporter of Candida glabrata with orthologues prevalent among pathogenic yeast and fungi. The Journal of biological chemistry, 286(22), 19714–19723. https://doi.org/10.1074/jbc.M111.240648

      Yen, J. L., Su, N. Y., & Kaiser, P. (2005). The yeast ubiquitin ligase SCFMet30 regulates heavy metal response. Molecular biology of the cell, 16(4), 1872–1882. https://doi.org/10.1091/mbc.e04-12-1130

    1. Author response:

      The following is the authors’ response to the previous reviews.

      Joint Public Review:

      In this work, the authors present DeepTX, a computational tool for studying transcriptional bursting using single-cell RNA sequencing (scRNA-seq) data and deep learning. The method aims to infer transcriptional burst dynamics-including key model parameters and the associated steady-state distributions-directly from noisy single-cell data. The authors apply DeepTX to datasets from DNA damage experiments, revealing distinct regulatory patterns: IdU treatment in mouse stem cells increases burst size, promoting differentiation, while 5FU alters burst frequency in human cancer cells, driving apoptosis or survival depending on dose. These findings underscore the role of burst regulation in mediating cell fate responses to DNA damage.

      The main strength of this study lies in its methodological contribution. DeepTX integrates a non-Markovian mechanistic model with deep learning to approximate steady-state mRNA distributions as mixtures of negative binomial distributions, enabling genome-scale parameter inference with reduced computational cost. The authors provide a clear discussion of the framework's assumptions, including reliance on steady-state data and the inherent unidentifiability of parameter sets, and they outline how the model could be extended to other regulatory processes.

      The revised manuscript addresses many of the original concerns, particularly regarding sample size requirements, distributional assumptions, and the biological interpretation of inferred parameters. However, the framework remains limited by the constraints of snapshot data and cannot yet resolve dynamic heterogeneity or causality. The manuscript would also benefit from a broader contextualisation of DeepTX within the landscape of existing tools linking mechanistic modelling and single-cell transcriptomics. Finally, the interpretation of pathway enrichment analyses still warrants clarification.

      Overall, this work represents a valuable contribution to the integration of mechanistic models with highdimensional single-cell data. It will be of interest to researchers in systems biology, bioinformatics, and computational modelling.

      Recommendations for the authors:

      We thank the authors for their thorough revision and for addressing many of the points raised during the initial review. The revised manuscript presents an improved and clearer account of the methodology and its implications. However, several aspects would benefit from further clarification and refinement to strengthen the presentation and avoid overstatement.

      (1) Contextualization within the existing literature

      The manuscript would benefit from placing DeepTX more clearly in the context of other computational tools developed to connect mechanistic modelling and single-cell RNA sequencing data. This is an active area of research with notable recent contributions, including Sukys and Grima (bioRxiv, 2024), Garrido-Rodriguez et al. (PLOS Comp Biol, 2021), and Maizels (2024). Positioning DeepTX in relation to these and other relevant efforts would help readers appreciate its specific advances and contributions.

      We sincerely thank you for this valuable suggestion. We agree that situating DeepTX within the broader landscape of computational approaches linking mechanistic modeling and single-cell RNA sequencing data will clarify its contributions and advances. In this revised version, we have explicitly discussed the comparison and relation of DeepTX in the context of this active area using an individual paragraph in the Discussion section.

      Specifically, we mentioned that the DeepTX research paradigm contributes to a growing line of area aiming to link mechanistic models of gene regulation with scRNA-seq data. Maizels provided a comprehensive review of computational strategies for incorporating dynamic mechanisms into single-cell transcriptomics (Maizels RJ, 2024). In this context, RNA velocity is one of the most important examples as it infers short-term transcriptional trends based on splicing kinetics and deterministic ODEs model. However, such approaches are limited by their deterministic assumptions and cannot fully capture the stochastic nature of gene regulation. DeepTX can be viewed as an extension of this framework to stochastic modelling, explicitly addressing transcriptional bursting kinetics under DNA damage. Similarly, DeepCycle, developed by Sukys and Grima (Sukys A & Grima R, 2025), investigates transcriptional burst kinetics during the cell cycle, employing a stochastic age-dependent model and a neural network to infer burst parameters while correcting for measurement noise. By contrast, MIGNON integrates genomic variation data and static transcriptomic measurements into a mechanistic pathway model (HiPathia) to infer pathway-level activity changes, rather than gene-level stochastic transcriptional dynamics (Garrido-Rodriguez M et al., 2021). In this sense, DeepTX and MIGNON are complementary, with DeepTX resolving burst kinetics at the single-gene level and MIGNON emphasizing pathway responses to genomic perturbations, which could inspire future extensions of DeepTX that incorporate sequence-level information.

      (2) Interpretation of GO analysis

      The interpretation of the GO enrichment results in Figure 4D should be revised. While the text currently associates the enriched terms with signal transduction and cell cycle G2/M phase transition, the most significant terms relate to mitotic cell cycle checkpoint signaling. This distinction should be made clear in the main text, and the conclusions drawn from the GO analysis should be aligned more closely with the statistical results.

      We sincerely appreciate you for the insightful comment. We have carefully re-examined the GO enrichment results shown in Figure 4D and agree that the most significantly enriched terms correspond to mitotic cell cycle checkpoint signaling and signal transduction in response to DNA damage, rather than general G2/M phase transition processes. Accordingly, we have revised the main text to highlight the biological significance of mitotic cell cycle checkpoint signaling.

      Specifically, we now emphasize two key points: DNA damage and mitotic checkpoint activation are closely interconnected. (1) The mitotic checkpoint serves as a crucial safeguard to ensure accurate chromosome segregation and maintain genomic stability under DNA damage conditions. Activation of the mitotic checkpoint can influence cell fate decisions and differentiation potential (Kim EM & Burke DJ, 2008; Lawrence KS et al., 2015). (2) Sustained activation of the spindle assembly checkpoint (SAC) has been reported to induce mitotic slippage and polyploidization, which in turn may enhance the differentiation potential of embryonic stem cells  (Mantel C et al., 2007). These revisions ensure that our interpretation is consistent with the statistical enrichment results and better reflect the underlying biological processes implicated by the data.

      (3) Justification for training on simulated data

      The decision to train the model on simulated data should be clearly justified. While the advantage of having access to ground-truth parameters is understood, the manuscript would benefit from a discussion of the limitations of this approach, particularly in terms of generalizability to real datasets. Moreover, it is worth noting that many annotated scRNA-seq datasets are publicly available and could, in principle, be used to complement the training strategy.

      We thank you for this insightful comment. We chose to train DeepTXsolver on simulated data because no experimental dataset currently provides genome-wide transcriptional burst kinetics with known ground truth, which is essential for supervised learning. Simulation enables us to (i) generate large, fully annotated datasets spanning the biologically relevant parameter space, (ii) expose the solver to diverse bursting regimes (e.g., low/high burst frequency, small/large burst size, unimodal/bimodal distributions), and (iii) quantitatively benchmark model accuracy, parameter identifiability, and robustness prior to deployment on real scRNA-seq data.

      We acknowledge, however, that simulation-based training has inherent limitations in terms of generalizability. Real biological systems may deviate from the idealized bursting model, exhibit more complex noise structures, or display parameter distributions that differ from those in simulations. Moreover, the lack of ground-truth parameters in experimental scRNA-seq datasets prevents an absolute evaluation of inference accuracy. In the future work, publicly available annotated scRNA-seq datasets could be used to complement this simulation-based training strategy and enhance generalizability. We have revised the manuscript to explicitly discuss both the rationale for using simulated data and the potential limitations of this approach.

      (4) Benchmarking against external methods

      The performance of DeepTX is primarily compared to a prior method from the same group. To strengthen the methodological claims, it would be preferable to include benchmarking against additional established tools from the broader literature. This would offer a more objective evaluation of the performance gains attributed to DeepTX.

      We thank you for this constructive suggestion. We fully agree that benchmarking DeepTX against additional established tools from the broader literatures would provide a more comprehensive and objective evaluation of DeepTX . In the revised manuscript, we have included comparative analyses with other widely used methods, including nnRNA (From Shahrezaei group (Tang W et al., 2023)), txABC (from our group (Luo S et al., 2023)), txBurst (from Sandberg group (Larsson AJM et al., 2019)), txInfer (from Junhao group (Gu J et al., 2025)) (Supplementary Figure S4). The comparative results indicate that our method demonstrates superior performance in both efficiency and accuracy.

      (5) Interpretation of Figures 4-6

      The revised figures are clear and informative; however, the associated interpretations in the main text remain too strong relative to the type of analysis performed. For instance, in Figure 4, it is suggested that changes in burst size are linked to DNA damage-induced signalling cascades that affect cell cycle progression and fate decisions. While this is a plausible hypothesis, GO and GSEA analyses are correlative by nature and not sufficient to support such a mechanistic claim on their own. These analyses should be presented as exploratory, and the strength of the conclusions drawn should be tempered accordingly. Similar caution should be applied to the interpretations of Figures 5 and 6.

      We thank you for this important comment. In the revised manuscript, we have carefully moderated the interpretation of the GO and GSEA results in Figures 4, 5, and 6. Specifically, we now present these analyses as exploratory and emphasize their correlative nature, avoiding causal claims that go beyond the scope of the data. The text has been rephrased to highlight the observed associations rather than implying direct causal relationships.

      For Figure 4, we emphasize that while it is tempting to hypothesize that enhanced burst size may contribute to DNA damage-related checkpoint activation and thereby influence cell cycle progression and differentiation, our current results only indicate an association between burst size enhancement and pathways involved in DNA damage response and checkpoint signaling.

      For Figure 5, we emphasize that although our GO analysis cannot establish causality, the results are consistent with an association between 5-FU-induced changes in burst kinetics and pathways related to oxidative stress and apoptosis. Based on this, we propose a model outlining a potential process through which DNA damage may ultimately lead to cellular apoptosis.

      For Figure 6, we emphasize that these enrichment results suggest that high-dose 5FU treatment may be associated with processes such as telomerase activation and mitochondrial function maintenance, both of which have been implicated in cell survival and apoptosis evasion in previous experimental studies. For example, prior work indicates that hTERT translocation can activate telomerase pathways to support telomere maintenance and reduce oxidative stress, which is thought to contribute to apoptosis resistance. While our enrichment analysis cannot establish causality, the observed transcriptional bursting changes are consistent with these reported survival-associated mechanisms.

      (6) Discussion section framing

      The initial paragraphs of the discussion section make broad biological claims about the role of transcriptional bursting in cellular decision-making. While transcriptional bursting is undoubtedly relevant, the manuscript would benefit from a more cautious framing. It would be more appropriate to foreground the methodological contributions of DeepTX, and to present the biological insights as hypotheses or observations that may guide future experimental investigation, rather than as established conclusions.

      We thank you for this insightful comment. We have revised the discussion to clarify and appropriately temper our claims regarding transcriptional bursting. First, we now explicitly recognize that transcriptional bursting is one of multiple contributors to cellular variability, rather than the sole or dominant factor driving cellular decision-making. Second, we have restructured the opening of the discussion to prioritize the methodological contributions of DeepTX, highlighting its strength as a framework for inferring genomewide burst kinetics from scRNA-seq data. Finally, the biological insights derived from our analysis are now presented as correlative observations and potential hypotheses, which may inform and guide future experimental investigations, rather than as definitive mechanistic conclusions.

      Small Comments

      (1) Presentation of discrete distributions: In several figures (e.g., Figure 2B and Supplementary Figures S4, S6, and S8), the comparisons between empirical mRNA distributions and DeepTX-inferred distributions are visually represented using connecting lines, which may give the impression that continuous distributions are being compared to discrete ones. Given the focus on transcriptional bursting, a process inherently tied to discrete stochastic events, this representation could be misleading. The figure captions and visual style should be revised to clarify that all distributions are discrete and to avoid potential confusion. In general, it is recommended to avoid connecting points in discrete distributions with lines, as this can suggest interpolation or comparison with continuous distributions. This applies to Figures 2A and 2B in particular.

      We thank you for this valuable suggestion. To prevent any potential misinterpretation of discrete distributions as continuous ones, we have revised the visual representation of the empirical and DeepTXinferred mRNA distributions in Figures 2B, and Supplementary Figures S4, S6, and S8. Specifically, we have replaced the line plots with step plots, which more accurately capture the discrete nature of transcriptional bursting. Additionally, we have updated the figure captions to clearly state that all distributions are discrete.

      (2) Transcription is always a multi-step process. While the manuscript aims to model additional complexity introduced by DNA damage, the current phrasing (e.g., on page 5) could be read as implying that transcription becomes multi-step only under damage conditions. This should be clarified.

      We thank you for this helpful observation. We agree that transcription is inherently a multi-step process under all conditions. To avoid any possible misunderstanding, we have revised the text to clarify this point.

      Specifically, we now explain that many previous studies have employed simplified two-state models to approximate transcriptional dynamics, however, the gene expression process is inherently a multi-step process, which particularly cannot be neglected under conditions of DNA damage. DNA damage can result in slowing or even stopping the RNA pol II movement and cause many macromolecules to be recruited for damage repair. This process will affect the spatially localized behavior of the promoter, causing the dwell time of promoter inactivation and activation that cannot be approximated by a simple two state. Our work adopts a multi-step model because it is more appropriate for capturing the additional complexity introduced by DNA damage.

      (3) The first sentence of the discussion section overstates the importance of transcriptional bursting. While it is a key source of variability, it is not the only nor always the dominant one. Furthermore, its role in DNA damage response remains an emerging hypothesis rather than a general principle. The claims in this section should be moderated accordingly.

      We thank you for this valuable feedback. In the revised discussion, we have moderated the statements in the opening paragraph to better reflect the current understanding. Specifically, we now acknowledge that transcriptional bursting represents one of multiple sources of variability and is not always the dominant contributor. In addition, we have reframed the role of transcriptional bursting in DNA damage response as an emerging hypothesis, rather than a general principle. To further address this concern, we replaced conclusion-like statements with more cautious, hypothesis-oriented phrasing, presenting our observations as potential directions for future experimental validation.

      References

      Maizels, R.J. 2024. A dynamical perspective: moving towards mechanism in single-cell transcriptomics. Philos Trans R Soc Lond B Biol Sci 379: 20230049. DOI: https://dx.doi.org/10.1098/rstb.2023.0049, PMID: 38432314

      Sukys, A., Grima, R. 2025. Cell-cycle dependence of bursty gene expression: insights from fitting mechanistic models to single-cell RNA-seq data. Nucleic Acids Research 53. DOI: https://dx.doi.org/10.1093/nar/gkaf295, PMID: 40240003

      Garrido-Rodriguez, M., Lopez-Lopez, D., Ortuno, F.M., Peña-Chilet, M., Muñoz, E., Calzado, M.A., Dopazo, J. 2021. A versatile workflow to integrate RNA-seq genomic and transcriptomic data into mechanistic models of signaling pathways. PLoS Computational Biology 17: e1008748. DOI: https://dx.doi.org/10.1371/journal.pcbi.1008748, PMID: 33571195

      Kim, E.M., Burke, D.J. 2008. DNA damage activates the SAC in an ATM/ATR-dependent manner, independently of the kinetochore. PLoS Genet 4: e1000015. DOI: https://dx.doi.org/10.1371/journal.pgen.1000015, PMID: 18454191

      Lawrence, K.S., Chau, T., Engebrecht, J. 2015. DNA damage response and spindle assembly checkpoint function throughout the cell cycle to ensure genomic integrity. PLoS Genet 11: e1005150.DOI: https://dx.doi.org/10.1371/journal.pgen.1005150, PMID: 25898113

      Mantel, C., Guo, Y., Lee, M.R., Kim, M.K., Han, M.K., Shibayama, H., Fukuda, S., Yoder, M.C., Pelus, L.M., Kim, K.S., Broxmeyer, H.E. 2007. Checkpoint-apoptosis uncoupling in human and mouse embryonic stem cells: a source of karyotpic instability. Blood 109: 4518-4527. DOI: https://dx.doi.org/10.1182/blood-2006-10-054247, PMID: 17289813

      Tang, W., Jørgensen, A.C.S., Marguerat, S., Thomas, P., Shahrezaei, V. 2023. Modelling capture efficiency of single-cell RNA-sequencing data improves inference of transcriptome-wide burst kinetics. Bioinformatics 39. DOI: https://dx.doi.org/10.1093/bioinformatics/btad395, PMID: 37354494

      Luo, S., Zhang, Z., Wang, Z., Yang, X., Chen, X., Zhou, T., Zhang, J. 2023. Inferring transcriptional bursting kinetics from single-cell snapshot data using a generalized telegraph model. Royal Society Open Science 10: 221057. DOI: https://dx.doi.org/10.1098/rsos.221057, PMID: 37035293

      Larsson, A.J.M., Johnsson, P., Hagemann-Jensen, M., Hartmanis, L., Faridani, O.R., Reinius, B., Segerstolpe, A., Rivera, C.M., Ren, B., Sandberg, R. 2019. Genomic encoding of transcriptional burst kinetics. Nature 565: 251-254. DOI: https://dx.doi.org/10.1038/s41586-018-0836-1, PMID: 30602787

      Gu, J., Laszik, N., Miles, C.E., Allard, J., Downing, T.L., Read, E.L. 2025. Scalable inference and identifiability of kinetic parameters for transcriptional bursting from single cell data. Bioinformatics. DOI: https://dx.doi.org/10.1093/bioinformatics/btaf581, PMID: 41131798.

    1. Reviewer #1 (Public review):

      Wang, Zhou et al. investigated coordination between the prefrontal cortex (PFC) and the hippocampus (Hp), during reward delivery, by analyzing beta oscillations. Beta oscillations are associated with various cognitive functions, but their role in coordinating brain networks during learning is still not thoroughly understood. The authors focused on the changes in power, peak frequencies, and coherence of beta oscillations in two regions when rats learn a spatial task over days. Inconsistent with the authors' hypothesis, beta oscillations in those two regions during reward delivery were not coupled in spectral or temporal aspects. They were, however, able to show reverse changes in beta oscillations in PFC and Hp as the animal's performance got better. The authors were also able to show a small subset of cell populations in PFC that are modulated by both beta oscillations in PFC and sharp wave ripples in Hp. A similarly modulated cell population was not observed in Hp. These results are valuable in pointing out distinct periods during a spatial task when two regions modulate their activity independently from each other.

      The authors included a detailed analysis of the data to support their conclusions. However, some clarifications would help their presentation, as well as help readers to have a clear understanding.

      (1) The crucial time point of the analysis is the goal entry. However, it needs a better explanation in the methods or in figures of what a goal entry in their behavioral task means.

      (2) Regarding Figure 2, the authors have mentioned in the methods that PFC tetrodes have targeted both hemispheres. It might be trivial, but a supplementary graph or a paragraph about differences or similarities between contralateral and ipsilateral tetrodes to Hp might help readers.

      (3) The authors have looked at changes in burst properties over days of training. For the coincidence of beta bursts between PFC and Hp, is there a change in the coincidence of bursts depending on the day or performance of the animal?

      (4) Regarding the changes in performance through days as well as variance of the beta burst frequency variance (Figures 3C and 4C); was there a change in the number of the beta bursts as animals learn the task, which might affect variance indirectly?

      (5) In the behavioral task, within a session, animals needed to alternate between two wells, but the central arm (1) was in the same location. Did the authors alternate the location of well number 1 between days to different arms? It is possible that having well number 1 in the same location through days might have an effect on beta bursts, as they would get more rewards in well number 1?

      (6) The animals did not increase their performance in the F maze as much as they increased it in the Y maze. It would be more helpful to see a comparison between mazes in Figure 5 in terms of beta burst timing. It seems like in Y maze, unrewarded trials have earlier beta bursts in Y maze compared to F maze. Also, is there a difference in beta burst frequencies of rewarded and unrewarded trials?

      (7) For individual cell analysis, the authors recorded from Hp and the behavioral task involved spatial learning. It would be helpful to readers if authors mention about place field properties of the cells they have recorded from. It is known that reward cells firing near reward locations have a higher rate to participate in a sharp wave ripple. Factoring in the place field properties of the cells into the analysis might give a clearer picture of the lack of modulation of HP cells by beta and sharp wave ripples.

    2. Reviewer #3 (Public review):

      Summary:

      This paper explored the role of beta rhythms in the context of spatial learning and mPFC-hippocampal dynamics. The authors characterized mPFC and hippocampal beta oscillations, examining how their coordination and their spectral profiles related to learning and prefrontal neuronal firing. Rats performed two tasks, a Y-maze and an F-maze, with the F-maze task being more cognitively demanding. Across learning, prefrontal beta oscillation power increased while beta frequency decreased. In contrast, hippocampal beta power and beta frequency decreased. This was particularly the case for the well-performed and well-learned Y-maze paradigm. The authors identified the timing of beta oscillations, revealing an interesting shift in beta burst timing relative to reward entry as learning progressed. They also discovered an interesting population of prefrontal neurons that were tuned to both prefrontal beta and hippocampal sharp-wave ripple events, revealing a spectrum of SWR-excited and SWR-inhibited neurons that were differentially phase locked to prefrontal beta rhythms.

      In sum, the authors set out to examine how beta rhythms and their coordination were related to learning and goal occupancy. The authors identified a set of learning and goal-related correlates at the level of LFP and spike-LFP interactions, but did not report on spike-behavioral correlates.

      Strengths:

      Pairing dual recordings of medial prefrontal cortex (mPFC) and CA1 with learning of spatial memory tasks is a strength of this paper. The authors also discovered an interesting population of prefrontal neurons modulated by both beta and CA1 sharp-wave ripple (SWR) events, showing a relationship between SWR-excited and SWR-inhibited neurons and beta oscillation phase.

      Weaknesses:

      The authors report on a task where rats were performing sub-optimally (F-maze), weakening claims. Likewise, it is questionable as to whether mPFC and hippocampus are dually required to perform a no-delay Y-maze task at day 5, where rats are performing near 100%. There would be little reason to suspect strong oscillatory coupling when task performance is poor and/or independent of mPFC-HPC communication (Jones and Wilson, 2005), potentially weakening conclusions about independent beta rhythms. Moreover, there is little detail provided about sample sizes and how data sampling is being performed (e.g., rats, sessions, or trials), raising generalizability concerns.

    3. Author response:

      Public Reviews:.

      Reviewer #1 (Public review):

      Wang, Zhou et al. investigated coordination between the prefrontal cortex (PFC) and the hippocampus (Hp), during reward delivery, by analyzing beta oscillations. Beta oscillations are associated with various cognitive functions, but their role in coordinating brain networks during learning is still not thoroughly understood. The authors focused on the changes in power, peak frequencies, and coherence of beta oscillations in two regions when rats learn a spatial task over days. Inconsistent with the authors' hypothesis, beta oscillations in those two regions during reward delivery were not coupled in spectral or temporal aspects. They were, however, able to show reverse changes in beta oscillations in PFC and Hp as the animal's performance got better. The authors were also able to show a small subset of cell populations in PFC that are modulated by both beta oscillations in PFC and sharp wave ripples in Hp. A similarly modulated cell population was not observed in Hp. These results are valuable in pointing out distinct periods during a spatial task when two regions modulate their activity independently from each other.

      The authors included a detailed analysis of the data to support their conclusions. However, some clarifications would help their presentation, as well as help readers to have a clear understanding.

      (1) The crucial time point of the analysis is the goal entry. However, it needs a better explanation in the methods or in figures of what a goal entry in their behavioral task means.

      We appreciate Reviewer 1 pointing out this shortcoming and will clarify the description in the revised manuscript. Each goal is located at the end of the arm, and is equipped with a reward delivery unit. The unit has an infrared sensor. The rat breaks the infrared beam when it enters the goal.

      (2) Regarding Figure 2, the authors have mentioned in the methods that PFC tetrodes have targeted both hemispheres. It might be trivial, but a supplementary graph or a paragraph about differences or similarities between contralateral and ipsilateral tetrodes to Hp might help readers.

      We will provide the requested analysis in the full revision. We saw both hemispheres had similar properties.

      (3) The authors have looked at changes in burst properties over days of training. For the coincidence of beta bursts between PFC and Hp, is there a change in the coincidence of bursts depending on the day or performance of the animal?

      We will provide the requested analysis in the full revision.

      (4) Regarding the changes in performance through days as well as variance of the beta burst frequency variance (Figures 3C and 4C); was there a change in the number of the beta bursts as animals learn the task, which might affect variance indirectly?

      The analysis we can do here is to control for differences in the number of bursts for each category (days/performance quintile) by resampling the data to match the burst count between categories.

      (5) In the behavioral task, within a session, animals needed to alternate between two wells, but the central arm (1) was in the same location. Did the authors alternate the location of well number 1 between days to different arms? It is possible that having well number 1 in the same location through days might have an effect on beta bursts, as they would get more rewards in well number 1?

      The central arm remained the same across days since we needed the animals to learn the alternation task. In our experience, the animal needs a few days to learn the alternation rule when we switch the central arm location. For this experiment, we were interested in the initial learning process, and we kept the central constant. Switching the central arm location is a great suggestion for a follow up experiment where we can understand the effects of reward contingency change has on beta bursts.

      (6) The animals did not increase their performance in the F maze as much as they increased it in the Y maze. It would be more helpful to see a comparison between mazes in Figure 5 in terms of beta burst timing. It seems like in Y maze, unrewarded trials have earlier beta bursts in Y maze compared to F maze. Also, is there a difference in beta burst frequencies of rewarded and unrewarded trials?

      We will add this analysis in the revised manuscript.

      (7) For individual cell analysis, the authors recorded from Hp and the behavioral task involved spatial learning. It would be helpful to readers if authors mention about place field properties of the cells they have recorded from. It is known that reward cells firing near reward locations have a higher rate to participate in a sharp wave ripple. Factoring in the place field propertiesd of the cells into the analysis might give a clearer picture of the lack of modulation of HP cells by beta and sharp wave ripples.

      This is a great suggestion, and we will address this in the full revision.

      Reviewer #2 (Public review):

      We thank Reviewer 2 for their helpful comments and will address these in full in the revision. These are great suggestions to provide greater detail on the spectral and behavioral data at the goal.

      (1) When presenting the power spectra for the representative example (Figure 1), it would be appropriate to display a broader frequency band-including delta, theta, and gamma (up to ~100 Hz), rather than only the beta band.

      We will show more examples of power spectra with a wider frequency range. We did examine the wider spectra and noticed power in the beta frequency band was more prominent than others.

      What was the rat's locomotor state (e.g., running speed) after entering the reward location, during which the LFPs were recorded?

      We will add the time aligned speed profile to the spectra and raw data examples. Because goal entry is defined as the time the animals break the infrared beam at the goal (response to Reviewer 1), the rat would have come to a stop.

      If the rats stopped at the goal but still consumed the reward (i.e., exhibited very low running speed), theta rhythms might still occasionally occur, and sharp-wave ripples (SWRs) could be observed during rest.

      We typically find low theta power in the hippocampus after the animal reaches the goal location and as it consumes reward. Reviewer 2 is correct about occasional theta power at the goal. We have observed this but mostly before the animal leaves the goal location. We did find SWRs during goal periods. One example is shown in Fig. 7A.

      Do beta bursts also occur during navigation prior to goal entry?

      We did not find consistent beta bursts in PFC or CA1 on approach to goal entry. We can provide the analyses in our full revision. In our initial exploratory analysis, we found beta bursts was most prominent after goal entry, which led us to focus on post-goal entry beta for this manuscript. However, beta oscillations in the hippocampus during locomotion or exploration has been reported (Ahmed & Mehta, 2012; Berke et al., 2008; França et al., 2014; França et al., 2021; Iwasaki et al., 2021; Lansink et al., 2016; Rangel et al., 2015).

      It would be beneficial to display these rhythmic activities continuously across both the navigation and goal entry phases. Additionally, given that the hippocampal theta rhythm is typically around 7-8 Hz, while a peak at approximately 15-16 Hz is visible in the power spectra in Figure 1C, the authors should clarify whether the 22 Hz beta activity represents a genuine oscillation rather than a harmonic of the theta rhythm.

      To ensure we fully address this concern, we can provide further spectral analysis in our revised manuscript to show theta power in CA1 is reduced after goal entry. We were initially concerned about the possibility that the 22Hz power in CA1 may be a harmonic rather than a standalone oscillation band. If these are harmonics of theta, we should expect to find coincident theta at the time of bursts in the beta frequency. In Fig. 1B, Fig. 2A, we show examples of the raw LFP traces from CA1. Here, the detected bursts are not accompanied by visible theta frequency activity. For PFC, we do not always see persistent theta frequency oscillations like CA1. In PFC, we found beta bursts were frequent and visually identifiable when examining the LFP. We provided examples of the PFC LFP (Fig. 1B, Fig. 1-1, and Fig. 2A). In these cases, we see clear beta frequency oscillations lasting several cycles and these are not accompanied by any oscillations in the theta frequency in the LFP trace.

      (2) The authors claim that beta activity is independent between CA1 and PFC, based on the low coherence between these regions. However, it is challenging to discern beta-specific coherence in CA1; instead, coherence appears elevated across a broader frequency band (Figure 2 and Figure 2-1D). An alternative explanation could be that the uncoupled beta between CA1 and PFC results from low local beta coherence within CA1 itself.

      This is a legitimate concern, and we used three methods to characterize coherence and coordination between the two regions. First, we calculated coherence for tetrode pairs for times when the animal was at goals (Fig. 2B), which provides a general estimation of coherence across frequencies but lack any temporal resolution. Second, we calculated burst aligned coherence (Fig. 2-1), which provides temporal resolution relative to the burst, but the multi-taper method is constrained by the time-frequency resolution trade off. Third, we quantified the timing between the burst peaks (Fig. 2D), which will describe timing differences but the peaks for the bursts may not be symmetric. Thus, each method has its own caveats, but we drew our conclusion from the combination of results from these three analyses, which pointed to similar conclusions.

      Reviewer 2 is correct in pointing out the uniformly high coherence within CA1 across the frequency range we examined. When we inspected the raw LFP across multiple tetrodes in CA1, they were similar to each other (Fig. 2A). This likely reflects the uniformity in the LFP across recording sites in CA1, which is what we saw with coherence values across the frequency range (Fig. 2B). We found CA1 coherence between tetrode pairs within CA1 across the range, were statistically higher, compared to tetrode pairs in PFC (Fig. 2B and C), thus our results are unlikely to be explained by low beta coherence within CA1 itself. The burst aligned coherence using a multi-taper method also supports this. The coherence values within CA1 at the time of CA1 bursts is ~0.8-0.9.

      (3) In Figure 2-1E-F, visual inspection of the box plots reveals minimal differences between PFC-Ind and PFC-Coin/CA1-Coin conditions, despite reported statistical significance. It may be necessary to verify whether the significance arises from a large sample size.

      We will include the sample sizes for each of the boxplots, these should be the same as the power comparison in Fig. 2-1 A-C. The LFP within a one second window centered around the bursts are usually very similar, and the multi-taper method will return high coherence values. The p-values from statistical comparisons between the boxes are corrected using the Benjamini-Hochberg method.

      (4) In Figure 3 and Figure 4, although differences in power and frequency appear to change significantly across days, these changes are not easily discernible by visual inspection. It is worth considering whether these variations are related to increased task familiarity over days, potentially accompanied by higher running speeds.

      We agree with Reviewer 2 that familiarity increases across days, and the animal is likely running faster. The analysis for Fig. 3 and 4 includes only data from periods when the animal was at the goal and was not moving. We used linear mixed effects models to quantify the relationship between power, frequency and day or behavioral quintile.

      (5) The stronger spiking modulation by local beta oscillations shown in Figure 6 could also be interpreted in the context of uncoupled beta between CA1 and PFC. In this analysis, only spikes occurring during beta bursts should be included, rather than all spikes within a trial. The authors should verify the dataset used and consider including a representative example illustrating beta modulation of single-unit spiking.

      We agree with Reviewer 2 that the stronger modulation to local beta is another piece of evidence indicating uncoupled beta between the two regions. We appreciate this suggestion and will add examples illustrating beta modulation for single units. We want to clarify the spikes were only from periods when the animal is at the goal location on each trial and does not include the running period between goals.

      (6) As observed in Figure 7D, CA1 beta bursts continue to occur even after 2.5 seconds following goal entry, when SWRs begin to emerge. Do these oscillations alternate over time, or do they coexist with some form of cross-frequency coupling?

      This is a very interesting and helpful suggestion. Although we found SWRs generally appear later than beta bursts, it is possible the two are related on a finer timescale pointing to coordination. Our cross-correlation analysis between PFC and CA1 beta bursts only showed the relationship on the timescale of seconds. We will show a higher time-resolution version of this analysis in the revision.

      Reviewer #3 (Public review):

      Summary:

      This paper explored the role of beta rhythms in the context of spatial learning and mPFC-hippocampal dynamics. The authors characterized mPFC and hippocampal beta oscillations, examining how their coordination and their spectral profiles related to learning and prefrontal neuronal firing. Rats performed two tasks, a Y-maze and an F-maze, with the F-maze task being more cognitively demanding. Across learning, prefrontal beta oscillation power increased while beta frequency decreased. In contrast, hippocampal beta power and beta frequency decreased. This was particularly the case for the well-performed and well-learned Y-maze paradigm. The authors identified the timing of beta oscillations, revealing an interesting shift in beta burst timing relative to reward entry as learning progressed. They also discovered an interesting population of prefrontal neurons that were tuned to both prefrontal beta and hippocampal sharp-wave ripple events, revealing a spectrum of SWR-excited and SWR-inhibited neurons that were differentially phase locked to prefrontal beta rhythms.

      In sum, the authors set out to examine how beta rhythms and their coordination were related to learning and goal occupancy. The authors identified a set of learning and goal-related correlates at the level of LFP and spike-LFP interactions, but did not report on spike-behavioral correlates.

      Strengths:

      Pairing dual recordings of medial prefrontal cortex (mPFC) and CA1 with learning of spatial memory tasks is a strength of this paper. The authors also discovered an interesting population of prefrontal neurons modulated by both beta and CA1 sharp-wave ripple (SWR) events, showing a relationship between SWR-excited and SWR-inhibited neurons and beta oscillation phase.

      Weaknesses:

      Moreover, there is little detail provided about sample sizes and how data sampling is being performed (e.g., rats, sessions, or trials), raising generalizability concerns.

      We appreciate Reviewer 3’s thoughtful suggestions for making our claims convincing. We will include information about sample sizes and address each detailed recommendation in the revised manuscript.

      The authors report on a task where rats were performing sub-optimally (F-maze), weakening claims.

      Our experiment was designed to allow us to examine within the same animal, a well-performed task (Y) and a less well-performed task (F). This contrast allows us to determine differences in neural correlates. We can further dissect the relevant differences to take advantage of this experiment design.

      Likewise, it is questionable as to whether mPFC and hippocampus are dually required to perform a no-delay Y-maze task at day 5, where rats are performing near 100%.

      We agree with Reviewer 3 that the mPFC and hippocampus may not be required when the animal reaches stable performance on day 5 (Deceuninck & Kloosterman, 2024). The data we collected spans the full range of early learning (day 1) to proficiency (day 5). We wanted to understand the dynamics of beta across these learning stages.

      Recent studies suggest mPFC and hippocampus are likely to be needed, in some capacity, for learning continuous spatial alternation tasks on a range of maze geometries. Lesions, inactivation or waking activity perturbation of hippocampus or hippocampus and mPFC on the W maze alternation task slowed learning (Jadhav et al., 2012; Kim & Frank, 2009; Maharjan et al., 2018). More recently, optogenetic silencing of mPFC after sharp wave ripples on the Y maze alternation affected performance when the center arm was switched (den Bakker et al., 2023). The Y and F mazes in our study both share the continuous alternation rule, where the animal needed to avoid visiting a previously visited location on the outbound choice relative to the center, and always return to the center location.

      Further, the performance characteristics on the outbound and inbound components of our Y task is similar to the W task. We have analyzed the “inbound” and “outbound” performance of the animals on the Y maze alternation task, and they are similar to the W maze alternation task. The “inbound” or reference location component is learned quickly whereas the ”outbound”, alternation component is learned slowly. We can add this analysis to the revised manuscript.

      There would be little reason to suspect strong oscillatory coupling when task performance is poor and/or independent of mPFC-HPC communication (Jones and Wilson, 2005) potentially weakening conclusions about independent beta rhythms.

      Although many studies have examined the oscillatory coupling properties at the theta frequency between mPFC-HPC (Hyman et al., 2005; Jones & Wilson, 2005; Siapas et al., 2005), our understanding of beta frequency coordination between the two regions is less established, especially at goal locations. Beta frequency coordination at goal locations may or may not follow similar properties to theta frequency coupling. In this manuscript we are reporting the properties of goal-location beta frequency activity in mPFC-HPC networks. We are not aware of prior work describing these properties at this stage of a spatial navigation task, especially their coordination in time.

      References

      Ahmed, O. J., & Mehta, M. R. (2012). Running speed alters the frequency of hippocampal gamma oscillations. J Neurosci, 32(21), 7373-7383. https://doi.org/10.1523/JNEUROSCI.5110-11.2012

      Berke, J. D., Hetrick, V., Breck, J., & Greene, R. W. (2008). Transient 23-30 Hz oscillations in mouse hippocampus during exploration of novel environments. Hippocampus, 18(5), 519-529. https://doi.org/10.1002/hipo.20435

      Deceuninck, L., & Kloosterman, F. (2024). Disruption of awake sharp-wave ripples does not affect memorization of locations in repeated-acquisition spatial memory tasks. Elife, 13. https://doi.org/10.7554/eLife.84004

      den Bakker, H., Van Dijck, M., Sun, J. J., & Kloosterman, F. (2023). Sharp-wave-ripple-associated activity in the medial prefrontal cortex supports spatial rule switching. Cell Rep, 42(8), 112959. https://doi.org/10.1016/j.celrep.2023.112959

      França, A. S., do Nascimento, G. C., Lopes-dos-Santos, V., Muratori, L., Ribeiro, S., Lobão-Soares, B., & Tort, A. B. (2014). Beta2 oscillations (23-30 Hz) in the mouse hippocampus during novel object recognition. Eur J Neurosci, 40(11), 3693-3703. https://doi.org/10.1111/ejn.12739

      França, A. S. C., Borgesius, N. Z., Souza, B. C., & Cohen, M. X. (2021). Beta2 Oscillations in Hippocampal-Cortical Circuits During Novelty Detection. Front Syst Neurosci, 15, 617388. https://doi.org/10.3389/fnsys.2021.617388

      Hyman, J. M., Zilli, E. A., Paley, A. M., & Hasselmo, M. E. (2005). Medial prefrontal cortex cells show dynamic modulation with the hippocampal theta rhythm dependent on behavior. Hippocampus, 15(6), 739-749. https://doi.org/10.1002/hipo.20106

      Iwasaki, S., Sasaki, T., & Ikegaya, Y. (2021). Hippocampal beta oscillations predict mouse object-location associative memory performance. Hippocampus, 31(5), 503-511. https://doi.org/10.1002/hipo.23311

      Jadhav, S. P., Kemere, C., German, P. W., & Frank, L. M. (2012). Awake hippocampal sharp-wave ripples support spatial memory. Science (New York, N.Y.), 336(6087), 1454-1458. https://doi.org/10.1126/science.1217230

      Jones, M. W., & Wilson, M. A. (2005). Theta Rhythms Coordinate Hippocampal–Prefrontal Interactions in a Spatial Memory Task. PLoS Biology, 3(12). https://doi.org/10.1371/journal.pbio.0030402

      Kim, S. M., & Frank, L. M. (2009). Hippocampal Lesions Impair Rapid Learning of a Continuous Spatial Alternation Task. PLoS ONE, 4(5). https://doi.org/10.1371/journal.pone.0005494

      Lansink, C. S., Meijer, G. T., Lankelma, J. V., Vinck, M. A., Jackson, J. C., & Pennartz, C. M. (2016). Reward Expectancy Strengthens CA1 Theta and Beta Band Synchronization and Hippocampal-Ventral Striatal Coupling. J Neurosci, 36(41), 10598-10610. https://doi.org/10.1523/JNEUROSCI.0682-16.2016

      Maharjan, D. M., Dai, Y. Y., Glantz, E. H., & Jadhav, S. P. (2018). Disruption of dorsal hippocampal - prefrontal interactions using chemogenetic inactivation impairs spatial learning. Neurobiol Learn Mem, 155, 351-360. https://doi.org/10.1016/j.nlm.2018.08.023

      Rangel, L. M., Chiba, A. A., & Quinn, L. K. (2015). Theta and beta oscillatory dynamics in the dentate gyrus reveal a shift in network processing state during cue encounters. Front Syst Neurosci, 9, 96. https://doi.org/10.3389/fnsys.2015.00096

      Siapas, A. G., Lubenov, E. V., & Wilson, M. A. (2005). Prefrontal Phase Locking to Hippocampal Theta Oscillations. Neuron, 46(1), 141-151. https://doi.org/10.1016/j.neuron.2005.02.028.

    1. Author response:

      The following is the authors’ response to the latest reviews:

      "One remaining question is the interpretation of matching variants with very low stable posterior probabilities (~0), which the authors have analyzed in detail but without fully conclusive findings. I agree with the authors that this event is relatively rare and the current sample size is limited but this might be something to keep in mind for future studies."

      Fine-mapping stabilityon matching variants with very low stable posterior probability

      We thank Reviewer 2 for encouraging us to think more about how low stable posterior probability matching variants can be interpreted. We describe a few plausible interpretations, even though – as Reviewer 2 and we have both acknowledged – our present experiments do not point to a clear and conclusive account.

      One explanation is that the locus captured by the variant might not be well-resolved, in the sense that many correlated variants exist around the locus. Thus, the variant itself is unlikely causal, but the set of variants in high LD with it may contain the true causal variant, or it's possible that the causal variant itself was not sequenced but lies in that locus. A comparison of LD patterns across ancestries at the locus would be helpful here.

      Another explanation rests on the following observation. For a variant to be matching between top and stable PICS and to also have very small stable PP, it has to have the largest PP after residualization on the ALL slice but also have positive PP with gene expression on many other slices. In other words, failing to control for potential confounders shrinks the PP. If one assumes that the matching variant is truly causal, then our observation points to an example of negative confounding (aka suppressor effect). This can occur when the confounders (PCs) are correlated with allele dosage at the causal variant in a different direction than their correlation with gene expression, so that the crude association between unresidualized gene expression and causal variant allele dosage is biased toward 0.

      Although our present study does not allow us to systematically confirm either interpretation – since we found that matching variants were depleted in causal variants in our simulations, violating the second argument, but we also found functional enrichment in analyses of GEUVADIS data though only 17 matching variants with low stable PP were reported – we believe a larger-scale study using larger cohort sizes (at least 1000 individuals per ancestry) and many more simulations (to increase yield of such cases) would be insightful.

      ———

      The following is the authors’ response to the original reviews:

      Reviewer #1:

      Major comments:

      (1) It would be interesting to see how much fine-mapping stability can improve the fine-mapping results in cross-population. One can simulate data using true genotype data and quantify the amount the fine-mapping methods improve utilizing the stability idea.

      We agree, and have performed simulation studies where we assume that causal variants are shared across populations. Specifically, by mirroring the simulation approach described in Wang et al. (2020), we generated 2,400 synthetic gene expression phenotypes across 22 autosomes, using GEUVADIS gene expression metadata (i.e., gene transcription start site) to ensure largely cis expression phenotypes were simulated. We additionally generated 1,440 synthetic gene expression phenotypes that incorporate environmental heterogeneity, to motivate our pursuit of fine-mapping stability in the first place (see Response to Reviewer 2, Comment 6). These are described in Results section “Simulation study”:

      We evaluated the performance of the PICS algorithm, specifically comparing the approach incorporating stability guidance against the residualization approach that is more commonly used — similar to our application to the real GEUVADIS data. We additionally investigated two ways of “combining” the residualization and stability guidance approaches: (1) running stability-guided PICS on residualized phenotypes; (2) prioritizing matching variants returned by both approaches. See Response to Reviewer 2, Comment 5.

      (2) I would be very interested to see how other fine-mapping methods (FINEMAP, SuSiE, and CAVIAR) perform via the stability idea.

      Thank you for this valuable comment. We ran SuSiE on the same set of simulated datasets. Specifically, we ran a version that uses residualized phenotypes (supposedly removing the effects of population structure), and also a version that incorporates stability. The second version is similar to how we incorporate stability in PICS. We investigated the performance of Stable SuSiE in a similar manner to our investigation of PICS. First we compared the performance relative to SuSiE that was run on residualized phenotypes. Motivated by our finding in PICS that prioritizing matching variants improves causal variant recovery, we did the same analysis for SuSiE. This analysis is described in Results section “Stability guidance improves causal variant recovery in SuSiE.”

      We reported overall matching frequencies and causal variant recovery rates of top and stable variants for SuSiE in Figures 2C&D.

      Frequencies with which Stable and Top SuSiE variants match, stratified by the simulation parameters, are summarized in Supplementary File 2C (reproduced for convenience in Response to Reviewer 2, Comment 3). Causal variant recovery rates split by the number of causal variants simulated, and stratified by both signal-to-noise ratio and the number of credible sets included, are reported in Figure 2—figure supplements 16-18. We reproduce Figure 2—figure supplement 18 (three causal variants scenario) below for convenience. Analogous recovery rates for matching versus non-matching top or stable variants are reported in Figure 2—figure supplements 19, 21 and 23.

      (3) I am a little bit concerned about the PICS's assumption about one causal variant. The authors mentioned this assumption as one of their method limitations. However, given the utility of existing fine-mapping methods (FINEMAP and SuSiE), it is worth exploring this domain.

      Thank you for raising this fair concern. We explored this domain, by considering simulations that include two and three causal variants (see Response to Reviewer 2, Comment 3). We looked at how well PICS recovers causal variants, and found that each potential set largely does not contain more than one causal variant (Figure 2—figure supplements 20 and 22). This can be explained by the fact that PICS potential sets are constructed from variants with a minimum linkage disequilibrium to a focal variant. On the other hand, in SuSiE, we observed multiple causal variants appearing in lower credible sets when applying stability guidance (Figure 2—figure supplements 21 and 23). A more extensive study involving more fine-mapping methods and metrics specific to violation of the one causal variant assumption could be pursued in future work.

      Reviewer #2:

      Aw et al. presents a new stability-guided fine-mapping method by extending the previously proposed PICS method. They applied their stability-based method to fine-map cis-eQTLs in the GEUVADIS dataset and compared it against what they call residualization-based method. They evaluated the performance of the proposed method using publicly available functional annotations and claimed the variants identified by their proposed stability-based method are more enriched for these functional annotations.

      While the reviewer acknowledges the contribution of the present work, there are a couple of major concerns as described below.

      Major:

      (1) It is critical to evaluate the proposed method in simulation settings, where we know which variants are truly causal. While I acknowledge their empirical approach using the functional annotations, a more unbiased, comprehensive evaluation in simulations would be necessary to assess its performance against the existing methods.

      Thank you for this point. We agree. We have performed a simulation study where we assume that causal variants are shared across populations (see response to Reviewer 1, Comment 1). Specifically, by mirroring the simulation approach described in Wang et al. (2020), we generated 2,400 synthetic gene expression phenotypes across 22 autosomes, using GEUVADIS gene expression metadata (i.e., gene transcription start site) to ensure cis expression phenotypes were simulated.

      (2) Also, simulations would be required to assess how the method is sensitive to different parameters, e.g., LD threshold, resampling number, or number of potential sets.

      Thank you for raising this point. The underlying PICS algorithm was not proposed by us, so we followed the default parameters set (LD threshold, r<sup>2</sup> \= 0.5; see Taylor et al., 2021 Bioinformatics) to focus on how stability considerations will impact the existing fine-mapping algorithm. We attempted to derive the asymptotic joint distribution of the p-values, but it was too difficult. Hence, we used 500 permutations because such a large number would allow large-sample asymptotics to kick in. However, following your critical suggestion we varied the number of potential sets in our analyses of simulated data. We briefly mention this in the Results.

      “In the Supplement, we also describe findings from investigations into the impact of including more potential sets on matching frequency and causal variant recovery…”

      A detailed write-up is provided in Supplementary File 1 Section S2 (p.2):

      “The number of credible or potential sets is a parameter in many fine-mapping algorithms. Focusing on stability-guided approaches, we consider how including more potential sets for stable fine-mapping algorithms affects both causal variant recovery and matching frequency in simulations…

      Causal variant recovery. We investigate both Stable PICS and Stable SuSiE. Focusing first on simulations with one causal variant, we observe a modest gain in causal variant recovery for both Stable PICS and Stable SuSiE, most noticeably when the number of sets was increased from 1 to 2 under the lowest signal-to-noise ratio setting…”

      We observed that increasing the number of potential sets helps with recovering causal variants for Stable PICS (Figure 2—figure supplements 13-15). This observation also accounts for the comparable power that Stable PICS has with SuSiE in simulations with low signal-to-noise ratio (SNR), when we increase the number of credible sets or potential sets (Figure 2—figure supplements 10-12).

      (3) Given the previous studies have identified multiple putative causal variants in both GWAS and eQTL, I think it's better to model multiple causal variants in any modern fine-mapping methods. At least, a simulation to assess its impact would be appreciated.

      We agree. In our simulations we considered up to three causal variants in cis, and evaluated how well the top three Potential Sets recovered all causal variants (Figure 2—figure supplements 13-15; Figure 2—figure supplement 15). We also reported the frequency of variant matches between Top and Stable PICS stratified by the number of causal variants simulated in Supplementary File 2B and 2C. Note Supplementary File 2C is for results from SuSiE fine-mapping; see Response to Reviewer 1, Comment 2.

      Supplementary File 2B. Frequencies with which Stable and Top PICS have matching variants for the same potential set. For each SNR/ “No. Causal Variants” scenario, the number of matching variants is reported in parentheses.

      Supplementary File 2C. Frequencies with which Stable and Top SuSiE have matching variants for the same credible set. For each SNR/ “No. Causal Variants” scenario, the number of matching variants is reported in parentheses.

      (4) Relatedly, I wonder what fraction of non-matching variants are due to the lack of multiple causal variant modeling.

      PICS handles multiple causal variants by including more potential sets to return, owing to the important caveat that causal variants in high LD cannot be statistically distinguished. For example, if one believes there are three causal variants that are not too tightly linked, one could make PICS return three potential sets rather than just one. To answer the question using our simulation study, we subsetted our results to just scenarios where the top and stable variants do not match. This mimics the exact scenario of having modeled multiple causal variants but still not yielding matching variants, so we can investigate whether these non-matching variants are in fact enriched in the true causal variants.

      Because we expect causal variants to appear in some potential set, we specifically considered whether these non-matching causal variants might match along different potential sets across the different methods. In other words, we compared the stable variant with the top variant from another potential set for the other approach (e.g., Stable PICS Potential Set 1 variant vs Top PICS Potential Set 2 variant). First, we computed the frequency with which such pairs of variants match. A high frequency would demonstrate that, even if the corresponding potential sets do not have a variant match, there could still be a match between non-corresponding potential sets across the two approaches, which shows that multiple causal variant modeling boosts identification of matching variants between both approaches — regardless of whether the matching variant is in fact causal.

      Low frequencies were observed. For example, when restricting to simulations where Top and Stable PICS Potential Set 1 variants did not match, about 2-3% of variants matched between the Potential Set 1 variant in Stable PICS and Potential Sets 2 and 3 variants in Top PICS; or between the Potential Set 1 variant in Top PICS and Potential Sets 2 and 3 variants in Stable PICS (Supplementary File 2D). When looking at non-matching Potential Set 2 or Potential Set 3 variants, we do see an increase in matching frequencies (between 10-20%) between Potential Set 2 variants and other potential set variants between the different approaches. However, these percentages are still small compared to the matching frequencies we observed between corresponding potential sets (e.g., for simulations with one causal variant this was 70-90% between Top and Stable PICS Potential Set 1, and for simulations with two and three causal variants this was 55-78% and 57-79% respectively).

      We next checked whether these “off-diagonal” matching variants corresponded to the true causal variants simulated. Here we find that the causal variant recovery rate is mostly less than the corresponding rate for diagonally matching variants, which together with the low matching frequency suggests that the enrichment of causal variants of “off-diagonal” matching variants is much weaker than in the diagonally matching approach. In other words, the fraction of non-matching (causal) variants due to the lack of multiple causal variant modeling is low.

      We discuss these findings in Supplementary File 1 Section S2 (bottom of p.2).

      (5) I wonder if you can combine the stability-based and the residualization-based approach, i.e., using the residualized phenotypes for the stability-based approach. Would that further improve the accuracy or not?

      This is a good idea, thank you for suggesting it. We pursued this combined approach on simulated gene expression phenotypes, but did not observe significant gains in causal variant recovery (Figure 2B; Figure 2—figure supplements 2, 13 and 15). We reported this Results “Searching for matching variants between Top PICS and Stable PICS improves causal variant Recovery.”

      “We thus explore ways to combine the residualization and stability-driven approaches, by considering (i) combining them into a single fine-mapping algorithm (we call the resulting procedure Combined PICS); and (ii) prioritizing matching variants between the two algorithms. Comparing the performance of Combined PICS against both Top and Stable PICS, however, we find no significant difference in its ability to recover causal variants (Figure 2B)...”

      However, we also confirmed in our simulations that prioritizing matching variants between the two approaches led to gains in causal variant recovery (Figure 2D; Figure 2—figure supplements 4, 19, 20 and 22). We reported this Results “Searching for matching variants between Top PICS and Stable PICS improves causal variant Recovery.”

      “On the other hand, matching variants between Top and Stable PICS are significantly more likely to be causal. Across all simulations, a matching variant in Potential Set 1 is 2.5X as likely to be causal than either a non-matching top or stable variant (Figure 2D) — a result that was qualitatively consistent even when we stratified simulations by SNR and number of causal variants simulated (Figure 2—figure supplements 19, 20 and 22)...”

      This finding is consistent with our analysis of real GEUVADIS gene expression data, where we reported larger functional significance of matching variants relative to non-matching variants returned by either Top of Stable PICS.

      (6) The authors state that confounding in cohorts with diverse ancestries poses potential difficulties in identifying the correct causal variants. However, I don't see that they directly address whether the stability approach is mitigating this. It is hard to say whether the stability approach is helping beyond what simpler post-hoc QC (e.g., thresholding) can do.

      Thank you for raising this fair point. Here is a model we have in mind. Gene expression phenotypes (Y) can be explained by both genotypic effects (G, as in genotypic allelic dosage) and the environment (E): Y = G + E. However, both G and E depend on ancestry (A), so that Y = G|A+E|A. Suppose that the causal variants are shared across ancestries, so that (G|A=a)=G for all ancestries a. Suppose however that environments are heterogeneous by ancestry: (E|A=a) = e(a) for some function e that depends non-trivially on a. This would violate the exchangeability of exogenous E in the full sample, but by performing fine-mapping on each ancestry stratum, the exchangeability of exogenous E is preserved. This provides theoretical justification for the stability approach.

      We next turned to simulations, where we investigated 1,440 simulated gene expression phenotypes capturing various ways in which ancestry induces heterogeneity in the exogenous E variable (simulation details in Lines 576-610 of Materials and Methods). We ran Stable PICS, as well as a version of PICS that did not residualize phenotypes or apply the stability principle. We observed that (i) causal variant recovery performance was not significantly different between the two approaches (Figure 2—figure supplements 24-32); but (ii) disagreement between the approaches can be considerable, especially when the signal-to-noise ratio is low (Supplementary File 2A). For example, in a set of simulations with three causal variants, with SNR = 0.11 and E heterogeneous by ancestry by letting E be drawn from N(2σ,σ<sup>2</sup>) for only GBR individuals (rest are N(0,σ<sup>2</sup>)), there was disagreement between Potential Set 1 and 2 variants in 25% of simulations — though recovery rates were similar (Probability of recovering at least one causal variant: 75% for Plain PICS and 80% for Stable PICS). These points suggest that confounding in cohorts can reduce power in methods not adjusting or accounting for ancestral heterogeneity, but can be remedied by approaches that do so. We report this analysis in Results “Simulations justify exploration of stability guidance”

      In the current version of our work, we have evaluated, using both simulations and empirical evidence, different ways to combine approaches to boost causal variant recovery. Our simulation study shows that prioritizing matching variants across multiple methods improves causal variant recovery. On GEUVADIS data, where we might not know which variants are causal, we already demonstrated that matching variants are enriched for functional annotations. Therefore, our analyses justify that the adverse consequence of confounding on reducing fine-mapping accuracy can be mitigated by prioritizing matching variants between algorithms including those that account for stability.

      (7) For non-matching variants, I wonder what the difference of posterior probabilities is between the stable and top variants in each method. If the difference is small, maybe it is due to noise rather than signal.

      We have reported differences in posterior probabilities returned by Stable and Top PICS for GEUVADIS data; see Figure 3—figure supplement 1. For completeness, we compute the differences in posterior probabilities and summarize these differences both as histograms and as numerical summary statistics.

      Potential Set 1

      - Number of non-matching variants = 9,921

      - Table of Summary Statistics of (Stable Posterior Probability – Top Posterior Probability)

      Author response table 1.

      - Histogram of (Stable Posterior Probability – Top Posterior Probability)

      Author response image 1.

      Potential Set 2

      - Number of non-matching variants = 14,454

      - Table of Summary Statistics of (Stable Posterior Probability – Top Posterior Probability)

      Author response table 2.

      - Histogram of (Stable Posterior Probability – Top Posterior Probability)

      Author response image 2.

      Potential Set 3

      - Number of non-matching variants = 16,814

      - Table of Summary Statistics of (Stable Posterior Probability – Top Posterior Probability)

      Author response table 3.

      - Histogram of (Stable Posterior Probability – Top Posterior Probability)

      Author response image 3.

      We also compared the difference in posterior probabilities between non-matching variants returned by Stable PICS and Top PICS for our 2,400 simulated gene expression phenotypes. Focusing on just Potential Set 1 variants, we find two equally likely scenarios, as demonstrated by two distinct clusters of points in a “posterior probability-posterior probability” plot. The first is, as pointed out, a small difference in posterior probability (points lying close to y=x). The second, however, reveals stable variants with very small posterior probability (of order 4 x 10<sup>–5</sup> to 0.05) but with a non-matching top variant taking on posterior probability well distributed along [0,1]. Moving down to Potential Sets 2 and 3, the distribution of pairs of posterior probabilities appears less clustered, indicating less tendency for posterior probability differences to be small ( Figure 2—figure supplement 8).

      Here are the histograms and numerical summary statistics.

      Potential Set 1

      - Number of non-matching variants = 663 (out of 2,400)

      - Table of Summary Statistics of (Stable Posterior Probability – Top Posterior Probability)

      Author response table 4.

      - Histogram of (Stable Posterior Probability – Top Posterior Probability)

      Author response image 4.

      Potential Set 2

      Number of non-matching variants = 1,429 (out of 2,400)

      - Table of Summary Statistics of (Stable Posterior Probability – Top Posterior Probability)

      Author response table 5.

      - Histogram of (Stable Posterior Probability – Top Posterior Probability)

      Author response image 5.

      Potential Set 3

      - Number of non-matching variants = 1,810 (out of 2,400)

      - Table of Summary Statistics of (Stable Posterior Probability – Top Posterior Probability)

      Author response table 6.

      - Histogram of (Stable Posterior Probability – Top Posterior Probability)

      Author response image 6.

      (8) It's a bit surprising that you observed matching variants with (stable) posterior probability ~ 0 (SFig. 1). What are the interpretations for these variants? Do you observe functional enrichment even for low posterior probability matching variants?

      Thank you for this question. We have performed a thorough analysis of matching variants with very low stable posterior probability, which we define as having a posterior probability < 0.01 (Supplementary File 1 Section S11). Here, we briefly summarize the analysis and key findings.

      Analysis

      First, such variants occur very rarely — only 8 across all three potential sets in simulations, and 17 across all three potential sets for GEUVADIS (the latter variants are listed in Supplementary 2E). We begin interpreting these variants by looking at allele frequency heterogeneity by ancestry, support size — defined as the number of variants with positive posterior probability in the ALL slice* — and the number of slices including the stable variant (i.e., the stable variant reported positive posterior probability for the slice).

      *Note that the stable variant posterior probability need not be at least 1/(Support Size). This is because the algorithm may have picked a SNP that has a lower posterior probability in the ALL slice (i.e., not the top variant) but happens to appear in the most number of other slices (i.e., a stable variant).

      For variants arising from simulations, because we know the true causal variants, we check if these variants are causal. For GEUVADIS fine-mapped variants, we rely on functional annotations to compare their relative enrichment against other matching variants that did not have very low stable posterior probability.

      Findings

      While we caution against generalizing from observations reported here, which are based on very small sample sizes, we noticed the following. In simulations, matching variants with very low stable posterior probability are largely depleted in causal variants, although factors such as the number of slices including the stable variant may still be useful. In GEUVADIS, however, these variants can still be functionally enriched. We reported three examples in Supplementary File 1 Section S11 (pp. 8-9 of Supplement), where the variants were enriched in either VEP or biologically interpretable functional annotations, and were also reported in earlier studies. We partially reproduce our report below for convenience.

      “However, we occasionally found variants that stand out for having large functional annotation scores. We list one below for each potential set.

      - Potential Set 1 reported the variant rs12224894 from fine-mapping ENSG00000255284.1 (accession code AP006621.3) in Chromosome 11. This variant stood out for lying in the promoter flanking region of multiple cell types and being relatively enriched for GC content with a 75bp flanking region. This variant has been reported as a cis eQTL for AP006632 (using whole blood gene expression, rather than lymphoblastoid cell line gene expression in this study) in a clinical trial study of patients with systemic lupus erythematosus (Davenport et al., 2018). Its nearest gene is GATD1, a ubiquitously expressed gene that codes for a protein and is predicted to regulate enzymatic and catabolic activity. This variant appeared in all 6 slices, with a moderate support size of 23.

      - Potential Set 2 reported the variant rs9912201 from fine-mapping ENSG00000108592.9 (mapped to FTSJ3) in Chromosome 17. Its FIRE score is 0.976, which is close to the maximum FIRE score reported across all Potential Set 2 matching variants. This variant has been reported as a SNP in high LD to a GWAS hit SNP rs7223966 in a pan-cancer study (Gong et al., 2018). This variant appeared in all 6 slices, with a moderate support size of 32.

      - Potential Set 3 reported the variant rs625750 from fine-mapping ENSG00000254614.1 (mapped to CAPN1-AS1, an RNA gene) in Chromosome 11. Its FIRE score is 0.971 and its B statistic is 0.405 (region under selection), which lie at the extreme quantiles of the distributions of these scores for Potential Set 3 matching variants with stable posterior probability at least 0.01. Its associated mutation has been predicted to affect transcription factor binding, as computed using several position weight matrices (Kheradpour and Kellis, 2014). This variant appeared in just 3 slices, possibly owing to the considerable allele frequency difference between ancestries (maximum AF difference = 0.22). However, it has a small support size of 4 and a moderately high Top PICS posterior probability of 0.64.

      To summarize, our analysis of GEUVADIS fine-mapped variants demonstrates that matching variants with very low stable posterior probability could still be functionally important, even for lower potential sets, conditional on supportive scores in interpretable features such as the number of slices containing the stable variant and the posterior probability support size…”

    1. The three dimensional framework can be visualized as a 3D coordinate system. The x-axis represents the Data Plane, with each point a unique hash of a dataset. The y-axis represents the Account Plane, with each point a collision-free public key. The z-axis represents the Time Plane, with each point a timestamped hash recorded on a blockchain or hash-registry.

      GraphTopologySenseMaking

      Generally graphs are visualised in an abstract relational space. You are describing here a 3 dimensional map (or rather a 2-dimensional map with a time dimension). the image above it does not seem to reflect such a space and the added benefits of working in a concrete mapable space are not immediately clear to me.

    1. Author response:

      The following is the authors’ response to the original reviews

      eLife Assessment

      This manuscript presents useful insights into the molecular basis underlying the positive cooperativity between the co-transported substrates (galactoside sugar and sodium ion) in the melibiose transporter MelB. Building on years of previous studies, this work improves on the resolution of previously published structures and reports the presence of a water molecule in the sugar binding site that would appear to be key for its recognition, introduces further structures bound to different substrates, and utilizes HDX-MS to further understand the positive cooperativity between sugar and the co-transported sodium cation. Although the experimental work is solid, the presentation of the data lacks clarity, and in particular, the HDX-MS data interpretation requires further explanation in both methodology and discussion, as well as a clearer description of the new insight that is obtained in relation to previous studies. The work will be of interest to biologists and biochemists working on cation-coupled symporters, which mediate the transport of a wide range of solutes across cell membranes.

      We express our gratitude to the associate editor, review editor, and reviewers for their favorable evaluation of this manuscript, as well as their constructive comments and encouragement. Their feedback has been integrated to fortify the evidence, refine the data analysis, and elevate the presentation of the results, thereby enhancing the overall quality and clarity of the manuscript.

      A brief summary of the modifications in this revision:

      (a) We performed four new experiments: 1) intact cell [<sup>3</sup>H]raffinose transport assay; 2) intact cell p-nitrophenol detection to demonstrate α-NPG transport; 3) ITC binding assay for the D59C mutant; and 4) molecular dynamics to simulate the water-1 in sugar-binding site and the dynamics of side chains in the Na<sup>+</sup>- and melibiose-binding pockets. All data consistently support the conclusion draw in this article.

      (b) We have added a new figure to show the apo state dynamics (the new Fig. 5a,b) and annotated the amino acid residue positions and marked positions in sugar- or Na<sup>+</sup>-binding pockets.

      (c) As suggested by reviewer-3, we have moved the individual mapping of ligand effects on HDX data to the main figure, combined with the residual plots, and marked the amino-acid residue positions.

      (d) We have added more deuterium uptake plots to cover all residues in the sugar- or Na<sup>+</sup>-binding pockets in the current figure 7 (previously figure 6).

      (e) We have added a new figure 8 showing the positions at the well-studied cytoplasmic gating salt-bridge network and other loops likely important for conformational changes, along with a membrane topology marked with the HDX data. We have added a new figure 9 from MD simulations.

      Reviewer #1:

      While the structure of the melibiose permease in both outward and inward-facing forms has been solved previously, there remain unanswered questions regarding its mechanism. Hariharan et al set out to address this with further crystallographic studies complemented with ITC and hydrogen-deuterium exchange (HDX) mass spectrometry.

      (1) They first report 4 different crystal structures of galactose derivatives to explore molecular recognition, showing that the galactose moiety itself is the main source of specificity. Interestingly, they observe a water-mediated hydrogen bonding interaction with the protein and suggest that this water molecule may be important in binding.

      We thank you for understanding what we've presented in this manuscript.

      (2) The results from the crystallography appear sensible, though the resolution of the data is low, with only the structure with NPG better than 3Å. However, it is a bit difficult to understand what novel information is being brought out here and what is known about the ligands. For instance, are these molecules transported by the protein or do they just bind? They measure the affinity by ITC, but draw very few conclusions about how the affinity correlates with the binding modes. Can the protein transport the trisaccharide raffinose?

      The four structures with bound sugars of different sizes were used to identify the binding motif on both the primary substrate (sugar) and the transporter (MelB<sub>St</sub>). Although the resolutions of the structures complexed with melibiose, raffinose, or a-MG are relatively low, the size and shape of the densities at each structure are consistent with the corresponding sugar molecules, which provide valuable data for confirming the pose of the bound sugar proposed previously. In this revision, we further refine the α-NPG-bound structure to 2.60 Å. The identified water-1 in this study further confirms the orientation of C4-OH. Notably, this transporter does not recognize or transport glucosides in which the orientation of the C4-OH at the glucopyranosyl ring is opposite. To verify the water in the sugar-binding site, we initiated a new collaborative study using MD simulations. Results showed that Wat-1 exhibited nearly full occupancy when melibiose was present, regardless of whether Na<sup>+</sup> was bound at the cation-binding site.

      As detailed in the Summary, we added two additional sets of transport assays and confirmed that raffinose and α-NPG are transportable substrates of MelB<sub>St</sub>. For α-NPG transport, we measured the end products of the process—enzyme hydrolysis and membrane diffusion of p-nitrophenol released from intracellular α-NPG.

      As a bonus, based on the WT-like downhill α-NPG transport activity by the D59C uniporter mutant that failed in active transport against a sugar concentration gradient, we further emphasized that the sugar translocation pathway is isolated from the cation-binding site. The new data strongly support the allosteric effects of cation binding on sugar-binding affinity. Thank you for this helpful suggestion.

      A meaningful analysis of ITC data heavily depends on the quality of the data. My laboratory has extensive experience with ITC and has gained rich, insightful mechanistic knowledge of MelB<sub>St</sub>. Because of the low affinity in raffinose and a-MG, unfortunately, no further information can be convincingly obtained. Therefore, we did not dissect the enthalpic and entropic contributions but focused on the Kd value and binding stoichiometry.

      (3) The HDX also appears to be well done; however, in the manuscript as written, it is difficult to understand how this relates to the overall mechanism of the protein and the conformational changes that the protein undergoes.

      We are sorry for not presenting our data clearly in the initial submission. In this revised manuscript, we have made numerous improvements, as described in the Summary. These enhancements in the HDX data analysis provided new mechanistic insights into the allosteric effects, leading us to conclude that protein dynamics and conformational transitions are coupled with sugar-binding affinity. Na<sup>+</sup> binding restricts protein conformational flexibility, thereby increasing sugar-binding affinity. The HDX study revealed that the major dynamic region includes a sugar-binding residue, Arg149, which also plays a gating role. Structurally, this dual-function residue undergoes significant displacement during the sugar-affinity-coupled conformational transition, thereby coupling the sugar binding and structural dynamics.

      Reviewer #2:

      This manuscript from Hariharan, Shi, Viner, and Guan presents x-ray crystallographic structures of membrane protein MelB and HDX-MS analysis of ligand-induced dynamics. This work improves on the resolution of previously published structures, introduces further sugar-bound structures, and utilises HDX to explore in further depth the previously observed positive cooperatively to cotransported cation Na<sup>+</sup>. The work presented here builds on years of previous study and adds substantial new details into how Na<sup>+</sup> binding facilitates melibiose binding and deepens the fundamental understanding of the molecular basis underlying the symport mechanism of cation-coupled transporters. However, the presentation of the data lacks clarity, and in particular, the HDX-MS data interpretation requires further explanation in both methodology and discussion.

      We appreciate this reviewer's time in reading our previous articles related to this manuscript.

      Comments on Crystallography and biochemical work:

      (1) It is not clear what Figure 2 is comparing. The text suggests this figure is a comparison of the lower resolution structure to the structure presented in this work; however, the figure legend does not mention which is which, and both images include a modelled water molecule that was not assigned due to poor resolution previously, as stated by the authors, in the previously generated structure. This figure should be more clearly explained.

      This figure is a stereo view of a density map created in cross-eye style. In this revision, we changed this figure to Fig. 3 and showed only the density for sugar and water-1. 

      (2) It is slightly unclear what the ITC measurements add to this current manuscript. The authors comment that raffinose exhibiting poor binding affinity despite having more sugar units is surprising, but it is not surprising to me. No additional interactions can be mapped to these units on their structure, and while it fits into the substrate binding cavity, the extra bulk of additional sugar units is likely to reduce affinity. In fact, from their listed ITC measurements, this appears to be the trend. Additionally, the D59C mutant utilised here in structural determination is deficient in sodium/cation binding. The reported allostery of sodium-sugar binding will likely influence the sugar binding motif as represented by these structures. This is clearly represented by the authors' own ITC work. The ITC included in this work was carried out on the WT protein in the presence of Na<sup>+</sup>. The authors could benefit from clarifying how this work fits with the structural work or carrying out ITC with the D59C mutant, or additionally, in the absence of sodium.

      Thank this reviewer for your helpful suggestions. We have performed the suggested ITC measurements with the D59C mutant. The purpose of the ITC experiments was to demonstrate that MelB<sub>St</sub> can bind raffinose and α-MG to support the crystal structures.

      Comments on HDX-MS work:

      While the use of HDX-MS to deepen the understanding of ligand allostery is an elegant use of the technique, this reviewer advises the authors to refer to the Masson et al. (2019) recommendations for the HDX-MS article (https://doi.org/10.1038/s41592-019-0459-y) on how to best present this data. For example:

      All authors value this reviewer's comments and suggestions, which have been included in this revision.

      (1) The Methodology includes a lipid removal step. Based on other included methods, I assumed that the HDX-MS was being carried out in detergent-solubilised protein samples. I therefore do not see the need for a lipid removal step that is usually included for bilayer reconstituted samples. I note that this methodology is the same as previously used for MelB. It should be clarified why this step was included, if it was in fact used, aka, further details on the sample preparation should be included.

      Yes, a lipid/detergent removal step was included in this study and previous ones, and this information was clearly described in the Methods.

      (2) A summary of HDX conditions and results should be given as recommended, including the mean peptide length and average redundancy per state alongside other included information such as reaction temperature, sequence coverage, etc., as prepared for previous publications from the authors, i.e., Hariharan et al., 2024.

      We have updated the Table S2 and addressed the reviewer’ request for the details of HDX experiments.

      (3) Uptake plots per peptide for the HDX-MS data should be included as supporting information outside of the few examples given in Figure 6.

      We have prepared and presented deuterium uptake time-course plots for any peptides with ΔD > threshold in Fig. S5a-c.

      (4) A reference should be given to the hybrid significance testing method utilised. Additionally, as stated by Hageman and Weis (2019) (doi:10.1021/acs.analchem.9b01325), the use of P < 0.05 greatly increases the likelihood of false positive ΔD identifications. While the authors include multiple levels of significance, what they refer to as high and lower significant results, this reviewer understands that working with dynamic transporters can lead to increased data variation; a statement of why certain statistical criteria were chosen should be included, and possibly accompanied by volcano plots. The legend of Figure 6 should include what P value is meant by * and ** rather than statistically significant and highly statistically significant.

      We appreciate this comment and have cited the suggested article on the hybrid significance method. We fully acknowledge that using a cutoff of P < 0.05 can increase the likelihood of false-positive identifications. By applying multiple levels of statistical testing, we determined that P < 0.05 is an appropriate threshold for this study. The threshold values were presented in the residual plots and explained in the text. For the previous Fig. 6 (renamed Fig. S4b in the current version), we have reported the P value. *, < 0.05; **, < 0.01. (The text for 0.01 was not visible in the previous version. Sorry for the confusion.)

      (5) Line 316 states a significant difference in seen in dynamics, how is significance measured here? There is no S.D. given in Table S4. Can the authors further comment on the potential involvement in solvent accessibility and buried helices that might influence the overall dynamics outside of their role in sugar vs sodium binding? An expected low rate of exchange suggests that dynamics are likely influenced by solvent accessibility or peptide hydrophobicity. The increased dynamics at peptides covering the Na binding site on overall more dynamic helices suggests that there is no difference between the dynamics of each site.

      The current Table S3 (combined from previous Tables S3 and S4 as suggested) was prepared to provide an overall view of the dynamic regions with SD values provided. For other questions, if we understand correctly, this reviewer asked us to comment on the effects of solvent accessibility or hydrophobic regions on the overall dynamics outside the binding residues of the peptides that cover them. Since HDX rates are influenced by two linked factors: solvent accessibility and hydrogen-bonding interactions that reflect structural dynamics, poor solvent accessibility in buried regions should result in low deuterium uptakes. The peptides in our dataset that include the Na<sup>+</sup>-binding site showed lower HDX, likely due to limited solvent accessibility and lower structural stability. It is unclear what this reviewer meant by "increased dynamics at peptides covering the Na binding site on overall more dynamic helices." We did not observe increased dynamics in peptides covering the Na<sup>+</sup>-binding site; instead, all Na<sup>+</sup>-binding residues and nearby sugar-binding residues have lower degrees of deuteriation.

      (6) Previously stated HDX-MS results of MelB (Hariharan et al., 2024) state that the transmembrane helices are less dynamic than polypeptide termini and loops with similar distributions across all transmembrane bundles. The previous data was obtained in the presence of sodium. Does this remove the difference in dynamics in the sugar-binding helices and the cation-binding helices? Including this comparison would support the statement that the sodium-bound MelB is more stable than the Apo state, along with the lack of deprotection observed in the differential analysis.

      Thanks for this suggestion. The previous datasets were collected in the presence of Na<sup>+</sup>. In the current study, we also have two Na<sup>+</sup>-containing datasets. Both showed similar results: the multiple overlapping peptides covering the sugar-binding residues on helices I and V have higher HDX rates than those peptides covering the Na<sup>+</sup>-binding residues, even when Na<sup>+</sup> was present.

      (7) Have the authors considered carrying out an HDX-MS comparison between the WT and the D59C mutant? This may provide some further information on the WT structure (particularly a comparison with sugar-bound). This could be tied into a nice discussion of their structural data.

      Thank you for this suggestion. Comparing HDX-MS between the WT and the D59C mutant is certainly interesting, especially with the increasing amount of structural, biochemical, and biophysical data now available for this mutant. However, due to limited resources, we might consider it later.

      (8) Have the authors considered utilising Li<sup>+</sup> to infer how cation selectivity impacts the allostery? Do they expect similar stabilisation of a higher-affinity sugar binding state with all cations?

      We have shown that Li<sup>+</sup> also works positively with melibiose. Li<sup>+</sup> binds to MelB<sub>St</sub> with a higher affinity than Na<sup>+</sup> and modifies MelB<sub>St</sub> differently. It is important to study this thoroughly and separately. To answer the second question, H<sup>+</sup> is a weak coupling cation with little effect on melibiose binding. Since its pKa is around 6.5, only a small population of MelB<sub>St</sub> is protonated at pH 7.5. The order of sugar-binding cooperativity is highest with Na<sup>+</sup>, then Li<sup>+</sup>, and finally H<sup>+</sup>.

      (9) MD of MelB suggests all transmembrane helices are reorientated during substrate translocation, yet substrate and cotransporter ligand binding only significantly impacts a small number of helices. Can the authors comment on the ensemble of states expected from each HDX experiment? The data presented here instead shows overall stabilisation of the transporter. This data can be compared to that of HDX on MFS sugar cation symporter XylE, where substrate binding induces a transition to the OF state. There is no discussion of how this HDX data compares to previous MFS sugar transporter HDX. The manuscript could benefit from this comparison rather than a comparison to LacY. It is unlikely that there are universal mechanisms that can be inferred even from these model proteins. Highlighting differences between these transport systems provides broader insights into this protein class. Doi: 10.1021/jacs.2c06148 and 10.1038/s41467-018-06704-1.

      The sugar translocation free-energy landscape simulations showed that both helix bundles move relative to the membrane plane. This analysis aimed to clarify a hypothesis in the field—that the MFS transporter can use an asymmetric mode to perform the conformational transition between inward- and outward-facing states. In the case of MelB<sub>St</sub>, we clearly demonstrated that both domains move and each helix bundle moves as a unit. So only a small number of helices and loops showed labeling changes. Thanks for the suggestion about comparing with XylE. We have included that in the discussion.

      (10) Additionally, the recent publication of SMFS data (by the authors: doi:10.1016/j.str.2022.11.011) states the following: "In the presence of either melibiose or a coupling Na<sup>+</sup>-cation, however, MelB increasingly populates the mechanically less stable state which shows a destabilized middle-loop C3." And "In the presence of both substrate and co-substrate, this mechanically less stable state of MelB is predominant.". It would benefit the authors to comment on these data in contrast to the HDX obtained here. Additionally, is the C3 loop covered, and does it show the destabilization suggested by these studies? HDX can provide a plethora of results that are missing from the current analysis on ligand allostery. The authors instead chose to reference CD and thermal denaturation methods as comparisons.

      Thank this reviewer for reading the single-molecule force spectroscopy (SMFS) study on MelB<sub>St</sub>.  The C3 loop mentioned in this SMFS article is partially covered in the dataset Mel or Mel plus Na<sup>+</sup> vs. apo, and there is more coverage in the Na<sup>+</sup> vs. apo dataset. In either condition, no deprotection was detected. The labeling time point might not be long enough to detect it.

      Reviewer #3:

      Summary:

      The melibiose permease from Salmonella enterica serovar Typhimurium (MelB<sub>St</sub>) is a member of the Major Facilitator Superfamily (MFS). It catalyzes the symport of a galactopyranoside with Na<sup>+</sup>, H<sup>+</sup>, or Li<sup>+</sup>, and serves as a prototype model system for investigating cation-coupled transport mechanisms. In cation-coupled symporters, a coupling cation typically moves down its electrochemical gradient to drive the uphill transport of a primary substrate; however, the precise role and molecular contribution of the cation in substrate binding and translocation remain unclear. In a prior study, the authors showed that the binding affinity for melibiose is increased in the presence of Na<sup>+</sup> by about 8-fold, but the molecular basis for the cooperative mechanism remains unclear. The objective of this study was to better understand the allosteric coupling between the Na<sup>+</sup> and melibiose binding sites. To verify the sugar-recognition specific determinants, the authors solved the outward-facing crystal structures of a uniport mutant D59C with four sugar ligands containing different numbers of monosaccharide units (α-NPG, melibiose, raffinose, or α-MG). The structure with α-NPG bound has improved resolution (2.7 Å) compared to a previously published structure and to those with other sugars. These structures show that the specificity is clearly directed toward the galactosyl moiety. However, the increased affinity for α-NPG involves its hydrophobic phenyl group, positioned at 4 Å-distance from the phenyl group of Tyr26, which forms a strong stacking interaction. Moreover, a water molecule bound to OH-4 in the structure with α-NPG was proposed to contribute to the sugar recognition and appears on the pathway between the two specificity-determining pockets. Next, the authors analyzed by hydrogen-to-deuterium exchange coupled to mass spectrometry (HDX-MS) the changes in structural dynamics of the transporter induced by melibiose, Na<sup>+</sup>, or both. The data support the conclusion that the binding of the coupling cation at a remote location stabilizes the sugar-binding residues to switch to a higher-affinity state. Therefore, the coupling cation in this symporter was proposed to be an allosteric activator.

      Strengths:

      (1) The manuscript is generally well written.

      (2) This study builds on the authors' accumulated knowledge of the melibiose permease and integrates structural and HDX-MS analyses to better understand the communication between the sodium ion and sugar binding sites. A high sequence coverage was obtained for the HDX-MS data (86-87%), which is high for a membrane protein.

      Thank this reviewer for your positive comments.

      Weaknesses:

      (1) I am not sure that the resolution of the structure (2.7 Å) is sufficiently high to unambiguously establish the presence of a water molecule bound to OH-4 of the α-NPG sugar. In Figure 2, the density for water 1 is not obvious to me, although it is indeed plausible that water mediates the interaction between OH4/OH6 and the residues Q372 and T373.

      A water molecule can be modeled at a resolution ranging from 2.4 to 3.2 Å, and the quality of the model depends on the map quality and water location. In this revision, we refined the resolution to 2.6 Å using the same dataset and also performed all-atom MD simulations. All results support the occupancy of water-1 in the sugar-bound MelB<sub>St</sub>.

      (2) Site-directed mutagenesis could help strengthen the conclusions of the authors. Would the mutation(s) of Q372 and/or T373 support the water hypothesis by decreasing the affinity for sugars? Mutations of Thr121, Arg 295, combined with functional and/or HDX-MS analyses, may also help support some of the claims of the authors regarding the allosteric communication between the two substrate-binding sites.

      The authors thank this reviewer for the thoughtful suggestions. MelB<sub>St</sub> has been subjected to Cys-scanning mutagenesis (https://doi.org/10.1016/j.jbc.2021.101090). Placing a Cys residue at Gln372 significantly decreased the transport initial rate, accumulation, and melibiose fermentation, with minimal effect on protein expression, as shown in Figure 2 of this JBC article, which could support its role in the binding pocket. The T373C mutant retained most of the WT's activities. Our previous studies showed that Thr121 is only responsible for Na<sup>+</sup> binding in MelB<sub>St</sub>, and mutations decreased protein stability; now, HDX reveals that this is the rigid position. Additionally, our previous studies indicated that Arg295 is another conformationally important residue. In this version, we have added more HDX analysis to explore the relationship between the two substrate-binding sites with conformational dynamics, especially focusing on the gating salt-bridge network including Arg295, which has provided meaningful new insights.

      (3) The main conclusion of the authors is that the binding of the coupling cation stabilizes those dynamic sidechains in the sugar-binding pocket, leading to a high-affinity state. This is visible when comparing panels c and a from Figure S5. However, there is both increased protection (blue, near the sugar) and decreased protection in other areas (red). The latter was less commented, could the increased flexibility in these red regions facilitate the transition between inward- and outward-facing conformations? The HDX changes induced by the different ligands were compared to the apo form (see Figure S5). It might be worth it for data presentation to also analyze the deuterium uptake difference by comparing the conditions sodium ion+melibiose vs melibiose alone. It would make the effect of Na<sup>+</sup> on the structural dynamics of the melibiose-bound transporter more visible. Similarly, the deuterium uptake difference between sodium ion+melibiose vs sodium ion alone could be analyzed too, in order to plot the effect of melibiose on the Na<sup>+</sup>-bound transporter.

      Thanks for this important question. We have added more discussion of the deprotected data and prepared a new Fig. 8b to highlight the melibiose-binding-induced flexibility in several loops, especially the gating area on both sides of the membrane. We also proposed that these changes might facilitate the formation of the transition-competent state. The overall effects induced by substrate binding are relatively small, and the datasets for apo and Na were collected separately, so comparing melibiose&Na<sup>+</sup> versus Na<sup>+</sup> might not be as precise. In fact, the Na<sup>+</sup> effects on the sugar-binding site can be clearly seen in the deuterium uptake plots shown in Figures 7-8, by comparing the first and last panels.

      (4) For non-specialists, it would be beneficial to better introduce and explain the choice of using D59C for the structural analyses.

      Asp59 is the only site that responds to the binding of all coupling cations: Na<sup>+</sup>, Li<sup>+</sup>, or H<sup>+</sup>. Notably, this thermostable mutant D59C selectively abolishes all cation binding and associated cotransport activities, but it maintains intact sugar binding and exhibits conformational transition as the WT, as demonstrated by electroneutral transport reactions including α-NPG transport showed in this articles, and melibiose exchange and fermentation showed previously. Therefore, the structural data derived from this mutant are significant and offer important mechanistic insights into sugar transport, which supports the conclusion that the Na<sup>+</sup> functions as allosteric activator.

      (5) In Figure 5a, deuterium changes are plotted as a function of peptide ID number. It is hardly informative without making it clearer which regions it corresponds to. Only one peptide is indicated (213-226). I would recommend indicating more of them in areas where deuterium changes are substantial.

      We appreciate this comment and have modified the plots by marking the residue position as well as labeled several peptides of significant HDX in the Fig 5b. We also provided a deuteriation map based on peptide coverage (Fig. 5a).

      (6) From prior work of the authors, melibiose binding also substantially increases the affinity of the sodium ion. Can the authors interpret this observation based on the HDX data?

      This is an intriguing mechanistic question. In this HDX study, we found that the cation-binding pocket and nearby sugar-binding residues are conformationally rigid, while some sugar-binding residues farther from the cation-binding pocket are flexible. We concluded that conformational dynamics regulate sugar-binding affinity, but the increase in Na-binding affinity caused by melibiose is not related to protein dynamics. Our previous interpretation based on structural data remains our preferred explanation; therefore, the bound melibiose physically prevents the release of Na<sup>+</sup> or Li<sup>+</sup> from the cation-binding pocket. We also proposed the mechanism of intracellular NA<sup>+</sup> release in the 2024 JBC paper (https://doi.org/10.1016/j.jbc.2024.107427); after sugar release, the rotamer change of Asp55 will help NA<sup>+</sup> exit the cation pocket into the empty sugar pocket, and the negative membrane potential inside the cell will further facilitate movement from MelB<sub>St</sub> to the cytosol.

      Recommendations for the authors:

      Reviewing Editor Comments:

      (1) It would help the reader if the previous work were introduced more clearly, and if the results of the experiments reported in this manuscript were put into the context of the previous work. Lines 283-296 discuss observations that are similar to previous reported structures as well as novel interpretations. It would help the reader to be clearer about what the new observations are.

      Thank you for the important comment. We have revised accordingly by adding related citations and words “as showed previously” when we stated our previous observations.

      (2) The affinity by ITC is measured for various ligands, but very few conclusions are drawn about how the affinity correlates with the binding modes. Are the other ligands that are investigated in this study transported by the protein, or do they just bind? Can the protein transport the trisaccharide raffinose? The authors comment that raffinose exhibiting poor binding affinity despite having more sugar units is surprising, but this is not surprising to me. No additional interactions can be mapped to these units on their structure, and while it fits into the substrate binding cavity, the extra bulk of additional sugar units is likely to reduce affinity. In fact, from their listed ITC measurements, this appears to be the trend.

      Additionally, the D59C mutant utilized here in structural determination is deficient in sodium/cation binding. The reported allostery of sodium-sugar binding will likely influence the sugar binding motif as represented by these structures. This is clearly represented by the authors' own ITC work. The ITC included in this work was carried out on the WT protein in the presence of Na<sup>+</sup>. The authors could benefit from clarifying how this work fits with the structural work or carrying out ITC with the D59C mutant, or additionally, in the absence of sodium. For non-specialists, please better introduce and explain the choice of using D59C for the structural analyses.

      Thank you for the meaningful comments. We have comprehensively addressed all the concerns and suggestions as listed in the summary of this revision. Notably, the D59C mutant does not catalyze any electrogenic melibiose transport involved in a cation transduction but catalyze downhill transport location of the galactosides, as shown by the downhill α-NPG transport assay in Fig. 1a. The intact downhill transport results from D59C mutant further supports the allosteric coupling between the cation- and sugar-binding sites.

      The binding isotherm and poor affinity of the ITC measurements do not support to further analyze the binding mode since none showed sigmoidal curve, so the enthalpy change cannot be accurately determined. But authors thank this comment.

      (3) It is not clear what Figure 2 is comparing. The text suggests this figure is a comparison of the lower resolution structure to the structure presented in this work; however, the figure legend does not mention which is which, and both images include a modelled water molecule that was not assigned due to poor resolution previously, as stated by the authors, in the previously generated structure. This figure should be more clearly explained.

      We have addressed these concerns in the response to the Public Reviews at reviewer-2 #1.

      (4) I am not sure that the resolution of the structure (2.7 Å) is sufficiently high to unambiguously establish the presence of a water molecule bound to OH-4 of the α-NPG sugar. In Figure 2, the density for water 1 is not obvious to me, although it is indeed plausible that water mediates the interaction between OH4/OH6 and the residues Q372 and T373. Please change line 278 to state "this OH-4 water molecule is likely part of sugar binding".

      We have addressed these concerns in the response to the Public Reviews at reviewer-3 #1.

      (5) Line 290-296: The Thr121 is not represented in any figures, while the Lys377 is. Their relative positioning between sugar water and sodium is not made clear by any figure.

      Thanks for this comment. This information has been clearly presented in the Figs. 7-8. Lys377 is closer to the cation site and related far from the sugar-binding site.

      (6) Methodology includes a lipid removal step. Based on other included methods, I assumed that the HDX-MS was being carried out in detergent-solubilized protein samples. I therefore do not see the need for a lipid removal step that is usually included for bilayer reconstituted samples. I note that this methodology is the same as previously used for MelB. It should be clarified why this step was included, if it was in fact used, aka, further details on the sample preparation should be included.

      (7) A summary of HDX conditions and results should be given as recommended, including the mean peptide length and average redundancy per state alongside other included information such as reaction temperature, sequence coverage, etc., as prepared for previous publications from the authors, i.e., Hariharan et al., 2024.

      We have addressed these concerns in the response to the Public Reviews at reviewer-2 #4.

      (8) Uptake plots per peptide for the HDX-MS data should be included as supporting information outside of the few examples given in Figure 6.

      We have addressed these concerns in the response to the Public Reviews at reviewer-2 #4.

      (9) A reference should be given to the hybrid significance testing method utilised. Additionally, as stated by Hageman and Weis (2019) (doi:10.1021/acs.analchem.9b01325), the use of P < 0.05 greatly increases the likelihood of false positive ΔD identifications. While the authors include multiple levels of significance, what they refer to as high and lower significant results, and this reviewer understands that working with dynamic transporters can lead to increased data variation, a statement of why certain statistical criteria were chosen should be included, and possibly accompanied by volcano plots. The legend of Figure 6 should include what P value is meant by * and ** rather than statistically significant and highly statistically significant.

      We have addressed these concerns in the response to the Public Reviews at reviewer-2 #4.

      (10) The table (S3) and figure (S4) showing uncovered residues is an unclear interpretation of the data; this would be better given as a peptide sequence coverage heat map. This would also be more informative for the redundancy in covered regions, too. In this way, S3 and S4 can be combined.

      We have addressed these concerns in the response to the Public Reviews at reviewer-2 #4.

      (11) Residual plots in Figure 5 could be improved by a topological map to indicate how peptide number resembles the protein amino acid sequence.

      Thanks for the request, due to the figure 6 is big so that we add a transmembrane topology plot colored with the HDX results in Fig. 8c.

      (12) The presentation of data in S5 could be clarified. Does the number of results given in the brackets indicate overlapping peptides? What are the lengths of each of these peptides? Classical HDX data presentation utilizes blue for protection and red for deprotection. The use of yellow ribbons to show protection in non-sugar binding residues takes some interpretation and could be clarified by also depicting in a different blue. I also don't see the need to include ribbon and cartoon representation when also using colors to depict protection and deprotection. The authors should change or clarify this choice.

      We have moved this figure into the current Fig. 6b as suggested by Reviewer-3. To address your questions listed in the figure legend, the number of results shown in brackets indeed indicates overlapping peptides. What are the lengths of each of these peptides? The sequences of each peptide are shown in Figures 7-8 and are also included in Supplemental Figure S5. Regarding the use of color, both blue and green were used to distinguish peptides protecting the substrate-binding site from other regions. The ribbon and cartoon representations are provided for clarity, as the cartoon style hides many helices.

      (13) In Table S5, the difference between valid points and protection is unclear. And what is indicated by numbers in brackets or slashes? Additionally, it should be highlighted again here that single-residue information is inferred from peptide-level data. By value, are the authors referring to peptide-level differential data?

      Please review our responses in the Public Reviews at reviewer-2 #5.

      (14) Line 316 states a significant difference in seen in dynamics, how is significance measured here? There is no S.D. given in Table S4. Can the authors further comment on the potential involvement in solvent accessibility and buried helices that might influence the overall dynamics outside of their role in sugar vs sodium binding? An expected low rate of exchange suggests that dynamics are likely influenced by solvent accessibility or peptide hydrophobicity? The increased dynamics at peptides covering the Na binding site on overall more dynamic helices suggests that there isn't a difference between the dynamics of each site.

      Please review our responses in the Public Reviews at reviewer-2 #5.

      (15) Previously stated HDX-MS results of MelB (Hariharan et al., 2024) state that the transmembrane helices are less dynamic than polypeptide termini and loops with similar distributions across all transmembrane bundles. The previous data was obtained in the presence of sodium. Does this remove the difference in dynamics in the sugar-binding helices and the cation-binding helices? Including this comparison would support the statement that the sodium-bound MelB is more stable than the Apo state, along with the lack of deprotection observed in the differential analysis.

      Please review our responses in the Public Reviews.

      (16) MD of MelB suggests all transmembrane helices are reorientated during substrate translocation, yet substrate and cotransporter ligand binding only significantly impacts a small number of helices. Can the authors comment on the ensemble of states expected from each HDX experiment? The data presented here instead shows overall stabilisation of the transporter. This data can be compared to that of HDX on MFS sugar cation symporter XylE, where substrate binding induces a transition to the OF state. There is no discussion of how this HDX data compares to previous MFS sugar transporter HDX. The manuscript could benefit from this comparison rather than a comparison to LacY. It is unlikely that there are universal mechanisms that can be inferred even from these model proteins. Highlighting differences instead between these transport systems provides broader insights into this protein class. Doi: 10.1021/jacs.2c06148 and 10.1038/s41467-018-06704-1.

      Please review our responses in the Public Reviews.

      (17) Additionally, the recent publication of SMFS data (by the authors: doi:10.1016/j.str.2022.11.011) states the following: "In the presence of either melibiose or a coupling Na<sup>+</sup>-cation, however, MelB increasingly populates the mechanically less stable state which shows a destabilized middle-loop C3." And "In the presence of both substrate and co-substrate this mechanically less stable state of MelB is predominant.". It would benefit the authors to comment on these data in contrast to the HDX obtained here. Additionally, is the C3 loop covered, and does it show the destabilization suggested by these studies? HDX can provide a plethora of results that are missing from the current analysis on ligand allostery. The authors instead chose to reference CD and thermal denaturation methods as comparisons.

      Please review our responses in the Public Reviews.

      (18) The main conclusion of the authors is that the binding of the coupling cation stabilizes those dynamic sidechains in the sugar-binding pocket, leading to a high-affinity state. This is visible when comparing panels c and a from Figure S5. However, there is both increased protection (blue, near the sugar) and decreased protection in other areas (red). The latter was less commented, could the increased flexibility in these red regions facilitate the transition between inward- and outward-facing conformations? The HDX changes induced by the different ligands were compared to the apo form (see Figure S5). It might be worth it for data presentation more visible to also analyze the deuterium uptake difference by comparing the conditions sodium ion+melibiose vs melibiose alone. You would make the effect of Na<sup>+</sup> on the structural dynamics of the melibiose-bound transporter. Similarly, the deuterium uptake difference between sodium ion+melibiose vs sodium ion alone could be analyzed too, in order to plot the effect of melibiose on the Na<sup>+</sup>-bound transporter.

      Please review our responses in the Public Reviews.

      (19) In Figure 5a, deuterium changes are plotted as a function of peptide ID number. It is hardly informative without making it clearer which regions it corresponds to. Only one peptide is indicated (213-226); I would recommend indicating more of them, in areas where deuterium changes are substantial.

      Please review our responses in the Public Reviews.

      (20) Figure 6, please indicate in the legend what the black and blue lines are (I assume black is for the apo?)

      We are sorry that we did not make it clear. Yes, the black was used for apo state and blue was used for all bound states

      (21) From prior work of the authors, melibiose binding also substantially increases the affinity of the sodium ion. Can the authors interpret this observation based on the HDX data?

      Please review our responses in the Public Reviews.

      Addressing the following three points would strengthen the manuscript, but also involve a significant amount of additional experimental work. If the authors decide not to carry out the experiments described below, they can still improve the assessment by focusing on points (1-21) described above.

      (22) Have the authors considered carrying out an HDX-MS comparison between the WT and the D59C mutant? This may provide some further information on the WT structure (particularly a comparison with sugar-bound). This could be tied into a nice discussion of their structural data.

      Please review our responses in the Public Reviews.

      (23) Have the authors considered utilising Li<sup>+</sup> to infer how cation selectivity impacts the allostery? Do they expect similar stabilisation of a higher-affinity sugar binding state with all cations?

      Please review our responses in the Public Reviews.

      (24) Site-directed mutagenesis could help strengthen the conclusions. Would the mutation(s) of Q372 and/or T373 support the water hypothesis by decreasing the affinity for sugars? Mutations of Thr 121 and Arg 295, combined with functional and/or HDX-MS analyses, may also help support some of the authors' claims regarding allosteric communication between the two substrate-binding sites.

      Please review our responses in the Public Reviews.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1

      Evidence, reproducibility and clarity

      __Summary

      Köver et al. examine the genetic and environmental underpinnings of multicellular-like phenotypes (MLPs) in fission yeast, studying 57 natural isolates of Schizosaccharomyces pombe. They uncover that a noteworthy subset of these isolates can develop MLPs, with the extent of these phenotypes varying according to growth media. Among these, two strains demonstrate pronounced MLP across a range of conditions. By genetically manipulating one strain with an MLP phenotype (distinct from the previously mentioned two strains), they provide evidence that genes such as MBX2 and SRB11 play a direct role in MLP formation, strengthening their genetic mapping findings. The study also reveals that while some key genes and their phenotypic effects are strikingly similar between budding and fission yeast, other aspects of MLP formation are not conserved, which is an intriguing finding.

      Overall, the manuscript is well-written, dense yet logically structured, and the figures are well presented. The combination of phenotypic, genetic, and bioinformatics analyses, particularly from wet lab experiments, is commendable. The study addresses a significant gap in our understanding, primarily explored in budding yeast, by providing comprehensive data on MLP diversity in fission yeast and the interplay of genetic and environmental factors.

      In summary, I enjoyed reading the manuscript and have only a few minor suggestions to strengthen the paper:

      Minor revisions:

      1. Although this may seem like a minor revision, but it is a crucial point. Please make sure that all raw data used to generate figures, run stats, sequence data, and scripts used to run data analysis are made publicly available. Provide relevant accession numbers and links to public data repositories. It is important that others can download the various types of data that went into the major conclusions of this paper in order to replicate your analysis or expand upon the scope of this work. I am not sure if the journal has a policy regarding this, but it should be followed to allow for transparency and reproducibility of the research.__

      Reply: We very much agree with the reviewer that sharing raw data and scripts is an essential part of open science. All code and data are deposited to Github (https://github.com/BKover99/S.-Pombe-MLPs) and Figshare (https://figshare.com/articles/software/S_-Pombe-MLPs/25750980), which have now been updated to reflect our revisions. Additionally, the sequenced genomes have been deposited to ENA (PRJEB69522). Where external data was used, it was properly referenced and specifically included in Supplementary Table 3.

      Two out of 57 strains exhibit strong and consistent MLP across multiple environments. Providing more information on these strains (JB914 and JB953), such as their natural habitats and distinct appearances of their MLP phenotypes under varying conditions, would provide valuable insights.

      First, a brief discussion highlighting what differentiates these two strains from the rest would be helpful for readers (e.g. insight into their unique genetic and environmental background that might be linked to the MLP phenotype).

      Additionally, culture tube and microscopy images of these strains, similar to those presented for JB759 in Figure 2A, can be included in the supplementary materials. My reasoning is that these images could help illustrate variation or lack thereof in aggregative group size across different media.

      Reply: We thank the reviewer for highlighting this issue. Our further investigation into these strains has added additional interesting insights. JB914 and JB953 were isolated from molasses in Jamaica and the exudate of Eucalyptus in Australia, respectively, though it remains unclear whether these environments are related or even selective for the ability of these strains to form MLPs. We note that the environment from which a strain is isolated is an incomplete way of assessing its ecology. Indeed, recent research suggests that the primary habitat of S. pombe is honeybee honey and suggests that bees, which may be attracted to a number of sugary substances, may be a vector by which fission yeast are transported (1). Therefore, isolation from a particular nectar or food production environment might not reflect significant ecological differences. We now refer to the location of strain isolation in the manuscript text (lines 208-209).

      However, there is more to learn from the genetic backgrounds of these two strains. We found that JB914 possesses the same variant in srb11 causally related to MLPs as JB759, the MLP-forming parental strain for our QTL analysis. To understand whether the appearance of this variant in these two strains derived from a single mutation event or was a case of convergent evolution, we analysed homology between the genomes of JB759 and JB914, focusing specifically on that variant. We found an approximately 20kb region of homology between JB759 and JB914 surrounding the srb11 truncation variant, in contrast to the majority of the genome, which does not share homology between those two strains (New Supplementary Figure 9A, B)). This result suggests that, while the two strains are largely unrelated, that specific region shares a recent common ancestor and is likely a result of interbreeding across strains.

      Importantly, this analysis further emphasizes the point that the srb11 variant segregates with the MLP-forming phenotype. We conclude this because none of the other strains similar to JB759 (either across the whole genome, or specifically in the region surrounding srb11) exhibit MLPs (New Supplementary Figure 9C). This thereby further complements our QTL analysis on the significance of this variant. We have added this analysis to the manuscript text (lines 337-349).

      Furthermore, we searched other strains which exhibited MLPs in our experiments (e.g. JB953) for frame shifts, insertions or deletions in any other genes in the CKM module or in the genes that were identified in our deletion library screen as adhesive, and did not identify any severe mutations falling into coding regions (other than the srb11 truncation in JB914 and JB759). This indicates that MLPs in these other strains may be caused by differences in regulatory regions surrounding these genes, or variants in other genes that were not identified in our screen. We have added this analysis to our manuscript (lines 424-425) and Supplementary Table 13.

      We agree that microscopy and culture tube images of JB914 and JB953 may give insight into the nature of the MLPs exhibited by those strains. We have included such images of cultures grown in YES, EMM and EMM-Phosphate media in our revision (Lines 207-208, Supplementary Figures 4 and 5). These images are consistent with our adhesion assay screen and show that JB914 and JB953 are adhesive at the microscopic level in the relevant conditions (EMM or EMM-Phosphate).

      The phenotypic outcome of overexpressing MXB2 is striking, as shown in Supplementary Figure 4C. Incorporating at least one of the culture tube images depicting large flocs into the main text, perhaps adjacent to Figure 3 panel D, would improve the visual appeal and highlight this key finding (at the moment those images are only shown in the supplementary materials).

      Reply: We thank the reviewer for this suggestion. In response to Reviewer 2's suggestion to overexpress mbx2 in YES, we created new mbx2 overexpression strains that could overexpress mbx2 in YES, which was not possible in our previous strain in which mbx2 overexpression was triggered by removal of thymine from the media. We have replaced our original data from Figure 3D with data from the new mbx2 overexpression experiment, including flask images.

      I know that the authors discuss the knowledge gap in the intro and results, but the abstract does not mention this critical gap. Please stress this critical gap (i.e., MLPs understudied in fission yeast) with a brief sentence in the abstract. Similarly, please consider writing a brief concluding sentence summarizing the paper's most significant finding referring to the knowledge gap would provide a clearer takeaway message for the reader - the abstract ends abruptly without any conclusion.

      Reply: We agree and have now emphasized the critical gap in our abstract:

      "As MLP formation remains understudied in fission yeast compared to budding yeast, we aimed to narrow this gap." at lines 18-19.

      Additionally, we added the following final sentence to give the reader a clearer takeaway message:

      "Our findings provide a comprehensive genetic survey of MLP formation in fission yeast, and a functional description of a causal mutation that drives MLP formation in nature." at lines 31-32.

      1. The observation that strains with adhesive phenotypes have a lower growth rate compared to non-adhesive strains is a noteworthy point (lines 532-535). This represents yet another example of this classical trade-off. This point could be emphasized in the Discussion or alongside the relevant result, with a brief speculative explanation for this phenomenon.

      Reply: We agree that the nature of the trade-off between MLP formation is an interesting discussion point that could arise from our work. Understanding this trade-off is made more complicated by the fact that growth is always condition-dependent, and measuring growth in strains exhibiting MLPs is non-trivial, as adhesion to labware and thick clumps of cells separated by regions of cell-free media can add variability. Nonetheless, there has been some previous work on this problem. In S. cerevisiae, it was shown that larger group size correlates with slower growth rate (3), and that flocculating cells grow more slowly (4). In S. cerevisiae, cAMP, a signalling molecule heavily involved in regulating growth in response to nutrient availability, also regulates filamentation (5). However, the relationship between flocculation and slow growth is not consistent in the literature. In some settings overexpressing the flocculins FLO8, FLO5, and FLO10 results in slower growth (6), while in others it does not (7). In addition, ethanol production has been shown to improve for biofilms (7).

      Furthermore, in S. cerevisiae, MLP-forming cells grow better in low sucrose concentrations (8) and under various stress conditions (4). Flocculating cells have also shown faster fermentation in media containing common industrial bioproduction inhibitors, despite slower fermentation than non-flocculating cells in non-inhibitory media (9). However, any consequence of this possible advantage on growth has not been characterised.

      In S. pombe, there is less work on this topic; however, it has been shown that deletions of rpl3201 and rpl3202, which code for ribosomal proteins, cause flocculation and slow growth (10). In that case, it is not clear if there is any causal relationship between slow growth and flocculation or if they are both parallel consequences of the ribosomal pathway disruption. We have added some of these points to the portion of the discussion that discusses this tradeoff (Lines 477-499).

      To get a better understanding of this tradeoff in our system, we took several approaches. First, we added a supporting analysis (New Supplementary Figure 12B), using published growth data based on measurements on agar plates for the S. pombe gene deletion library (11). There, the authors defined a set of deletion strains that grow more slowly on EMM than the wild-type lab strain. We found that our MLP hit strains were significantly enriched in this "EMM-slow" category. This information is now included in the manuscript (Lines 409-413, New Supplementary Figure 12B).

      It is, however, possible that for the assays from that work, the appearance of slow growth on solid agar in adhesive cells could be partially artifactual. Indeed, we have observed that adhesive cells tend to stick to flasks and, when grown on agar plates, cells in the same colony can stick to one another rather than to inoculation loops or pin pads. Both of these dynamics can reduce initial inoculation densities. This is less of a concern for our adhesion assay and Figures 2E, 5B, and 5F, because our before-wash intensity was done with a 7x7 pinned square about 10x10 mm2. Nonetheless, as we wanted to make a point about srb10 and srb11 mutants growing faster than other deletion mutants that exhibit MLP-formation, we also conducted growth assays in liquid media (New Figure 5F).

      We observed that srb10Δ and srb11Δ strains (which exhibit MLPs in EMM) show growth curves similar to wild-type cells in minimal (EMM) and rich media (YES). On the other hand, other strains that grow similarly to wild type cells in YES, such as tlg2Δ and rpa12Δ, grow much more slowly in EMM when they clump together. There are also some strains, mus7Δ and kgd2Δ, that grow more slowly in both YES and EMM but are only adhesive in EMM.

      The text mentions two lab strains, JB22 and JB50, displaying strong adhesion under phosphate starvation (lines 525-526), yet the data point for JB22 in Figure 2C is not labeled.

      Reply: We agree that highlighting JB22 on the figure is crucial, given that it was mentioned in the main text. JB22 is now highlighted in green on Fig 2C.

      1. Although I generally avoid commenting on formatting, I found the manuscript to be dense. As mentioned above, I truly enjoyed reading it! But I couldn't help but think of ways to make the manuscript more concise for readers. The Results section spans nine pages (excluding figure captions), and the Discussion is five pages long. The main text contains 6 figures with approximately 27 panels and 32 plots and Venn diagrams, while the supplementary material has 11 figures with 22 panels and about 59 plots. Altogether, the manuscript comprises 17 figures, 49 panels, and roughly 91 plots and Venn diagrams! While I will not request any changes, I encourage the authors to consider streamlining the text/data where possible to focus on the core theme of the study.

      We thank the reviewer for these suggestions and have reorganised some of our figures and text to appear less dense. We have also added several figures and panels in response to reviewer comments. While we endeavor to make our points clear and concise in the main figures, we believe that it is important to retain key supplementary figures so that an interested reader can evaluate the data in more detail:

      A summary of our major changes to the figures is below, and we also provide a manuscript with changes tracked for the reviewers' convenience:

      Fig 2:

      Added Panel E in response to reviewer comments. Fig 3:

      Removed axes for pfl3 and pfl7 from Fig 3C, as the point was made by the other genes displayed (mbx2, pfl8 and gsf2) Replaced Fig 3D with similar data from an improved experiment in response to reviewer comments. Added New Fig 3F from Original Supp Fig 5 Fig 5:

      Moved Original Fig 5A to New Supp Fig 10A. Added New Fig 5F in response to reviewer comments. Original Supp Fig 4 / New Supp Fig 6:

      Removed mbx2 overexpression images from Original Fig 4C, to be replaced by new overexpression data and images in New Fig 3D. Added flask images for srb10 and srb11 deletion mutants from Original Supp Fig 5A to New Supp Fig 6C. Added microscope image for srb11 deletion mutant from Ooriginal Supp Fig 5A to New Supp Fig 6C. Added adhesion assay results from Original Supp Fig 5C to New Supp Fig 6C. Added New Supp Fig 6D in response to review Original Supp Fig 5

      Removed this figure. Original Supp Fig 5A and 5B were moved to New Supp Fig 6. Original Supp Fig 5B was removed to make the manuscript more concise. Original Supp Figs 6, 7 and 8 were combined into New Supp Fig 8.

      Original Supp Fig 6A and 6B are now New Supp Fig 8A and 8B. Original Supp Fig 7 is now New Supp Fig 8C. Original Supp Fig 8A is now New Supp Fig 8D and 8E. Original Supp Fig 8B is now New Supp Fig 8F Original Supp Fig 9/New Supp Fig 10

      Added Original Fig 5A as new Supp Fig 10A. Original Supp Fig 11/New Supp Fig 12

      Removed Original Fig 11B and the relevant text to make the manuscript more concise. Added New Supp Fig 12B in response to reviewer comments. New Supplementary Figures added in response to reviewer comments:

      New Supp Fig 4: Microscopy images of natural isolates. New Supp Fig 5: Flask images of natural isolates New Supp Fig 7: Microscopy and flask images of mbx2 overexpression strains. New Supp Fig 9: Genomic comparisons between JB759 and the MLP-forming wild isolate, JB914. Removed some less relevant points from our discussion, to reduce the length.

      Added new Supplementary Tables:

      Supplementary Table 13: Variants in candidate genes. Added in response to reviewer comments Supplementary Table 14: List of plasmids used in the study.

      **Referees cross-commenting**

      There are many useful recommendations from all the other reviewers that will help improve the final product. Once those points are revised, I think this will be a nice paper of interest to folks interested in natural variation in MLPs and its genetic background.

      Significance

      My expertise: evolutionary genetics, evolution of multicellularity, yeast genetics, experimental evolution

      Overall, the manuscript is well-written, dense yet logically structured, and the figures are well presented. The combination of phenotypic, genetic, and bioinformatics analyses, particularly from wet lab experiments, is commendable. The study addresses a significant gap in our understanding, primarily explored in budding yeast, by providing comprehensive data on MLP diversity in fission yeast and the interplay of genetic and environmental factors.

      In summary, I enjoyed reading the manuscript and have only a few minor suggestions to strengthen the paper.

      Reviewer #2

      Evidence, reproducibility and clarity

      REVIEWER COMMENTS

      Yeast species, including fission yeast and budding yeast, could form multicellular-like phenotypes (MLP). In this work, Kӧvér and colleagues found most proteins involved in MLP formation are not functionally conserved between S. pombe and budding yeast by bioinformatic analysis. The authors analyzed 57 natural S. pombe isolates and found MLP formation to widely vary across different nutrient and drug conditions. The authors demonstrate that MLP formation correlated with expression levels of the transcription factor gene mbx2 and several flocculins. The authors also show that Cdk8 kinase module and srub11 deletions also resulted in MLP formation. The experimental design is logic, the manuscript is well-written and organized. I have a few concerns that should be addressed before the publication.

      Major points:

      1) Line 61-62, how did the authors grow yeast cells in the liquid medium? Shaking or static? If shaking, the nutrient should be even distributed in the medium.

      If static culture, most single yeast cells could precipitate on the bottom, how do you address the advantage of flocculation for increasing the sedimentation? In addition, under static culture, the bottom will have less air than the up medium, how to balance the air and nutrients?

      Reply: In line 61-62 we stated that "Similarly, flocculation could increase sedimentation in liquid media, thereby assisting the search for more nutrient-rich or less stressful environments (4)".

      Our intent was to speculate on the advantages of multicellular-like growth, and cited a review article which has mentioned sedimentation. After further consideration, we decided that this is a minor point and is rather speculative, and removed it altogether from the manuscript.

      In response to the Reviewer's question about how cells were grown in liquid medium, throughout the paper we used shaking cultures for our flocculation assays and for pre-cultures. We have made this more clear in the text where it was ambiguous (e.g. line 189, throughout the methods section, and in the legend of Fig. 2A).

      2) Line 555, it will be interesting to test whether overexpression of mbx2 could cause flocculation in YES medium. In Figure 3D, the authors use two control strains, but only one mbx2 OE strain, mbx2 OE should be tested in both strains. In addition, did the authors transform empty plasmid into the control strains, please indicate in the figure.

      In this experiment, mbx2 was overexpressed using a thiamine-repressible nmt1 promoter, which is a standard construct in fission yeast studies. Assaying MLP formation was not feasible in YES with this strain, because YES is a rich media made up of yeast extract which contains thiamine. Thus, we could not remove thiamine from the media to trigger mbx2 overexpression.

      In order to test the influence of mbx2 overexpression in YES, we constructed strains in which mbx2 was integrated into the genome and expression was driven by the rpl2102 promoter, which has been shown to provide constitutive moderate expression levels (12). We observed strong flocculation in both EMM and YES (Fig 3D, New Supplementary Figure 7) . We did not see strong flocculation in a control in which GFP was expressed under the rpl2102 promoter. The flocculation phenotype was so strong that our original adhesion assay protocol required modification for this experiment, including resuspension in 10 mM EDTA before repinning (Methods). We observed strong adhesion for the mbx2 overexpression strains (Fig 3D), but not for control strains in YES. We could not check adhesion in EMM for those strains because cells pinned on EMM did not survive resuspension in EDTA.

      We performed these experiments in two backgrounds, 968 h90 (JB50), which is one of the parental strains of the segregant library analysed in Figure 3 and 972 h- (JB22), which is an appropriate background for the gene deletion collection.

      We have replaced the data from the original Figure 3D with the new adhesion assay and added New Supplementary Figure 7 to the manuscript (Lines 236-244).

      This result also helped us to further refine our model for the pathway. We can now say that the repression of MLPs in rich media must act via Mbx2, as overexpression of mbx2 is sufficient to abolish it, and is likely to act transcriptionally (if it acted on the protein level, the mild overexpression would likely not have led to the phenotype) (Figure 6, Lines 554-556 in the discussion)

      3) Line 600-601, the authors may do the backcross of srb11Δ::Kan to exclude the possibility caused by other mutations.

      Reply: We thank the reviewer for noticing our concern about suppressor mutations arising in the srb11Δ strain obtained from our deletion library. This initial concern arose following the observation that while qualitatively the srb11Δ::Kan and srb11Δ(CRISPR) strains were both strongly adhesive, there was a minor quantitative difference in their adhesion.

      As we obtained this strain from an h+ deletion library strain backcrossed with a prototrophic h- strain (JB22) in order to restore auxotrophies (13), the chances for a suppressor mutation to arise are very low. We have therefore removed that language from our text. We now suspect that a more likely explanation for this small difference could be the strain background, as our CRISPR engineered strain was made in a JB50 background which has the h90 mating type, while the deletion library strains are h- without auxotrophic markers.

      We would like to emphasize, however, that despite this quantitative difference in the adhesion phenotype between the two srb11Δ strains, they both have a large increase in the adhesion phenotype relative to the respective wild-type strains. To address this point, we have removed the unnecessary statistical comparison of these two deletion strains and focused on their qualitatively high levels of adhesion in the text (lines 267-269) and in our Revised Supplementary Figure 6D.

      Minor points:

      1) Line 506, what are the growth conditions of cells in Figure 2A? Did the authors use the liquid or solid medium? Please mention in the Methods or figure legends.

      Reply: We have updated the manuscript to include the relevant details in the text (line 189), figure caption for Fig. 2A and in the methods section (lines 829-831).

      2) Line 533-535, please explain why the strains exhibiting strong adhesion have a decreased growth rate. Is there any related research? Please add some references.

      Reply: Please see reply to Reviewer 1, comment 5.

      **Referees cross-commenting**

      I agree with most of the comments from other reviewers. This publication may indeed be of interest to a minor area. But the results and the interpretations of the data are interesting and warranted, the findings are scientifically important.

      Significance

      The authors did many large-scale screens and bioinformatic analyses. The experiments in the manuscript are generally logical and sound. This study is useful for deciphering the mechanism of multicellular-like phenotype formation in the fission yeast, with some implications for some other organisms.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Summary: Using a variety of targeted and genome wide analyses, the authors investigate the basis for "multicellular-like phenotypes" in S. pombe. Authors developed several methodologies to detect and quantify "multicellular-like phenotypes" (flocculation, aggregation...) and defined genes involved in these processes in laboratory and wild S. pombe.

      SECTION A - Evidence, reproducibility and clarity

      This is a very solid manuscript that is well-written and supported by convincing data. While one can imagine many additional experiments, the manuscript stands on its own and presents a quite exhaustive analysis of the area. I commend the author for their rigorous work and clear presentation. They are only a few minor points that warrant comments or corrections: - Supplementary Figure 1 is a typical example of the "necessity" to have statistics and P-values everywhere. The data are convincing but what is the evidence that the Filtering assay and the Plate-reader assay values should be linearly related? Lets imagine that Plate-reader assay value is proportional to the square of the Filtering assay value. What would be the Pearson R and P-value in this case? What is most appropriate? Why would one use a linear correlation? What is the "real" significance?

      Reply: We thank the reviewer for pointing out that the data in Supplementary Figure 1 does not appear to be linear and, therefore, reporting the Pearson correlation coefficient may not be the best way to represent the relationship between the two assays. The nonlinear nature of this data could indicate that

      The filtering assay saturates before the plate reader assay, and is less able to distinguish between strains that flocculate strongly and The filtering assay may be more sensitive for strains that show lower levels of flocculation. In general, we observed fewer strains with intermediate phenotypes for both assays, making it difficult to ascertain the true relationship between them; however, we believe that the key result is that the strains with the highest level of flocculation have the highest values in both assays. To capture this aspect of the data, we now report the Spearman correlation which is non-parametric and indicates how similar the ranking of each strain is based on both assays. With the alternative hypothesis being that the correlation is > 0, we report a Spearman correlation coefficient of 0.24 and a P-value of 0.04 (lines 823-826)

      • Minor points: * They are several "personal communications" in the manuscript (page 11, page 18, page 23). It should be checked whether this is accepted in the journal that publishes this manuscript.

      Reply: We thank the reviewer for highlighting this issue. We had three instances of "personal communications" in our original submission.

      The first instance was an acknowledgement for advice on our DNA extraction protocol from Dan Jeffares. We now include this in the Acknowledgements section instead.

      The second communication with Angad Garg described that they observed flocculation while growing cells in phosphate starvation conditions, which was not reported in their publication (14). Though we appreciate their willingness to share unpublished data with us, we have removed this observation from our manuscript and instead rely only on our own observations and arguments based on their published RNA-seq data to make our point.

      The third personal communication with Olivia Hillson supplements a minor hypothesis, namely that deletion of SPNCRNA.781 might cause MLP formation by affecting the promoter of hsr1, for which we had access to unpublished ChIP-seq data, showing its binding to flocculins. Recently published work from a different group (15) also suggests this link between hsr1 and flocculation and is now discussed in our manuscript instead of the result based on unpublished data obtained from personal communication at Lines 397-398.

      * Page 4 check "a few regulators"

      Reply: For clarity, this has now been changed to "several regulatory proteins" at Line 108. The specific proteins we are referring to are highlighted in Figure 1C.

      * Page 19, line 567: "remaining 8 strains" may be confusing as Material and Methods states "remaining 10 strains".

      Reply: Two of the 10 strains were found to be redundant after sequencing as explained in the Methods (Lines 930-934). Therefore, we only added 8 new strains to the analysis. We thank the reviewer for highlighting this as a potential source of misunderstanding, and clarified this point in the text (Lines 247-250 and in the methods).

      **Referees cross-commenting**

      I concur with most comments. Overall, the reviewers agree that this is a solid piece of work that could benefit from minor modifications and should be published. I reiterate that, for me, despite its quality, this publication will only be of interest to specialists.

      Reviewer #3 (Significance (Required)):

      A limited number of studies have investigated "multicellular-like phenotypes" in S. pombe. This manuscript brings therefore new and solid information. Yet, despite an impressive amount of work, our conceptual advance in understanding this process and its phylogenetic conservation remains limited. This is probably best illustrated in the figure 6 that summarize the study and contains 3 question marks and an additional unknown mechanism. (Most of the solid arrows in this figure correspond to interactions within the Mediator complex that were well known before this study.) In addition, while only few studies have been published in this area, the authors' findings are often only bringing additional support to already published observations. Overall, while this manuscript will be of interest to a restricted group of aficionados, it will most likely not attract the attention of a wide readership.

      __ Reviewer #4 (Evidence, reproducibility and clarity (Required)):__

      In this manuscript, the authors explore how multicellular-like phenotypes (MLPs) arise in the fission yeast S. pombe. Although yeasts are characterized as unicellular fungi, diverse species show MLPs, including filamentous growth on agar plates and flocculation in liquid media. MLPs may provide certain advantages in nutritionally poor conditions and protection against external challenges, upon which natural selection can then act. Previous work on MLPs has mostly been carried out in the budding yeasts S. cerevisiae and C. albicans, and little was known about these behaviors in S. pombe. The authors thus set out to investigate both genetic and environmental regulators of MLP formation.

      First, their analysis of published data revealed a limited number of shared regulators of MLP between S. pombe, S. cerevisiae, and C. albicans, although the cell adhesion proteins themselves are largely not conserved. Next, the authors screened a set of non-clonal natural isolates using two high-throughput assays that they developed and found that MLPs vary in strains and depending on nutrient conditions. Focusing on a natural isolate that showed both adhesion on agar plates and flocculation in liquid medium, they then analyzed a segregant library generated from this and a laboratory strain using their assays. Using QTL analysis, they uncovered a frameshift in the srb11 gene, which encodes a subunit of the Mediator complex, as the likely causal inducer of MLP. This was confirmed by additional analyses of strains lacking srb11 or other members of Mediator. Furthermore, the authors showed that loss of srb11 function resulted in the upregulation of the Mbx2 transcription factor, which was both necessary and sufficient for MLP formation in this background. Finally, screening of two additional yeast strain collections (gene and long intergenic non-coding RNA deletion) identified both known and novel regulators representing different pathways that may be involved in MLP formation.

      Altogether, this study provides new perspectives into our understanding of the diverse inputs that regulate multicellular-like phenotypes in yeast.

      Major comments:

      • The methods for screening for adhesion and flocculation are well described, with representative figures that show plates and flasks. However, there are few microscopy images of cells, and it would be interesting and helpful for the reader to have an idea of how cells look when they exhibit MLPs. For instance, are there any differences in cell shape or size when strains present different degrees of adhesion or flocculation? In addition, the authors mention that mutants with strong adhesion generally had lower colony density and are likely to be slower growing. Although their analyses suggest otherwise (page 22), this has a potential for introducing error in their observations, and including images of the adhesion/flocculation phenotypes may provide further support for their conclusions. I suggest that the authors present microscopy images 1) similar to what is shown for JB759 in Figure 2A and 2) of cells growing on agar in the adhesion assay. This could be included for the different Mediator subunit deletions that they tested, where there appear to be varying phenotypes. It could also be informative for a subset of the 31 high-confidence candidates that they identified in their screen.

      Reply: We thank the reviewer for highlighting the need for further microscopic characterisation of MLP forming strains. We therefore now include images of JB914, JB953 (New Supplementary Figures 4, Figure 2E) in liquid media in EMM, EMM-Phosphate, and YES; an srb11 deletion strain (Figure 3F), and mbx2 overexpression strains (New Supplementary Figure 7).

      • Upon identifying a frameshift in srb11 that is responsible for the MLP, the authors assessed whether deletion of other Mediator subunits would result in the same phenotype. They found that srb10 and srb11 deletions both flocculate and show adhesion, while other mutants had milder phenotypes. However, the authors also found that a new deletion of srb11 that they generated had a stronger adhesion phenotype than the srb11 deletion from the prototrophic deletion library, which was attributed this the accumulation of suppressor mutations in the strains of the deletion collection. As the authors make clear distinctions between the phenotypes of different Mediator mutants, I suggest generating and analyzing "clean" deletions of the 6 other subunits that they tested. This would strengthen their conclusion and help to rule out accumulated suppressors as the cause of the differences in the observed phenotypes.

      Reply: We thank the reviewer for noticing our concern about suppressor mutations in the manuscript. As we describe above in response to a similar question from reviewer 2, as the prototrophic deletion library from which we extracted the Mediator deletion strains had been backcrossed during its construction (13), we no longer suspect that small difference between the srb11Δ::Kan strain from the deletion library and the newly created srb11Δ (CRISPR) strains is due to suppressor mutations. Rather, we think they may be a result of the difference in genetic background and possibly mating type between the two strains. We also want to emphasize that this difference is small compared to the difference between the adhesion ratios of the srb11Δ strains and their respective control strains.

      Nevertheless, we made clean, independent Mediator mutants for 5 out of 6 Mediator genes tested (med10Δ, med13Δ, med19Δ, med27Δ, and srb10Δ) as well as an additional mutant that we didn't have in our library, med12Δ (Figure R9). When running the assay on these new strains we got an overall lower dynamic range, possibly due to variations in the water flow rate relative to the first assay. However, we saw a strong phenotype for both library and our own srb10Δ and CRISPR srb11Δ strains. We did not see a significant increase in adhesion for the other Mediator deletion mutants in EMM relative to wild type with the exception of for med10Δ in both the library strain and for our clean mutant, for which we did not observe a phenotype in our previous experiment. We included the experiment for the newly created mutants as New Supplementary Figure S6E and described them in lines 276-281 in our revised manuscript.

      Minor comments:

      • One point that recurs in the manuscript is the idea that mutations that give rise to strong MLPs also generally lead to slower growth, representing a potential trade-off. This idea could be reinforced with measurements of growth rate or generation time by optical density or cell number, for instance, rather than comparisons of colony density. Also, it would be interesting to mention if the slow growth phenotype is only observed in MLP-inducing conditions or also in rich medium.

      Reply: As described above in response to item 5 from Reviewer 1, we have conducted growth assays in liquid media for srb10Δ, srb11Δ, and other mutants from our adhesion screen (tlg2Δ, rpa12Δ, mus7Δ and kgd2Δ) that showed a similar phenotype to those genes in both minimal (EMM) and rich (YES) media. We observe that in rich media, srb10Δ and srb11Δ cells grow similarly to control strains, and they exhibit a lower decrease in growth rate than the other similarly adhesive strains. Both mus7Δ and kgd2Δ cells grow more slowly, even in rich media.

      We have also added data on the tradeoff between growth and adhesion based on growth on solid media from (11) for all mutants identified in our screen (New Supp Fig 12B)).

      Thus, the relationship between slow growth and clumpiness depends on the mutation, and specifically, mutations of the Mediator, including those to srb11 and srb10, seem to decrease the impact of any tradeoff between growth and adhesion.

      • The authors show that the MLPs of the srb10 and srb11 deletions occur through mbx2 upregulation. Do the varying strengths of the phenotypes of the strains lacking different Mediator subunits correlate with mbx2 levels in these backgrounds?

      Reply: There is some evidence from previous work that the relationship between the strength of the MLPs and the expression of mbx2 may not be perfectly proportional. In (16), med12Δ had a higher (though qualitatively comparable) level of mbx2 upregulation than srb10Δ (New Supp Fig 8E), even though that paper reported a milder phenotype for med12Δ than for srb10Δ cells. We did not observe a significant increase in adhesion in our med12Δ strain (New Supp Fig 6D). This suggests that in the case of these mutants, it is not simply the level of mbx2 that controls MLP formation, but that there are likely additional regulatory mechanisms. We have added some discussion on this context in the manuscript (lines 545-547).

      **Referees cross-commenting**

      I agree overall with the comments and suggestions from the other reviewers. The revision would require only minor modifications. The paper is interesting both for the combination of methodologies used and its findings, and I believe that it would benefit a growing community of researchers.

      Reviewer #4 (Significance (Required)):

      This study employed a variety of methods that allowed the authors to uncover previously unknown regulators of MLPs. Taking advantage of the diversity of natural fission yeast isolates as well as the constructed gene and non-coding RNA deletion collections, the authors identified novel genetic determinants that give rise to MLPs, opening new avenues into this exciting area of research. The overall conclusions of the work are solid and supported by the reported results and analyses. This study will be appreciated by a broad audience of readers who are interested in understanding how organisms respond to environmental challenges as well as how MLPs may result in emergent properties that play key roles in these responses. Some of the limitations of the work are described above, with recommendations for addressing these points.

      Keywords for my field of expertise: fission yeast, cell cycle, transcription, replication.

      References for Response to Reviews

      1. Brysch-Herzberg M, Jia GS, Seidel M, Assali I, Du LL. Insights into the ecology of Schizosaccharomyces species in natural and artificial habitats. Antonie Van Leeuwenhoek. 2022 May 1;115(5):661-95.
      2. Jeffares DC, Rallis C, Rieux A, Speed D, Převorovský M, Mourier T, et al. The genomic and phenotypic diversity of Schizosaccharomyces pombe. Nat Genet. 2015 Mar;47(3):235-41.
      3. Ratcliff WC, Denison RF, Borrello M, Travisano M. Experimental evolution of multicellularity. Proc Natl Acad Sci. 2012 Jan 31;109(5):1595-600.
      4. Smukalla S, Caldara M, Pochet N, Beauvais A, Guadagnini S, Yan C, et al. FLO1 is a variable green beard gene that drives biofilm-like cooperation in budding yeast. Cell. 2008 Nov 14;135(4):726-37.
      5. Lorenz MC, Heitman J. Yeast pseudohyphal growth is regulated by GPA2, a G protein alpha homolog. EMBO J. 1997 Dec 1;16(23):7008-18.
      6. Ignacia DGL, Bennis NX, Wheeler C, Tu LCL, Keijzer J, Cardoso CC, et al. Functional analysis of Saccharomyces cerevisiae FLO genes through optogenetic control. FEMS Yeast Res. 2025 Sept 24;25:foaf057.
      7. Wang Z, Xu W, Gao Y, Zha M, Zhang D, Peng X, et al. Engineering Saccharomyces cerevisiae for improved biofilm formation and ethanol production in continuous fermentation. Biotechnol Biofuels Bioprod. 2023 July 31;16(1):119.
      8. Koschwanez JH, Foster KR, Murray AW. Improved use of a public good selects for the evolution of undifferentiated multicellularity. eLife. 2013 Apr 2;2:e00367.
      9. Westman JO, Mapelli V, Taherzadeh MJ, Franzén CJ. Flocculation Causes Inhibitor Tolerance in Saccharomyces cerevisiae for Second-Generation Bioethanol Production. Appl Environ Microbiol. 2014 Nov;80(22):6908-18.
      10. Li R, Li X, Sun L, Chen F, Liu Z, Gu Y, et al. Reduction of Ribosome Level Triggers Flocculation of Fission Yeast Cells. Eukaryot Cell. 2013 Mar;12(3):450-9.
      11. Rodríguez-López M, Bordin N, Lees J, Scholes H, Hassan S, Saintain Q, et al. Broad functional profiling of fission yeast proteins using phenomics and machine learning. Marston AL, James DE, editors. eLife. 2023 Oct 3;12:RP88229.
      12. Hebra T, Smrčková H, Elkatmis B, Převorovský M, Pluskal T. POMBOX: A Fission Yeast Cloning Toolkit for Molecular and Synthetic Biology. ACS Synth Biol. 2024 Feb 16;13(2):558-67.
      13. Malecki M, Bähler J. Identifying genes required for respiratory growth of fission yeast. Wellcome Open Res. 2016 Nov 15;1:12.
      14. Garg A, Sanchez AM, Miele M, Schwer B, Shuman S. Cellular responses to long-term phosphate starvation of fission yeast: Maf1 determines fate choice between quiescence and death associated with aberrant tRNA biogenesis. Nucleic Acids Res. 2023 Feb 16;51(7):3094-115.
      15. Ohsawa S, Schwaiger M, Iesmantavicius V, Hashimoto R, Moriyama H, Matoba H, et al. Nitrogen signaling factor triggers a respiration-like gene expression program in fission yeast. EMBO J. 2024 Oct 15;43(20):4604-24.
      16. Linder T, Rasmussen NN, Samuelsen CO, Chatzidaki E, Baraznenok V, Beve J, et al. Two conserved modules of Schizosaccharomyces pombe Mediator regulate distinct cellular pathways. Nucleic Acids Res. 2008 May;36(8):2489-504.
    1. Synthèse du Webinaire : Aménagements d'Examens pour les Élèves à Besoins Éducatifs Particuliers

      Résumé Exécutif

      Ce document de synthèse résume les points clés du webinaire organisé par la FCPE nationale le 20 novembre 2025, animé par Guillaume Laffitte, conseiller technique académique pour l'École inclusive, et Laurence Noël, chef de la division des examens et concours (DEC) de l'académie de Montpellier.

      L'objectif central était de clarifier les droits, les procédures et les délais concernant les aménagements d'examens.

      Les aménagements ne sont pas des faveurs, mais un droit fondamental pour garantir l'égalité des chances et permettre une évaluation juste et adaptée aux besoins de chaque élève.

      Le concept central est la cohérence du "parcours de l'élève" : les aménagements aux examens doivent être l'aboutissement logique des aides pédagogiques mises en place durant toute la scolarité.

      Deux acteurs principaux collaborent : le Pôle École Inclusive, qui se concentre sur l'accompagnement pédagogique en amont, et la Division des Examens et Concours (DEC), qui gère le cadre réglementaire et logistique des épreuves.

      La procédure de demande se divise en deux voies : une procédure simplifiée pour les élèves bénéficiant déjà d'un PAP, PAI ou PPS, et une procédure complète pour les autres cas ou les demandes nouvelles.

      L'anticipation est cruciale : les démarches doivent être entamées dès la classe de quatrième pour le brevet et en seconde pour le baccalauréat.

      Enfin, des outils pédagogiques innovants comme les "matrices pédagogiques" sont encouragés pour renforcer l'autonomie des élèves, illustrant une évolution vers une "école pour tous" où les adaptations bénéfiques pour certains le sont pour l'ensemble des élèves.

      --------------------------------------------------------------------------------

      1. Principes Fondamentaux et Philosophie

      Le webinaire établit d'emblée que les démarches d'aménagement d'examen sont essentielles pour garantir l'égalité des chances. Elles constituent un parcours souvent lourd et mal compris pour les familles.

      Un Droit, Pas une Faveur : Il est rappelé que les aménagements sont un "droit indispensable pour que chaque élève soit évalué dans des conditions le plus juste et adaptée à leurs besoins".

      De l'École Inclusive à l'École pour Tous : Guillaume Laffitte propose de dépasser le terme "école inclusive" pour viser une "école pour tous", qui répond aux besoins de chacun sans étiqueter les élèves. La diversité est présentée comme normale et bénéfique.

      Le Parcours de l'Élève : L'idée centrale est que l'examen n'est pas une simple étape, mais l'aboutissement de toute la scolarité.

      Il doit exister une cohérence systématique entre les aménagements pédagogiques fournis en classe tout au long du parcours et ceux accordés lors des épreuves. Cette continuité renforce l'autonomie de l'élève.

      "Il faut vraiment qu'on puisse corréler systématiquement [...] le parcours de l'élève jusqu'aux épreuves pour le candidat, parce qu'il faut une cohérence et c'est comme ça qu'on peut renforcer finalement les élèves face à leur autonomie en situation d'apprentissage." - Guillaume Laffitte

      2. Les Acteurs Clés et Leurs Rôles

      La gestion des aménagements repose sur la collaboration de deux services principaux au sein du rectorat, ici illustrés par l'Académie de Montpellier.

      Le Pôle Académique École Inclusive

      Dirigé par Guillaume Laffitte, ce pôle se concentre sur l'accompagnement pédagogique de l'élève tout au long de sa scolarité.

      Coordination : Il pilote l'organisation de l'école inclusive au niveau académique, en s'appuyant sur les orientations nationales.

      Collaboration : Il travaille en lien étroit avec tous les services de l'académie, notamment la Division des Examens et Concours (DEC).

      Création de Ressources : Il produit des guides pour les familles et les équipes, comme le "guide académique pour les aménagements des examens, mais du parcours de l'élève jusqu'aux aménagements des examens".

      Priorités Académiques : L'une des priorités est l'utilisation des matrices pédagogiques comme réponse pédagogique cohérente.

      La Division des Examens et Concours (DEC)

      Dirigée par Laurence Noël, la DEC est le service administratif et logistique qui organise l'ensemble des épreuves et gère l'application réglementaire des aménagements.

      Chaque rectorat possède une DEC (à Paris, il s'agit du SIEC).

      Missions principales :

      Organisation Globale : Organisation de tous les examens (DNB, CAP, Baccalauréats, BTS, etc.) et des concours de recrutement de l'Éducation Nationale.

      Volet Sujets : Élaboration et adaptation des sujets d'examen (ex: dictée aménagée, sujets agrandis, sujets en braille).

      Volet Organisationnel : Gestion des inscriptions, élaboration des calendriers (en tenant compte des tiers temps qui allongent la durée des épreuves), répartition des candidats dans les centres, et communication des aménagements aux chefs de centre.

      Volet Logistique : Fourniture de matériel spécifique comme les copies spéciales (mais pas les ordinateurs ou le mobilier ergonomique).

      Volet Administratif :

      Notification : C'est la DEC qui envoie la décision officielle d'aménagement (la "notification") aux familles via l'application Cyclades.  

      Recours : Elle traite les recours des familles en cas de désaccord avec une décision.   

      Fraudes : Elle gère les commissions de discipline, y compris celles liées à un mauvais usage des aménagements (ex: aide humaine qui donne les réponses, ordinateur non vidé de son contenu).

      3. Le Cadre des Aménagements d'Examens

      Types d'Aménagements Possibles

      Les aménagements peuvent porter sur divers aspects de l'épreuve pour répondre aux besoins spécifiques du candidat.

      Catégorie

      Exemples d'aménagements

      Temps

      - Temps majoré (ex: tiers temps) pour les épreuves écrites, orales ou pratiques.<br>- Temps compensatoire pour permettre des soins ou des pauses.<br>- Temps pour se lever et faire quelques pas.

      Espace

      - Composition en rez-de-chaussée.<br>- Placement spécifique dans la salle (près d'une fenêtre).<br>- Composition dans une salle isolée.

      Aides Techniques

      - Utilisation d'un ordinateur (personnel ou fourni par le centre).<br>- Matériel spécifique (tables ou chaises ergonomiques, non fournies par la DEC).<br>- Sujets adaptés : en braille, agrandis, sur support numérique.

      Aides Humaines

      - Secrétaire : Tâches d'exécution pure (lecteur, scripteur sous la dictée).<br>- Assistant : Marge d'autonomie (reformulation ou séquençage des consignes, recentrage de l'attention).<br>- AESH : Missions précises définies dans le cadre d'un PPS.

      Adaptations & Dispenses

      - Adaptation de l'épreuve : Dictée aménagée pour le DNB.<br>\

      • Dispense d'épreuve : Très réglementée et spécifique à chaque examen (ex: dispense de langue vivante, non applicable à tous les diplômes).<br>\

      • Étalement : Possibilité de passer les épreuves sur plusieurs sessions consécutives.<br>\

      • Conservation des notes : Les notes obtenues peuvent être conservées durant cinq ans.

      Correction

      - Anonymat respecté : Le correcteur n'a pas connaissance du handicap.<br>\

      • Non-pénalisation de l'orthographe : Si validé, un sigle sur la copie anonyme l'indique au correcteur.

      Les "Matrices Pédagogiques" : Un Outil d'Avenir

      Fortement mises en avant par Guillaume Laffitte, les matrices sont des outils méthodologiques qui aident l'élève à séquencer une tâche et à organiser sa pensée.

      Principe : Elles ne sont pas une antisèche, mais une fiche qui guide l'élève dans les étapes d'une tâche (ex: comment utiliser son brouillon, construire un fil conducteur, organiser son temps).

      Cohérence : Elles permettent à l'élève d'utiliser le jour de l'examen un outil qu'il maîtrise déjà pour l'avoir utilisé en classe.

      Autonomie : Elles visent à rendre l'élève plus autonome et à renforcer son estime de soi.

      Statut : L'utilisation de matrices est un aménagement réglementaire autorisé pour les examens.

      _"Ce qui réussit à l'élève qui a le plus de besoins, il n'y a pas de raison que ce ne soit pas utile à tous.

      C'est ce qu'on appelle la conception universelle des apprentissages."_ - Guillaume Laffitte

      Distinction Cruciale : Dispense d'Enseignement et Aménagement d'Examen

      Il est essentiel de ne pas confondre ces deux notions :

      Dispense d'enseignement : Décision très rare, prise uniquement par le recteur à la demande des parents, pour un élève en situation de handicap.

      Elle a un impact majeur sur le parcours et l'orientation future de l'élève et doit être évaluée en cohérence avec les examens à venir.

      Dispense d'épreuve d'examen : Fait partie des aménagements possibles mais est strictement encadrée par la réglementation de chaque diplôme.

      La DEC ne peut valider une dispense que si le règlement de l'examen le permet.

      4. Procédures de Demande d'Aménagement

      La procédure a été simplifiée en 2020 pour garantir la continuité entre le parcours scolaire et les examens. Elle s'articule en deux voies principales.

      Pour Qui ?

      Tout candidat présentant un handicap (reconnu par la MDPH), un trouble de santé invalidant (dans le cadre d'un PAP ou PAI) ou une limitation temporaire d'activité (ex: bras cassé avant l'épreuve) peut demander un aménagement, quel que soit son statut (scolarisé, candidat individuel, etc.).

      Procédure Simplifiée

      Conditions : Réservée aux élèves scolarisés en établissement public ou privé sous contrat, disposant d'un PAP, PAI ou PPS valide, et dont les aménagements demandés pour l'examen sont identiques à ceux déjà mis en place durant leur scolarité.

      Processus : La demande ne nécessite pas l'avis d'un médecin. Le chef d'établissement signe le formulaire, qui est ensuite transmis à la DEC.

      Procédure Complète

      Conditions : S'applique à tous les autres candidats (individuels, hors contrat), à ceux qui n'ont pas de plan formalisé (PAP, PAI, PPS), ou à ceux qui demandent des aménagements différents ou nouveaux par rapport à leur scolarité.

      Elle est également requise en cas d'aggravation de l'état de santé ou pour une majoration de temps au-delà du tiers temps (mi-temps).

      Processus : Le dossier est examiné par l'équipe pédagogique et doit obligatoirement recevoir l'avis d'un médecin de l'Éducation Nationale avant d'être transmis à la DEC.

      Calendrier et Délais Clés

      L'anticipation est le maître-mot. L'interlocuteur principal pour les familles est le chef d'établissement.

      Examen

      Moment pour Entamer la Procédure

      DNB / CFG

      En classe de quatrième

      Baccalauréats (général, techno, pro)

      Fin du second trimestre de la classe de seconde

      Autres examens (CAP, BTS, etc.)

      Au cours de l'année de l'examen

      La demande formelle et la transmission des pièces se font généralement au moment de l'inscription à l'examen. Le respect des délais est impératif pour permettre à la DEC d'organiser la logistique (ex: la production d'un sujet en braille demande un mois).

      Le Processus de Traitement et de Notification

      1. Instruction : Les services de la DEC étudient le dossier et vérifient sa conformité réglementaire.

      2. Décision : Le recteur prend la décision finale.

      3. Notification : La DEC informe officiellement la famille de la décision via l'application Cyclades. Les notifications sont envoyées entre février et mai.

      4. Conservation : La notification est à conserver précieusement, à présenter à chaque épreuve avec la convocation, et peut servir de pièce justificative pour de futures demandes.

      5. Données Chiffrées et Tendances (Académie de Montpellier)

      Les statistiques de l'Académie de Montpellier illustrent une forte augmentation des demandes d'aménagement.

      Indicateur

      Données 2020

      Données 2025 (prévisionnel)

      % de candidats avec aménagement

      10 %

      13,51 %

      Nombre total de dossiers

      ~10 000

      14 000

      Nombre total de mesures d'aménagement

      15 000

      76 000

      Moyenne de mesures par candidat

      ~1,5

      ~5,5

      Taux de notifications positives

      N/A

      99,67 %

      Mesures les plus courantes :

      • Tiers temps

      • Dictée aménagée (DNB)

      • Autorisation de la calculatrice

      • Aide humaine pour le séquençage ou la reformulation des consignes

      6. Points de Vigilance et Conseils Pratiques

      Confiance et Autonomie : Les deux intervenants insistent sur la nécessité de faire confiance aux capacités des enfants, de viser leur autonomie et de s'assurer que les aménagements demandés correspondent réellement à leurs besoins et à leurs habitudes de travail.

      Utilisation de l'ordinateur : Si un ordinateur personnel est autorisé, il doit être entièrement vide de tout dossier et présenté au chef de centre pour vérification avant chaque épreuve.

      Il faut bien distinguer la demande de "sujet sur support numérique" de la "composition sur ordinateur".

      Enregistrement régulier : En cas de composition sur ordinateur, il est vital d'enregistrer le travail très régulièrement sur le disque dur ET sur une clé USB pour éviter toute perte en cas de problème technique.

      Contacter le Centre d'Examen : Pour des aménagements lourds ou spécifiques (notamment liés à l'espace, comme un fauteuil roulant), il est conseillé de prendre contact en amont avec le chef du centre d'examen.

      Recours : Si un aménagement accordé n'est pas respecté le jour de l'épreuve, la famille doit adresser un recours écrit au recteur.

      La DEC mènera alors une enquête.

    1. Briefing sur la Plateforme Avenir : Un Outil d'Accompagnement à l'Orientation de la 5ème à la Terminale

      Résumé Exécutif

      La plateforme Avenir, développée par l'Office national d'information sur les enseignements et les professions (Onisep), constitue le pivot numérique du « parcours Avenir ».

      Conçue pour accompagner les élèves de la classe de 5ème jusqu'à la Terminale, elle vise à rendre la démarche d'orientation plus lisible, interactive et personnalisée.

      L'objectif central est de permettre à chaque élève de construire progressivement son projet scolaire et professionnel en fonction de ses compétences et centres d'intérêt, tout en atténuant la pression liée aux choix d'orientation. Intégrée aux établissements scolaires, la plateforme favorise la coéducation en impliquant les équipes éducatives et les familles dans un cadre sécurisé et structuré sur le long terme.

      --------------------------------------------------------------------------------

      1. L'Onisep : Missions et Valeurs de l'Opérateur d'État

      L'Onisep est un opérateur public placé sous la double tutelle du ministère de l'Éducation nationale et du ministère de l'Enseignement supérieur et de la Recherche. Sa mission repose sur deux piliers : informer et accompagner.

      1.1 Un maillage territorial dense

      L'Onisep s'appuie sur des services centraux en région parisienne et sur 17 directions territoriales (incluant la Corse et l'Outre-mer). Ce réseau permet :

      • L'alimentation continue de bases de données documentaires sur les formations, les métiers et les établissements.

      • Un déploiement de la plateforme au plus près des usagers via des présentations en établissement et lors de salons.

      1.2 Principes fondamentaux

      L'action de l'Onisep et de la plateforme Avenir est guidée par des valeurs d'inclusion et d'équité :

      Égalité d'accès : Un socle commun d'information pour tous.

      Lutte contre les stéréotypes : Déconstruction des préjugés sur les métiers et promotion de l'égalité garçons-filles.

      Inclusion : Ressources spécifiques pour les élèves en situation de handicap.

      Développement durable : Sensibilisation aux enjeux écologiques dans les parcours professionnels.

      --------------------------------------------------------------------------------

      2. Structure et Fonctionnalités de la Plateforme Avenir

      La plateforme a été conçue de manière intuitive avec le concours d'élèves, d'enseignants, de psychologues de l'Éducation nationale et de parents. Elle s'articule autour de quatre onglets principaux.

      2.1 L'Agenda de l'orientation

      Il ne s'agit pas d'un cahier de textes scolaire, mais d'un calendrier dédié exclusivement à l'orientation. L'élève y trouve :

      Activités en médiation : Séances programmées par les enseignants.

      Événements suggérés : Forums des métiers, journées portes ouvertes ou salons régionaux.

      Dates institutionnelles : Repères clés (notamment pour Parcoursup en Terminale).

      2.2 Les Objectifs annuels

      Les objectifs sont adaptés à chaque niveau scolaire pour garantir une progression cohérente.

      | Niveau | Exemples d'Objectifs Incontournables | | --- | --- | | Collège (3ème) | Lister ses goûts et points forts ; identifier les voies après la 3ème ; préparer et réaliser un stage de découverte. | | Lycée (Terminale) | Préparer l'accès à l'enseignement supérieur ; finaliser son projet pour Parcoursup. |

      2.3 Les Outils d'exploration

      La plateforme propose des modules interactifs pour aider l'élève à se découvrir :

      « Je découvre des métiers » : Outil ludique explorant des thématiques modernes (ex: impact de l'intelligence artificielle sur les métiers).

      Fiches métiers et vidéos : Contenus contextualisés permettant de « liker » des professions pour les enregistrer dans son profil.

      2.4 L'Espace « Me faire accompagner »

      Ce volet rappelle l'importance de l'accompagnement humain. Il facilite la mise en relation avec :

      • Le psychologue de l'Éducation nationale (PsyEN) de l'établissement.

      • Le service « Mon orientation en ligne », un support gratuit accessible par chat, mail ou téléphone.

      --------------------------------------------------------------------------------

      3. Dispositifs Spécifiques pour le Lycée : Le Module « Mon Projet Sup »

      Pour les lycéens, la plateforme intègre l'outil Mon Projet Sup, dont la mission principale est de lutter contre l'autocensure.

      Personnalisation : L'outil part des centres d'intérêt, des enseignements de spécialité choisis et des préférences géographiques de l'élève.

      Suggestions intelligentes : Il propose des formations ambitieuses ou des « plans B » réalistes auxquels l'élève n'aurait pas forcément pensé.

      Lien avec Parcoursup : Bien qu'indépendant du système d'affectation, il permet de préparer ses vœux et d'explorer la carte des formations de manière fluide.

      --------------------------------------------------------------------------------

      4. Le Portfolio : Une Mémoire du Cheminement

      Le portfolio est l'espace où l'élève consigne toutes ses traces d'apprentissage et de réflexion de la 5ème à la Terminale.

      Continuité : Les données suivent l'élève même s'il change d'établissement, d'académie ou de région.

      Conservation : Le contenu est conservé jusqu'à trois ans après la Terminale pour faciliter d'éventuelles réorientations ou reprises d'études.

      Contenus stockés : Projets d'études, CV, lettres de motivation, comptes rendus d'entretiens avec les PsyEN et documents personnels (ex: interviews de professionnels, brochures d'entreprises).

      --------------------------------------------------------------------------------

      5. Gouvernance des Données et Rôle des Acteurs

      5.1 Accès et Connexion

      L'accès à Avenir se fait via les identifiants nationaux sécurisés :

      EduConnect pour la majorité des élèves de l'Éducation nationale.

      EduAgri pour l'enseignement agricole.

      Note : Les parents n'ont pas de compte propre mais sont invités à explorer la plateforme « côte à côte » avec leur enfant.

      5.2 Confidentialité et Droits

      RGPD : La plateforme respecte strictement les normes de protection des données personnelles.

      Visibilité restreinte : Les enseignants voient le tableau de bord des objectifs (auto-évaluation de l'élève) et le portfolio pour conseiller l'élève, mais n'ont pas accès à l'espace de stockage privé ni aux comptes rendus confidentiels des psychologues.

      Droit à l'erreur : L'élève est acteur de son profil ; il peut modifier ou supprimer ses centres d'intérêt et métiers favoris à tout moment.

      5.3 Déploiement dans les établissements

      Le déploiement est progressif.

      Le « plan Avenir » prévoit la formation des enseignants, en priorité les professeurs principaux de 3ème.

      La mise en œuvre dépend du projet de chaque établissement (utilisation durant les heures dédiées à l'orientation, vie de classe ou demi-journées thématiques).

      La plateforme est un outil pédagogique à la main des équipes, respectant leur liberté pédagogique.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1

      Evidence, reproducibility and clarity

      The manuscript by Wu and Griffin describes a mechanism where CHD4 and BRG1, two chromatin remodelling enzymes, have antagonistic functions to regulate extracellular matrix (ECM) plasmin activity and sterile inflammatory phenotype in the endothelial cells of the developing liver. As a follow up from a previous study, the authors investigate the phenotype of embryonic-lethal endothelial-specific CHD4-knockout, leading to liver phenotype and embryo death, and the rescue of this phenotype when subsequently BRG1 is knocked-out also in the endothelium. First, the authors show that the increase in plasmin activator uPAR (which leads to ECM degradation) in CHD4-KO embryos can be rescued by BRG1-KO, and that both CHD4 and BRG1 interact with the uPAR promoter. However, the authors demonstrate that reducing plasminogen by genetic knockout is unable to rescue the CHD4-KO embryos alone, suggesting an additional mechanism. By RNAseq analysis, the authors identify sterile inflammation as another potential contributor to the lethal phenotype of CHD4-KO embryos through increased expression of ICAM-1 in endothelial cells, also showing binding of both chromatin remodellers to ICAM-1 promoter. Finally, the authors use nonsteroidal anti-inflammatory drug carprofen, alone or in combination with plasminogen genetic knockout, and demonstrate CHD4-KO lethal embryonic phenotype rescue with the combination of plasminogen reduction and inflammation reduction, highlighting the synergistic role of both ECM degradation and sterile inflammation in this genetic KO.

      The findings of the manuscript are interesting, experiments well controlled and paper well written. While the work is of potential specialist interest to the field of liver development, there are several issues which authors should address before this paper can be published:

      Major issues:

      1. The authors still see embryonic lethality of some embryos with endothelial BRG1-KO or combined endothelial CHD4/BRG1-KO - could the authors please show or at least comment in the discussion why those animals are dying?

      We observed no dead Brg1-ECko or Brg1/Chd4-ECdko embryos by E14.5. However, at E17.5, there was an 18.8% lethality rate for Brg1-ECko mutants and a 12.5% rate for Brg1/Chd4-ECdko mutants (Fig. 1B). The reasons behind the incomplete rescue of Brg1/Chd4-ECdko embryos and the cause of death in Brg1-ECko mutants remain unknown, as we have mentioned in the revised discussion (see lines 311-316).

      1. In the qRT-PCR results Fig.2c, what is each dot?

      Each dot represents transcripts acquired from a separate embryo. We have modified the figure legend for clarification.

      1. In the same figure, I would expect that in CHD4-KO there is no CHD4 transcript, and in BRG1-KO there is no BRG1 transcript, rather than the reduction shown, which seems quite noisy (though significant) - is it this a result of normalisation? Or is indeed only a certain amount of the transcript reduced?

      The VE-Cadherin Cre mouse line utilized in this study is reported to have progressive Cre expression and activity from E8.5 to E13.5 and only to reach full penetrance across all vasculature at E14.51. The liver sinusoidal ECs (LSECs) analyzed in Fig. 2C were isolated at E12.5, before Cre activity reached its full penetrance. This is likely the primary cause of the variability in gene excision seen in this panel.

      1. In the same figure, is the statistical testing performed before or after normalisation? This can introduce errors if done after normalisation.

      Normalization was performed before statistical analysis to combine relative transcript counts from embryos harvested in multiple litters. This is now clarified in our methods (see lines 486-489).

      1. In some cases, the authors show immunofluorescence images but do not specify how many biological replicates this represents (e.g. Fig.1d, 4c-d). This should be added.

      We have updated the legends for Figs. 1E, 4C-D, and 6E-F, as suggested.

      1. I also encourage the authors to present a supplementary figure with at least one other biological replicate shown for imaging data (optional).

      We appreciated this suggestion but opted not to add additional supplemental figures, which might have been confusing to readers.

      1. The plasminogen reduction by genetic modulation results in drastic changes to the embryos' appearance - is this a whole embryo KO or endothelial-specific KO? Can authors at least comment on the differences?

      The plasminogen-deficient embryos used in this study were global knockouts; this is now clarified on line 177. The Chd4-ECko embryos with varying degrees of plasminogen deficiency that are shown in Fig. 2F were dissected at E17.5, which is ~3 days after the typical time of death for Chd4-ECko embryos. This explains why the dead and partially resorbed mutants in Fig. 2F look so different from their control (Plg-/-) littermate and from the E14.5 Chd4-ECko embryos shown in Fig. 1C.

      1. In Fig.2b, do I understand correctly only 1 sample was analysed with different areas plotted on the graph? If so, this experiment should be repeated on another set of embryos to be robust, and data plotted as a mean of each embryo (rather than areas).

      Each dot represents the mean value obtained after quantifying 4 fluorescent areas within a liver section from a single embryo. The N number indicates the number of embryos used from each genotype. We have updated the figure legend accordingly.

      1. Also in some graphs, authors specify that it was more than n>x embryos, but then - what are the dots on the graph representing? Each embryo? This should be specified (e.g. Fig.2b-c, but please check this in all the figure legends).

      Thank you for this question. We have worked to clarify the legends for all our graphs. Overall, for graphs related to embryos, each dot represents data from a single embryo. Since the sample sizes vary across genotypes, we used the smallest sample size taken from the mutant groups when listing our minimum N.

      1. "we found Plaur was the only gene that was induced in CHD4-ECko LSECs at E12.5 (Figure S3D)." - I am not sure this is correct, as gene Plau is also increased in 2/3 samples?

      Although Plau transcripts were also increased in Chd4-ECko LSECs compared to control samples, our statistical analysis showed a p-value of 0.0564, which was deemed non-significant according to our cutoff criteria of p

      1. I find the title and the running title somewhat misleading and too broad; the authors should specify more detail in the title about the content of the paper - the current statement of the title is somewhat true but shown only for one genetic model and not confirmed for all types of "lethal embryonic liver degeneration".

      We have updated the title to incorporate this suggestion. The revised title is ‘Plasmin activity and sterile inflammation synergize to promote lethal embryonic liver degeneration in endothelial chromatin remodeler mutants.’ The revised running title is ‘Plasmin and inflammation in endothelial mutant livers.’

      Minor issues:

      1. If an animal licence was used, its number should be specified in the ethics or methods section

      We have added this information to the methods (see line 383).

      1. In fig.3g it is very hard to see each of the samples, could authors try to improve this graph for clarity using colours-or split Y axis - or both?

      We have revised Fig. 3G to include a split y-axis, as suggested.

      1. "This indicates that ECs can play a pro-inflammatory role in embryonic livers and highlights the need for tight regulation to ensure normal liver growth." This sentence for me is misleading, EC are producing inflammatory signals only during the CHD4-KO according to the author's data, and authors do not show such data in normal homeostasis condition. Actually, the pro-inflammatory role here seems detrimental, and ECs should not exhibit it for correct development. The authors should rephrase this to be clearer.

      The detrimental inflammation observed when Chd4 was deleted in ECs indicates that endothelial CHD4 normally suppresses inflammation during liver development (Fig. 3F-G, and 4A-B). When endothelial CHD4 functions properly, there is no excessive cytokine activation and inflammation. We have modified the sentence to help clarify this information (see lines 295-297).

      Significance

      General assessment: The study is well controlled and well written. The findings are interesting. The limitation of the findings is only 1 combination genetic model being studied, and it is unclear if the synergistic effect of sterile inflammation and ECM degradation is broadly applicable to other models, where embryo dies because of liver failure.

      Advance: The study makes an incremental advance, following up findings from a previous study. However, it is conceptually interesting.

      Audience: The audience for this manuscript would be a liver development specialist. However, broader concepts could also be applicable to liver disease.

      Expertise: I research in the field of liver regeneration and disease.

      __Reviewer #2 __

      Evidence, reproducibility and clarity

      In essence, Wu et Al find that Chd4 mutant mice exhibit embryonic liver degeneration due to uPA-mediated plasmin hyperactivity and an ICAM-1-driven hyperinflammation and that additional mutation of BRG1 opposes this liver degeneration, possibly via ICAM-1.

      Generally, this is an excellent manuscript with a very logical sequence of experiments, although it has shortcomings such as validating their findings in an independent system, ideally human, and further establishing the translational relevance. Establishing translational relevance through mechanistic experiments that identify specific inflammatory tissue pathways, such as by blocking ICAM-1 and TNF-alpha, could also define developmental aberrations as a model for broader (patho)physiology and thereby enhance the impact on the field.

      Major

      1. The embryonic and postnatal survival data of Chd4-ECko and Brg1/Chd4-Ecdko mice should be included in Fig. 1

      We revised Fig. 1 to add representative photos and lethality rates for control and mutant embryos at E17.5 (see new Fig. 1B). All Chd4-ECko embryos we dissected at E17.5 were dead, which was consistent with our previous report2. Although Brg1/Chd4-ECdko embryos were largely rescued at E17.5, these mutants still die soon after birth due to lung development issues, as we previously reported3.

      1. What is the impact of Chd4-ECko and Brg1/Chd4-ECdko on the multicellular microenvironment? At a minimum, IF or spatial transcriptomics for hepatocyte and biliary markers, pericytes, and other mesenchymal cells would be recommended. Can there be a distinction made on what type of endothelial cell is affected? (sinusoidal lineage, vs. venous vs. lymphatic)

      To assess whether the multicellular microenvironment of Chd4-ECko livers was altered, we performed immunostaining for various cellular markers from E12.5 to E14.5. These markers included LYVE-1 for liver sinusoids; PROX1 and E-cadherin (ECAD) for hepatocytes; CD41 for platelets and megakaryocytes; CD45 for leukocytes; CD68 and F4/80 for macrophages; MPO for neutrophils; TER119 for erythroid cells; and a-smooth muscle actin (SMA) for pericytes and smooth muscle cells (see Fig. 4D and__ Fig. R1*__). Across all the images we examined, no obvious cell-type-specific differences were observed between control and mutant livers.

      Biliary epithelial cells, which begin to differentiate at approximately E15.54, were also assessed using cytokeratin 19 (CK19) immunostaining; however, no CK19-positive cells were detected in control livers at E14.5 (see Fig. R2*). Note that although LYVE-1 is also expressed by lymphatic endothelial cells, lymphatic vessels are not yet established in the liver at E14.52. Therefore, LYVE-1 staining is appropriate for identifying liver sinusoidal ECs at this stage of development. Our data indicate that the affected vasculature in Chd4-ECko livers is predominantly localized to the liver periphery (see Fig. 1D), which LYVE-1 staining shows to be mostly populated by sinusoidal vessels (Fig. R1B and R1F).

      *Please see uploaded Response to Reviewers PDF for Figures R1 and R2

      1. The experiments showing how endothelial Chd4 loss leads to a hyperinflammatory endothelial-and potentially hepatoblast-state are important. However, the relevance of immune cell infiltration in the hematopoietic-developing liver remains unclear. Which immune cells are presumably recruited to inflame the microenvironment then? Bone-marrow-derived? This aspect would benefit from experimental clarification, for example, using migration and/or direct co-culture versus indirect cell co-culture-ideally with or without ICAM-1 blockade-in vitro assays to determine if direct crosstalk with the CD45+ immune cell compartment explains the hyperinflammatory endothelia phenotype.

      In mice, the first hematopoietic cells emerge in the yolk sac at E7.55. Subsequently, embryonic hematopoiesis takes place in the aorta-gonad-mesonephros (AGM) region and the placenta, before immature hematopoietic cells migrate to the fetal liver. After E11.0, the fetal liver becomes the main hematopoietic organ, supporting the expansion and differentiation of hematopoietic stem and progenitor cells into all mature blood cell lineages5-8. Around E16.5, hematopoietic cells migrate to the bone marrow9, so the bone marrow is not a relevant source of infiltrating immune cells in our E12.5-14.5 Chd4-ECko mutants. We therefore examined immune cell populations, including leukocytes, macrophages, and neutrophils, in Chd4-ECko livers. No enrichment of specific immune cell types was observed in Chd4-ECko livers compared with controls at E13.5-14.5 (Fig. R1). Since immune cells develop within fetal livers at this stage, these findings suggest that they are locally activated rather than recruited to Chd4-ECko livers. Moreover, because fetal livers contain a heterogeneous mixture of immature and mature hematopoietic and immune cells, appropriate in vitro cell models to assess immune cell activation in this context are currently lacking. We have added comments to the introduction to address some of these points (see lines 66-68).

      1. Related to the previous comment: Can the authors validate their findings in an independent, ideally human, cell-based system?

      To explore this, we analyzed PLAUR and ICAM1 transcripts following CHD4 and/or BRG1 knockdown in primary human umbilical vein endothelial cells (HUVECs) for 48 hours. No antagonistic regulation of either gene was detected in HUVECs (Fig. R3*). Moreover, while Icam1 transcription was antagonistically regulated by CHD4 and BRG1 in the mouse MS1 EC line (see Fig. 5A), transcriptional regulation of Plaur by these remodelers was observed only in isolated LSECs and not in cultured MS1 cells. Together, these findings demonstrate that BRG1 and CHD4 play context-specific roles when regulating Icam1 and Plaur transcription in different EC types. Furthermore, in vitro versus in vivo EC environments may additionally influence BRG1 and CHD4 activity.

      *Please see uploaded Response to Reviewers PDF for Figure R3

      1. Identifying the specific hematopoietic/immune subset could further increase the paper's impact, as it would more definitively clarify the mechanism in the developing endothelial niche.

      Please see our response to question # 3.

      1. Also, can the authors show experimentally whether, conversely, Chd4 overexpression can limit an endothelial-type of inflammatory liver injury?

      We agree that exploring this suggestion would provide useful insights. However, we currently lack a genetic or inducible endothelial-specific Chd4 overexpression model, which makes it challenging to link our embryonic findings to the context of adult liver injury. For now, our study demonstrates that hepatic ECs regulate sterile inflammation to support embryonic liver development. Future development of appropriate genetic tools will allow us to determine if the role of endothelial CHD4 that is demonstrated in the current study is recapitulated in adult inflammatory liver injury models.

      Minor

      1. A separate figure panel for Chd4fl/fl; Vav-Cre+ appears reasonable, instead of being shown as a table.

      Thank you. Please see our new Fig. S1, which includes representative images (and lethality rates) of control and Chd4fl/fl;Vav-Cre+ embryos at E18.5.

      Significance:

      Generally, this is an excellent manuscript with a strong developmental biology focus, and its translational relevance is not immediately apparent; however, establishing such a link could significantly increase its impact. For example, the significance of these findings in ischemia-reperfusion injury, SOS/VOD, and sepsis could offer therapeutic avenues to stabilize endothelial function.

      The advance is the elegant discovery of a multifactorial endothelial-stabilizing mechanism in development, although its applicability to scenarios beyond developmental mutation remains unknown.

      The strengths are the clear and transparent experimental interrogation. Rightfully, the authors acknowledge that there would be a benefit in finalizing inflammatory blockade, genetic or antibody-mediated, to pin down the mechanistic circuit.

      The reviewer's expertise is: childhood liver diseases, developmental liver organoid generation, stem cells (iPSCs), cell reprogramming

      Reviewer #3

      Evidence, reproducibility and clarity:

      1. Wu et al. report antagonistic roles for chromatin remodelers Chd4 and Brg1, in endothelial cells, during liver development. There is a major flaw in the study which makes it difficult to interpret the conclusions. The genotypes of the mouse models used are flawed. The comparison should be made between two single knockouts (Chd4 single, Brg1 single), double mutants (Chd4/Brg1) and proper controls. For both "single KO", one allele of the other gene is also deleted - Chd4 -Ecko has one allele of Brg1 deleted and vice versa. Also, the proper control should be Chd4 fl/flBrg1fl/fl without the Cre. Since 3 alleles (not just two that belong to the same gene) are deleted in a single knockout, it is impossible to assign the effect to one gene.

      We acknowledge the fact that the single Brg1 and Chd4 EC knockouts in this study each carry a heterozygous deletion allele for the other remodeler (exact genotypes are shown in Fig. 1A). The mating strategy that yielded these mutants was chosen for three reasons. First, we have found that genetic background influences the embryonic phenotypes of these chromatin remodeler mutants3. Moreover, embryonic development at the stages analyzed in this study occurs quickly and requires precise timing for comparative analysis between genotypes. Therefore, it is most rigorous to study littermates when comparing single- and double-mutant embryos for BRG1 and CHD4. To achieve this, we used Brg1fl/fl;Chd4fl/fl females rather than Brg1fl/+;Chd4fl/+ females for timed matings. Although the former females cannot produce single knockout embryos without a compound heterozygous allele of the other remodeler, these females allowed us to generate single- and double-knockouts at a rate of 1/8 embryos. If we had used Brg1fl/+;Chd4fl/+ females for timed matings, we would have been able to generate “clean” single mutants with wildtype alleles of the other remodeler, but the single- and double-knockout generation rate would have been 1/32 embryos. This would have been an impractical mutant generation rate for this study. Second, our prior research demonstrates that heterozygous deletion of Chd4 or Brg1 does not produce the liver phenotypes seen with the respective homozygous deletions2,3. Third, the complete lethality of Chd4-ECko (Brg1fl/+;Chd4fl/fl;VE-cadherin-Cre+) mutants in this study demonstrates that deleting one allele of Brg1 cannot rescue Chd4-related lethality.

      As for controls in this study, we saw no evidence of phenotypes or of any gene deletion in our Cre- embryos (either in this study or in previous ones analyzing similar phenotypes2,3). Therefore, we used Cre- embryos for controls because they were generated at a 1/2 rate by our timed matings, which boosted our output for analyses.

      Specific points

      1. Fig 2c Plaur transcript - no statistical comparison between 2nd and 4th column, Chd4 Ecko vs double mutant. If there is not statistical difference, does not explain the rescue in double mutants

      Thank you for the suggestion. We have included a comparison between Chd4-ECko and Brg1/Chd4-ECdko in our revised Fig 2C. The Kruskal-Wallis test showed a significant difference between the Chd4-Ecko and Brg1/Chd4-ECdkogroups (p=0.016). This indicates that Plaur induction in Chd4-Ecko LSECs is rescued in Brg1/Chd4-ECdko LSECs.

      1. Fig 2e. Comparison should be made between Plg-/- Chd4 fl/fl and Plg-/- Chd4 fl/fl Cre, not other genotypes

      This experiment aims to determine whether different levels of plasminogen (Plg) reduction can rescue the lethality caused by Chd4 deletion. To do this, we set up the mating strategy shown in Fig. 2E to produce appropriate littermate controls and to compare lethality among Plg+/+;Chd4-ECko, Plg+/-;Chd4-ECko, and Plg-/-;Chd4-ECko embryos. This comparison would not have been possible with embryos generated only from mice on a Plg-/- background.

      1. Fig. 4. How does Chd4 or Brg1 activity in endothelial cells lead to Icam1 activation in epithelial cells?

      Since cytokines like IFNg, TNFa, and IL1b can induce ICAM-1 expression in hepatocytes10, we speculate that ICAM-1 expression in hepatoblasts (ECAD+ cells in Fig. 4D) was induced by the elevated TNFa and IL1b produced in Chd4-ECko livers (Fig. 3G).

      1. Mice used in Figure 5 are Cdf4 fl/+ and Cdf4 fl/fl, no Brg1 deletion. The authors improperly compare these to Chd4-Ecko which have one allele of Brg1 deleted. The rescue needs to be done in the same genotype Chd4-Ecko.

      Please note that data from Fig. 5 were generated from cultured ECs (MS1 cells).

      Significance

      Wu et al. report antagonistic roles for chromatin remodelers Chd4 and Brg1, in endothelial cells, during liver development. There is a major flaw in the study which makes it difficult to interpret the conclusions. Genotypes that were chosen for the study make the data not interpretable

      Please see our response to your Question #1


      In summary, we have included the following changes to this revised manuscript:

      • New Figure 1B: Representative images and lethality rates for control, Chd4-ECko, Brg1-ECko, and Brg1/Chd4-ECdko embryos at E17.5.
      • New Figure 2C: qRT-PCR analysis of Chd4, Brg1, and Plaur gene transcripts in E12.5 control and mutant LSECs.
      • Regraphing of Figure 3G: qRT-PCR analysis of Tnf, Il6, and Il1b gene transcripts in E14.5 control and mutant livers.
      • New Figure S1: Representative images and lethality rates for control, Chd4fl/+;Vav-Cre+, and Chd4fl/fl;Vav-Cre+embryos at E18.5. References for this revision:

      Alva JA, Zovein AC, Monvoisin A, Murphy T, Salazar A, Harvey NL, Carmeliet P, Iruela-Arispe ML. VE-Cadherin-Cre-recombinase Transgenic Mouse: A Tool for Lineage Analysis and Gene Deletion in Endothelial Cells. Dev Dyn. 2006;235:759-767. doi: 10.1002/dvdy.20643 Crosswhite PL, Podsiadlowska JJ, Curtis CD, Gao S, Xia L, Srinivasan RS, Griffin CT. CHD4-regulated plasmin activation impacts lymphovenous hemostasis and hepatic vascular integrity. J Clin Invest. 2016;126:2254-2266. doi: 10.1172/JCI84652 Wu ML, Wheeler K, Silasi R, Lupu F, Griffin CT. Endothelial Chromatin-Remodeling Enzymes Regulate the Production of Critical ECM Components During Murine Lung Development. Arterioscler Thromb Vasc Biol. 2024;44:1784-1798. doi: 10.1161/ATVBAHA.124.320881 Shiojiri N, Inujima S, Ishikawa K, Terada K, Mori M. Cell lineage analysis during liver development using the spfash-heterozygous mouse. Lab Invest. 2001;81:17-25. doi: 10.1038/labinvest.3780208 Soares-da-Silva F, Peixoto M, Cumano A, Pinto-do OP. Crosstalk Between the Hepatic and Hematopoietic Systems During Embryonic Development. Front Cell Dev Biol. 2020;8:612. doi: 10.3389/fcell.2020.00612 Ema H, Nakauchi H. Expansion of hematopoietic stem cells in the developing liver of a mouse embryo. Blood. 2000;95:2284-2288. Kieusseian A, Brunet de la Grange P, Burlen-Defranoux O, Godin I, Cumano A. Immature hematopoietic stem cells undergo maturation in the fetal liver. Development. 2012;139:3521-3530. doi: 10.1242/dev.079210 Freitas-Lopes MA, Mafra K, David BA, Carvalho-Gontijo R, Menezes GB. Differential Location and Distribution of Hepatic Immune Cells. Cells. 2017;6. doi: 10.3390/cells6040048 Christensen JL, Wright DE, Wagers AJ, Weissman IL. Circulation and chemotaxis of fetal hematopoietic stem cells. PLoS Biol. 2004;2:E75. doi: 10.1371/journal.pbio.0020075 Satoh S, Nussler AK, Liu ZZ, Thomson AW. Proinflammatory cytokines and endotoxin stimulate ICAM-1 gene expression and secretion by normal human hepatocytes. Immunology. 1994;82:571-576.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The manuscript by Wu and Griffin describes a mechanism where CHD4 and BRG1, two chromatin remodelling enzymes, have antagonistic functions to regulate extracellular matrix (ECM) plasmin activity and sterile inflammatory phenotype in the endothelial cells of the developing liver. As a follow up from a previous study, the authors investigate the phenotype of embryonic-lethal endothelial-specific CHD4-knockout, leading to liver phenotype and embryo death, and the rescue of this phenotype when subsequently BRG1 is knocked-out also in the endothelium. First, the authors show that the increase in plasmin activator uPAR (which leads to ECM degradation) in CHD4-KO embryos can be rescued by BRG1-KO, and that both CHD4 and BRG1 interact with the uPAR promoter. However, the authors demonstrate that reducing plasminogen by genetic knockout is unable to rescue the CHD4-KO embryos alone, suggesting an additional mechanism. By RNAseq analysis, the authors identify sterile inflammation as another potential contributor to the lethal phenotype of CHD4-KO embryos through increased expression of ICAM-1 in endothelial cells, also showing binding of both chromatin remodellers to ICAM-1 promoter. Finally, the authors use nonsteroidal anti-inflammatory drug carprofen, alone or in combination with plasminogen genetic knockout, and demonstrate CHD4-KO lethal embryonic phenotype rescue with the combination of plasminogen reduction and inflammation reduction, highlighting the synergistic role of both ECM degradation and sterile inflammation in this genetic KO.

      The findings of the manuscript are interesting, experiments well controlled and paper well written. While the work is of potential specialist interest to the field of liver development, there are several issues which authors should address before this paper can be published:

      Major issues:

      • The authors still see embryonic lethality of some embryos with endothelial BRG1-KO or combined endothelial CHD4/BRG1-KO - could the authors please show or at least comment in the discussion why those animals are dying?
      • In the qRT-PCR results Fig.2c, what is each dot?
      • In the same figure, I would expect that in CHD4-KO there is no CHD4 transcript, and in BRG1-KO there is no BRG1 transcript, rather than the reduction shown, which seems quite noisy (though significant) - is it this a result of normalisation? Or is indeed only a certain amount of the transcript reduced?
      • In the same figure, is the statistical testing performed before or after normalisation? This can introduce errors if done after normalisation.
      • In some cases, the authors show immunofluorescence images but do not specify how many biological replicates this represents (e.g. Fig.1d, 4c-d). This should be added.
      • I also encourage the authors to present a supplementary figure with at least one other biological replicate shown for imaging data (optional).
      • The plasminogen reduction by genetic modulation results in drastic changes to the embryos' appearance - is this a whole embryo KO or endothelial-specific KO? Can authors at least comment on the differences?
      • In Fig.2b, do I understand correctly only 1 sample was analysed with different areas plotted on the graph? If so, this experiment should be repeated on another set of embryos to be robust, and data plotted as a mean of each embryo (rather than areas).
      • Also in some graphs, authors specify that it was more than n>x embryos, but then - what are the dots on the graph representing? Each embryo? This should be specified (e.g. Fig.2b-c, but please check this in all the figure legends).
      • "we found Plaur was the only gene that was induced in CHD4-ECko LSECs at E12.5 (Figure S3D)." - I am not sure this is correct, as gene Plau is also increased in 2/3 samples?
      • I find the title and the running title somewhat misleading and too broad; the authors should specify more detail in the title about the content of the paper - the current statement of the title is somewhat true but shown only for one genetic model and not confirmed for all types of "lethal embryonic liver degeneration".

      Minor issues:

      • If an animal licence was used, its number should be specified in the ethics or methods section
      • In fig.3g it is very hard to see each of the samples, could authors try to improve this graph for clarity using colours-or split Y axis - or both?
      • "This indicates that ECs can play a pro-inflammatory role in embryonic livers and highlights the need for tight regulation to ensure normal liver growth." This sentence for me is misleading, EC are producing inflammatory signals only during the CHD4-KO according to the author's data, and authors do not show such data in normal homeostasis condition. Actually, the pro-inflammatory role here seems detrimental, and ECs should not exhibit it for correct development. The authors should rephrase this to be clearer.

      Significance

      General assessment: The study is well controlled and well written. The findings are interesting. The limitation of the findings is only 1 combination genetic model being studied, and it is unclear if the synergistic effect of sterile inflammation and ECM degradation is broadly applicable to other models, where embryo dies because of liver failure.

      Advance: The study makes an incremental advance, following up findings from a previous study. However, it is conceptually interesting.

      Audience: The audience for this manuscript would be a liver development specialist. However, broader concepts could also be applicable to liver disease.

      Expertise: I research in the field of liver regeneration and disease.

    1. suggested that migrations from Africa

      Could it be because of remains from people with nomadic lifestyles being found? I know that before humans settled down into civilization, they often tended to be hunter-gatherers and stayed near wherever food existed. Which made it very difficult to pinpoint where humans originated before we had all these tools to detect DNA differences and markers. I am interested about the biological side of how you can tell the differences between certain genetic markers on chromosomes, honestly. How were scientists able to find the genetic markers on a Y chromosome anyways? What does it look like from a model and actual picture? Those are the real questions here, and I wished that the paragraph intro shared more about this topic. Though, I do understand that this is a history class and not a science one.

    1. Le phénomène du bavardage scolaire : Analyse et perspectives

      Synthèse Exécutive

      Ce document présente une analyse approfondie du phénomène de bavardage scolaire, un enjeu souvent sous-estimé qui affecte de manière significative le climat de classe et la réussite des élèves.

      Basée sur une étude combinant une revue de la littérature scientifique et une enquête de terrain menée auprès d'élèves de 4ème, cette synthèse met en lumière la complexité du bavardage, les perceptions divergentes qu'il suscite et l'efficacité limitée des interventions basées uniquement sur la prise de conscience individuelle.

      L'analyse théorique révèle que le bavardage a évolué, passant d'un "chahut traditionnel" structuré à un désordre plus "anomique" et généralisé.

      Les travaux de chercheurs comme Florence Ehnuel et Alain Courneloup soulignent un décalage fondamental entre les perceptions des différents acteurs : les élèves le banalisent souvent comme une interaction sociale normale, les parents le perçoivent avec une faible gravité, tandis que les enseignants le vivent comme une source de déprofessionnalisation et d'impuissance.

      Les causes identifiées sont multiples, incluant l'ennui, le besoin d'interaction sociale, la pression des pairs et un cadre scolaire parfois perçu comme trop rigide.

      L'enquête de terrain, réalisée via un questionnaire suivi d'entretiens individuels, confirme ces constats. Une majorité d'élèves bavards ne se sentent pas personnellement dérangés par le bruit et estiment que leurs propres conversations ne nuisent pas à leurs camarades, se croyant capables de parler et d'écouter simultanément.

      L'outil de questionnement a permis une prise de conscience modérée chez environ la moitié des participants, mais n'a entraîné un changement de comportement durable que pour une minorité.

      La crainte de sanctions demeure le levier externe le plus efficace, tandis que la motivation interne reste fragile.

      En conclusion, la lutte contre le bavardage scolaire ne peut se résumer à des sanctions disciplinaires.

      Elle exige une approche globale qui intègre la compréhension des perceptions des élèves, la mise en place de cadres clairs et co-construits, et l'adoption de stratégies pédagogiques actives pour réduire l'ennui.

      Si la prise de conscience est une étape nécessaire, elle s'avère insuffisante sans un accompagnement structuré et des règles appliquées avec constance.

      --------------------------------------------------------------------------------

      I. Cadre Théorique du Bavardage Scolaire

      Définition et Évolution du Phénomène

      Le bavardage scolaire est défini comme toute prise de parole non autorisée par l'enseignant durant un temps de cours. Il constitue un phénomène social complexe qui perturbe la transmission des savoirs et le climat d'apprentissage.

      L'analyse sociologique de Jacques Testanière (1967) offre une perspective historique sur l'évolution du désordre en classe. Il distingue :

      • Le chahut traditionnel : Une "anomalie normale" et collective, souvent ritualisée, qui visait à tester l'autorité de l'enseignant tout en renforçant la cohésion du groupe d'élèves.

      • Le chahut anomique : Une forme de désordre plus généralisée, individualiste et sans règles, qui exprime une mauvaise intégration de l'élève au système pédagogique. Le bavardage contemporain s'apparente davantage à cette seconde forme, caractérisée par une multitude de conversations parallèles plutôt qu'une confrontation unifiée.

      Perceptions Divergentes des Acteurs

      L'une des difficultés majeures dans la gestion du bavardage réside dans le profond décalage de perception entre les différents acteurs de la communauté éducative, comme le démontre l'ouvrage de Florence Ehnuel, « Le bavardage : Parlons-en enfin ! ».

      | Acteur | Perception du Bavardage | | --- | --- | | Élèves | Considéré comme une interaction sociale normale et un "non-acte". Beaucoup estiment pouvoir écouter et parler en même temps. Il est souvent justifié par l'ennui, le besoin d'échanger avec les pairs ou le désintérêt pour la matière. | | Enseignants | Vécu comme une nuisance majeure, un manque de respect, et une source de fatigue et de culpabilité. Les réactions varient de la tolérance à la sanction systématique, en passant par un sentiment d'impuissance. | | Parents | Souvent perçu comme un problème mineur, non comparable à l'insolence ou aux mauvais résultats. Certains y voient même un signe de "vitalité" ou d'"aisance relationnelle". | | Didacticiens | Interprété comme une forme de résistance à la norme scolaire, une pratique sociale d'échange, une échappatoire face aux difficultés d'apprentissage, ou un symptôme du décalage entre la culture scolaire et la culture jeune. |

      Causes et Motivations du Bavardage

      La littérature identifie plusieurs facteurs expliquant la prévalence du bavardage :

      Facteurs Pédagogiques : L'ennui provoqué par un cours jugé trop lent ou inintéressant est une cause majeure. Comme le souligne Alain Courneloup, "un élève qui s'ennuie est un élève qui va trouver à s'occuper".

      Facteurs Sociaux : Le besoin d'interaction avec les pairs est fondamental à l'adolescence. Le groupe agit comme un "médiateur" entre l'individu et les adultes. Répondre à un camarade est souvent perçu comme une obligation sociale pour ne pas le "vexer" ou trahir une amitié.

      Facteurs Sociétaux : La "génération du zapping" est habituée à un environnement bruyant et à la multi-activité. Le silence peut être perçu comme angoissant par certains élèves.

      Facteurs Institutionnels : L'absence de règles claires ou le manque de constance dans l'application des sanctions par les enseignants peut créer un cadre propice au développement du bavardage.

      Conséquences et Enjeux

      Les méfaits du bavardage sont souvent sous-estimés. Il ne s'agit pas d'un simple désagrément sonore.

      Sur les apprentissages : Le bavardage est une "forme d'absentéisme" intellectuel.

      Même si l'élève est physiquement présent, son attention est détournée, ce qui nuit à la concentration, à la compréhension et à la mémorisation.

      Sur le climat de classe : Le bruit constant génère de la fatigue et de la tension pour l'enseignant et pour les élèves qui souhaitent travailler.

      Il ralentit le rythme du cours et peut créer un sentiment d'impunité.

      Sur le parcours de l'élève : À long terme, le bavardage persistant, lorsqu'il est le symptôme d'un désintérêt plus profond, peut être un indicateur de risque de décrochage scolaire.

      François Dubet, dans « La galère », décrit comment le désengagement scolaire peut mener à des trajectoires de marginalisation.

      --------------------------------------------------------------------------------

      II. Enquête de Terrain sur la Prise de Conscience des Élèves

      Objectif et Méthodologie de l'Étude

      L'enquête visait à déterminer si un outil de questionnement pouvait amener des élèves de 4ème à prendre conscience de l'ampleur et des conséquences de leur propre bavardage, et si cette prise de conscience pouvait induire un changement de comportement. L'expérimentation s'est déroulée en trois phases :

      1. Phase 1 : Administration d'un questionnaire en ligne (Google Forms) à 52 élèves pour évaluer leurs pratiques et perceptions.

      2. Phase 2 : Une période de plusieurs semaines pour observer d'éventuels changements.

      3. Phase 3 : Entretiens individuels avec un échantillon de 8 élèves pour mesurer l'impact de l'intervention.

      Principaux Résultats du Questionnaire (N=52, dont 35 "bavards")

      L'analyse s'est concentrée sur les 35 élèves s'identifiant comme discutant en cours "de temps en temps", "assez" ou "tout le temps".

      Auto-perception des élèves bavards :

      Un paradoxe central : Une grande majorité des élèves bavards (65,7%) déclarent ne pas être dérangés par le bruit en classe.   

      La rationalisation du multitâche : Plus de la moitié (54,3%) estiment que leurs propres discussions ne gênent "pas du tout" leurs camarades. La raison principale invoquée (68,4%) est leur conviction de pouvoir "parler à [leur] voisin et écouter le professeur en même temps". 

      Les motivations sociales avant tout : La raison principale du bavardage est d'avoir "des choses importantes à dire à leurs amis" (45,7%), devant les difficultés de concentration (40%) et le désintérêt pour la matière (34,3%).

      Conscience de l'Impact :

      Un effet modéré : Le questionnaire a permis à 51,4% des élèves de prendre "un peu" conscience des conséquences de leurs conversations.

      Seuls 5 élèves (14,3%) ont jugé cette prise de conscience "nécessaire" ou "essentielle". 

      Lien avec les résultats scolaires contesté : Les avis sont partagés quant à l'impact du bavardage sur les notes. 37,5% pensent que leurs discussions n'ont "pas d'impact" sur leurs résultats.

      Volonté de Changement :

      Une faible envie d'arrêter : Plus de la moitié des élèves bavards n'ont pas l'intention de mettre fin à leurs discussions, considérant que ce n'est "pas si bavard que ça" ou que c'est "plus fort que moi".  

      Le poids des sanctions : La "sanction de la part du professeur" est identifiée comme la pression extérieure la plus efficace pour les inciter à diminuer leurs bavardages.   

      Des résolutions fragiles : Malgré tout, 16 élèves sur 35 ont décidé de "prendre une résolution" pour se modérer.

      Résultats des Entretiens Individuels (N=8)

      Les entretiens menés quelques semaines après le questionnaire ont permis de nuancer les résolutions prises.

      Un Impact Limité sur le Comportement Réel : Seuls 3 des 8 élèves interrogés ont déclaré avoir effectivement diminué leur niveau de bavardage. Pour les autres, la situation était "pareille" voire "accentuée".

      La Persistance des Habitudes : Le changement de comportement s'est avéré difficile.

      Le placement en classe (proximité avec un ami) reste un facteur déterminant.

      Plusieurs élèves reconnaissent que malgré leur bonne volonté, l'habitude reprend le dessus.

      Un Acte Anormal mais Inévitable : La majorité des élèves interrogés conviennent qu'il n'est "pas normal" de discuter en classe.

      Cependant, cette reconnaissance intellectuelle ne se traduit que rarement par une auto-discipline efficace, illustrant le fossé entre la conscience d'une règle et sa mise en application.

      --------------------------------------------------------------------------------

      III. Synthèse et Recommandations Stratégiques

      Synthèse des Constats

      1. Le fossé perceptuel comme obstacle majeur : Le principal frein au changement est que les élèves bavards ne perçoivent majoritairement pas leur comportement comme une nuisance, ni pour eux-mêmes ni pour les autres.

      La croyance erronée en leur capacité à effectuer plusieurs tâches à la fois est une rationalisation puissante.

      2. L'insuffisance de la prise de conscience seule : L'enquête démontre qu'une intervention visant à provoquer une prise de conscience interne, bien qu'utile, est insuffisante pour modifier durablement les comportements.

      La volonté de changer est souvent volatile et rapidement supplantée par les habitudes et la dynamique sociale de la classe.

      3. L'importance persistante du cadre externe : Les facteurs externes, notamment la clarté des règles et la constance dans l'application des sanctions, restent des leviers d'action déterminants pour la majorité des élèves.

      Pistes de Réflexion et Stratégies d'Intervention

      En s'appuyant sur les apports de la littérature et les résultats de l'enquête, plusieurs stratégies peuvent être envisagées pour une gestion plus efficace du bavardage.

      Co-construire les règles de vie (Courneloup) : Impliquer les élèves dans l'élaboration des règles de communication en classe.

      Cet exercice de citoyenneté permet de rendre les règles plus explicites et de favoriser l'adhésion en montrant qu'elles servent l'intérêt collectif.

      Établir un cadre clair et constant (Ehnuel) : Dès le début de l'année, l'enseignant doit définir clairement ses attentes en matière de silence et de prise de parole.

      La constance est cruciale : les élèves identifient rapidement les enseignants dont les avertissements ne sont pas suivis d'effets.

      Adopter une pédagogie active (Courneloup) : Pour contrer l'ennui, il est essentiel de varier les modalités de travail. Alterner les exposés magistraux avec des exercices, des travaux de groupe structurés, et des mises en commun permet de canaliser l'énergie des élèves et de réduire les temps morts propices au bavardage.

      Utiliser la communication non verbale (Courneloup) : Un regard appuyé, un doigt sur la bouche ou un déplacement silencieux vers un groupe d'élèves est souvent plus efficace et moins perturbateur pour le reste de la classe qu'une réprimande verbale à voix haute.

      Privilégier le dialogue individuel (Ehnuel) : En cas de bavardage récurrent d'un élève, une discussion en aparté à la fin du cours peut être bénéfique.

      Elle permet de comprendre les raisons du comportement (difficultés, anxiété, etc.) et de responsabiliser l'élève sans l'humilier publiquement.

    1. Analyse du Microlycée de Sénart : Une Approche Pédagogique Alternative pour les Décrocheurs Scolaires

      Synthèse

      Le microlycée de Sénart est un établissement public qui incarne une approche pédagogique radicalement différente, conçue pour rescolariser les jeunes de 17 à 26 ans ayant quitté le système traditionnel.

      Face au phénomène national de 100 000 décrocheurs annuels, cette structure offre une "seconde chance" à 90 élèves, en s'attaquant aux causes profondes de leur déscolarisation : phobie scolaire, harcèlement, problèmes psychologiques ou mauvaise orientation.

      La méthode du microlycée repose sur trois piliers fondamentaux : la flexibilité, la confiance et la co-construction.

      Le cadre scolaire est volontairement assoupli : les retards sont tolérés, il n'y a pas de sanctions, et certaines règles des lycées classiques sont levées, comme l'interdiction du téléphone en cours ou de la cigarette (par dérogation).

      Les classes à effectifs réduits (neuf élèves) permettent une relation enseignant-élève intime et familière, caractérisée par le tutoiement et un suivi proactif, comme les appels quotidiens aux absents pour les encourager.

      L'évaluation est entièrement réinventée pour ne plus être une source de jugement destructeur.

      Les notes sur 20 sont remplacées par des pourcentages de réussite que les élèves peuvent discuter, voire négocier, avec leurs professeurs.

      Ce système de "co-construction" vise à faire de l'évaluation un outil d'apprentissage, renforçant l'autonomie et la confiance de l'élève.

      De même, le conseil de classe est transformé en un format de "speed dating" où chaque élève échange directement avec ses professeurs, devenant ainsi un acteur central de son parcours.

      Les parcours d'élèves comme Romain, Lola et Léo témoignent de l'efficacité de cette approche. Ils illustrent la capacité de l'établissement à reconstruire des jeunes brisés par le système traditionnel, en leur redonnant le goût d'apprendre et en leur permettant de se réconcilier avec l'école et avec eux-mêmes.

      Le dispositif inclut également un soutien crucial aux familles, via des groupes de parole, qui partagent leur désarroi et leur soulagement.

      Malgré un taux d'abandon de 20% en cours d'année, le microlycée parvient à mener 7 élèves sur 10 jusqu'au baccalauréat, prouvant la pertinence de son modèle atypique.

      --------------------------------------------------------------------------------

      1. Contexte et Mission du Microlycée de Sénart

      Le microlycée de Sénart, ouvert en septembre 2000, est l'une des 61 structures publiques en France dédiées à la rescolarisation des élèves décrocheurs.

      Chaque année, 100 000 jeunes quittent le lycée sans diplôme, soit 9% des élèves.

      L'établissement accueille 90 de ces jeunes, âgés de 17 à 26 ans, de la seconde à la terminale.

      Les raisons du décrochage sont multiples et complexes :

      • Phobie scolaire

      • Refus du système éducatif traditionnel

      • Problèmes psychologiques

      • Harcèlement scolaire

      • Mauvaise orientation

      La mission principale de l'établissement est de "réapprendre à aimer l'école" à ces jeunes en leur offrant un cadre bienveillant et des méthodes alternatives.

      2. Une Approche Pédagogique Fondée sur la Confiance et la Flexibilité

      Le programme de l'Éducation Nationale est suivi à la lettre, mais les méthodes d'enseignement et le cadre de vie scolaire sont radicalement différents de ceux d'un lycée traditionnel.

      2.1. Un Cadre Souple et Non-Punitif

      L'objectif est de dédramatiser l'école en supprimant les sources de stress et de conflit.

      Absence de Sanctions : Les retards sont autorisés et il n'y a pas de sanctions disciplinaires. Comme le souligne une enseignante, "on a d'autres moyens aussi de faire en sorte que ces jeunes puissent raccrocher".

      Tolérance et Flexibilité : Les élèves peuvent utiliser leur téléphone portable pour écouter de la musique en cours. La cigarette est autorisée dans des zones dédiées, sur dérogation de l'inspection académique.

      Suivi Proactif : Une professeure, Christine, appelle chaque jour les dizaines d'élèves absents, non pas pour les réprimander, mais pour les encourager à revenir. "Je t'appelle pour t'encourager à revenir à l'école [...] courage".

      2.2. La Relation Enseignant-Élève

      Le rapport entre les professeurs et les élèves est au cœur du dispositif.

      Effectifs Réduits : Les classes ne comptent que neuf élèves, favorisant une interaction directe et personnalisée.

      Proximité et Familiarité : Le tutoiement est la norme et les élèves sont appelés par leur prénom. Un élève explique : "Ça apporte plutôt je me trouve plus à l'aise en fait avec les profs [...] on commence à prendre confiance".

      Enseignants Volontaires et Formés : Les 14 professeurs sont tous volontaires et suivent des formations spécifiques pour encadrer ces élèves. Une professeure de français, Emmanuel, témoigne de la valeur de son travail : "Ce qu'il m'apporte de plus fondamental, c'est un sens profond à ce que je fais [...] j'ai l'impression de pouvoir accompagner ces jeunes jusqu'à une reprise de confiance en eux".

      3. L'Évaluation Réinventée : De la Sanction à la Co-construction

      L'un des aspects les plus innovants du microlycée est sa redéfinition complète du système de notation, souvent vécu comme un "jugement de la personne" dans le système classique.

      Pas de Note sur 20 : Les copies ne sont pas notées sur 20 mais reçoivent un pourcentage de réussite.

      La Co-construction : L'évaluation n'est pas un verdict final mais le début d'un dialogue. L'élève peut discuter son résultat avec le professeur.

      Emmanuel Catinois, professeure de français, précise : "C'est ce qu'on appelle la co-construction, c'est on construit ensemble l'évaluation. L'idée c'est que l'évaluation, ça ne doit pas être un coup prêt, une note qui sanctionne, mais une note sur laquelle on peut s'appuyer, une note qui aide".

      Le Droit à l'Erreur : Les élèves ont la possibilité de retravailler une partie d'un devoir pour améliorer leur score. Romain, un élève, apprécie cette méthode : "Je peux le refaire, je peux me reprendre et elle va accepter [...] au moins on peut justifier au lieu de perdre des points bêtement".

      4. Parcours d'Élèves : Portraits de la Reconstruction

      Les profils des élèves sont variés, issus de tous les milieux sociaux, et illustrent les défis auxquels le microlycée répond.

      | Élève | Âge | Parcours Avant le Microlycée | Situation au Microlycée | | --- | --- | --- | --- | | Romain | 17 | Ancien "premier de la classe", il tombe en dépression après le divorce de ses parents. Déscolarisé pendant deux ans. | Est devenu le meilleur élève de sa classe. Se réconcilie avec l'école grâce à la nouvelle approche de l'évaluation et des conseils de classe. | | Lola | \- | Déscolarisée pendant deux ans après avoir été harcelée au collège en raison de son homosexualité. A fait une tentative de suicide et a été hospitalisée six mois en psychiatrie. | A réappris à "aimer les lycées, à aimer les cours". Malgré d'importantes lacunes scolaires, l'équipe pédagogique parie sur elle en la faisant passer en classe supérieure. | | Léo | 19 | Décrit comme "je-m'en-foutiste", il a été renvoyé de plusieurs lycées. A décroché pendant deux ans, vivant la nuit et faisant des petits boulots. | Participe activement en cours, a pris le rôle d' "intendant café" et s'est réconcilié avec le français, écrivant désormais des chansons. |

      5. Le Rôle Crucial des Familles

      Le microlycée reconnaît que le décrochage scolaire est une épreuve pour toute la famille et intègre les parents dans son dispositif.

      Groupe de Parole : Un professeur anime des réunions régulières pour les parents, leur permettant de partager leur "désarroi" et de se soutenir mutuellement.

      Témoignages Émouvants : Les parents expriment un immense soulagement.

      Claude, père de Lola : "On est complètement paumé [...] Aujourd'hui [...] regardez c'est le sourire, Lola elle a un soir magnifique".    ◦ Laurence, mère de Romain : "Vous nous avez sauvé [...] le Romain de l'année dernière [...] à se poser des grosses questions est-ce qu'il va pas franchir une autre étape. Et puis aujourd'hui où je retrouve un môme de 17 ans vraiment bien dans ses baskets [...] c'est le jour et la nuit, on respire enfin".

      Forte Demande : L'établissement est perçu comme une "bouée de sauvetage". Chaque année, 40 familles postulent mais 20 candidatures doivent être refusées faute de place.

      6. Le Conseil de Classe : Un Modèle de Transparence et d'Implication

      Le traditionnel conseil de classe à huis clos est remplacé par un format innovant, conçu pour rendre l'élève acteur de son parcours.

      Format "Speed Dating" : Chaque élève rencontre individuellement chaque professeur pendant trois minutes pour discuter de ses résultats et de ses appréciations.

      Transparence Totale : "Il ne faut pas qu'il y ait des choses qui se disent sans la présence des élèves, rien n'est secret".

      Implication de l'Élève : Les élèves valident ou non les commentaires des professeurs. Romain explique : "On a l'interaction avec le prof et il nous met son commentaire devant nous et on valide ou pas [...] Ça m'apporte que je vois l'avis du prof en face de moi et qu'il fasse pas derrière mon dos".

      Renforcement de la Confiance : Cette pratique est jugée essentielle pour associer des élèves "adultes" (plus de 17 ans) à la construction de leur scolarité, ce qui "permet la confiance et le raccrochage".

      7. Résultats, Défis et Perspectives

      Le modèle du microlycée de Sénart, bien qu'exigeant, affiche des résultats probants.

      Taux de Réussite : 7 élèves sur 10 qui poursuivent leur scolarité au microlycée décrochent leur baccalauréat.

      Taux d'Abandon : Le parcours reste difficile, et 20% des élèves abandonnent en cours d'année.

      Défis Pédagogiques : L'équipe doit gérer des écarts de niveau considérables, comme celui de Lola qui a un niveau de 5ème en langues. L'établissement fait alors "le pari de la seconde chance" en adaptant ses décisions pour ne pas décourager les élèves malgré leurs lacunes.

      Transformation Personnelle : Au-delà du succès scolaire, l'établissement permet aux élèves de se reconstruire, de reprendre confiance et de développer de nouveaux projets, à l'image de Léo qui compose des chansons : "Apprendre, me cultiver et revenir avec des phrases de prof de français".

    1. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary

      The manuscript by Peter et al., reports on the neuronal activity and connectivity of iPSC-derived human cortical neurons from Down syndrome (DS) that is caused by caused by trisomy of the human chromosome 21 (TS21).

      Major points:

      Although the manuscript is potentially interesting, the results appear somehow preliminary and need to be corroborated by control experiments and quantifications of effects to fully sustain the conclusions.

      (1) The authors have not assessed the percentage of WT and TS21 cells that acquire a neuronal or glia identity in their cultures. Indeed, the origin of alterations in network activity and connectivity observed in TS21 neurons could simply derive from reduced number of neurons arising from TS21 iPSC. Alternatively, the same alteration in network activity and connectivity could derive from a multitude of other factors including deficits in neuronal development, neurite extension, or intrinsic electrophysiological properties. In the current version of the manuscript, none of these has been investigated.

      (2) Electrophysiological properties of TS21 and WT neurons at day 53/54 in vitro indicate an extremely immature stage of development (i.e. RMP between -36 and -27 mV with most of the cells firing a single action potential after current injection) in the utilized culture conditions: This is far from ideal for in vitro neuronal-network studies. Finally, reduced activity of HCN1 channels should be confirmed by specific recordings isolating or blocking the related current.

      Main points highlighting the preliminary character of the study.

      1) In Figure 1 immunofluorescence images of the neuronal differentiation markers (Tbr1, Ctip2 and Tuj1) are showed. However, no quantification of the percentage of cells expressing these markers for WT and TS21 neurons is reported. On the other hand, simple inspection of the representative images clearly seams to indicate a difference between the two genotypes, with TS21 cultures showing lower number of cells expressing neuronal markers. This quantification should be corroborated by a similar staining for an astrocyte marker (GFAP, but not S100b since is triplicated in DS). This is an extremely important point since it is obvious that any change in the percentage of neurons (or the neuron/astrocyte ratio) in the cultures will strongly affect the resulting network activity (shown in Figure 2) and the connectivity (showed in Figure 4). Possibly, the quantification should be done at the same time points of the calcium imaging experiments.

      2) In Figure 2 the authors show some calcium imaging traces of WT and TS21 cultures at different time points. However, they again do not show any quantification of neuronal activity. A power spectra analysis is shown in Supplementary Figure 2, but only for WT cultures, while in Supplementary Figure 3 a comparison between WT and Ts21 power spectra is done, but only at the 50 day time point, while difference in synchrony are assessed at 60 days. At minimum, the author should include in main Figure 2 the quantification of the mean calcium event rate and mean event amplitude at the different time points and the power spectra analysis for both WT and TS21 cultures at the same timepoints.

      Of note, the synchronized neuronal activity is present in WT cultures at day 60, but totally lost at subsequent time-points (70 and 80 days). The results of this later time points are different from previous data from the same lab (Kirwan et al., 2015). How might these data be explained? It would be important to rule out any potential issues with the health of the culture that could explain the loss of neuronal activity.It would be beneficial to check cell viability at the different time points to exclude possible confounding factors ? A propidium staining or a MTT assay would strongly improve the soundness of the calcium data.

      3) In Figure 3 there is no quantification of the number and/or density of transplanted neurons for WT and TS21, but only representative images. As above, inspection of the representative images seems to show a decrease in cells labeled by the Tbr1 neuronal marker for TS21 cells. Moreover, the in vivo calcium imaging of transplanted WT and TS21 cells lacks most of the quantification normally done in calcium imaging experiments. Are the event rate and event amplitude different between WT and TS21 neurons ? The measure of neuronal synchrony by mean pixel correlation is not well explained, but it looks somehow simplistic. Neuronal synchrony can be more precisely measured by cross-correlation analysis or spike time tiling coefficients on the traces from single-neuron ROI rather than on all pixels in the field of view, as apparently was done here.

      4) The results on reduced neuronal connectivity in Figure 3 look very striking. However, these results should be accompanied by control experiments to verify the number of neuronal cells and neurite extension in WT and Ts21 cultures. These two parameters could indeed strongly influence the results. As the cultures appear to grow in clusters, bright-field images and TuJ1 staining of the cultures will also greatly help to understand the degree of morphological interconnection between the clusters.

      5) The authors performed RNA-seq experiments on day 50 cultures. Why the authors do not show the complete differential gene expression analysis, but only a small subset of genes? A comprehensive volcano plot and the complete list of identified genes with logFC and FDR values would be helpful. If possible, comparison of the present data (particularly on KCN and HCN expression changes) with published and publicly available expression datasets of other human or human Down syndrome iPSC-derived neurons or human Down syndrome brains will greatly increase the soundness of the present findings. In addition, the gene ontology (GO) results are mentioned in the text, but are not presented. Showing the complete GO analysis for both up and downregulated genes will help the reader to better understand the RNA-seq results. Notably, the results shown in Supplementary Figure on GRIN2A and GRIN2B expression (with values of 300-700 counts versus 2000-4000 counts, respectively) clearly indicate that in both WT and TS21 cultures the NMDA developmental switch has not occurred yet at the 50 days timepoint.

      6) The measure of hyperpolarization-activated currents shown in Figure 5 lack proper control experiments. First, the hyperpolarizing current in TS21 cells do not reach a steady-state as the controls. The two curves are therefore hard to compare. To exclude possible difference in kinetic activation, the authors should have prolonged the current injection period (1-2 seconds). Second, to ultimately prove that such currents are mediated by HCN channels in WT cells the authors should perform some control experiments with a specific HCN blocker. A good example of a suitable protocol, with also current blockers to exclude all other possible current contributions, is the one reported in Matt et al Cell. Mol. Life Sci. 68, 125-137 (2011).

      7) The manuscript lacks information on the statistical analysis used. Also, the numerosity of samples is not clear. Were the dots shown in some graph technical replicates from a single neuronal induction or were all independent neuronal inductions or a mix of the two ? Please clarify.

      8) The method section lacks important information to guarantee reproducibility. Just a few examples: - Only electrophysiology methods for slice are reported, but not for in vitro culture. - Details on Laminin coating is lacking. What concentration was used ? Was poly-ornithine or poly-lysine used before Laminin coating ? - How long cells were switched to BrainPhys medium before calcium imaging ?

      Minor point/typos etc.

      Introduction

      • Page 4 line 6: in the line "Trisomy 21 in humans commonly results in a range in developmental and morphological changes in the forebrain ..." "in" could be replaced by "of".
      • Page 5 line 2: please remove "an" before the word "another".
      • Page 5 line 2: please replace "ecitatory" with "excitatory"

      Results

      • Page 10 line 25: The concept of "pixel-wise" appears for the first time in this section and could be better introduced to facilitate the understanding of the experiment.
      • In the "results" section, page 11 line 1 and 4, references are made to "Figure 4D" and "4F," but these figures do not appear to be present in the figure section. Upon reviewing the rest of the section, the data seem to refer to "Figure 3D" and "3E."

      Discussion

      • Page 15 line 20: please replace "synchronised" with "synchronized".
      • Page 16 line 11: please replace "T21" with "TS21".

      Methods

      • Page 19 line 12: "Pens/Strep" has to be replaced by Pen/Strep.
      • Page 20 line 20: "Tocris Biocience" has to be replaced by "Tocris Bioscience".
      • Page 21 line 2: "Addegene" has to be replaced by "Addgene".

      Figures

      • Figure 3: the schematic experimental design (Fig. 3A) could be enlarged to match the width of the images/graphs below.
      • Figure 5: the reviewer suggests resizing/repositioning the graphs in Fig. 1A so that they match the width of those below.
      • Figure S1D: In all the figures of the paper, the respective controls for the TS21 1 and TS21 2 lines are labelled as "WT1/WT2," while in these graphs, they are called "Ctrl1" and "Ctrl2." To ensure consistency throughout the paper, it is suggested to change the names in these graphs.
      • Figure S4L: The graph is not very clear, especially regarding the significance reported at -50 pA, please modify the graphical visualization and/or add a legend in the caption.

      Significance

      Nature and significance of the advance for the field. The results presented in the manuscript are potentially interesting and useful, but not completely novel (currents deregulation has already been highlighted in mouse models of Down Syndrome).

      Work in the context of the existing literature. This work follows the line of evidence that characterizes Down Syndrome in human neurons (Huo, H.-Q. et al. Stem Cell Rep. 10, 1251-1266 (2018); Briggs, J. A. et al. Etiology. Stem Cells 31, 467-478 (2013)), both in vitro and in xenotransplanted mice, by corrborating some important findings already found in animal models (Stern, S., Segal, M. & Moses, E. EBioMedicine 2, 1048-1062 (2015); Cramer, N. P., Xu, X., F. Haydar, T. & Galdzicki, Z. Physiol. Rep. 3, e12655 (2015); Stern, S., Keren, R., Kim, Y. & Moses, E. http://biorxiv.org/lookup/doi/10.1101/467522 (2018) doi:10.1101/467522.

      Audience. Scientists in the field of pre-clinical biomedical research, especially those working on neurodevelopmental disorders and iPSC-based non-animal models.

      Field of expertise. In vitro electrophysiology, Neurodevelopmental disorders, Down Syndrome, ips cells.

    1. Synthèse : Enfants Violents à l'École - Entre Aide et Répression

      Résumé Exécutif

      Ce document de synthèse analyse les tensions et les débats entourant la gestion de la violence chez les jeunes enfants au sein du système scolaire et de la société française.

      Il ressort que l'école se trouve démunie face à des comportements extrêmes, conduisant à la création de structures expérimentales comme "r'école" pour éviter la déscolarisation.

      Parallèlement, une tendance croissante à la médicalisation des troubles du comportement, incarnée par le diagnostic d'hyperactivité et la prescription de Ritaline, suscite une vive controverse.

      Des experts dénoncent l'influence du lobbying pharmaceutique et une simplification qui ignore les causes profondes de la souffrance de l'enfant.

      Cette approche s'inscrit dans un contexte de "psychose médiatique" qui exagère le phénomène de la violence infantile, contredit par la réalité judiciaire qui atteste de la rareté des cas criminels chez les très jeunes.

      L'analyse des cas individuels révèle que la violence est souvent le symptôme d'une souffrance psychique profonde, liée à des contextes familiaux difficiles (ruptures, violence parentale) et socio-économiques précaires.

      Face à des réponses répressives ou médicamenteuses, des initiatives de prévention de proximité, comme l'association "Mission Possible", démontrent qu'un accompagnement axé sur l'écoute et le soutien aux familles est non seulement plus humain, mais aussi considérablement moins coûteux et plus efficace à long terme pour la société.

      --------------------------------------------------------------------------------

      1. Le Défi de l'École Face aux Comportements Extrêmes

      Le système éducatif est confronté à une difficulté croissante pour gérer les comportements violents et ingérables de certains très jeunes élèves.

      Les enseignants et les directions d'école expriment un sentiment d'impuissance et de manque de formation, menant à des situations d'échec et à l'exclusion des enfants concernés comme ultime recours.

      Le Cas d'Ethan Paul et Hamadi (6 ans) : Ces deux élèves de CP ont été exclus de leur école pour "comportement violent et ingérable".

      Ethan Paul aurait tenté d'étrangler un camarade, conduisant des parents à porter plainte. Sa maîtresse reconnaît son échec :

      "Je n'ai pas réussi à ce qu'ils puissent être intégrés en classe et faire les apprentissages de la place de façon satisfaisante." Elle décrit un enfant en "très très très très grande souffrance".

      La Structure "r'école" : Mise en place en janvier 2009 par le rectorat de Paris, cette structure unique en France accueille pour trois mois des enfants exclus.

      Objectif : Éviter la déscolarisation en offrant une "surveillance éducative renforcée" et en réapprenant aux enfants les règles de la vie en groupe.  

      Fonctionnement : Les enfants sont pris en charge par un personnel mixte (enseignante, auxiliaire de vie scolaire), mais seule l'enseignante est spécifiquement formée. Les crises de violence y sont fréquentes et difficiles à gérer pour le personnel.   

      Limites : Bien que présentée comme le "seul espoir", cette solution est coûteuse et soulève le risque de créer des filières alternatives pour enfants "difficiles", comme le souligne Jean-Louis Barateau, initiateur du projet : "Ça pourrait être dangereux si on multipliait des r'écoles au point d'avoir finalement des écoles alternatives."

      2. La Médicalisation de la Violence Infantile : Une Solution Controversée

      Face à l'inquiétude grandissante, une approche tendant à considérer les troubles du comportement comme des pathologies médicales à traiter a émergé, non sans susciter de vives critiques.

      2.1. Le Rapport de l'INSERM et la Récupération Politique

      Le Rapport (2006) : Consacré aux "troubles des conduites", ce rapport d'experts visait à dépister les "facteurs de risque" et de "vulnérabilité" chez l'enfant.

      La Controverse : Le rapport a servi de caution scientifique à un projet de loi sur la délinquance des mineurs, porté par Nicolas Sarkozy, alors ministre de l'Intérieur.

      Celui-ci affirmait : "Plutôt on n'intervient plus mieux on a de chances d'éviter le drame d'un enfant qui évolue vers la délinquance."

      La Réponse de la Société Civile : Des experts ayant participé au rapport ont précisé n'avoir "jamais écrit un rapport sur la prévention de la délinquance".

      En réaction, la pétition "Pas de zéro de conduite pour les enfants de 3 ans" a recueilli près de 200 000 signatures, dénonçant le risque de transformer des comportements normaux (morsures, colères) en symptômes d'un trouble mental à rééduquer.

      2.2. L'Hyperactivité et la Ritaline : Remède ou Simplification ?

      Le diagnostic de "l'hyperactivité avec déficit de l'attention" (TDAH) et son traitement par la Ritaline, une amphétamine, sont au cœur du débat.

      La Défense du Traitement : La pédopsychiatre Marie-France Le Heuzey justifie son usage pour "améliorer la qualité de vie et d'améliorer le quotidien de ces enfants", soulignant la souffrance liée au rejet social et familial.

      Pour elle, si le médicament permet à l'enfant de ne plus être puni et aux parents de moins se disputer, "on a aussi fait du bien largement à l'enfant".

      La Critique de la Sur-médicalisation :

      Rareté de la pathologie : Selon le Pr Bernard Golse (Hôpital Necker), l'hyperactivité "vraie" est très rare (1 à 2 cas pour 1000), loin des 5 à 10% avancés par certains. Il dénonce "l'effet direct du lobbying pharmaceutique qui veut élargir coûte que coûte la prescription de médicaments".   

      Création de la demande : Philippe Pignarre, ancien cadre de l'industrie pharmaceutique, explique la stratégie marketing : "L'industrie pharmaceutique travaille à créer à la fois l'offre et la demande... On va la créer la demande en disant aux gens... ce que vous saviez pas, c'est qu'il a un trouble mental et qu'on peut soigner ce trouble mental."   

      Traitement des symptômes, pas des causes : La Ritaline, surnommée "pilule de l'obéissance", agit sur les symptômes mais ne traite pas les causes sous-jacentes de la souffrance.

      2.3. L'Expérience Vécue : Le Cas d'Aymeric

      Aymeric, 16 ans, a été traité à la Ritaline pendant des années.

      Son témoignage illustre l'ambivalence du traitement : "C'était bien mais c'est pas bien. Pourquoi c'était bien ? Parce que ça me calmait d'un côté. Mais... c'était bien pour eux, mais pour moi c'était pas bien parce que là je mangeais plus... j'étais tout le temps fatigué."

      3. Psychose Médiatique et Réalité Judiciaire

      La perception publique de la violence infantile est fortement influencée par un traitement médiatique qui tend à se focaliser sur des faits divers extrêmes, créant une forme de psychose collective.

      L'Affaire d'Uckange (2009) : Un Emballement Révélateur :

      ◦ Un garçon de 5 ans est accusé d'avoir poignardé sa sœur de 8 ans, prétendument sous l'influence de jeux vidéo.

      L'affaire est largement médiatisée, et la thèse de l'enfant coupable est acceptée par les médias, la police et la justice.   

      ◦ Quelques jours plus tard, la mère avoue être l'auteure du coup de couteau. L'affaire démontre la rapidité avec laquelle "les médias ont véhiculé un peu trop vite le scénario de l'enfant criminel".

      La Perspective des Magistrats : La juge pour enfants Marie-Pierre Hourcade affirme que les affaires de violence au pénal impliquant de très jeunes enfants sont très rares.

      Le critère de la responsabilité pénale est le discernement, qui apparaît vers 7-8 ans. "En aucune façon le parquet ne nous saisit pour des situations de violences commises par des très jeunes enfants."

      4. Derrière la Violence : Souffrance Psychique et Contexte Familial

      L'analyse approfondie des cas révèle que les comportements violents sont presque toujours l'expression d'une souffrance profonde, souvent enracinée dans des histoires familiales et sociales complexes.

      L'Expression de la Souffrance : Le Cas de Sami (13 ans) :

      ◦ Retiré à 8 ans d'un contexte familial violent, Sami a été ballotté de foyer en foyer. Sa violence est une manifestation de sa tristesse face aux ruptures affectives répétées.  

      ◦ Le Dr Roger Teboul, psychiatre, explique : "Bien souvent, quand vous parlez de la violence, vous parlez de la tristesse... Le seul truc qui permet de tenir, c'est d'être en colère."

      L'objectif de son service est de permettre à ces jeunes d'exprimer leur tristesse pour ne plus avoir à l'agir par la violence.

      L'Impact du Contexte Socio-économique : Le Cas de Florian (7 ans) :

      ◦ Florian vit dans un quartier précaire d'Amiens. Sa mère élève seule 5 enfants avec le RMI. L'État s'est largement désengagé du quartier.  

      ◦ Cet environnement de précarité rend l'éducation extrêmement difficile. La mère de Florian exprime sa peur que son fils devienne délinquant si elle n'est pas soutenue.

      5. Stratégies d'Intervention : La Prévention comme Alternative

      Face aux approches répressives ou médicales, les initiatives de prévention axées sur l'accompagnement et le soutien des familles démontrent leur pertinence humaine et économique.

      "Mission Possible" : Un Modèle de Prévention de Proximité :

      ◦ Créée par le juge des enfants Claude Baud, cette association à Amiens accueille librement des familles sans obligation judiciaire.  

      ◦ Elle offre un soutien aux parents, souvent démunis et en rupture avec les services sociaux, sans les juger ni les culpabiliser.   

      ◦ Elle apprend aux enfants les règles de vie en société par le dialogue et un cadre clair, en cherchant à comprendre le sens de leurs comportements plutôt qu'à les étiqueter.

      L'Analyse d'un Juge : Coût et Efficacité : Claude Baud souligne les avantages de la prévention :

      Moins Stigmatisant : "Un parcours judiciaire pour un enfant est dix fois plus stigmatisant qu'un parcours de prévention."   

      Moins Coûteux : Il établit une comparaison financière éloquente :      

      Prévention (Mission Possible) : 8 € par jour et par enfant.     

      Placement Éducatif : 200 à 400 € par jour.    

      Détention en section mineurs : 700 à 1000 € par jour.

      La conclusion est claire : investir dans des moyens humains, de l'écoute et du personnel bien formé pour aider les familles et les enfants en souffrance est infiniment moins coûteux que de devoir gérer, quelques années plus tard, les conséquences de cette souffrance non traitée.

    1. Cher Tony Gheeraert,

      Merci pour cet article, que j'ai vraiment apprécié. Parce que c'est une chronique, elle pourrait être publiée telle quelle. j'ai fait un certain nombre de commentaires, que vous être libre d'utiliser ou non, qui ne s'insèrent pas toujours dans la logique de l'article (reprendre la théorie marxiste de la création de valeurs et essayer de l'appliquer à la situation présente de l'IA) et que vous êtes bien évidemment libre de reprendre ou non.

      Reprenons les différents critères d'évaluation :

      Pertinence de la réflexion

      Cet article propose une relecture marxiste de l'économie de l'IA générative, de la bulle spéculative qu'elle forme, etc. Même pour un non-marxiste, cette relecture est intéressante: dans mon expérience de chercheur non-marxiste, les écrits marxistes (orthodoxes ou hétérodoxes) m'ont toujours forcé à regarder des situations avec un autre regard et, donc, à varier mes analyses.

      Deux exemple: le livre de Gavin Mueller qui essaye de réconcilier luddisme et marxisme (Mueller Gavin, Breaking things at work: the Luddites are right about why you hate your job, London New York, Verso, 2021.) J'avoue que je me contrefiche un peu de la compatibilité du luddisme avec le marxisme (sauf par intérêt historique). Mais cela a poussé Mueller à analyser la nature du mouvement du logiciel libre comme forme de luddisme technophile, idée que je repends depuis à mon tompte.

      Second exemple, très loin de ce qui nous préoccupe ici: ce sont notamment des analyses marxistes hétérodoxes qui ont poussé les historiens et historiennes à regarder de plus près les politiques financière, budgétaires des dictatures fascistes et nazies, dès la fin des années 1940.

      Le seul problème de cette démarche de relecture marxiste est qu'elle peut être souvent frustrante pour le lecteur. Est-ce qu'il y a des moyens d'éviter cela, je n'en suis pas sûr.

      Je reste cela dit peu convaincu par l'entrée en matière de l'article. Le texte de gemini est vraiment mauvais et caricatural (et en outre, j'aime bien avoir les prompts quand quelqu'un cite une IA).

      Subjectivité et démarche

      Je renvoie à la section précédente sur ce point, car la démarche, une relecture marxiste, y est bien commentée.

      Contribution au champ disciplinaire

      Il est indéniable que cet artcile participe à quelque chose d'important, démystifier l'IA générative. Ce que je regrette un peu, c'est le fait que l'auteur ne fait pas complètement le tri (de manière explicite, s'entend) entre certains mythes de l'IA et la réalité économique de l'IA. Je reconnais que c'est assez difficile, dans la mesure où certains éléments relèvent des deux, la question de la bulle spéculative en premier lieu. Mais on a parfois l'impression que lauteur succombe lui-même un peu au mythe de l'IA (ce que je ne juge pas, moi-même y succombant parfois aussi). Mais je ne peux qu'être d'accord avec la logique d'ensemble de l'article et notamment la section "Par-delà « le libre »".

      Avis général sur la publication

      Comme c'est une chronique, je dirais que l'article peut être publié sans modification. À l'auteur, ou à l'éditeurice, de voir s'il souhaite prendre en compte certaines de mes remarques.

      Bien à vous.

    2. En simplifiant, on retrouve : une phase de suraccumulation de capital, suivie d’une surproduction (relative) de capacités productives, menant in fine à une dévalorisation violente (un « krach »), suivie d’une restructuration qui rétablit temporairement les conditions de profit

      Modèle de la crise des dotcoms en 2000-2001, d'aileurs. Mais cette crise a débouché sur le fameux web2 d'O'Reilly en 2005, plus participatif, etc mais surtout fondé sur un nouveau modèle économique prédateur, le capitalisme de surveillance décrit par Zuboff.

      Je ne suis pas fan des prédictions (la damnation des historiens), même si je pense qu'il y aura éclatement de la bulle: toutefois, il n'y a pas de certitude qu'elle éclate effectivement. Il y a d'autres trajectoires de sortie des bulles spéculatives.

    3. Or, les premières études sur le sujet montrent qu’un tel entraînement récursif dégrade fortement la qualité des modèles au fil des générations,

      J'ai un doute ici. Il faudrait peut-être éclaircir les choses: il y a tout un champs de recherche sur les données synthétiques et leur efficacité / inefficacité pour entraîner un modèle de langue. PleIAs par exemple a sorti un jeu de données synthétiques pour l'entraînement de petits modèles, jeux de donées extrapolé à partir de plus de 50 000 articles de wikipedia (si j'ai bien compris). Dans votre exemple, il me semble que vous pensez plutôt à la possibilité que les grandes composantes des données d'entraînement (wikipedia pour les articles qui sont semble-t-il de plus en plus générés par des bots IA, commoncrawl qui intègre une grande partie du web, donc de nombreuses pages désormais générées par IA, etc) contiennent de plus en plus de données engendrées automatiquement, et surtout du domaine du 'AI Slop'. Je pense que ce sont deux cas bien différent, et l'usage du terme 'synthétique' est trop ambigu ici.

    4. Mais ce faisant, il ne crée pas de valeur nouvelle : il la transforme et la transporte.

      Je pense qu'il y a des arguments qui contredisent cette phrase. Alors, n'étant pas un spécialiste de Marx, je suis susceptble de n'avoir compris ce que vous impliquez ici. Mais je vois aussi dans cette affirmation une référence implicite à l'article 'stochastic parrots (bender et al.). Or il a été contesté depuis.

      Surtout, pour être le plus rigoureux possible, il faudrait séparer la référence au modèle de langue et la référene au chatbot: il y a beaucoup plus qu'un modèle de langue derrière un chatbot, même si c'en est le coeur, avec de très nombreuses opérations avant et après la requête au chatbot elle-même. Quid de ces opérations en termes de 'travail mort'?

      La question de la différence entre chatbot et modèle au sens strict, me semble importante, car l'on pourrait argumenter que la création de valeur se situe non pas dans les réponses des chatbots, mais bien dans l'interaction entre l'utilisateur et le chatbot.

    5. Gemini, au demeurant, ne fait ici que relayer le discours public des plateformes,

      En dehors du fantastique "que sauver de Marx?", le texte de gemini laisse un peu le lecteur sur sa faim, car d'une très grande platitude. Il serait intéressant de donner au lecteur pour le paragraphe 18, des citations peut-être plus concrètes du discours public des plateformes (il y a beaucoup de textes de ce type chez Sam Altman, par exemple). Il y a certes la référence au blog de microsoft, mais pas tellement plus.

    6. Gemini m’a menti. IA, valeur-travail et contradictions du capitalisme

      Cher auteur -- les remarques ci-dessous sont faites au fur et à mesure de ma lecture de votre article. Il peut donc y avoir des interrogations auxquelles vous répondez plus tard dans l'article.

    1. Synthèse sur la Maltraitance Infantile : Thèmes, Intervenants et Cas d'Étude

      Synthèse Exécutive

      Ce document de synthèse analyse les thèmes centraux, les dynamiques et les conséquences de la maltraitance infantile, en se basant sur une série d'études de cas et d'interventions d'experts.

      L'analyse révèle que la maltraitance est un phénomène polymorphe, incluant la violence physique extrême, le syndrome du bébé secoué, les abus sexuels et les négligences graves.

      Une conclusion alarmante émerge : dans la majorité des cas (neuf sur dix), les sévices sont infligés au sein même de la cellule familiale, transformant le lieu de sécurité supposé en principal foyer de danger.

      Le silence des victimes, la complicité passive ou active de certains membres de la famille et l'aveuglement de l'entourage constituent des obstacles majeurs à la protection des enfants.

      La chaîne d'intervention, bien que complexe, est clairement définie : elle commence par une alerte (via le 119 ou un signalement médical), se poursuit par une enquête policière (Brigade des Mineurs), aboutit à une réponse judiciaire (Procureur, Juge des enfants) et se conclut par une prise en charge spécialisée (placement, suivi psychologique).

      Les séquelles de la maltraitance sont profondes et durables, affectant les victimes sur les plans physique, psychologique et comportemental.

      Néanmoins, les témoignages de résilience, illustrés par des parcours de reconstruction personnelle et la recréation de liens affectifs, soulignent que la guérison, bien que longue et ardue, reste possible grâce à un soutien adéquat et continu.

      --------------------------------------------------------------------------------

      1. Les Multiples Visages de la Maltraitance

      La maltraitance infantile se manifeste sous diverses formes, souvent cumulatives, allant des sévices physiques aux abus psychologiques et sexuels.

      Violences Physiques et Sévices Graves

      La violence physique est la forme la plus visible de la maltraitance. Les statistiques présentées sont alarmantes : chaque semaine en France, trois enfants meurent des suites de mauvais traitements.

      Le cas de Gaël : Adolescent de 15 ans, il a été victime de violences extrêmes de la part de son père pendant 21 mois, à l'âge de 8 ans. Ses témoignages décrivent un calvaire :

      ◦ Brûlures de cigarettes.  

      ◦ Coups portés avec divers objets : manche à balai, fourchette à poulet, bouteille de verre, assiette, pare-chocs de voiture.  

      ◦ Tentative de noyade dans la baignoire.  

      ◦ Humiliations extrêmes, comme être forcé à manger des excréments de chien.

      Son père a été condamné à 14 ans de prison ferme.

      Gaël utilise aujourd'hui la boxe à haut niveau pour "dégager la haine" et se reconstruire.

      Le cas de Dylan : Enfant de 4 ans décédé en 2003, son corps présentait d'innombrables traces de coups, de morsures et de brûlures de cigarettes, infligées par son beau-père.

      Il était devenu son "souffre-douleur" depuis plusieurs mois.

      Le Syndrome du Bébé Secoué

      Une forme de violence spécifique aux nourrissons est mise en évidence : le syndrome du bébé secoué.

      Mécanisme : Le Dr Philippe Meyer explique qu'il ne s'agit pas d'un simple jeu, mais de "mouvements très répétitifs" et "extrêmement violents".

      La tête du bébé, très lourde et mal soutenue par les muscles du cou, subit des accélérations et décélérations qui provoquent des hémorragies cérébrales (hématome sous-dural).

      Prévalence : L'hôpital Necker reçoit plus de 50 bébés par an présentant ces symptômes.

      Conséquences : Les séquelles peuvent être irréversibles, et un bébé secoué sur dix en meurt.

      Cas étudiés :

      Louis (6 jours) : Arrivé pour un hématome sous-dural, son cas est d'autant plus suspect que son frère est décédé cinq ans plus tôt dans des circonstances similaires, conduisant les médecins à faire un signalement au procureur.  

      Willy (3,5 mois) : Admis pour le même symptôme, son père avoue lui avoir porté un coup lors d'une dispute.

      Il reconnaît son geste : "J'ai craqué [...] j'ai fait ces gestes là j'ai regretté".

      Abus Sexuels

      Les abus sexuels, souvent perpétrés par des proches, sont une autre facette de la maltraitance.

      Le cas d'Elena (7 ans) : La fillette se plaint d'avoir été touchée par Yvon, l'ami de sa grand-mère.

      L'enquête de la Brigade des Mineurs révèle que l'agresseur présumé a déjà des antécédents pour "agression sexuelle sur mineur" en 1998. Confronté, il avoue les faits.

      Le cas d'Estelle : Violée de 2 à 12 ans par son grand-père maternel, elle n'a osé en parler que dix ans plus tard.

      Son parcours illustre la difficulté de la révélation et le poids de la culpabilité et de la honte, qui se sont traduits par des conduites à risque (tentatives de suicide, drogue) à l'adolescence.

      Négligences Graves et Violences Psychologiques

      La maltraitance ne se limite pas aux actes de commission.

      Le placement d'un enfant de 8 ans : La Brigade des Mineurs intervient pour retirer un enfant de sa famille suite à des "graves négligences". L'enfant n'est pas scolarisé et les services sociaux n'ont plus accès à la famille.

      Le cas de Marie : Placée à 15 ans, elle a fui une famille où, au-delà des violences physiques, régnait une "violence psychologique permanente". Elle témoigne : "chaque fois je faisais quelque chose ma mère me disait que ça allait pas tout le temps tout le temps". Cette emprise psychologique l'a conduite à des pensées suicidaires.

      2. L'Environnement Familial : Principal Foyer de Danger

      Le documentaire souligne de manière récurrente que le danger provient le plus souvent de l'entourage immédiat de l'enfant.

      La Responsabilité des Auteurs et le Silence Complice

      Les auteurs des violences sont les parents, beaux-parents ou des proches. Le silence d'un des parents peut être assimilé à une forme de complicité.

      Le cas d'Adeline, mère de Dylan : Elle est jugée pour ne pas avoir dénoncé les violences infligées par son compagnon à son fils.

      Elle a retiré Dylan de l'école pour cacher ses blessures. Son procès en appel aboutit à une peine alourdie à 20 ans de réclusion criminelle.

      Pour l'avocat du père de Dylan, son comportement n'était pas un simple silence mais une "dissimulation" active des faits, court-circuitant toute aide possible.

      L'Aveuglement et la Culpabilité de l'Entourage

      L'entourage élargi peine souvent à percevoir ou à admettre la réalité de la maltraitance, ce qui engendre une profonde culpabilité a posteriori.

      L'entourage de Gaël : La mère de Gaël, Carole, a lutté seule pendant deux ans pour récupérer son fils, séquestré par son ex-mari.

      Les grands-parents expriment leur regret : "Carole disait toujours mon enfant est en danger et nous autour d'elle, on le croyait pas [...] on ne peut pas imaginer qu'on s'est rendu compte de rien." Ils avouent même avoir pensé qu'elle "amplifiait la chose".

      L'indifférence du voisinage : Gaël raconte avoir dormi en slip sur le toit du garage, visible par des centaines de personnes, y compris les parents et enfants de l'école voisine. "Personne a jugé bon de signaler qu'il y avait un souci, c'est inadmissible."

      3. La Chaîne d'Intervention : Du Signalement à la Protection

      Le processus de prise en charge d'un enfant en danger implique une succession d'acteurs institutionnels.

      | Étape | Acteurs Clés | Actions et Observations | | --- | --- | --- | | L'Alerte | Ligne 119, entourage, écoles, médecins | Le service du 119 reçoit plus de 4000 appels par jour. L'alerte est le point de départ crucial qui déclenche l'intervention. | | Le Diagnostic Médical | Médecins hospitaliers (pédiatres, réanimateurs) | Ils sont en première ligne pour détecter les signes physiques (hématomes, fractures). Leur rôle est de soigner mais aussi de signaler les suspicions aux autorités judiciaires, comme dans le cas de Louis. | | L'Enquête Policière | Brigade de Protection des Mineurs | Les policiers mènent des auditions et des interrogatoires pour établir les faits. Leur travail consiste à démêler le vrai du faux face aux dénégations initiales des parents (cas du bébé secoué) ou à obtenir les aveux (cas d'Elena). | | La Réponse Judiciaire | Procureur de la République, Juge des enfants | Le procureur décide des suites à donner (mise en examen, contrôle judiciaire, procès). Le juge des enfants prend les mesures de protection nécessaires (enquête sociale, placement) et évalue la sécurité de l'enfant dans son milieu familial (cas d'Elena et de Marie). | | Le Placement et le Soin | Foyers, pouponnières, éducateurs spécialisés, pédopsychiatres | Lorsque le danger est avéré, les enfants sont retirés de leur famille et placés dans des structures spécialisées. Le placement est souvent un traumatisme, comme le montre l'intervention forcée pour l'enfant de 8 ans. Le soin vise à "réparer" les traumatismes (cas de Roxane et Charlotte à la pouponnière). |

      4. Les Séquelles et le Chemin de la Reconstruction

      Les conséquences de la maltraitance sont profondes et nécessitent un travail de reconstruction de longue haleine.

      Traumatismes Physiques et Psychologiques

      Séquelles physiques : Gaël conserve de multiples cicatrices de ses blessures.

      Séquelles psychologiques : Me Brun Meyrin, l'avocate de Gaël, souligne : "Il a surtout des séquelles morales dont on se demande bien comment elles pourraient ne pas avoir de conséquences dans son futur."

      La psychologue Martine Nisse explique que la communication paradoxale dans les familles maltraitantes ("c'est pour ton bien que je te frappe") rend les enfants "difficiles à comprendre".

      Comportements post-traumatiques : Les enfants placés en pouponnière manifestent des troubles du comportement :

      Roxane, exposée à la violence, développe de l'agressivité et des difficultés relationnelles ; Charlotte, bébé secoué, a appris à "éviter la relation" en se protégeant du contact physique.

      La Thérapie comme Voie de Guérison

      Le suivi psychologique est essentiel pour surmonter le traumatisme.

      Le cas d'Estelle : Après quatre ans de thérapie, elle a pu mettre des mots sur l'inceste subi et déconstruire le sentiment de culpabilité. "Là j'ai compris vraiment que j'y étais pour rien [...] la honte elle reste mais elle s'estompe."

      L'importance de la parole : L'éducateur de Marie souligne que "le fait qu'il y ait une intervention du commissariat [...] n'a pas réglé les problèmes". Il a fallu deux ans et demi pour qu'elle arrive progressivement à "prendre en main sa vie".

      La Résilience et la Reconstruction des Liens

      Malgré la gravité des faits, des parcours de résilience sont possibles.

      Gaël : La boxe lui sert d'exutoire et il retisse un lien fort avec sa mère, Carole. Il parvient à formuler : "Grâce à ma mère, je suis là."

      Marie : Bien qu'inquiète de sa majorité, elle demande à la juge de continuer à la protéger, montrant sa volonté de se construire un avenir stable.

      Cindy, mère de Roxane : En désintoxication et séparée de son conjoint violent, elle s'engage dans un processus pour recréer un lien avec ses enfants et espère pouvoir un jour les récupérer.

      5. Citations Clés

      Gaël, victime de violences paternelles : "C'est pour pouvoir me défendre, c'est pour pouvoir dégager la haine que j'ai sur lui."

      Père de Willy, auteur de secouement : "J'ai craqué [...] je pardonne pas parce qu'on fait pas ça à un bébé mais je sais que ça peut arriver à n'importe qui."

      Grand-père de Gaël, sur sa culpabilité : "Carole disait toujours mon enfant est en danger et nous autour d'elle, on le croyait pas [...] je m'imaginais jamais ce qui se passe."

      Maître Bejo, avocat du père de Dylan : "On n'est pas dans le silence, on est dans un comportement actif de dissimulation des faits et c'est ce comportement actif qui a court-circuité toutes les velléités d'intervention."

      Dr Renier, pionnier sur le syndrome du bébé secoué : "Ce qui fait la différence entre un bien-traitant pour un bébé et un non bien-traitant [...] c'est la maîtrise et la maîtrise elle est indispensable en toutes circonstances."

      Françoise Achard, médecin scolaire, aux enseignants : "On sait que tout le monde peut être maltraitant, c'est-à-dire que ces parents qui avaient l'air bien sympathiques, et ben ça veut pas dire pour autant qu'ils soient pas maltraitants dans l'intimité de leur maison."

      Martine Nisse, psychologue : "Je crois que les principaux sévices c'est la famille, c'est le principal danger pour l'enfant."

      Carole, mère de Gaël : "On essaie de récupérer mais on récupérera jamais ces années, c'est des années qui vont nous manquer toujours."

    1. Analyse de l'Avis du CESE sur les Temps de Vie de l'Enfant

      Résumé Exécutif

      Cet avis du Conseil économique, social et environnemental (CESE), intitulé « Satisfaire les besoins fondamentaux des enfants et garantir leurs droits », dresse un constat critique de la situation des enfants en France, dont les temps de vie sont davantage structurés par les contraintes des adultes que par leurs propres besoins fondamentaux.

      Fruit d'une saisine gouvernementale faisant suite à une Convention citoyenne, le rapport souligne un décalage majeur entre les droits constitutionnels et internationaux de l'enfant et leur application effective, particulièrement pour les plus vulnérables.

      Les principales conclusions révèlent des inégalités sociales, territoriales et économiques profondes qui entravent le développement, la santé et le bien-être des enfants.

      L'avis pointe du doigt des rythmes scolaires inadaptés, une sédentarité croissante, un manque de sommeil chronique, une surexposition aux écrans, et une déconnexion préoccupante de la nature.

      La pression sur les familles, notamment monoparentales, et le manque de coordination entre les acteurs éducatifs aggravent ces constats.

      Pour y remédier, le CESE formule 19 préconisations interdépendantes visant une transformation systémique. Celles-ci incluent des mesures politiques fortes comme l'instauration d'une « clause impact enfance » dans chaque projet de loi, une réforme ambitieuse des rythmes scolaires sur la base des besoins physiologiques, et la création d'un Service Public de la Continuité Éducative (SPCE) pour assurer une meilleure coordination des acteurs.

      L'avis appelle également à renforcer le soutien à la parentalité, à garantir l'accès de tous les enfants aux loisirs, à la culture et aux activités de plein air, et à allouer des financements publics pérennes pour faire de l'enfance un véritable investissement d'avenir.

      Introduction et Contexte

      En réponse à une saisine du Premier ministre de mai 2025, le CESE a élaboré cet avis suite aux travaux d'une Convention citoyenne dédiée aux temps de vie des enfants. Cent trente-trois citoyens et un panel de vingt enfants et adolescents ont été invités à répondre à la question :

      « Comment mieux structurer les différents temps de la vie quotidienne des enfants afin qu’ils soient plus favorables à leurs apprentissages, à leur développement et à leur santé ? ».

      Le constat principal de la Convention citoyenne, repris par le CESE, est que les enfants subissent les rythmes effrénés d'une société qui construit leurs temps autour des contraintes des adultes plutôt qu'en réponse à leurs besoins biologiques et de développement.

      Le rapport du CESE, s'appuyant sur les 20 propositions citoyennes, formule 19 préconisations qui constituent une position commune de la société civile organisée.

      Cet avis s'inscrit dans la continuité de travaux antérieurs du CESE sur l'éducation, la protection de l'enfance et la santé mentale, et vise à proposer des réponses globales et articulées.

      Partie 1 : Droits et Besoins Fondamentaux de l'Enfant : Un Constat Alarmant

      A. L'Écart entre Droits Reconnus et Réalité Vécue

      La France a consacré les droits de l'enfant dans sa Constitution et a ratifié la Convention Internationale des Droits de l'Enfant (CIDE) en 1990, s'engageant sur quatre principes fondamentaux : le droit à la vie, l'intérêt supérieur de l'enfant, la non-discrimination et le respect de son opinion.

      Cependant, l'avis du CESE met en lumière une ineffectivité préoccupante de ces droits pour une part significative des enfants.

      Pauvreté et Précarité : En 2023, 21,9 % des enfants de moins de 18 ans vivent sous le seuil de pauvreté monétaire.

      À la rentrée 2025, au moins 2 159 enfants se sont retrouvés sans solution d'hébergement.

      Ces réalités percutent violemment la capacité de la société à répondre à leurs besoins fondamentaux.

      Critiques Internationales : Le Comité des droits de l'enfant de l'ONU a enjoint la France en 2023 à prendre des mesures urgentes concernant la violence, la protection de l'enfance, la détention d'enfants étrangers, la pauvreté et l'inclusion des enfants en situation de handicap.

      L'« Infantisme » : Le rapport dénonce la persistance de l'« infantisme », un concept désignant les préjugés et la discrimination fondée sur l'âge, qui considère les enfants comme des êtres inférieurs et moins dignes de respect.

      Cette culture conduit à ignorer leur parole et leur capacité à être des acteurs sociaux. Pour le combattre, le CESE réaffirme la nécessité d'un débat de société et la création d'un Code de l'enfance.

      Clause « Impact Enfance » : S'inspirant de la « clause impact jeunesse », le CESE préconise (Préconisation #1) d'intégrer un volet enfance dans chaque étude d'impact des projets de loi afin de s'assurer que toute politique publique soit fondée sur le respect des droits de l'enfant.

      B. Le Rôle de la Famille et les Obstacles Socio-économiques

      La famille est le premier lieu de développement de l'enfant, mais elle fait face à de nombreux obstacles.

      Soutien à la Parentalité : Face à la diversité des modèles familiaux (nucléaire, monoparentale, recomposée...), un soutien renforcé à la parentalité est jugé nécessaire pour aider les parents à répondre aux besoins de leurs enfants (Préconisation #7).

      Inégalités de Genre : Les femmes continuent d'assumer l'essentiel des responsabilités familiales et de la charge mentale, ce qui impacte leur santé et leur carrière.

      Le rapport souligne la nécessité d'une répartition équitable des tâches.

      Conciliation Vie Professionnelle/Familiale : Les contraintes professionnelles empiètent sur le temps familial.

      Le CESE préconise (Préconisation #2) la transposition complète de la directive européenne sur l'équilibre vie professionnelle-vie personnelle, en créant un droit à des « formules souples de travail » (aménagement du temps, télétravail) négocié dans les branches et la fonction publique.

      Enfants Séparés de leur Famille :

      Parents séparés : Il est crucial de soutenir les dispositifs comme les Espaces de rencontre pour préserver la relation parent-enfant tout en prenant en compte le point de vue de l'enfant (Préconisation #3).   

      Aide Sociale à l'Enfance (ASE) : L'avis dénonce une crise systémique de la protection de l'enfance, où les droits des enfants confiés, notamment l'accès aux loisirs et à la culture, sont négligés.

      Il est préconisé (Préconisation #4) que le Projet Pour l'Enfant (PPE) soit co-construit avec les parents et l'enfant, et qu'il intègre l'ensemble de ses besoins.

      Partie 2 : Les Enjeux des Temps et des Espaces de Vie

      L'avis analyse en profondeur la manière dont les temps et les espaces de l'enfant sont organisés, révélant de multiples fractures et inadéquations.

      A. Les Temps de Vie : Entre Contraintes et Qualité

      La vie de l'enfant est rythmée par trois grands temps : familial, scolaire, et les "tiers temps" (périscolaire, extrascolaire).

      Qualité des Temps : Le rapport insiste sur la nécessité d'un équilibre entre temps contraints et temps libre, temps individuel et collectif, activité et repos.

      La qualité des interactions avec les adultes et un environnement sécurisant sont déterminants.

      Le CESE préconise (Préconisation #6) d'intégrer des temps libres de qualité dans toutes les activités d'apprentissage.

      Le Temps Scolaire : La France se distingue par des journées scolaires longues et un temps d'instruction élevé, sans que cela se traduise par de meilleurs résultats.

      Le rythme de la semaine de quatre jours est jugé contraire aux besoins des enfants. Le CESE estime que le statu quo n'est plus tenable et appelle (Préconisation #8) à une évolution des rythmes scolaires :

      Premier degré : Réorganiser la journée et la semaine scolaire après concertation.   

      Second degré : Adapter les amplitudes horaires aux besoins physiologiques des jeunes (ex: commencer plus tard).   

      Calendrier scolaire : Organiser le calendrier hexagonal autour de deux zones de vacances, avec une alternance de 7 semaines de cours et 2 semaines de vacances.

      Les Tiers Temps et le Droit aux Loisirs : Les activités périscolaires et extrascolaires, portées par les associations et les collectivités, sont essentielles mais menacées par le désengagement de l'État et la marchandisation.

      L'accès à ces activités, ainsi qu'aux vacances, est fortement marqué par les inégalités sociales.

      Un enfant sur deux ne part pas en vacances. Le CESE réaffirme (Préconisation #9) que chaque enfant a droit aux vacances et aux loisirs, et appelle à renforcer le financement des accueils collectifs de mineurs et l'information sur les aides existantes.

      B. Les Espaces de Vie : De l'« Enfant d'Intérieur » à la Reconnexion au Dehors

      L'environnement physique joue un rôle crucial dans le développement de l'enfant.

      L'« Enfant d'Intérieur » : Le rapport alerte sur le phénomène des « enfants d'intérieur », qui passent de moins en moins de temps à l'extérieur et en contact avec la nature, en raison de la peur du risque, de l'urbanisation centrée sur la voiture et de l'attrait des écrans.

      Repenser l'Aménagement : Il est impératif de repenser l'aménagement des territoires « à hauteur d'enfant », en créant des espaces publics (rues, places) sécurisés, propices au jeu, à la socialisation et aux mobilités douces.

      Le CESE préconise (Préconisation #11) d'associer les enfants à l'élaboration des projets d'urbanisme.

      Le Bâti et le Cadre de Vie : Les bâtiments accueillant des enfants (écoles, centres de loisirs) sont souvent inadaptés, notamment face aux enjeux climatiques (vagues de chaleur).

      Leur rénovation écologique et leur accessibilité sont des priorités. Toute rénovation doit faire l'objet d'une concertation incluant les enfants et les jeunes (Préconisation #12).

      Partie 3 : Leviers d'Action pour la Santé et le Bien-être

      L'avis identifie quatre domaines d'action prioritaires pour améliorer la santé physique et mentale des enfants.

      Reconnecter à la Nature : Le contact avec la nature est fondamental pour la santé.

      Le CESE appelle à valoriser et accompagner l'éducation au dehors (Préconisation #10) et à garantir que chaque enfant bénéficie d'un accès à des espaces naturels, de sorties régulières et d'au moins un séjour en classe de découverte par cycle scolaire (Préconisation #13).

      Lutter contre le Manque de Sommeil : Le déficit de sommeil touche plus de 30 % des enfants et 70 % des adolescents, avec des conséquences graves sur l'apprentissage et la santé.

      Le CESE demande une campagne nationale de sensibilisation (Préconisation #14) et la garantie de temps de repos et de sieste dans toutes les structures, notamment en maternelle (Préconisation #15).

      Favoriser l'Activité Physique : Face à une sédentarité alarmante, il est crucial de faciliter l'accès au sport pour tous. Le CESE préconise (Préconisation #16) une tarification sociale et l'élargissement du dispositif Pass'Sport, récemment restreint.

      Mieux Réguler les Écrans : L'omniprésence des écrans a des effets néfastes documentés (sommeil, sédentarité, exposition à des contenus inappropriés). L'avis souligne la nécessité d'une meilleure régulation et d'un accompagnement à la parentalité numérique.

      Partie 4 : Gouvernance, Coordination et Financement

      Pour que ces changements soient effectifs, une transformation de la gouvernance des politiques de l'enfance est indispensable.

      Coordination des Acteurs : L'action publique est jugée trop fragmentée. Le CESE préconise (Préconisation #17) de réhabiliter le Projet Éducatif Territorial (PEDT) et d'en faire le volet éducation des Conventions Territoriales Globales (CTG) pour assurer une coordination efficace au niveau local.

      Un Service Public de la Continuité Éducative (SPCE) : Pour garantir une offre éducative cohérente sur tous les temps de l'enfant, l'avis propose la création d'un SPCE (Préconisation #18).

      Ce service, confié aux collectivités locales, serait chargé de diagnostiquer les besoins et de planifier les actions en associant tous les acteurs.

      Formation et Financement : La revalorisation des métiers éducatifs et le développement d'une culture commune des droits de l'enfant sont essentiels.

      Enfin, le CESE alerte sur l'insuffisance des budgets alloués aux politiques de l'enfance et appelle (Préconisation #19) à un effort budgétaire conséquent et pérenne de l'État et de la Sécurité sociale, considérant ces dépenses comme un investissement fondamental pour l'avenir.

      Synthèse des 19 Préconisations du CESE

      | Numéro | Thème Principal | Résumé de la Préconisation | | --- | --- | --- | | #1 | Droits de l'enfant | Mettre en œuvre une « clause impact enfance » dans chaque étude d'impact de projet de loi ou de texte réglementaire pour garantir que les politiques publiques respectent les droits de l'enfant. | | #2 | Parentalité & Travail | Créer un droit aux « formules souples de travail » (aménagement du temps, télétravail) pour les parents, par la négociation dans les branches et la fonction publique. | | #3 | Séparation parentale | Développer et soutenir financièrement les Espaces de rencontre pour aider les parents séparés à assumer leurs responsabilités parentales en prenant en compte le point de vue de l'enfant. | | #4 | Protection de l'enfance (ASE) | Rendre le Projet pour l'enfant (PPE) systématiquement co-construit avec les parents et l'enfant, et y intégrer tous les besoins, y compris les loisirs et la culture. Simplifier la gestion des actes usuels. | | #5 | Accès à la culture | Soutenir financièrement et développer tous les dispositifs culturels et artistiques pour les enfants (scolaires, ACM), via des contrats multipartites (État, collectivités, réseau culturel). | | #6 | Qualité des temps | Intégrer des temps libres de qualité dans les activités d'apprentissage, ce qui implique de former les adultes et personnels encadrants. | | #7 | Soutien à la parentalité | Mieux faire connaître, rendre accessibles et valoriser financièrement les lieux et actions d'aide aux parents (maisons des familles, groupes de parole, LAEP, PMI...). | | #8 | Rythmes scolaires | Faire évoluer les rythmes scolaires : réorganiser la journée et la semaine au primaire ; adapter les horaires aux besoins physiologiques au secondaire ; organiser un calendrier national à 2 zones (7 semaines de cours / 2 de vacances). | | #9 | Droit aux vacances et loisirs | Mobiliser les pouvoirs publics pour rendre effectif le droit aux vacances. Renforcer l'information sur les aides et financer davantage les accueils collectifs de mineurs (ACM). | | #10 | Éducation à la nature | Valoriser et accompagner l'éducation au dehors et en lien avec la nature (formation des acteurs, verdissement des espaces, aires éducatives, terrains d'aventure...). | | #11 | Aménagement du territoire | Aménager les territoires « à hauteur d'enfant » dans une démarche participative, en repensant les espaces publics comme lieux de sociabilité, de mixité et de jeu. | | #12 | Bâti et cadre de vie | Rendre obligatoire la concertation avec les enfants et les jeunes pour tout projet d'aménagement ou de rénovation de bâtiments (écoles, centres de loisirs, gymnases...). | | #13 | Lien à la nature | Garantir que chaque enfant bénéficie d'un accès à des espaces naturels, de sorties régulières, et d'au moins un séjour en classe de découverte par cycle de scolarité. | | #14 | Sommeil | Organiser une campagne nationale d'information et de sensibilisation sur le rôle fondamental du sommeil et les facteurs qui lui nuisent. | | #15 | Temps de repos | Prévoir des temps de repos, de calme et de sieste (préservée en maternelle) dans toutes les structures accueillant des enfants, et repenser les locaux pour créer une atmosphère paisible. | | #16 | Activité physique et sportive | Soutenir une tarification sociale pour l'accès au sport. Étendre et revaloriser le Pass'Sport, en y incluant les associations sportives scolaires. | | #17 | Coordination locale | Réhabiliter le Projet Éducatif Territorial (PEDT) et en faire le volet "éducation" des Conventions Territoriales Globales (CTG) pour une coordination globale des acteurs. | | #18 | Gouvernance | Créer un Service Public de la Continuité Éducative (SPCE), confié aux collectivités, pour diagnostiquer les besoins et planifier les actions éducatives sur le territoire. | | #19 | Financement | Assurer un effort budgétaire conséquent et pérenne de l'État et de la Sécurité sociale pour financer les politiques publiques en faveur de l'enfance. |

    1. Synthèse de l'Étude sur la Protection des Mineurs en Ligne

      Synthèse Exécutive

      Cette étude, menée par l'Arcom en septembre 2025, révèle que les plateformes numériques sont devenues un pilier central et inévitable de la vie des adolescents de 11 à 17 ans, avec des implications majeures en matière d'exposition aux risques et d'efficacité des mesures de protection.

      L'accès à ces services est quasi universel, de plus en plus précoce, et se fait souvent en contournant les restrictions d'âge conçues pour protéger les plus jeunes.

      Les principaux points à retenir sont les suivants :

      Usage quasi universel et intensif : 99 % des 11-17 ans utilisent au moins une plateforme en ligne, et 83 % fréquentent quotidiennement une très grande plateforme (VLOP).

      En moyenne, les adolescents utilisent 3,6 plateformes différentes chaque jour, motivés principalement par le besoin de lien social, de divertissement et d'accès à l'information.

      Contournement systématique des restrictions d'âge : L'âge moyen de la première utilisation des réseaux sociaux est de 12,3 ans, bien en deçà du seuil légal de 13 ans.

      Une part significative (62 %) des adolescents reconnaît avoir menti sur son âge lors de l'inscription, principalement pour accéder à des services pour lesquels ils n'avaient pas l'âge requis (65 %).

      Cette tendance à une inscription précoce s'accentue chez les plus jeunes générations.

      Faiblesse des mécanismes de vérification : Les systèmes de vérification d'âge des plateformes s'avèrent largement inefficaces.

      Seulement 18 % des mineurs déclarent avoir déjà dû prouver leur âge ou avoir vu leur compte bloqué.

      Les observations techniques montrent que le contournement des blocages à l'inscription est souvent simple, notamment sur des plateformes majeures comme Instagram, Snapchat et Facebook.

      Encadrement parental ambivalent et contourné : Bien que 94 % des foyers instaurent des règles sur l'usage du numérique, près de la moitié des adolescents (45 %) admettent les contourner régulièrement.

      Il existe une perception partagée des risques entre parents et enfants, mais les parents se montrent nettement plus inquiets et moins convaincus des bénéfices des plateformes.

      Perception dichotomique : Les adolescents et leurs parents entretiennent un rapport ambivalent aux plateformes, les considérant à la fois comme des outils d'intégration sociale et de divertissement indispensables, mais aussi comme des sources d'inquiétude et d'exposition à des risques graves.

      1. Contexte et Méthodologie de l'Étude

      Objectifs de l'Étude

      L'étude menée pour l'Arcom vise à dresser un état des lieux complet de la protection des mineurs dans l'univers numérique. Elle s'articule autour de trois axes principaux d'investigation :

      1. L'Exposition : Mesurer le degré de conscience des mineurs face aux risques en ligne et leur exposition réelle.

      2. La Protection : Analyser les moyens de prévention mis en place par les mineurs et leur entourage, ainsi que leurs réactions post-exposition.

      3. Les Attentes : Recueillir les attentes des mineurs, des parents et des professionnels pour une meilleure protection.

      L'objectif est de comprendre les compétences que les adolescents mobilisent pour naviguer en ligne, dans un contexte oscillant entre la conscience des dangers et la prise de risques.

      Approche Méthodologique

      Pour garantir une vision exhaustive, l'étude a été réalisée en quatre volets complémentaires entre novembre 2024 et avril 2025, en partenariat avec Ipsos BVA et OpinionWay.

      | Volet | Type d'étude | Période | Participants et Méthodes | | --- | --- | --- | --- | | 1 | Entretiens préparatoires | Nov - Déc 2024 | Entretiens avec des experts, des représentants de plateformes. | | 2 | Étude qualitative | Fév - Mars 2025 | Entretiens avec des experts (associations, psychologue, pédiatre), 16 entretiens individuels et 4 triades avec des mineurs (11-17 ans). | | 3 | Étude sémiologique et observations | Avril 2025 | Analyse des outils et CGU des plateformes ; simulation de parcours utilisateurs avec 8 profils fictifs ; focus sur les thèmes de la maigreur et du masculinisme. | | 4 | Étude quantitative | Avril 2025 | Questionnaire en ligne auprès de 2 000 mineurs (11-17 ans) et de leurs parents. |

      Le périmètre de l'étude couvre les réseaux sociaux (Snapchat, TikTok, Facebook, Instagram, etc.), les plateformes de partage de vidéos (YouTube, Twitch, etc.) et les messageries instantanées (WhatsApp, Discord, etc.).

      2. L'Usage Incontournable des Plateformes par les Mineurs

      Omniprésence et Intensité d'Usage

      Les plateformes en ligne sont omniprésentes dans la vie des 11-17 ans. L'étude révèle des chiffres qui témoignent d'une adoption quasi totale et d'un usage quotidien intensif.

      99 % des 11-17 ans utilisent au moins une plateforme en ligne.

      83 % utilisent au moins une Très Grande Plateforme en Ligne (VLOP) chaque jour.

      • En moyenne, les adolescents utilisent 3,6 plateformes différentes quotidiennement.

      La ventilation par catégorie de services montre une forte pénétration de tous les types de plateformes.

      | Catégorie de Service | Taux d'Utilisation (11-17 ans) | | --- | --- | | Plateformes de vidéos en ligne | 98 % | | Messageries instantanées | 91 % | | Réseaux sociaux | 88 % | | Jeux en ligne | 87 % | | Sites de rencontres | 15 % |

      YouTube, Snapchat, TikTok et WhatsApp sont les plateformes les plus utilisées au quotidien par plus de la moitié des 11-17 ans. L'usage quotidien des VLOP augmente de manière significative avec l'âge, passant de 62 % chez les 11 ans à 96 % chez les 17 ans.

      Motivations Principales des Adolescents

      Trois motivations majeures expliquent pourquoi les plateformes sont devenues incontournables pour les adolescents.

      1. Le besoin d'appartenance et de lien social : Les plateformes sont perçues comme un vecteur essentiel d'intégration sociale et de communication avec les pairs.

      2. La recherche de divertissement et d'évasion : Les contenus ludiques et humoristiques sont massivement plébiscités pour se détendre et s'évader du quotidien.

      3. L'accès à l'information : Les plateformes servent également de canal d'information pour se tenir au courant de l'actualité et des sujets d'intérêt.

      3. Le Contournement Systématique des Restrictions d'Âge

      Malgré les dispositifs de restriction, l'accès des mineurs aux plateformes est de plus en plus précoce, grâce à des stratégies de contournement généralisées et à une faible application des règles par les services en ligne.

      Précocité de l'Accès

      L'âge de la première utilisation des plateformes se situe bien en dessous des seuils réglementaires.

      • Âge moyen déclaré de la 1ère utilisation :

      11,2 ans pour les plateformes vidéos.    ◦ 12,3 ans pour les réseaux sociaux.

      L'étude met en évidence une tendance à un accès toujours plus précoce : 22 % des jeunes de 11 ans actuels déclarent avoir utilisé les réseaux sociaux pour la première fois à 10 ans ou moins, contre seulement 4 % des jeunes de 17 ans.

      Déclaration d'Âge et Manquements à la Vérification

      Le contournement de l'âge minimum requis est une pratique massive et assumée par les adolescents.

      62 % des adolescents reconnaissent ne pas avoir mis leur vraie date de naissance sur au moins une de leurs inscriptions.

      17 % l'ont fait sur toutes leurs inscriptions.

      La principale raison invoquée est l'impossibilité de s'inscrire autrement :

      65 % n'avaient pas l'âge minimum requis.

      31 % ne voulaient pas donner leurs données personnelles.

      12 % voulaient passer pour plus âgés.

      "Tout le monde peut y aller, parce que quand tu t'inscris, tu as juste à mettre une fausse date de naissance, ils ne la vérifient pas." - Garçon, 15 ans.

      Face à cette pratique, les mesures de contrôle des plateformes apparaissent très limitées :

      • Seulement 18 % des 11-17 ans ont déjà dû prouver leur âge ou ont vu leur compte bloqué.

      • Facebook est la plateforme où les contrôles sont les plus fréquents (12 % des utilisateurs concernés), suivie par TikTok (10 %) et Instagram (7 %).

      Failles Techniques et Contournement à l'Inscription

      Les observations de parcours utilisateurs confirment la facilité avec laquelle les restrictions peuvent être contournées.

      • L'interdiction d'inscription pour les moins de 13 ans n'est pas clairement explicitée lors du processus.

      • Sur Instagram, Snapchat et Facebook, il est possible de contourner un premier refus en modifiant simplement sa date de naissance lors d'une nouvelle tentative.

      • Le contournement est plus complexe sur d'autres plateformes comme TikTok, YouTube ou X, nécessitant des manipulations comme la réinitialisation de l'application ou la création d'une nouvelle adresse mail.

      4. Perceptions Ambivalentes et Encadrement Familial

      Une Perception Dichotomique des Risques et Bénéfices

      Les adolescents et leurs parents partagent une vision ambivalente des plateformes, oscillant entre l'attrait des bénéfices et l'inquiétude face aux risques. Cependant, les parents se montrent systématiquement plus préoccupés et moins convaincus des avantages.

      | Perception des plateformes (% d'accord) | Mineurs | Parents | | --- | --- | --- | | Permettent d’avoir une vie sociale riche | 80 % | 37 % | | Permettent d’accéder à des contenus éducatifs | 76 % | 56 % | | Exposent les mineurs à des risques graves | 77 % | 89 % | | Inquiètent quant à leur impact sur moi / votre enfant | 83 % | 86 % |

      L'Encadrement Parental : Règles et Contournement

      L'encadrement familial est une réalité dans la quasi-totalité des foyers, mais son efficacité est relative.

      94 % des familles ont instauré au moins une règle concernant l'usage du numérique, avec une moyenne de 3,5 règles par foyer.

      • Les règles les plus fréquentes sont l'interdiction du téléphone pendant les repas (63 %) et au coucher (55 %).

      Malgré ce cadre, 45 % des mineurs admettent contourner ces règles régulièrement (8 % "souvent" et 37 % "de temps en temps"). Les adolescents reconnaissent la finalité protectrice de ces règles mais développent des stratégies pour s'y soustraire.

      Utilisation des Comptes Supervisés

      Une majorité de jeunes déclarent utiliser des dispositifs de protection intégrés aux plateformes, mais une part non négligeable ignore leur statut.

      71 % des 11-17 ans déclarent utiliser au moins un compte paramétré pour un adolescent ou supervisé par un adulte.

      • Le taux d'utilisation de ces comptes varie selon les plateformes : 63 % sur Snapchat, 60 % sur TikTok, 58 % sur Instagram et 49 % sur YouTube.

      • Cependant, une part importante des jeunes (par exemple, 26 % sur Instagram) ne savent pas si leur compte est un compte "adulte" ou un compte "ado/supervisé", ce qui questionne la clarté et l'efficacité de ces dispositifs.

    1. 3.7. Recommandations

      Note de synthèse : Usages des réseaux sociaux et santé des adolescents - Analyse et recommandations de l'Anses

      1. Introduction : Contexte et portée de l'expertise de l'Anses

      Face à l'expansion massive des réseaux sociaux numériques et aux préoccupations croissantes concernant leur impact sur la santé, l'Agence nationale de sécurité sanitaire de l’alimentation, de l’environnement et du travail (Anses) s'est autosaisie en septembre 2019 pour évaluer les risques sanitaires encourus par les adolescents.

      Cette démarche répond à un besoin d'expertise scientifique indépendante sur un phénomène socio-culturel majeur qui reconfigure en profondeur les modes de vie et de socialisation des plus jeunes.

      L'expertise, menée par un groupe de travail pluridisciplinaire, s'appuie sur une analyse rigoureuse de plus d'un millier d'études scientifiques internationales.

      Elle se concentre sur la population des 11-17 ans, une période charnière du développement marquée par une vulnérabilité particulière.

      L'objectif est de caractériser les usages, d'identifier les risques avérés et de formuler des recommandations pour protéger la santé de cette population.

      La conclusion centrale de l'Agence est sans équivoque : l'usage des réseaux sociaux numériques a des effets négatifs documentés sur la santé physique et mentale des adolescents.

      Ces effets ne sont pas le fruit du hasard mais découlent en grande partie de la conception même des plateformes. Ils nécessitent une réponse coordonnée et systémique impliquant les pouvoirs publics, les plateformes elles-mêmes, ainsi que les acteurs du monde éducatif et de la santé.

      2. Le Modèle Économique des Plateformes : Un Facteur de Risque Systémique

      Pour évaluer les risques sanitaires des réseaux sociaux, il est indispensable de comprendre leur fonctionnement. Leur conception n'est pas neutre mais répond à des impératifs économiques précis qui constituent le cœur du problème.

      Le modèle économique dominant des grandes plateformes repose sur la monétisation de l'attention et des données des utilisateurs.

      En offrant un accès gratuit à leurs services, ces entreprises transforment l'usager en une source de profit, principalement par la vente d'espaces publicitaires ciblés et l'exploitation de ses données comportementales.

      Ce modèle induit une course à l'engagement maximal. Pour y parvenir, les plateformes intègrent délibérément des mécanismes de captation de l'attention conçus pour influencer le comportement de l'utilisateur, maximiser le temps passé sur le service et, in fine, induire une perte de contrôle. Parmi ces techniques figurent notamment les interfaces persuasives ou trompeuses, qui exploitent des biais cognitifs pour inciter les utilisateurs à réaliser des actions qu'ils ne feraient pas autrement, et le défilement infini, qui élimine les points d'arrêt naturels pour favoriser une consultation prolongée et passive.

      Les adolescents sont particulièrement vulnérables à ces stratégies.

      Leurs capacités de régulation émotionnelle et comportementale étant encore en développement, ils peinent davantage à maîtriser leur temps de connexion.

      De plus, la conception de ces plateformes entre en résonance directe avec leurs aspirations fondamentales : besoin d'interactions sociales avec les pairs, recherche de sensations et construction de l'identité.

      Les réseaux sociaux exploitent ainsi une vulnérabilité psychologique et développementale inhérente à cette période de la vie. Ces mécanismes de conception sont à l'origine de risques sanitaires spécifiques et documentés.

      3. Principaux Risques Sanitaires Identifiés et Populations Vulnérables

      L'expertise de l'Anses établit des corrélations claires et préoccupantes entre l'usage des réseaux sociaux et la détérioration de la santé des jeunes.

      Ces risques ne sont pas des externalités malheureuses, mais des conséquences directes des stratégies de captation de l'attention et d'exploitation des vulnérabilités développementales décrites précédemment. Un constat transversal émerge de l'analyse : les filles constituent une population particulièrement à risque.

      Cette vulnérabilité accrue n'est pas monolithique ; elle résulte d'une confluence de facteurs : un temps d'usage quantitativement supérieur, une orientation vers des plateformes hautement visuelles qui exacerbent la pression sur l'apparence, et une plus grande exposition aux dynamiques de cyberviolence genrée.

      D'autres populations, comme les jeunes LGBTQIA+ ou ceux présentant des troubles préexistants (anxiodépressifs, TDAH), sont également surexposées.

      3.1. Dégradation de la Santé Mentale et de l'Image de Soi

      L'expertise de l'Anses établit que l'usage des réseaux sociaux constitue un facteur contributif aux troubles anxiodépressifs.

      Cette relation est médiée par plusieurs mécanismes psychologiques délétères, tels que la comparaison sociale ascendante, qui génère un sentiment d'insatisfaction ; le FoMO (Fear of Missing Out), qui nourrit une connexion anxiogène ; et le cyberharcèlement.

      L'expertise met en lumière un cercle vicieux : un mal-être initial peut conduire un adolescent à se réfugier dans les réseaux sociaux dans une stratégie d'« escapisme », ce qui renforce paradoxalement ses difficultés psychologiques.

      L'impact sur l'image corporelle est particulièrement prononcé. L'exposition continue à des corps idéalisés, souvent modifiés par des filtres et des retouches, favorise l'insatisfaction corporelle.

      L'expertise identifie cette exposition comme étant corrélée à l’intériorisation des idéaux corporels, l’auto-objectification et la comparaison sociale ascendante, autant de facteurs intermédiaires des troubles des conduites alimentaires.

      L'effet est amplifié par les algorithmes de personnalisation qui créent un effet « silo », enfermant les jeunes vulnérables dans des boucles de contenus délétères (valorisation de la maigreur, automutilation, suicide), banalisant ces comportements et augmentant le risque d'imitation.

      3.2. Altération du Sommeil : Un Médiateur Clé des Troubles de Santé

      L'Anses identifie la perturbation du sommeil comme un effet sanitaire majeur et un médiateur central entre l'usage des réseaux sociaux et la dégradation de la santé mentale.

      L'impact négatif sur le sommeil s'opère via trois mécanismes principaux :

      Réduction de la durée du sommeil : l'augmentation du temps d'écran retarde systématiquement l'heure du coucher.

      Altération de la qualité du sommeil : les contenus et interactions en ligne provoquent une stimulation cognitive et émotionnelle qui entrave l'endormissement et fragmente le sommeil.

      Perturbation du rythme circadien : l'exposition à la lumière bleue des écrans en soirée inhibe la sécrétion de mélatonine, l'hormone de l'endormissement.

      Or, une perturbation chronique du sommeil est elle-même un facteur de risque majeur pour le développement de troubles de santé mentale et de maladies chroniques.

      3.3. Exposition aux Conduites à Risques et aux Cyberviolences

      Les réseaux sociaux agissent comme de puissants vecteurs de conduites à risques.

      Ils contribuent à la normalisation de la consommation de substances psychoactives (alcool, tabac, cannabis) et assurent la propagation virale de défis dangereux (challenges), dont l'attrait repose sur la quête de reconnaissance sociale.

      Le cyberharcèlement est une autre menace centrale. Il prolonge les dynamiques de harcèlement hors ligne, mais son impact est amplifié par des facteurs spécifiques au numérique : l'anonymat (réel ou perçu), la persistance des contenus et l'ampleur de leur diffusion.

      L'expertise souligne que le fait d’appartenir à une communauté LGBTQIA+ est associé à une probabilité plus élevée d’être cybervictime.

      Les conséquences documentées par l'Anses sont graves :

      • Augmentation des symptômes dépressifs

      • Risque accru d'idées suicidaires et de tentatives de suicide

      • Comportements d'automutilation

      • Augmentation de l'usage problématique des réseaux sociaux

      Enfin, l'expertise alerte sur les cyberviolences à caractère sexuel, comme le sexting non consenti ou la coercition numérique. Ces pratiques constituent une nouvelle expression du sexisme, particulièrement risquée pour les filles.

      Ce tableau de risques multifactoriels, systémiquement liés à la conception des plateformes, appelle une réponse stratégique et coordonnée, que l'Anses articule en quatre axes d'intervention.

      4. Axes d'Intervention Stratégiques : Les Recommandations de l'Anses

      L'Anses préconise une approche systémique et coordonnée qui ne fait pas reposer la charge uniquement sur les individus. Les recommandations visent à la fois les plateformes, les pouvoirs publics, les acteurs de l'éducation et la communauté scientifique. L'Agence insiste sur la nécessité d'impliquer les adolescents dans l'élaboration de ces mesures pour garantir leur pertinence et faciliter leur adhésion.

      4.1. Axe 1 : Réguler et Sécuriser l'Environnement Numérique

      Cet axe vise directement les plateformes et les pouvoirs publics, considérant que la responsabilité première incombe aux concepteurs des services. Les recommandations phares incluent :

      Instaurer un cahier des charges technique pour les réseaux sociaux accessibles aux mineurs, afin de garantir un design protecteur.

      Appliquer des mécanismes fiables de vérification de l'âge et du consentement parental.

      Encadrer légalement les interfaces persuasives ou trompeuses et les algorithmes de personnalisation, en s'appuyant sur les dispositions du Digital Services Act (DSA) européen pour interdire les techniques d'influence trompeuse et la diffusion de contenus délétères.

      Imposer un paramétrage par défaut protecteur pour les comptes des mineurs (limitation des notifications, suppression des indicateurs d'activité en ligne).

      Mettre en place des procédures de signalement simples et efficaces pour les contenus problématiques.

      4.2. Axe 2 : Développer une Éducation aux Médias Numériques

      L'éducation est un levier complémentaire indispensable. Pour les parents et adolescents, il s'agit de co-construire des repères de bonnes pratiques et d'alerter sur les pressions sociales spécifiques (stéréotypes de genre, harcèlement).

      Pour le milieu scolaire, l'Anses préconise de renforcer les programmes d'éducation au numérique, de développer l'esprit critique et les compétences socio-émotionnelles, et de promouvoir des espaces de parole entre pairs.

      4.3. Axe 3 : Renforcer la Prévention des Effets sur la Santé

      Une approche de santé publique globale est nécessaire. L'Anses préconise de :

      • Mener des campagnes de sensibilisation sur l'hygiène de vie (sommeil, sédentarité) et l'hygiène numérique (risques liés à l'image de soi, aux images intimes).

      Renforcer la prévention en santé mentale, par la formation des professionnels et l'augmentation des moyens du système de santé et du personnel médical scolaire.

      Lutter activement contre les cyberviolences et toutes les formes de discrimination.

      Développer des alternatives attractives à la socialisation en ligne (infrastructures sportives, culturelles, associatives).

      4.4. Axe 4 : Soutenir la Recherche Scientifique

      Pour combler les lacunes de la recherche, l'Anses recommande de garantir l'accès des chercheurs aux données des plateformes, comme le prévoit le Digital Services Act (DSA) européen, et d'améliorer la méthodologie des études pour mieux objectiver les usages et les effets sanitaires.

      Enfin, l'Agence appelle la communauté scientifique à étudier la pertinence de qualifier l'« usage problématique » des réseaux sociaux comme une addiction comportementale, au même titre que les jeux d’argent et de hasard.

      5. Conclusion Générale

      L'expertise de l'Anses dresse un constat sévère : les effets négatifs documentés des réseaux sociaux sur la santé des adolescents sont étroitement liés aux caractéristiques de conception et au modèle économique des plateformes.

      Le problème n'est donc pas réductible à une simple question de responsabilité individuelle.

      Les stratégies de captation de l'attention sont systémiques et exploitent des vulnérabilités psychologiques propres à l'adolescence.

      Ces constats invalident l'approche de l'autorégulation et démontrent l'urgence d'adopter un cadre de gouvernance robuste pour les réseaux sociaux, à la hauteur des enjeux de santé publique.

      Si l'éducation au numérique et l'accompagnement parental sont des piliers nécessaires, ils demeurent insuffisants face à un problème d'une telle ampleur structurelle.

      L'expertise de l'Anses fournit la base factuelle pour une politique publique plus musclée, engageant la responsabilité des plateformes pour imposer des modifications profondes de leurs services.

      Une vigilance continue s'impose face aux évolutions technologiques rapides, notamment l'intégration de l'intelligence artificielle, qui pourrait démultiplier les risques identifiés.

    2. Synthèse du rapport de l'Anses sur les usages des réseaux sociaux et la santé des adolescents

      Résumé Exécutif

      Ce document synthétise l'avis et le rapport d'expertise collective de l'Agence nationale de sécurité sanitaire de l'alimentation, de l'environnement et du travail (Anses), publiés en décembre 2025, concernant les effets de l'usage des réseaux sociaux numériques sur la santé des adolescents de 11 à 17 ans.

      S'appuyant sur l'analyse de plus d'un millier d'études scientifiques, l'expertise établit un lien clair entre l'utilisation des réseaux sociaux et une augmentation des risques pour la santé mentale et le bien-être des jeunes.

      Les conclusions principales indiquent que le modèle économique des plateformes, fondé sur une "économie de l'attention", induit des conceptions (interfaces persuasives, défilement infini, algorithmes de personnalisation) qui exploitent les vulnérabilités propres à l'adolescence.

      Ces mécanismes favorisent un usage excessif et une perte de contrôle, entraînant des conséquences sanitaires multifactorielles.

      Les principaux effets négatifs identifiés sont :

      Perturbation du sommeil : Réduction de la durée et de la qualité du sommeil, agissant comme un médiateur clé pour d'autres troubles de santé mentale.

      Troubles anxiodépressifs : L'usage des réseaux sociaux est un facteur contributif, notamment via la comparaison sociale, le cyberharcèlement et la "peur de manquer" (FoMO).

      Image corporelle et troubles des conduites alimentaires : L'exposition à des contenus idéalisés renforce l'insatisfaction corporelle, particulièrement chez les filles.

      Conduites à risques : Les plateformes agissent comme des vecteurs pour la normalisation de la consommation de substances, la participation à des défis dangereux et l'exposition aux cyberviolences.

      L'expertise souligne que les filles constituent une population particulièrement à risque, étant plus impactées sur l'ensemble des effets sanitaires étudiés.

      Face à ce constat, l'Anses formule des recommandations structurées autour de quatre axes :

      • une régulation stricte des plateformes pour protéger les mineurs,
      • le renforcement de l'éducation aux médias,
      • des campagnes de prévention en santé publique, et
      • un soutien accru à la recherche pour combler les lacunes de connaissances.

      L'Agence conclut que si l'accompagnement parental et l'éducation sont nécessaires, ils ne peuvent se substituer à un cadre de gouvernance contraignant pour les plateformes, dont la responsabilité dans les impacts sanitaires observés est centrale.

      --------------------------------------------------------------------------------

      1. Contexte et Organisation de l'Expertise

      1.1. Origine et Objectifs

      Face à l'expansion massive des technologies numériques et aux interrogations sur leurs effets sanitaires, l'Anses s'est autosaisie le 12 septembre 2019 pour évaluer les risques liés à leurs usages.

      L'expertise a été spécifiquement focalisée sur les risques pour la santé des adolescents (11-17 ans) liés à l'utilisation des réseaux sociaux numériques, en raison de la vulnérabilité particulière de cette période de la vie.

      Les objectifs de l'expertise étaient de :

      • Caractériser le fonctionnement et les usages des réseaux sociaux.

      • Analyser les spécificités de la population adolescente.

      • Décrire les effets sur la santé de certaines pratiques.

      • Analyser les risques sanitaires globaux.

      • Formuler des recommandations pour protéger la santé des adolescents.

      1.2. Méthodologie

      L'expertise a été menée par le groupe de travail "Effets de l’usage des outils numériques sur la santé des adolescents", créé en septembre 2020, et adoptée par le Comité d’experts spécialisé (CES) "Agents physiques et nouvelles technologies". La démarche s'est appuyée sur :

      Une revue exhaustive de la littérature scientifique académique (plus d'un millier d'articles analysés via les bases de données Scopus et Pubmed entre 2011 et 2021, complétée par des études antérieures et postérieures).

      L'analyse de la littérature grise (rapports institutionnels et associatifs).

      Une analyse du cadre législatif menée par l'Institut de recherche juridique de la Sorbonne.

      1.3. Limites de la Littérature Scientifique

      Le groupe de travail a identifié plusieurs limites aux études disponibles :

      Décalage temporel : De nombreuses études portent sur des réseaux sociaux moins populaires aujourd'hui (ex: Facebook) et peu sur des plateformes plus récentes comme TikTok.

      Mesure de l'utilisation : La plupart des études reposent sur le temps d'utilisation déclaré, une mesure sujette aux biais de mémoire et de désirabilité sociale. Un temps élevé n'est pas suffisant pour qualifier un usage de "préoccupant".

      Hétérogénéité des contextes : Les études proviennent de divers pays, mais les mécanismes d'action des plateformes étant similaires, les résultats ont été jugés transposables.

      Causalité : La majorité des études sont transversales, montrant des liens statistiques mais ne permettant pas d'établir de lien de cause à effet. Les études longitudinales, bien que moins nombreuses, apportent des éléments sur la temporalité des effets.

      2. Le Fonctionnement des Réseaux Sociaux Numériques

      2.1. Définition et Modèle Économique

      En l'absence de définition consensuelle, l'expertise s'est adossée à une conception large, similaire à celle de la loi du 7 juillet 2023 : une plateforme permettant aux utilisateurs de se connecter, communiquer et partager des contenus.

      Le modèle économique des plateformes majeures s'apparente à celui d'une régie publicitaire. La gratuité apparente du service est compensée par la monétisation des données personnelles et de l'attention des utilisateurs.

      Ce modèle incite les plateformes à maximiser le temps passé et l'engagement des utilisateurs.

      2.2. Stratégies de Captation de l'Attention

      Pour maintenir l'engagement, les plateformes déploient des stratégies de conception spécifiques :

      Algorithmes de personnalisation : Ils proposent des contenus visant à retenir l'utilisateur, créant parfois un "effet silo" qui renforce l'exposition à des contenus potentiellement délétères.

      Interfaces trompeuses (ou dark patterns) : Ce sont des mécanismes persuasifs qui exploitent des biais psychologiques pour inciter les utilisateurs à des actions qu'ils ne feraient pas autrement.

      Fonctionnalités incitatives : Le défilement infini, les notifications et les contenus éphémères sont conçus pour inciter à un usage prolongé et induire une perte de contrôle.

      Ces stratégies exploitent les vulnérabilités de l'adolescence : besoin d'interactions sociales, recherche de sensations et capacités de régulation émotionnelle encore limitées.

      3. Usages des Réseaux Sociaux par les Adolescents

      L'expertise distingue l'utilisation (interaction technique), l'usage (intégration sociale et culturelle) et la pratique (routines et savoir-faire). L'analyse se concentre sur les usages, qui sont des phénomènes complexes.

      3.1. État des Lieux

      | Donnée Clé | Valeur | Source / Année | | --- | --- | --- | | Adolescents (12-17 ans) utilisant un smartphone quotidiennement pour aller sur Internet | Près de 90 % | \- | | Adolescents (12-17 ans) passant entre 2 et 5h/jour sur leur smartphone | 42 % | Credoc, 2025 | | Adolescents (12-17 ans) passant plus de 5h/jour sur leur smartphone | 9 % | Credoc, 2025 | | Utilisation quotidienne des réseaux sociaux chez les 12-17 ans (2023) | 53 % | CREDOC, Baromètre du numérique | | Utilisation quotidienne des réseaux sociaux chez les 12-17 ans (2024) | 58 % | CREDOC, Baromètre du numérique |

      Les usages varient selon l'âge, le genre et le milieu social. Les filles consacrent plus de temps aux réseaux sociaux que les garçons, qui privilégient les jeux vidéo.

      3.2. Rôle dans la Socialisation

      Les réseaux sociaux répondent aux aspirations des adolescents (interactions, recherche d'informations auprès des pairs, prise de risques) et participent à l'exploration de leur identité. Ils prolongent et transforment les processus de socialisation, s'inscrivant dans la continuité des dynamiques familiales, scolaires et amicales.

      La sphère familiale peut jouer un rôle de régulation et de ressource, mais les usages configurent aussi un territoire informationnel propre à l'adolescent.

      4. Principaux Effets sur la Santé des Adolescents

      L'expertise révèle des conséquences négatives significatives, avec une prévalence plus marquée chez les filles pour la majorité des effets sanitaires étudiés.

      4.1. Usage Problématique et Addiction

      Le terme "addiction aux réseaux sociaux" n'est pas reconnu dans les classifications internationales (DSM-5R, ICD-11) et fait l'objet de débats. Le rapport opte pour la notion d'"usage problématique", la plus fréquente dans la littérature.

      Les outils de mesure sont hétérogènes mais s'accordent sur deux dimensions caractéristiques d'une addiction :

      • Les répercussions négatives sur la santé et les activités quotidiennes.

      • L'impossibilité de maîtriser le temps passé sur les plateformes (perte de contrôle).

      4.2. Perturbation du Sommeil

      Un impact négatif clair est démontré. Les mécanismes sont :

      Réduction de la durée du sommeil par un retard de l'heure du coucher.

      Stimulation de l'éveil (physiologique, cognitif, émotionnel) qui entrave l'endormissement.

      Exposition à la lumière bleue des écrans le soir, qui inhibe la sécrétion de mélatonine.

      Une perturbation chronique du sommeil est un facteur de risque pour des maladies physiques et mentales, et un médiateur clé entre l'usage des réseaux sociaux et les symptômes anxiodépressifs.

      4.3. Image du Corps et Troubles des Conduites Alimentaires (TCA)

      Certaines pratiques, notamment sur les réseaux "hautement visuels", sont corrélées à :

      L'intériorisation d'idéaux corporels irréalistes.

      La comparaison sociale ascendante (se comparer à des personnes perçues comme plus désirables).

      L'auto-objectification (se percevoir comme un objet à regarder).

      Ces facteurs renforcent l'insatisfaction corporelle et la surveillance de son apparence, particulièrement chez les filles, et constituent des facteurs intermédiaires des TCA.

      Les algorithmes peuvent amplifier l'exposition à des contenus valorisant la maigreur ou la musculature, exacerbant les comportements délétères.

      4.4. Troubles Anxiodépressifs et Idées Suicidaires

      L'usage des réseaux sociaux est identifié comme un facteur contributif aux troubles anxiodépressifs, sans être une cause unique. La relation est complexe et médiée par :

      • L'altération du sommeil.

      • Le cyberharcèlement.

      • La comparaison sociale.

      • Le FoMO (Fear Of Missing Out), qui peut entraîner une perte de contrôle.

      Une spirale délétère est souvent observée : un mal-être initial peut conduire à un usage compulsif des réseaux ("escapisme"), qui à son tour détériore la santé mentale.

      Les algorithmes peuvent enfermer les jeunes en détresse dans des "silos" de contenus négatifs (automutilation, suicide), banalisant ces comportements (effet Werther).

      4.5. Conduites à Risques et Cyberviolences

      Consommation de substances : Les réseaux sociaux participent à la normalisation de la consommation d'alcool, de tabac et de cannabis en exposant les jeunes à des contenus valorisants et en renforçant les normes sociales perçues.

      Défis (challenges) : La recherche de reconnaissance par les pairs peut inciter à la participation à des défis dangereux.

      Cyberharcèlement : Il s'agit souvent d'une extension du harcèlement scolaire, amplifiée par l'anonymat, la persistance des contenus et l'ampleur de leur diffusion.

      La cybervictimation est associée à une augmentation des symptômes dépressifs, des idées suicidaires et de l'automutilation.

      Sexting non consenti : La diffusion d'images intimes sans consentement est une forme de cyberviolence sexuelle aux conséquences graves, en particulier pour les filles.

      4.6. Résultats Scolaires

      L'expertise met en évidence une association négative faible entre le temps passé sur les réseaux sociaux et les résultats scolaires.

      Cependant, les limites méthodologiques des études empêchent de conclure à un lien causal direct. Le multitâche numérique et la perturbation du sommeil sont des facteurs explicatifs probables.

      5. Autres Impacts Soulignés par le Comité d'Experts

      Le CES a rappelé la pertinence d'autres enjeux sanitaires et sociétaux :

      Sédentarité et inactivité physique : Bien que l'usage nomade des smartphones ne soit pas directement synonyme de sédentarité, les longues durées d'utilisation y contribuent probablement.

      Lumière bleue : Les adolescents sont plus sensibles à la lumière bleue des écrans, ce qui augmente le risque de perturbation des rythmes circadiens et, à long terme, de troubles métaboliques ou de santé mentale.

      Impacts environnementaux : Le numérique représente près de 4 % des émissions mondiales de gaz à effet de serre, un chiffre en hausse, notamment à cause du streaming vidéo encouragé par les réseaux sociaux.

      Enjeux démocratiques : Les algorithmes peuvent polariser les opinions, diffuser de la désinformation et manipuler l'information, soulevant des questions majeures pour la construction citoyenne des adolescents.

      6. Recommandations de l'Anses

      Face à ces constats, l'Agence formule des recommandations structurées selon quatre axes d'action complémentaires.

      6.1. Réguler et Sécuriser l'Environnement Numérique

      Imposer un cahier des charges aux plateformes pour qu'elles soient accessibles aux mineurs, incluant des mécanismes fiables de vérification de l'âge.

      Encadrer légalement les interfaces persuasives et les algorithmes de personnalisation pour interdire les techniques d'influence trompeuse et limiter l'amplification de contenus préjudiciables.

      • Instaurer un paramétrage par défaut protecteur pour les comptes des mineurs (limitation des notifications, etc.).

      • Mettre en place des procédures simples et efficaces de signalement et de modération des contenus délétères.

      • Étendre aux réseaux sociaux l'encadrement des publicités prévu pour la télévision.

      6.2. Éduquer aux Médias Numériques

      Fournir des repères de bonnes pratiques aux parents et adolescents, coconstruits avec eux.

      Renforcer l'éducation au numérique à l'école, en formant du personnel dédié et en développant l'esprit critique et les compétences socio-émotionnelles des élèves.

      • Promouvoir des espaces de parole entre pairs pour réfléchir collectivement aux pratiques numériques.

      6.3. Prévenir les Effets sur la Santé

      • Mener des campagnes de santé publique sur l'hygiène de vie (sommeil, activité physique) et l'hygiène numérique (risques liés à l'image de soi, au consentement).

      Renforcer la prévention en santé mentale en formant les professionnels au contact des adolescents et en dotant les systèmes scolaire et de santé de moyens suffisants.

      Intensifier la lutte contre les cyberviolences et toutes les formes de discrimination.

      • Développer des alternatives de socialisation hors ligne (infrastructures sportives, culturelles) adaptées aux adolescents.

      6.4. Soutenir la Recherche

      Garantir l'accès aux données des plateformes pour les chercheurs, comme le prévoit le Digital Services Act (DSA) européen.

      Améliorer la méthodologie des études scientifiques en diversifiant les approches et en développant des outils de mesure plus fiables.

      • Financer la recherche sur des thèmes clés comme le cyberharcèlement, les interfaces trompeuses, les populations vulnérables et l'efficacité des actions de prévention.

      • Étudier la pertinence de qualifier l'usage problématique des réseaux sociaux comme une addiction comportementale.

    1. Synthèse de l'Avis du Conseil d'État sur la Proposition de Loi "Protéger les Mineurs en Ligne"

      1. Contexte et Objectifs de la Proposition de Loi

      Cette proposition de loi a été élaborée en réponse à des constats alarmants concernant les risques auxquels les réseaux sociaux exposent les mineurs.

      Faisant directement suite aux recommandations du rapport de la commission d’enquête sur TikTok, le texte met en lumière les dangers d'addiction et les effets psychologiques néfastes de certaines plateformes sur la santé mentale des jeunes.

      L'objectif principal du législateur est donc de renforcer de manière significative le cadre de protection des mineurs dans l'environnement numérique, en instaurant des mesures contraignantes et préventives.

      Les deux mesures phares de la proposition initiale sont les suivantes :

      Interdiction d'accès pour les moins de 15 ans : Le texte visait à imposer une obligation directe aux fournisseurs de services de réseaux sociaux de refuser l'inscription des mineurs de moins de 15 ans.

      Pour ce faire, les plateformes auraient dû mettre en œuvre des dispositifs de contrôle d'âge robustes, sous peine de sanctions financières et d'injonctions judiciaires.

      Couvre-feu numérique pour les 15-18 ans : Pour cette tranche d'âge, la proposition prévoyait une obligation de désactivation automatique de l'accès aux comptes entre 22 heures et 8 heures du matin, en s'appuyant sur les mêmes solutions techniques de vérification de l'âge.

      En complément de ce dispositif central, le texte comprend plusieurs autres mesures structurantes :

      | Mesure | Objectif Stratégique | | --- | --- | | Lutte contre la publicité pro-suicide | Compléter la liste des contenus illicites pour inclure la propagande en faveur de moyens de se donner la mort. | | Renforcement des peines | Augmenter la durée de suspension des comptes d'accès aux plateformes en cas d'infraction. | | Messages sanitaires | Imposer des informations préventives sur les publicités pour les réseaux sociaux et sur les emballages de smartphones. | | Formation scolaire | Étendre la formation sur l'usage du numérique à la sensibilisation aux enjeux de santé mentale. | | Interdiction des téléphones dans les lycées | Généraliser l'interdiction déjà en vigueur dans les collèges pour favoriser la concentration et prévenir le harcèlement. | | Création d'un délit de négligence parentale | Sanctionner les parents en cas d'usage excessif, inadapté ou non surveillé des outils numériques par leur enfant. |

      L'analyse juridique approfondie du Conseil d'État révèle cependant que, si l'intention est louable, les mécanismes proposés soulèvent des difficultés majeures de compatibilité avec le droit européen et les libertés fondamentales.

      2. Analyse Critique du Conseil d'État : Compatibilité avec le Droit Européen

      La conformité au droit de l'Union européenne est une condition essentielle de la validité de toute loi nationale.

      Le Conseil d'État souligne que le Règlement sur les Services Numériques (DSA) harmonise pleinement les règles pour les plateformes opérant dans l'UE, limitant drastiquement la capacité des États membres à leur imposer des obligations supplémentaires.

      L'avis du Conseil se révèle être une véritable leçon d'ingénierie juridique, démontrant comment atteindre un objectif de politique nationale dans le cadre contraignant d'un droit européen harmonisé.

      Le Conseil d'État met en évidence une incompatibilité juridique frontale : en imposant une obligation directe aux plateformes de refuser l'inscription des mineurs, la proposition de loi initiale violerait le principe d'harmonisation maximale du DSA, rendant la mesure juridiquement fragile et susceptible d'être invalidée.

      Pour surmonter cet obstacle majeur, le Conseil d'État propose une reformulation décisive, qui constitue le pivot de sa stratégie. Au lieu d'obliger les plateformes, la loi doit directement interdire l'accès au mineur : `

      « Il est interdit au mineur de quinze ans d’accéder à un service de réseau social en ligne »`.

      Cet acte de prohibition qualifie automatiquement un tel accès de "contenu illicite" au sens de la définition large du DSA.

      Cette reclassification est la clé de voûte de la stratégie du Conseil : elle permet de mobiliser les puissants mécanismes de régulation du DSA (injonctions de l'Arcom, signalements, sanctions) contre les plateformes sans créer une nouvelle obligation nationale, interdite par le droit européen.

      Le cadre de l'UE devient ainsi le principal outil d'application d'une politique nationale française.

      Pour renforcer l'effectivité de cette interdiction, le Conseil suggère d'ouvrir un second flanc de mise en conformité. Il préconise de prévoir la nullité de plein droit des contrats passés par un mineur en violation de cette interdiction.

      Une telle nullité priverait de base légale tout traitement de ses données personnelles, exposant les plateformes à des contrôles et sanctions de la part de la CNIL au titre du RGPD, ce qui augmente considérablement la pression en faveur du respect de la loi.

      Enfin, le Conseil recommande que la Commission européenne élabore des lignes directrices pour s'assurer que les plateformes gèrent correctement la restitution des contenus et des données aux mineurs dont les comptes sont résiliés, afin de ne pas porter atteinte à leurs droits de propriété intellectuelle.

      Cette refonte juridique est présentée comme une condition sine qua non à la viabilité du texte.

      3. Analyse Critique du Conseil d'État : Équilibre avec les Droits et Libertés Fondamentaux

      Au-delà de la conformité européenne, le Conseil d'État analyse la conciliation entre l'objectif de protection de l'enfance — une exigence constitutionnelle — et le respect des libertés fondamentales du mineur (liberté d'expression, d'information) et des droits des parents.

      Sur ce plan, le Conseil juge le dispositif initial déséquilibré et disproportionné pour trois raisons principales :

      1. Caractère général et absolu : L'interdiction s'appliquerait à tous les "réseaux sociaux" sans distinction, y compris ceux ne présentant aucun risque avéré (plateformes collaboratives, éducatives), ce qui est jugé excessif.

      2. Absence de discernement et de rôle parental : Le mécanisme initial ignore le degré de maturité de l'enfant et écarte totalement les parents de leur rôle d'accompagnement, en contradiction avec le Code civil et la Convention relative aux droits de l’enfant.

      3. Manque de justification du couvre-feu : Les bornes horaires du couvre-feu pour les 15-18 ans (22h-8h) sont jugées insuffisamment documentées et donc disproportionnées.

      Pour rééquilibrer le texte, le Conseil d'État propose une refonte qui incarne un changement de philosophie réglementaire : passer d'une interdiction étatique, brute et centrée sur la plateforme, à un système nuancé, responsabilisant les parents et centré sur le terminal. Ce mécanisme alternatif repose sur deux volets :

      Volet 1 - Interdiction Ciblée Le Gouvernement pourrait, par décret en Conseil d’État pris après avis de l’Arcom, interdire l'accès aux mineurs de moins de 15 ans à des réseaux sociaux spécifiquement identifiés comme dangereux en raison de leurs systèmes de recommandation.

      L'État utilise ici son pouvoir de prohibition de manière ciblée, là où le danger est avéré.

      Volet 2 - Autorisation Parentale Généralisée Pour tous les autres réseaux sociaux, l'accès serait interdit sauf autorisation expresse d'un parent.

      Réalisée via des dispositifs installés sur les systèmes d’exploitation des équipements terminaux distribués par les fournisseurs d’accès à l’internet (à l'instar des mécanismes de contrôle parental existants), cette autorisation serait révocable et pourrait préciser une durée d'usage.

      L'État délègue ici à une autorité parentale guidée le soin d'évaluer le risque.

      Cette approche duale résout le problème de proportionnalité, transformant une interdiction fragile en un système de régulation juridiquement beaucoup plus solide.

      4. Recommandations et Points de Vigilance sur les Autres Articles

      Le Conseil d'État a également examiné les autres articles de la proposition de loi, formulant des recommandations d'ajustement ou des réserves importantes.

      Interdiction des téléphones dans les lycées (Art. 6) : La mesure est jugée nécessaire et proportionnée.

      Le Conseil recommande d'exclure explicitement de son champ les formations de l'enseignement supérieur et de différer son entrée en vigueur à la rentrée scolaire 2026.

      Formation scolaire (Art. 4) : Jugée conforme, la mesure est cependant qualifiée de potentiellement redondante avec des dispositions déjà existantes.

      Une entrée en vigueur différée à la rentrée 2026 est également suggérée pour permettre l'adaptation des enseignants.

      Délit de négligence numérique (Art. 7) : Le Conseil exprime de fortes réserves.

      À titre principal, il estime que le droit pénal existant est suffisant.

      À titre subsidiaire, si le délit était maintenu, ses termes ("usage excessif", "outils numériques") sont jugés trop vagues et contraires au principe constitutionnel de légalité des délits et des peines.

      Publicité et emballages (Art. 3) : Ces dispositions devront être notifiées à la Commission européenne au titre de la directive "TRIS", une étape procédurale cruciale destinée à prévenir la création de barrières techniques inopinées au sein du marché unique.

      Rapport au Parlement (Art. 5) : Il est suggéré de restreindre le champ du rapport pour le concentrer sur le respect par les plateformes de leurs obligations spécifiques envers les mineurs dans le cadre du DSA.

      Ces ajustements visent à garantir la sécurité juridique et l'applicabilité concrète de l'ensemble du texte.

      5. Conclusion : Synthèse Stratégique pour la Décision

      L'avis du Conseil d'État valide sans équivoque la nécessité d'agir face aux dangers documentés que les réseaux sociaux font peser sur les mineurs et reconnaît la pertinence de l'objectif poursuivi par le législateur.

      Cependant, cette validation de l'objectif s'accompagne d'une censure quasi totale du dispositif initialement proposé. Celui-ci est jugé doublement fragile :

      1. Incompatible avec le droit de l'Union européenne, en raison de la violation du principe d'harmonisation maximale du DSA.

      2. Déséquilibré au regard des droits fondamentaux, car l'interdiction générale et le couvre-feu sont jugés disproportionnés et écartent indûment l'autorité parentale.

      En définitive, les amendements du Conseil d'État ne sont pas de simples ajustements.

      Ils constituent une refondation juridique et une véritable feuille de route stratégique et législative offerte au Parlement. Ils transforment un projet juridiquement précaire en une loi conforme, proportionnée et, par conséquent, viable et réellement efficace pour protéger les mineurs dans l'espace numérique.

    1. Reviewer #1 (Public review):

      Summary:

      Here Bansal et al., present a study on the fundamental blood and nectar feeding behaviors of the critical disease vector, Anopheles stephensi. The study encompasses not just the fundamental changes in blood feeding behaviors of the crucially understudied vector, but then use a transcriptomic approach to identify candidate neuromodulation path ways which influence blood feeding behavior in this mosquito species. The authors then provide evidence through RNAi knockdown of candidate pathways that the neuromodulators sNPF and Rya modulate feeding either via their physiological activity in the brain alone or through joint physiological activity along the brain-gut axis (but critically not the gut alone). Overall, I found this study to be built on tractable, well-designed behavioral experiments.

      Their study begins with a well-structured experiment to assess how the feeding behaviors of A. stephensi changes over the course of its life history and in response to its age, mating and oviposition status. The authors are careful and validate their experimental paradigm in the more well-studied Ae. aegypti, and are able to recapitulate the results of prior studies which show that mating is pre-requisite for blood feeding behaviors in Ae. aegypt. Here they find A. stephensi like another Anopheline mosquitoes has a more nuanced regulation of its blood and nectar feeding behaviors.

      The authors then go on to show in a Y- maze olfactometer that to some degree, changes in blood feeding status depend on behavioral modulation to host-cues, and this is not likely to be a simple change to the biting behaviors alone. I was especially struck by the swap in valence of the host-cues for the blood-fed and mated individuals which had not yet oviposited. This indicates that there is a change in behavior that is not simply desensitization to host-cues while navigating in flight, but something much more exciting happening.

      The authors then use a transcriptomic approach to identify candidate genes in the blood feeding stages of the mosquito's life cycle to identify a list of 9 candidates which have a role in regulating the host-seeking status of A. stephensi. Then through investigations of gene knockdown of candidates they identify the dual action of RYa and sNPF and candidate neuromodulators of host-seeking in this species. Overrall, I found the experiments to be well-designed. I found the molecular approach to be sound. While I do not think the molecular approach is necessarily an all-encompassing mechanism identification (owing mostly to the fact that genetic resources are not yet available in A. stephensi as they are in other dipteran models), I think it sets up a rich lines of research questions for the neurobiology of mosquito behavioral plasticity and comparative evolution of neuromodulator action.

      Strengths:

      I am especially impressed by the authors' attention to small details in the course of this article. As I read and evaluated this article I continued to think how many crucial details I may have missed if I were the scientist conducting these experiments. That attention to detail paid off in spades and allowed the authors to carefully tease apart molecular candidates of blood-seeking stages. The authors top down approach to identifying RYamide and sNPF starting from first principles behavioral experiments is especially comprehensive. The results from both the behavioral and molecular target studies will have broad implications for the vectorial capacity of this species and comparative evolution of neural circuit modulation.

      I believe the authors have adequately addressed all of my concerns; however, I think an accompanying figure to match the explained methods of the tissue-specific knockdown would help readers. The methods are now explicitly written for the timing and concentrations required to achieve tissue-specific knockdown, but seeing the data as a supplement would be especially reassuring given the critical nature of tissue-specific knockdown to the final interpretations of this paper.

    2. Author response:

      The following is the authors’ response to the original reviews.

      Public Reviews:

      Reviewer #1 (Public review):

      Summary:

      Bansal et al. present a study on the fundamental blood and nectar feeding behaviors of the critical disease vector, Anopheles stephensi. The study encompasses not just the fundamental changes in blood feeding behaviors of the crucially understudied vector, but then uses a transcriptomic approach to identify candidate neuromodulation pathways which influence blood feeding behavior in this mosquito species. The authors then provide evidence through RNAi knockdown of candidate pathways that the neuromodulators sNPF and Rya modulate feeding either via their physiological activity in the brain alone or through joint physiological activity along the brain-gut axis (but critically not the gut alone). Overall, I found this study to be built on tractable, well-designed behavioral experiments.

      Their study begins with a well-structured experiment to assess how the feeding behaviors of A. stephensi change over the course of its life history and in response to its age, mating, and oviposition status. The authors are careful and validate their experimental paradigm in the more well-studied Ae. aegypti, and are able to recapitulate the results of prior studies, which show that mating is a prerequisite for blood feeding behaviors in Ae. aegypt. Here they find A. Stephensi, like other Anopheline mosquitoes, has a more nuanced regulation of its blood and nectar feeding behaviors.

      The authors then go on to show in a Y-maze olfactometer that ,to some degree, changes in blood feeding status depend on behavioral modulation to host cues, and this is not likely to be a simple change to the biting behaviors alone. I was especially struck by the swap in valence of the host cues for the blood-fed and mated individuals, which had not yet oviposited. This indicates that there is a change in behavior that is not simply desensitization to host cues while navigating in flight, but something much more exciting is happening.

      The authors then use a transcriptomic approach to identify candidate genes in the blood-feeding stages of the mosquito's life cycle to identify a list of 9 candidates that have a role in regulating the host-seeking status of A. stephensi. Then, through investigations of gene knockdown of candidates, they identify the dual action of RYa and sNPF and candidate neuromodulators of host-seeking in this species. Overall, I found the experiments to be well-designed. I found the molecular approach to be sound. While I do not think the molecular approach is necessarily an all-encompassing mechanism identification (owing mostly to the fact that genetic resources are not yet available in A. stephensi as they are in other dipteran models), I think it sets up a rich line of research questions for the neurobiology of mosquito behavioral plasticity and comparative evolution of neuromodulator action.

      We appreciate the reviewer’s detailed summary of our work. We thank them for their positive comments and agree with them on the shortcomings of our approach.

      Strengths:

      I am especially impressed by the authors' attention to small details in the course of this article. As I read and evaluated this article, I continued to think about how many crucial details could potentially have been missed if this had not been the approach. The attention to detail paid off in spades and allowed the authors to carefully tease apart molecular candidates of blood-seeking stages. The authors' top-down approach to identifying RYamide and sNPF starting from first principles behavioral experiments is especially comprehensive. The results from both the behavioral and molecular target studies will have broad implications for the vectorial capacity of this species and comparative evolution of neural circuit modulation.

      We really appreciate that the reviewer has recognised the attention to detail we have tried to put, thank you!

      Weaknesses:

      There are a few elements of data visualizations and methodological reporting that I found confusing on a first few read-throughs. Figure 1F, for example, was initially confusing as it made it seem as though there were multiple 2-choice assays for each of the conditions. I would recommend removing the "X" marker from the x-axis to indicate the mosquitoes did not feed from either nectar, blood, or neither in order to make it clear that there was one assay in which mosquitoes had access to both food sources, and the data quantify if they took both meals, one meal, or no meals.

      We thank the reviewer for flagging the schematic in figure 1F. As suggested, we have removed the “X” markers from the x-axis and revised the axis label from “choice of food” to “choice made” to better reflect what food the mosquitoes chose in the assay. For clarity, we have now also plotted the same data as stacked graphs at the bottom of Fig. 1F, which clearly shows the proportion of mosquitoes fed on each particular choice. We avoid the stacked graph as the sole representation of this data, as it does not capture the variability in the data.

      I would also like to know more about how the authors achieved tissue-specific knockdown for RNAi experiments. I think this is an intriguing methodology, but I could not figure out from the methods why injections either had whole-body or abdomen-specific knockdown.

      The tissue-specific knockdown (abdomen only or abdomen+head) emerged from initial standardisations where we were unable to achieve knockdown in the head unless we used higher concentrations of dsRNA and did the injections in older females. We realised that this gave us the opportunity to isolate the neuronal contribution of these neuropeptides in the phenotype produced. Further optimisations revealed that injecting dsRNA into 0-10h old females produced abdomen-specific knockdowns without affecting head expression, whereas injections into 4 days old females resulted in knockdowns in both tissues. Moreover, head knockdowns in older females required higher dsRNA concentrations, with knockdown efficiency correlating with the amount injected. In contrast, abdominal knockdowns in younger females could be achieved even with lower dsRNA amounts.

      We have mentioned the knockdown conditions- time of injection and the amount dsRNA injected- for tissue-specific knockdowns in methods but realise now that it does not explain this well enough. We have now edited it to state our methodology more clearly (see lines 932-948).

      I also found some interpretations of the transcriptomic to be overly broad for what transcriptomes can actually tell us about the organism's state. For example, the authors mention, "Interestingly, we found that after a blood meal, glucose is neither spent nor stored, and that the female brain goes into a state of metabolic 'sugar rest', while actively processing proteins (Figure S2B, S3)".

      This would require a physiological measurement to actually know. It certainly suggests that there are changes in carbohydrate metabolism, but there are too many alternative interpretations to make this broad claim from transcriptomic data alone.

      We thank the reviewer for pointing this out and agree with them. We have now edited our statement to read:

      “Instead, our data suggests altered carbohydrate metabolism after a blood meal, with the female brain potentially entering a state of metabolic 'sugar rest' while actively processing proteins (Figure S2B, S3). However, physiological measurements of carbohydrate and protein metabolism will be required to confirm whether glucose is indeed neither spent nor stored during this period.” See lines 271-277.

      Reviewer #2 (Public review):

      Summary:

      In this manuscript, Bansal et al examine and characterize feeding behaviour in Anopheles stephensi mosquitoes. While sharing some similarities to the well-studied Aedes aegypti mosquito, the authors demonstrate that mated females, but not unmated (virgin) females, exhibit suppression in their bloodfeeding behaviour. Using brain transcriptomic analysis comparing sugar-fed, blood-fed, and starved mosquitoes, several candidate genes potentially responsible for influencing blood-feeding behaviour were identified, including two neuropeptides (short NPF and RYamide) that are known to modulate feeding behaviour in other mosquito species. Using molecular tools, including in situ hybridization, the authors map the distribution of cells producing these neuropeptides in the nervous system and in the gut. Further, by implementing systemic RNA interference (RNAi), the study suggests that both neuropeptides appear to promote blood-feeding (but do not impact sugar feeding), although the impact was observed only after both neuropeptide genes underwent knockdown.

      Strengths and/or weaknesses:

      Overall, the manuscript was well-written; however, the authors should review carefully, as some sections would benefit from restructuring to improve clarity. Some statements need to be rectified as they are factually inaccurate.

      Below are specific concerns and clarifications needed in the opinion of this reviewer:

      (1) What does "central brains" refer to in abstract and in other sections of the manuscript (including methods and results)? This term is ambiguous, and the authors should more clearly define what specific components of the central nervous system was/were used in their study.

      Central brain, or mid brain, is a commonly used term to refer to brain structures/neuropils without the optic lobes (For example: https://www.nature.com/articles/s41586-024-07686-5). In this study we have focused our analysis on the central brain circuits involved in modulating blood-feeding behaviour and have therefore excluded the optic lobes. As optic lobes account for nearly half of all the neurons in the mosquito brain (https://pmc.ncbi.nlm.nih.gov/articles/PMC8121336/), including them would have disproportionately skewed our transcriptomic data toward visual processing pathways. 

      We have indicated this in figure 3A and in the methods (see lines 800-801, 812). We have now also clarified it in the results section for neurotranscriptomics to avoid confusion (see lines 236-237).

      (2) The abstract states that two neuropeptides, sNPF and RYamide are working together, but no evidence is summarized for the latter in this section.

      We thank the reviewer for pointing this out. We have now added a statement “This occurs in the context of the action of RYa in the brain” to end of the abstract, for a complete summary of our proposed model. 

      (3) Figure 1

      Panel A: This should include mating events in the reproductive cycle to demonstrate differences in the feeding behavior of Ae. aegypti.

      Our data suggest that mating can occur at any time between eclosion and oviposition in An. stephensi and between eclosion and blood feeding in Ae. aegypti. Adding these into (already busy) 1A, would cloud the purpose of the schematic, which is to indicate the time points used in the behavioural assays and transcriptomics.

      Panel F: In treatments where insects were not provided either blood or sugar, how is it that some females and males had fed? Also, it is unclear why the y-axis label is % fed when the caption indicates this is a choice assay. Also, it is interesting that sugar-starved females did not increase sugar intake. Is there any explanation for this (was it expected)?

      We apologise for the confusion. The experiment is indeed a choice assay in which sugar-starved or sugar-sated females, co-housed with males, were provided simultaneous access to both blood and sugar, and were assessed for the choice made (indicated on the x-axis): both blood and sugar, blood only, sugar only, or neither. The x-axis indicates the choice made by the mosquitoes, not the choice provided in the assay, and the y-axis indicates the percentage of males or females that made each particular choice. We have now removed the “X” markers from the x-axis and revised the axis label from “choice of food” to “choice made” to better reflect what food the mosquitoes chose to take.

      In this assay, we scored females only for the presence or absence of each meal type (blood or sugar) and are therefore unable to comment on whether sugar-starved females consumed more sugar than sugarsated females. However, when sugar-starved, a higher proportion of females consumed both blood and sugar, while fewer fed on blood alone.

      For clarity, we have now also plotted the same data as stacked graphs at the bottom of Fig. 1F, which clearly shows the proportion of mosquitoes fed on each particular choice. We avoid the stacked graph as the sole representation of this data as it does not capture the variability in the data.

      (4) Figure 3

      In the neurotranscriptome analysis of the (central) brain involving the two types of comparisons, can the authors clarify what "excluded in males" refers to? Does this imply that only genes not expressed in males were considered in the analysis? If so, what about co-expressed genes that have a specific function in female feeding behaviour?

      This is indeed correct. We reasoned that since blood feeding is exclusive to females, we should focus our analysis on genes that were specifically upregulated in them. As the reviewer points out, it is very likely that genes commonly upregulated in males and females may also promote blood feeding and we will miss out on any such candidates based on our selection criteria. 

      (5) Figure 4

      The authors state that there is more efficient knockdown in the head of unfed females; however, this is not accurate since they only get knockdown in unfed animals, and no evidence of any knockdown in fed animals (panel D). This point should be revised in the results test as well.

      Perhaps we do not understand the reviewer’s point or there has been a misunderstanding. In figure 4D, we show that while there is more robust gene knockdown in unfed females, blood-fed females also showed modest but measurable knockdowns ranging from 5-40% for RYamide and 2-21% for sNPF. 

      Relatedly, blood-feeding is decreased when both neuropeptide transcripts are targeted compared to uninjected (panel C) but not compared to dsGFP injected (panel E). Why is this the case if authors showed earlier in this figure (panel B) that dsGFP does not impact blood feeding?

      We realise this concern stems from our representation of the data. Since we had earlier determined that dsGFP-injected females fed similarly to uninjected females (fig 4B), we used these controls interchangeably in subsequent experiments. To avoid confusion, we have now only used the label ‘control’ in figure 4 (and supplementary figure S9) and specified which control was used for each experiment in the legend.

      In addition to this, we wanted to clarify that fig 4C and 4E are independent experiments. 4C is the behaviour corresponding to when the neuropeptides were knocked down in both heads and abdomens. 4E is the behaviour corresponding to when the neuropeptides were knocked down in only the abdomens. We have now added a schematic in the plots to make this clearer.

      In addition, do the uninjected and dsGFP-injected relative mRNA expression data reflect combined RYa and sNPF levels? Why is there no variation in these data,…

      In these qPCRs, we calculated relative mRNA expression using the delta-delta Ct method (see line 975). For each neuropeptide its respective control was used. For simplicity, we combined the RYa and sNPF control data into a single representation. The value of this control is invariant because this method sets the control baseline to a value of 1.

      …and how do transcript levels of RYa and sNPF compare in the brain versus the abdomen (the presentation of data doesn't make this relationship clear).

      The reviewer is correct in pointing out that we have not clarified this relationship in our current presentation. While we have not performed absolute mRNA quantifications, we extracted relative mRNA levels from qPCR data of 96h old unmanipulated control females. We observed that both sNPF and RYa transcripts are expressed at much lower levels in the abdomens, as compared to those in the heads, as shown in Author response Image 1 below. 

      Author response image 1.

      (6) As an overall comment, the figure captions are far too long and include redundant text presented in the methods and results sections.

      We thank the reviewer for flagging this and have now edited the legends to remove redundancy.  

      (7) Criteria used for identifying neuropeptides promoting blood-feeding: statement that reads "all neuropeptides, since these are known to regulate feeding behaviours". This is not accurate since not all neuropeptides govern feeding behaviors, while certainly a subset do play a role.

      We agree with the reviewer that not all neuropeptides regulate feeding behaviours. Our statement refers to the screening approach we used: in our shortlist of candidates, we chose to validate all neuropeptides.

      (8) In the section beginning with "Two neuropeptides - sNPF and RYa - showed about 25% and 40% reduced mRNA levels...", the authors state that there was no change in blood-feeding and later state the opposite. The wording should be clarified as it is unclear.

      Thank you for pointing this out. We were referring to an unchanged proportion of the blood fed females. We have now edited the text to the following: 

      “Two neuropeptides - sNPF and RYa - showed about 25% and 40% reduced mRNA levels in the heads but the proportion of females that took blood meals remained unchanged”. See lines 338-340.

      (9) Just before the conclusions section, the statement that "neuropeptide receptors are often ligandpromiscuous" is unjustified. Indeed, many studies have shown in heterologous systems that high concentrations of structurally related peptides, which are not physiologically relevant, might cross-react and activate a receptor belonging to a different peptide family; however, the natural ligand is often many times more potent (in most cases, orders of magnitude) than structurally related peptides. This is certainly the case for various RYamide and sNPF receptors characterized in various insect species.

      We agree with the reviewer and apologise for the mistake. We have now removed the statement.

      (10) Methods

      In the dsRNA-mediated gene knockdown section, the authors could more clearly describe how much dsRNA was injected per target. At the moment, the reader must carry out calculations based on the concentrations provided and the injected volume range provided later in this section.

      We have now edited the section to reflect the amount of dsRNA injected per target. Please see lines 921-931.

      It is also unclear how tissue-specific knockdown was achieved by performing injection on different days/times. The authors need to explain/support, and justify how temporal differences in injection lead to changes in tissue-specific expression. Does the blood-brain barrier limit knockdown in the brain instead, while leaving expression in the peripheral organs susceptible?

      To achieve tissue-specific knockdowns of sNPF and RYa, we optimised both the time of injection as well as the dsRNA concentration to be injected. Injecting dsRNA into 0-10h females produced abdomen-specific knockdowns without affecting head expression, whereas injections into 96h old females resulted in knockdowns in both tissues. Head knockdowns in older females required higher dsRNA concentrations, with knockdown efficiency correlating with the amount injected. In contrast, abdominal knockdowns in younger females could be achieved even with lower dsRNA amounts, reflecting the lower baseline expression of sNPF in abdomens compared to heads and the age-dependent increase in head expression (as confirmed by qPCR). It is possible that the blood-brain barrier also limits the dsRNA entering the brain, thereby requiring higher amounts to be injected for head knockdowns. 

      We have now edited this section to state our methodology more clearly (see lines 932-948).

      For example, in Figure 4, the data support that knockdown in the head/brain is only effective in unfed animals compared to uninjected animals, while there is no evidence of knockdown in the brain relative to dsGFP-injected animals. Comparatively, evidence appears to show stronger evidence of abdominal knockdown mostly for the RYa transcript (>90%) while still significantly for the sNPF transcript (>60%).

      As we explained earlier, this concern likely stems from our representation of the data. Since we had earlier determined that dsGFP-injected females fed similarly to uninjected females (fig 4B), we used these controls interchangeably in subsequent experiments. To avoid confusion, we have now only used the label ‘control’ in figure 4 (and supplementary figure S9) and specified which control was used for each experiment in the legend.

      In addition to this, we wanted to clarify that fig 4C and 4E are independent experiments. 4C is the behaviour corresponding to when the neuropeptides were knocked down in both heads and abdomens.  4E is the behaviour corresponding to when the neuropeptides were knocked down in only the abdomen. We have now added a schematic in the plots to make this clearer.

      Reviewer #3 (Public review):

      Summary:

      This manuscript investigates the regulation of host-seeking behavior in Anopheles stephensi females across different life stages and mating states. Through transcriptomic profiling, the authors identify differential gene expression between "blood-hungry" and "blood-sated" states. Two neuropeptides, sNPF and RYamide, are highlighted as potential mediators of host-seeking behavior. RNAi knockdown of these peptides alters host-seeking activity, and their expression is anatomically mapped in the mosquito brain (sNPF and RYamide) and midgut (sNPF only).

      Strengths:

      (1) The study addresses an important question in mosquito biology, with relevance to vector control and disease transmission.

      (2) Transcriptomic profiling is used to uncover gene expression changes linked to behavioral states.

      (3) The identification of sNPF and RYamide as candidate regulators provides a clear focus for downstream mechanistic work.

      (4) RNAi experiments demonstrate that these neuropeptides are necessary for normal host-seeking behavior.

      (5) Anatomical localization of neuropeptide expression adds depth to the functional findings.

      Weaknesses:

      (1) The title implies that the neuropeptides promote host-seeking, but sufficiency is not demonstrated (for example, with peptide injection or overexpression experiments).

      Demonstrating sufficiency would require injecting sNPF peptide or its agonist. To date, no small-molecule agonists (or antagonists) that selectively mimic sNPF or RYa neuropeptides have been identified in insects. An NPY analogue, TM30335, has been reported to activate the Aedes aegypti NPY-like receptor 7 (NPYLR7; Duvall et al., 2019), which is also activated by sNPF peptides at higher doses (Liesch et al., 2013). Unfortunately, the compound is no longer available because its manufacturer, 7TM Pharma, has ceased operations. Synthesising the peptides is a possibility that we will explore in the future.

      (2) The proposed model regarding central versus peripheral (gut) peptide action is inconsistently presented and lacks strong experimental support.

      The best way to address this would be to conduct tissue-specific manipulations, the tools for which are not available in this species. Our approach to achieve head+abdomen and abdomen only knockdown was the closest we could get to achieving tissue specificity and allowed us to confirm that knockdown in the head was necessary for the phenotype. However, as the reviewer points out, this did not allow us to rule out any involvement of the abdomen. This point has been addressed in lines 364-371.

      (3) Some conclusions appear premature based on the current data and would benefit from additional functional validation.

      The most definitive way of demonstrating necessity of sNPF and RYa in blood feeding would be to generate mutant lines. While we are pursuing this line of experiments, they lie beyond the scope of a revision. In its absence, we relied on the knockdown of the genes using dsRNA. We would like to posit that despite only partial knockdown, mosquitoes do display defects in blood-feeding behaviour, without affecting sugar-feeding. We think this reflects the importance of sNPF in promoting blood feeding.

      Recommendations for the authors:

      Reviewer #1 (Recommendations for the authors):

      Overall, I found this manuscript to be well-prepared, visually the figures are great and clearly were carefully thought out and curated, and the research is impactful. It was a wonderful read from start to finish. I have the following recommendations:

      Thank you very much, we are very pleased to hear that you enjoyed reading our manuscript!

      (1) For future manuscripts, it would make things significantly easier on the reviewer side to submit a format that uses line numbers.

      We sincerely apologise for the oversight. We have now incorporated line numbers in the revised manuscript.

      (2) There are a few statements in the text that I think may need clarification or might be outside the bounds of what was actually studied here. For example, in the introduction "However, mating is dispensable in Anophelines even under conditions of nutritional satiety". I am uncertain what is meant by this statement - please clarify.

      We apologise for the lack of clarity in the statement and have now deleted it since we felt it was not necessary.

      (3) Typo/Grammatical minutiae:

      (a) A small idiosyncrasy of using hyphens in compound words should also be fixed throughout. Typically, you don't hyphenate if the words are being used as a noun, as in the case: e.g. "Age affects blood feeding.". However, you would hyphenate if the two words are used as a compound adjective "Age affects blood-feeding behavior". This may not be an all-inclusive list, but here are some examples where hyphens need to either be removed or added. Some examples:

      "Nutritional state also influences other internal state outputs on blood-feeding": blood-feeding -> blood feeding

      "... the modulation of blood-feeding": blood-feeding -> blood feeding

      "For example, whether virgin females take blood-meals...": blood-meals -> blood meals

      ".... how internal and external cues shape meal-choice"-> meal choice

      "blood-meal" is often used throughout the text, but is correctly "blood meal" in the figures.

      There are many more examples throughout.

      We apologise for these errors and appreciate the reviewer’s keen eye. We have now fixed them throughout the manuscript.  

      (b) Figure 1 Caption has a typo: "co-housed males were accessed for sugar-feeding" should be "co-housed males were assessed for sugar feeding"

      We apologise for the typo and thank the reviewer for spotting it. We have now corrected this.  

      (c) It would be helpful in some other figure captions to more clearly label which statement is relevant to which part of the text. For example, in Figure 4's caption.

      "C,D. Blood-feeding and sugar-feeding behaviour of females when both RYa and sNPF are knocked down in the head (C). Relative mRNA expressions of RYa and sNPF in the heads of dsRYa+dssNPF - injected blood-fed and unfed females, as compared to that in uninjected females, analysed via qPCR (D)."

      I found re-referencing C and D at the end of their statements makes it look as thought C precedes the "Relative mRNA expression" and on a first read through, I thought the figure captions were backwards. I'd recommend reformatting here and throughout consistently to only have the figure letter precede its relevant caption information, e.g.:

      "C. Blood-feeding and sugar-feeding behaviour of females when both RYa and sNPF are knocked down in the head. D. Relative mRNA expressions of RYa and sNPF in the heads of dsRYa+dssNPF - injected bloodfed and unfed females, as compared to that in uninjected females, analysed via qPCR."

      We have now edited the legends as suggested.

      Reviewer #2 (Recommendations for the authors):

      Separately from the clarifications and limitations listed above, the authors could strengthen their study and the conclusions drawn if they could rescue the behavioural phenotype observed following knockdown of sNPF and RYamide. This could be achieved by injection of either sNPF or RYa peptide independently or combined following knockdown to validate the role of these peptides in promoting blood-feeding in An. stephensi. Additionally, the apparent (but unclear) regionalized (or tissue-specific) knockdown of sNPF and RYamide transcripts could be visualized and verified by implementing HCR in situ hyb in knockdown animals (or immunohistochemistry using antibodies specific for these two neuropeptides). 

      In a follow up of this work, we are generating mutants and peptides for these candidates and are planning to conduct exactly the experiments the reviewer suggests.

      Reviewer #3 (Recommendations for the authors):

      The loss-of-function data suggest necessity but not sufficiency. Synthetic peptide injection in non-hostseeking (blood-fed mated or juvenile) mosquitoes would provide direct evidence for peptide-induced behavioral activation. The lack of these experiments weakens the central claim of the paper that these neuropeptides directly promote blood feeding.

      As noted above, we plan to synthesise the peptide to test rescue in a mutant background and sufficiency.  

      Some of the claims about knockdown efficiency and interpretation are conflicting; the authors dismiss Hairy and Prp as candidates due to 30-35% knockdown, yet base major conclusions on sNPF and RYamide knockdowns with comparable efficiencies (25-40%). This inconsistency should be addressed, or the justification for different thresholds should be clearly stated.

      We have not defined any specific knockdown efficacy thresholds in the manuscript, as these can vary considerably between genes, and in some cases, even modest reductions can be sufficient to produce detectable phenotypes. For example, knockdown efficiencies of even as low as about 25% - 40% gave us observable phenotypes for sNPF and RYa RNAi (Figure S9B-G).

      No such phenotypes were observed for Hairy (30%) or Prp (35%) knockdowns. Either these genes are not involved in blood feeding, or the knockdown was not sufficient for these specific genes to induce phenotypes. We cannot distinguish between these scenarios. 

      The observation that knockdown animals take smaller blood meals is interesting and could reflect a downstream effect of altered host-seeking or an independent physiological change. The relationship between meal size and host-seeking behavior should be clarified.

      We agree with the reviewer that the reduced meal size observed in sNPF and RYa knockdown animals could result from their inability to seek a host or due to an independent effect on blood meal intake. Unfortunately, we did not measure host-seeking in these animals. We plan to distinguish between these possibilities using mutants in future work.

      Several figures are difficult to interpret due to cluttered labeling and poorly distinguishable color schemes. Simplifying these and improving contrast (especially for co-housed vs. virgin conditions) would enhance readability. 

      We regret that the reviewer found the figures difficult to follow. We have now revised our annotations throughout the manuscript for enhanced readability. For example, “D1<sup>B”</sup> is now “D1<sup>PBM”</sup> (post-bloodmeal) and “D1<sup>O”</sup> is now “D1<sup>PO”</sup> (post-oviposition). Wherever mated females were used, we have now appended “(m)” to the annotations and consistently depicted these females with striped abdomens in all the schematics. We believe these changes will improve clarity and readability.

      The manuscript does not clearly justify the use of whole-brain RNA sequencing to identify peptides involved in metabolic or peripheral processes. Given that anticipatory feeding signals are often peripheral, the logic for brain transcriptomics should be explained.

      The reviewer is correct in pointing out that feeding signals could also emerge from peripheral tissues. Signals from these tissues – in response to both changing nutritional and reproductive states – are then integrated by the central brain to modulate feeding choices. For example, in Drosophila, increased protein intake is mediated by central brain circuitry including those in the SEZ and central complex (Munch et al., 2022; Liu et al., 2017; Goldschmidt et al., 202ti). In the context of mating, male-derived sex peptide further increases protein feeding by acting on a dedicated central brain circuitry (Walker et al., 2015). We, therefore focused on the central brain for our studies.

      The proposed model suggests brain-derived peptides initiate feeding, while gut peptides provide feedback. However, gut-specific knockdowns had no effect, undermining this hypothesis. Conversely, the authors also suggest abdominal involvement based on RNAi results. These contradictions need to be resolved into a consistent model.

      We thank the reviewer for raising this point and recognise their concern. Our reasons for invoking an involvement of the gut were two-fold:

      (1) We find increased sNPF transcript expression in the entero-endocrine cells of the midgut in blood-hungry females, which returns to baseline after a blood-meal (Fig. 4L, M).

      (2) While the abdomen-only knockdowns did not affect blood feeding, every effective head knockdown that affected blood feeding also abolished abdominal transcript levels (Fig. S9C, F). (Achieving a head-only reduction proved impossible because (i) systemic dsRNA delivery inevitably reaches the abdomen and (ii) abdominal expression of both peptides is low, leaving little dynamic range for selective manipulation.) Consequently, we can only conclude the following: 1) that brain expression is required for the behaviour, 2) that we cannot exclude a contributory role for gut-derived sNPF. We have discussed this in lines 364-371.

      The identification of candidate receptors is promising, but the manuscript would be significantly strengthened by testing whether receptor knockdowns phenocopy peptide knockdowns. Without this, it is difficult to conclude that the identified receptors mediate the behavioral effects.

      We agree that functional validation of the receptors would strengthen the evidence for sNPF and RYa-mediated control of blood feeding in An. stephensi. We selected these receptors based on sequence homology. A possibility remains that sNPF neuropeptides activate more than one receptor, each modulating a distinct circuit, as shown in the case of Drosophila Tachykinin (https://pmc.ncbi.nlm.nih.gov/articles/PMC10184743/). This will mean a systematic characterisation and knockdown of each of them to confirm their role. We are planning these experiments in the future.  

      The authors compared the percentage changes in sugar-fed and blood-fed animals under sugar-sated or sugar-starved conditions. Figure 1F should reflect what was discussed in the results.

      Perhaps this concern stems from our representation of the data in figure 1F? We have now edited the xaxis and revised its label from “choice of food” to “choice made” to better reflect what food the mosquitoes chose to take.

      For clarity, we have now also plotted the same data as stacked graphs at the bottom of Fig. 1F, which clearly shows the proportion of mosquitoes fed on each particular choice. We avoid the stacked graph as the sole representation of this data because it does not capture the variability in the data.

      Minor issues:

      (1) The authors used mosquitoes with belly stripes to indicate mated females. To be consistent, the post-oviposition females should also have belly stripes.

      We thank the reviewer for pointing this out. We have now edited all the figures as suggested.

      (2) In the first paragraph on the right column of the second page, the authors state, "Since females took blood-meals regardless of their prior sugar-feeding status and only sugar-feeding was selectively suppressed by prior sugar access." Just because the well-fed animals ate less than the starved animals does not mean their feeding behavior was suppressed.

      Perhaps there has been a misunderstanding in the experimental setup of figure 1F, probably stemming from our data representation. The experiment is a choice assay in which sugar-starved or sugar-sated females, co-housed with males, were provided simultaneous access to both blood and sugar, and were assessed for the choice made (indicated on the x-axis): both blood and sugar, blood only, sugar only, or neither. We scored females only for the presence or absence of each meal type (blood or sugar) and did not quantify the amount consumed.

      (3) The figure legend for Figure 1A and the naming convention for different experimental groups are difficult to follow. A simplified or consistently abbreviated scheme would help readers navigate the figures and text.

      We regret that the reviewer found the figure difficult to follow. We have now revised our annotations throughout the manuscript for enhanced readability. For example, “D1<sup>B”</sup> is now “D1<sup>PBM”</sup> (post-bloodmeal) and “D1<sup>O”</sup> is now “D1<sup>PO”</sup> (post-oviposition).

      (4) In the last paragraph of the Y-maze olfactory assay for host-seeking behaviour in An. stephensi in Methods, the authors state, "When testing blood-fed females, aged-matched sugar-fed females (bloodhungry) were included as positive controls where ever possible, with satisfactory results." The authors should explicitly describe what the criteria are for "satisfactory results".

      We apologise for the lack of clarity. We have now edited the statement to read:

      “When testing blood-fed females, age-matched sugar-fed females (blood-hungry) were included wherever possible as positive controls. These females consistently showed attraction to host cues, as expected.” See lines 786-790.

      (5) In the first paragraph of the dsRNA-mediated gene knockdown section in Methods, dsRNA against GFP is used as a negative control for the injection itself, but not for the potential off-target effect.

      We agree with the reviewer that dsGFP injections act as controls only for injection-related behavioural changes, and not for off-target effects of RNAi. We have now corrected the statement. See lines 919-920.

      To control for off-target effects, we could have designed multiple dsRNAs targeting different parts of a given gene. We regret not including these controls for potential off-target effects of dsRNAs injected. 

      (6) References numbers 48, 89, and 90 are not complete citations.

      We thank the reviewer for spotting these. We have now corrected these citations.

    1. Author response:

      The following is the authors’ response to the original reviews.

      We thank the editor and reviewers for their constructive questions, valuable feedback, and for approving our manuscript. We truly appreciate the opportunity to improve our work based on their insightful comments. Before addressing the editor’s and each referee’s remarks individually, we provide below a point-by-point response summarizing the revisions made.

      Duplication of control groups across experiments

      We appreciate the reviewers’ concern regarding the potential duplication of control groups. In the revised manuscript, we have explicitly clarified that independent groups of control mice were used for each experiment. These details are now clearly indicated in the Materials and Methods section to avoid any ambiguity and to reinforce the rigor of our experimental design (Page 15, Line 453-455): “Furthermore, knockout animals and those treated with pharmacological inhibitors or neutralizing antibodies shared the same control groups (chow and HFCD), as required by the animal ethics committee.”

      Validation of the MASLD model

      To strengthen the metabolic characterization of our MASLD model, we have now included additional parameters, including liver weight, Picrosirius staining and blood glucose measurements. These data are presented as new graphs in the revised manuscript and support the metabolic relevance of the HFCD diet model (Figure Suplementary S1). The corresponding description has been added to the Results section (Page 5, Lines 116-117) as follows: “Mice fed HFCD showed no increase in liver weight and collagen deposition as evidenced by Picrosirius staining (Fig. S1A and Fig. S1C)”

      Assessment of liver injury in RagKO and anti-NK1.1 mice

      We fully agree that assessment of liver injury is essential for these models. For mice treated with antiNK1.1, ALT levels are shown in Figure 4G, confirming increased liver injury after treatment. Regarding Rag⁻/⁻ mice, the animals exhibit exacerbation of liver injury when fed a HFCD diet and challenged with LPS (Page 7, Lines 183–184). The corresponding description has been added to the Results section (Page 7, Lines 175-176) as follows: “Interestingly, Rag1-deficient animals under the HFCD remained susceptible to the LPS challenge (Fig. 4C) with exacerbation of liver injury (Fig. 4D) ”

      Discussion of limitations

      We have expanded the Discussion section to provide a more comprehensive and balanced perspective on the limitations of our model and experimental approach (Page 13-14, Lines 401–414) “Our study presents several limitations that should be acknowledged and discussed. First, we cannot entirely rule out the possibility that our mice deficient in pro-inflammatory components exhibit reduced responsiveness to LPS. However, our ex vivo analyses using splenocytes from these animals revealed a preserved cytokine production following LPS stimulation. These results suggest that the in vivo differences observed are primarily driven by the MAFLD condition rather than by intrinsic defects in LPS sensitivity. Second, the absence of publicly available single-cell RNA-seq datasets from MAFLD subjects under endotoxemic or septic conditions limited our ability to perform direct translational comparisons. To overcome this, we analyzed existing MAFLD patients and experimental MAFLD datasets, which consistently demonstrated upregulation of IFN-y and TNF-α inflammatory pathways in MALFD. In line with these findings, our murine model revealed TNF-α⁺ myeloid and IFN-y⁺ NK cell populations, thereby reinforcing the validity and translational relevance of our results.”. This revision highlights the constraints of the MASLD model, the inherent variability among in vivo experiments, and the interpretative limitations related to immunodeficient mouse strains.

      Recommendations for the authors:

      Reviewer #1 (Recommendations for the authors):

      (1) In Figure 4 the authors are showing the number of IFN+ positive CD4, CD8, and NK 1.1+ cells. Could they show from total IFNg production, how much it goes specifically on NK cells and how much on other cell populations since NK1.1 is NK but also NKT and gamma delta T cell marker? Also, in Figure 2E the authors see a substantial increase in IFNg signal in T cells.

      While we did not specifically assess IFNγ production in NKT cells or other minor populations, our data indicate that the NK1.1+CD3+ cells (NKT cells) cited in Page 7, Lines  188-192 were essentially absent in the liver tissue of LPS-challenged animals, as shown in Supplementary Figures 3C and S10. The corresponding description has been added to the Results section (Page 7, Lines 188-192) as follows: “We observed that the number of NK cells increased in the liver tissue of PBS-treated MAFLD mice compared with mice fed a control diet (Fig. 4E). LPS challenge increased the accumulation of NK1.1+CD3− NK cells in the liver tissue of MAFLD mice and the absence of NK1.1+CD3+ NKT cells (Fig. S3C and 4E)”.

      This absence was consistent across all experimental conditions, corroborating our focus on NK1.1+CD3− cells as the primary source of NK1.1-associated IFNγ production. Furthermore, data demonstrated in Figure 2E illustrate the presence of IFNγ primarily in NK cells. Therefore, the observed IFNγ signal, attributed to NK1.1+ cells, predominantly reflects conventional NK cells, with minimal contribution from NKT or γδ T cells.

      (2) In Figure 4C, the authors state that the results suggest that T and B cells do not contribute to susceptibility to LPS challenge. However, they observe a drop in survival compared to chow+LPS. Are the authors certain there is no statistical significance there?

      The observed decrease in survival is consistent with our expectations, as T and B cells are not the primary source of interferon-gamma (IFNγ) in this context. Even in their absence, animals remain susceptible to LPS challenge due to the presence of other IFNγ-producing cells that drive the observed lethality. We have carefully re-examined the statistical analysis and confirm that it was correctly performed.  

      (3) Since the survival curve and rate are exactly the same (60%) in Figures 3F, 3G, 4C, 4F, 5G, and 5H I would just like to double-check that the authors used different controls for each experiment.

      The number of mice used in each experiment was carefully determined to ensure sufficient statistical power while fully complying with the limits established by our institutional Animal Ethics Committee. To minimize animal use, the same control group was shared across multiple survival experiments. Despite using shared controls, the total number of animals per experimental group was adequate to produce robust and reproducible survival outcomes. All groups were properly randomized, and the shared control data were rigorously incorporated into statistical analyses. This strategy allowed us to maintain both ethical standards and the scientific rigor of our findings.

      (4) In Figure 5 the authors are saying that it is neutrophils but not monocytes mediate susceptibility of animals with NAFLD to endotoxemia. However, CXCR2i depletion and CCR2 knock out mice affect both monocytes/macrophages and neutrophils. And in Figures 5E, 5G, and 5H they see that a) LPS+CXCR2i decreases liver damage more than LPS+anti Ly6G, b) HFCD mice challenged with LPS and treated with anti-LY6G do not rescue survival to levels of CHOW LPS and c) anti Ly6G treatment helps less than CXCR2i. Therefore, from both knock out mice and depletion experiments the authors can conclude that most likely monocytes (but potentially also other cells) together with neutrophils are substantial for the development of endotoxemic shock in choline-deficient high-fat diet model.

      While neutrophils express CCR2, our data clearly show that CCR2 deficiency does not impair neutrophil migration, as demonstrated in Supplemental Figures 5A and 5B (added to the manuscript, page 8, lines 213–217). The corresponding description has been added to the Results section (Page 8, Lines 213217) as follows: ``Interestingly, animals deficient in monocyte migration (CCR2-/-) showed a high mortality rate compared to wild type after LPS challenge and neutrophil migration is not altered (Fig. 5SA and Fig. 5SB)``, In contrast, CCR2 deficiency primarily affects monocyte recruitment, yet in our experimental conditions, monocyte depletion or CCR2 knockout did not significantly alter the severity of endotoxemic shock, indicating that monocytes play a minimal role in mediating susceptibility in HFCD-fed mice.

      To specifically investigate neutrophils, we used pharmacological blockade of CXCR2 to inhibit migration and antibody-mediated neutrophil depletion. Both approaches have consistently demonstrated that neutrophils are critical factors in endotoxemic shock.

      These findings support our conclusion that neutrophils are the primary cellular contributors to susceptibility in HFCD-fed mice during endotoxemia, with monocytes making a negligible contribution under the tested conditions.

      (6) In Figure 6A (but also others with PD-L1) did the authors do isotype control? And can they show how much of PD1+ population goes on neutrophils, and how much on all the other populations?

      To address this issue, we performed additional analyses to assess the distribution of PD-L1 expression on CD45+CD11B+ leukocytes. These new results, detailed on Page 9, lines 245-250, and now presented in Supplemental Figure 6, demonstrate that PD-L1 expression is predominantly enriched in neutrophils compared to other immune subsets. This observation further reinforces our conclusion that neutrophils represent a major source of PD-L1 in our experimental model.

      To ensure the robustness of these findings, we also included FMO controls for PD-L1 staining in the newly added Supplemental Figure S6. These controls validate the specificity of our gating strategy and confirm the reliability of the detected PD-L1 signal. The corresponding description has been added to the Results section (Page 9, Lines 245-250) as follows: ``First, we observed that only the MAFLD diet caused a significant increase in PD-L1 expression in CD45+CD11b+ leukocytes after LPS challenge (Fig. S6C). We observed that within this population, neutrophils predominate in their expression when compared to monocytes (Fig. 6SA, Fig. 6SB, and Fig. 6SD). Furthermore, PD-L+1 neutrophils showed an exacerbated migration of PD-L1+ neutrophils towards the liver (Fig. 6A and 6B)”

      (7) In Figure 6D it is interesting that there is not an increase in PD-L1+ neutrophils in LPS HFCD IFNg+/+ mice in comparison to LPS chow IFNg+/+ mice, since those should be like WT mice (Figure 6A going from 50% to 97%) and so an increase should be seen?

      The apparent difference between Figures 6A and 6D likely reflects inter-experimental variability rather than a biological discrepancy. Although the absolute percentages of PD-L1⁺ neutrophils varied slightly among independent experiments, the overall phenotype and trend were consistently maintained namely, that PD-L1 expression on neutrophils is enhanced in response to LPS stimulation and modulated by IFNγ signaling. Thus, the data shown in Figure 6D are representative of this consistent phenotype despite minor quantitative variation.

      (8) In Figure 7 do the authors have isotype control for TNFa because gating seems a bit random so an isotype control graph would help a lot as supplementary information, in order to make the figure more persuasive

      To address the concern regarding gating in Figure 7, we have included the FMO showing TNFα as a histogram Supplementary Figure 8gG. These control reaffirm the accuracy and reliability of our gating strategy for TNFα, further supporting the robustness of our data. The corresponding description has been added to the Results section (Page 9, Lines 272-274) as follows:`` We observed an exacerbated TNF-α expression by PD-L1+ neutrophils from MAFLD when compared to control chow animals (Fig. 7A, Fig. 7B, Fig. 7D, and Fig8SG).

      (9) Figure 6C IFNg+/+ mice on CHOW +LPS is same as Figure 8E mice chow +LPS but just with different numbers. Can the authors explain this?

      Although the data points in Figures 6C and 8E may appear similar, we confirm that they originate from entirely independent experiments and represent distinct datasets. To enhance clarity and avoid any potential confusion, we have adjusted the figure presentation and sizing in the revised manuscript. These changes make it clear that the datasets, while comparable, are derived from separate experimental replicates.

      (10) Figure 1E chow B6+LPS is the same as Figure 5D B6+LPS but should they be different since those should be two different experiments?

      We confirm that Figures 1E and 5D correspond to data obtained from independent experiments. Although the experimental conditions were similar, each dataset was generated and analyzed separately to ensure the reproducibility and robustness of our results.

      Reviewer #2 (Recommendations for the authors):

      (1) Why did you look at kidney injury in Figure 1D? I think this should be explained a little.

      We assessed kidney injury alongside ALT, a marker of liver damage, because both the liver and kidneys are among the primary organs affected during sepsis and endotoxemia. This rationale has been added to the manuscript (page 5, lines 129–131): “Remarkably, compared to the Chow group, HFCD mice exposed to LPS did not show greater changes in other organs commonly affected by endotoxemia, such as the kidneys (Figure 1D).” By evaluating markers of injury in both organs, we aimed to determine whether our physiopathological condition was liver-specific or indicative of broader systemic injury.

      (2) I know Figure 2C isn't your data, but why are there so few NK cells, considering NK cells are a resident liver cell type? Doesn't that also bring into question some of your data if there are so few NK cells? And the IFNG expression (2E) looks to mostly come from T-cells (CD8?).

      The data shown in Figure 2C were reanalyzed from a separate NAFLD model based on a 60% high-fat diet. Although this model differs from ours, the observed low number of NK cells is consistent with expectations for animals subjected solely to a hyperlipidic diet, which primarily provides an inflammatory stimulus that promotes recruitment rather than maintaining high baseline NK cell numbers.

      In our experimental model, these observations align with published data. Specifically, liver tissue from NAFLD animals typically exhibits low baseline NK cell numbers, but upon LPS challenge, there is a marked increase in NK cell recruitment to the liver. This dynamic illustrates the interplay between dietinduced inflammation and immune cell recruitment in our experimental context and supports the interpretation of our IFNγ data.

      (3) In your methods, I think you didn't explain something. You said LPS was administered to 56 week old mice, but that HFCD diet was started in 5-6 week old mice and lasted 2 weeks, then LPS was administered. So LPS administration happened when the mice were 7-8 weeks old, right?

      We thank the reviewer for pointing out this inconsistency in our Methods section. The reviewer is correct: the HFCD diet was initiated in 5–6-week-old mice, and LPS was administered after 2 weeks on the diet, such that LPS challenge occurred when the mice were 7–8 weeks old.

      We have revised the Methods section (add page 15-16, lines 474–480).  to clarify this timeline and ensure it is accurately described in the manuscript. The corresponding description has been added to the Materials and Methods section (Page 14, Lines 436-442) as follows: “Lipopolysaccharide (LPS; Escherichia coli (O111:B4), L2630, Sigma-Aldrich, St. Louis, MO, USA) was administered intraperitoneally (i.p.; 10 mg/kg) in C57BL/6, CCR2 -/-, IFN-/-, and TNFR1R2 -/- mice. The HFCD was initiated in 5–6 week-old mice, and LPS was administered after 2 weeks on the diet, meaning that LPS administration occurred when the mice were 7–8 weeks old, with body weights ranging from 22 to 26 g. LPS was previously solubilized in sterile saline and frozen at -70°C. The animals were euthanized 6 hours after LPS administration”.

      (4) Throughout the manuscript, I would consider changing the term NAFLD to something else. I think HFCD diet is a closer model to NASH, so there needs to be some discussion on that. And the field is changing these terms, so NAFLD is now MASLD and NASH is now MASH.

      We appreciate the reviewer’s comment regarding the terminology and disease classification. In our experimental conditions, the animals were subjected to a high-fat, choline-deficient (HFCD) diet for only two weeks, a period considered very early in the progression of diet-induced liver disease. At this stage, histological analysis revealed lipid accumulation in hepatocytes without evidence of hepatocellular injury, inflammation, or fibrosis. Therefore, our model more closely resembles the metabolic-associated fatty liver disease (MAFLD, formerly NAFLD) stage rather than the more advanced metabolic-associated steatohepatitis (MASH, formerly NASH).

      Indeed, prolonged exposure to HFCD diets, typically 8 to 16 weeks, is required to induce the inflammatory and fibrotic features characteristic of MASH. Since our objective was to study the initial metabolic and immune alterations preceding overt liver injury, we believe that using the term MAFLD more accurately reflects the pathological stage represented in our model. Accordingly, we have revised the text to align with the updated nomenclature and disease context.

      (6) I am concerned about over interpretation of the publicly available RNA-seq data in Figure 2. This data comes from human NAFLD patients with unknown endotoxemia and mouse models using a traditional high-fat diet model. So it is hard to compare these very disparate datasets to yours. Also, if these datasets have elevated IFNG, why does your model require LPS injection?

      We thank the reviewer for their thoughtful comments regarding the interpretation of the RNA-seq data presented in Figure 2. We would like to clarify that the human NAFLD datasets referenced in our study do not specifically include patients with endotoxemia; rather, they focus on individuals with NAFLD alone.

      Comparing data from human and murine MAFLD models, we observed that NK cells, T cells, and neutrophils are present and contribute to the hepatic inflammatory environment. Our reanalysis indicates that the elevations of IFNγ and TNF in NAFLD are primarily derived from NK cells, T cells, and myeloid cells, respectively.

      In our experimental model, LPS administration was used to evaluate whether these immune populations particularly NK cells are further potentiated under a hyperinflammatory state, leading to exacerbated IFNγ production. This approach allows us to determine whether increased IFNγ contributes to worsening outcomes in NAFLD, providing mechanistic insights that cannot be obtained from static human or traditional mouse datasets alone.

      (7) The zoom-ins for the histology (for example, Figure 1E) don't look right compared to the dotted square. The shape and area expanded don't match. And the cells in the zoom-in don't look exactly the same either.

      We have thoroughly re-examined the histological sections and the corresponding zoom-ins, including the example in Figure 1E. Upon verification, we confirm that the zoom-ins accurately represent the highlighted areas indicated by the dotted squares. The apparent discrepancies in shape or cellular appearance are likely due to minor differences in orientation or cropping during figure preparation. Nevertheless, the content and regions depicted are consistent with the original sections.  

      (8) Did the authors measure myeloid infiltration in the CCR2-/- mice? Did you measure Neutrophil infiltration in the TNF-Receptor KO mice?

      Analysis of CD45+ cell migration in CCR2 knockout mice, as shown in Supplemental Figure 5C and 5D, demonstrates that the absence of CCR2 does not impair overall leukocyte migration. Similarly, assessment of neutrophil migration in TNF receptor (TNFR1/2) knockout mice, presented in Supplemental Figure 8A, shows that neutrophil trafficking is not affected in these animals. These results indicate that the respective knockouts do not compromise the migration of the analyzed immune populations, supporting the interpretations presented in our study.

      (9) Regarding Methods for RNA-seq Analysis. Was the Mitochondrial percentage cutoff 0.8%, because that seems low. And was there not a Padj or FDR cutoff for the differential expression?

      The mitochondrial percentage in our scRNA-seq analysis reflects the proportion of mitochondrial gene expression per cell, which serves as a quality control metric. A low mitochondrial gene expression percentage, such as the 0.8% cutoff used here, is indicative of highly viable cells.

      For differential gene expression analysis, we employed the FindMarkers function in Seurat with standard parameters: adjusted p-value (Padj) < 0.05 and log2 fold change > 0.25 for upregulated genes, and adjusted p-value < 0.05 with log2 fold change < -0.25 for downregulated genes. These thresholds ensure robust identification of differentially expressed genes while balancing sensitivity and specificity.

      (10) Regarding Methods for Flow Cytometry. How were IFNG and TNF staining performed? Was this an intracellular stain? Did you need to block secretion? TNF and IFNG antibodies have the same fluorophore (PE), so were these stainings and analyses performed separately?

      Six hours after LPS challenge, non-parenchymal liver cells were isolated using Percoll gradient centrifugation. Because the animals were in a hyperinflammatory state induced by LPS, no in vitro stimulation was performed; all staining was carried out immediately after cell isolation. Detection of IFNγ and TNF was performed via intracellular staining using the Foxp3 staining kit (eBioscience). Due to both antibodies being conjugated to PE, IFN-γ and TNF-α staining and analyses were conducted in separate experiments. These distinct staining protocols and analyses are detailed in Supplemental Figures 10 and 11. The corresponding description has been added to the Materials and Methods section (Page 16, Lines 490-493) as follows: ``As animals were already in a hyperinflammatory state, no additional in vitro stimulation was required. Intracellular detection of IFN-γ and TNF-α was conducted using the Foxp3 staining kit (eBioscience). Since both antibodies were conjugated to PE, staining and analyses were performed in separate experiments``

      Reviewer #3 (Recommendations for the authors):

      (1) Achieving an NAFLD model/disease is the starting point of this study. I understand that a two-week HFCD diet period was applied due to the decrease in lymphocyte numbers. Was it enough to initiate NAFLD then? Or is it a milder metabolic disease? Which parameters have been evaluated to accept this model as a NAFLD model?

      Indeed, the two-week HFCD diet induces an early-stage form of NAFLD, characterized by initial fat accumulation in the liver without significant hepatic injury. While this represents a milder metabolic phenotype, it is sufficient to study the inflammatory and immune responses associated with NAFLD. To validate this model, we assessed multiple parameters: liver weight, blood glucose levels, and collagen deposition. These measurements confirmed the presence of early-stage NAFLD features in the animals, providing a relevant and reliable context for investigating susceptibility to endotoxemia and immune cell dynamics. They are shown in Figure Suplementary 1 and the text was included in the manuscript (Page 5, Lines 116-117): “Mice fed HFCD showed no increase in liver weight and collagen deposition as evidenced by Picrosirius staining (Fig. S1A and Fig. S1C) ”.

      (2) It is true that the CD274 gene (encoding PD-L1) and the IFNGR2 gene, corresponding to the IFNγ receptor, are among the upregulated genes when authors analyzed the publicly available RNAseq data but they are not the most significantly elevated genes. What is the reasoning behind this cherrypicking? Why are other high DEGs not analyzed but these two are analyzed?

      We highlighted the expression of the IFN-γ receptor (IFNGR2) and CD274 (encoding PD-L1) in the publicly available RNA-seq data to align and corroborate these findings with the key results observed later in our study. To avoid redundancy, we chose to present these genes in the initial figures as they are directly relevant to the subsequent analyses. Regarding the broader analysis of human RNA-seq data, our primary objective was to identify enriched biological processes and pathways, which served as a foundation for the focus and direction of this study.

      (3) Figures 3C-3G: I understand that IFNg-/- and NFR1R2a-/- mice are not showing elevated liver damage but it may simply be because of the non-responsiveness to the LPS challenge. I suggest using a different challenge or recovery experiments with the cytokines to show that the challenge is successful and results are caused by NAFLD, truly. The same goes for Figure 6: Looking at Figure 6D one may think that IFNg deficiency alters the LPS response independent of the diet condition (or NAFLD condition).

      We appreciate the reviewer’s insightful comment and fully understand the concern regarding the potential non-responsiveness of IFN-γ⁻/⁻ and TNFR1R2a⁻/⁻ mice to the LPS challenge. To address this point and confirm that these knockout animals are indeed responsive to LPS stimulation, we conducted an additional set of ex vivo experiments.

      Specifically, WT and cytokine-deficient (IFN-γ⁻/⁻) mice were fed either Chow or HFCD for two weeks, after which spleens were collected, and splenocytes were challenged in vitro with LPS. We then quantified TNF, IFN, and IL-6 production to confirm that these mice are capable of mounting cytokine responses upon LPS stimulation.

      Due to current breeding limitations and a temporary issue in colony maintenance of TNF-deficient mice, we were unable to include TNFR1R2a⁻/⁻ animals in this additional experiment. Nevertheless, we prioritized performing the analysis with the available knockout line to avoid leaving this important point unaddressed.

      These additional data demonstrate that IFN-γ-deficient mice remain responsive to LPS, reinforcing that the differences observed in vivo are related to the NAFLD condition rather than a lack of LPS responsiveness.

      (4) Figure 1 vs Figure 4: Rag-/- mice seem more susceptible to LPS-derived death even after normal conditions. But If I compare the survival data between Figure 1 and Figure 4, Rag-/- HFCD diet mice seem to be doing better than wt mice after LPS treatment. (1 day survival vs 2 days survival). How do you explain these different outcomes?

      We thank the reviewer for this insightful question regarding the survival data in Figures 1 and 4. Although there is a one-day difference in survival outcomes, Rag-/- mice consistently exhibit increased susceptibility to LPS-induced mortality can influence the exact survival timing. Nonetheless, across all experiments, Rag-/- mice display a reproducible phenotype of heightened sensitivity to LPS challenge, which is supported by multiple independent observations in our study.

      (5) How do you explain Figure 4J in connection to the observation presented with Figure 7: TNFa tissue levels, even though significant, seem very similar between the conditions?

      We would like to clarify that the animals in this study are in a metabolic syndrome state, with early-stage NAFLD characterized by hepatic fat accumulation without significant tissue injury, as shown in Figure 1C.

      Under these conditions, the LPS challenge triggers an exacerbated inflammatory response, leading to increased secretion of IFN-γ and TNF-α, primarily from NK cells and neutrophils. While TNFα levels may appear visually similar across conditions, the HFCD mice exhibit a heightened predisposition for an amplified immune response compared to chow-fed mice. This difference is consistent with the functional outcomes observed in our study and highlights the diet-specific sensitization of the immune system.

    1. Author response:

      The following is the authors’ response to the original reviews.

      Public Reviews: 

      Reviewer #1 (Public review):

      We thank Reviewer #1 for its thoughtful and constructive feedback. We found the suggestions particularly helpful in refining the conceptual framework and clarifying key aspects of our interpretations.

      Summary:

      This paper investigates the potential link between amygdala volume and social tolerance in multiple macaque species. Through a comparative lens, the authors considered tolerance grade, species, age, sex, and other factors that may contribute to differing brain volumes. They found that amygdala, but not hippocampal, volume differed across tolerance grades, such that hightolerance species showed larger amygdala than low-tolerance species of macaques. They also found that less tolerant species exhibited increases in amygdala volume with age, while more tolerant species showed the opposite. Given their wide range of species with varied biological and ecological factors, the authors' findings provide new evidence for changes in amygdala volume in relation to social tolerance grades. Contributions from these findings will greatly benefit future efforts in the field to characterize brain regions critical for social and emotional processing across species.

      Strengths:

      (1) This study demonstrates a concerted and impressive effort to comparatively examine neuroanatomical contributions to sociality in monkeys. The authors impressively collected samples from 12 macaque species with multiple datapoints across species age, sex, and ecological factors. Species from all four social tolerance grades were present. Further, the age range of the animals is noteworthy, particularly the inclusion of individuals over 20 years old - an age that is rare in the wild but more common in captive settings. 

      (2) This work is the first to report neuroanatomical correlates of social tolerance grade in macaques in one coherent study. Given the prevalence of macaques as a model of social neuroscience, considerations of how socio-cognitive demands are impacted by the amygdala are highly important. The authors' findings will certainly inform future studies on this topic.

      (3) The methodology and supplemental figures for acquiring brain MRI images are well detailed. Clear information on these parameters is crucial for future comparative interpretations of sociality and brain volume, and the authors do an excellent job of describing this process in full.

      Weaknesses:

      (1) The nature vs. nurture distinction is an important one, but it may be difficult to draw conclusions about "nature" in this case, given that only two data points (from grades 3 and 4) come from animals under one year of age (Method Figure 1D). Most brains were collected after substantial social exposure-typically post age 1 or 1.5-so the data may better reflect developmental changes due to early life experience rather than innate wiring. It might be helpful to frame the findings more clearly in terms of how early experiences shape development over time, rather than as a nature vs. nurture dichotomy.

      We agree with the reviewer that presenting our findings through a strict nature vs. nurture dichotomy was potentially misleading. We have revised the introduction and the discussion (e.g. lines 85-95 and 363-365) to clarify that we examined how neurodevelopmental trajectories differ across social grades with the caveat of related to the absence of very young individuals in our samples.  We now explicitly mention that our results may reflect both early species-typical biases and experience-dependent maturation.

      We positioned our study on social tolerance in a comparative neuroscience framework and introduced a tentative working model that articulates behavioral traits, cognitive dimensions, and their potential subcortical neural substrates

      Drawing upon 18 behavioral traits identified in Thierry’s comparative analyses (Thierry, 2021, 2007), we organize these traits into three core dimensions: socio-cognitive demands, behavioral inhibition, and the predictability of the social environment (Table 1). This conceptualization does not aim to redefine social tolerance itself, but rather to provide a structured basis for testing neuroanatomical hypotheses related to social style variability. It echoes recent efforts to bridge behavioral ecology and cognitive neuroscience by linking specific mental abilities – such as executive functions or metacognition – with distinct prefrontal regions shaped by social and ecological pressures (Bouret et al., 2024).

      “Cross-fostering experiments (De Waal and Johanowicz, 1993), along with our own results, suggest that social tolerance grades reflect both early, possibly innate predispositions and later environmental shaping”.

      (2) It would be valuable to clarify how the older individuals, especially those 20+ years old, may have influenced the observed age-related correlations (e.g., positive in grades 1-2, negative in grades 3-4). Since primates show well-documented signs of aging, some discussion of the potential contribution of advanced age to the results could strengthen the interpretation.

      We thank the reviewer for highlighting this important point. In our dataset, younger and older subjects are underrepresented, but they are distributed across all subgroups. Therefore, we do not think that it could drive the interaction effect we are reporting. In our sample, amygdala volume tended to increase with age in intolerant species and decrease in tolerant species. We included a new analysis (Figure 4) that allows providing a clearer assessment of when social grades 1 vs 4 differed in terms of amygdala and hippocampus volume. While our model accounts for age continuously, we agree that age-related variation deserves cautious interpretation and require longitudinal designs in future studies.

      We also added the following statements in the discussion (lines 386-391)

      “Due to a limited sample size of our study, this crossing trend, already accounted for by our continuous age model, should be further investigated. These results call for cautious interpretation of age-related variation and further emphasize the importance of longitudinal studies integrating both behavioral, cognitive and anatomical data in non-human primates, which would help to better understand the link between social environment and brain development (Song et al., 2021)”.

      (3) The authors categorize the behavioral traits previously described in Thierry (2021) into 3 selfdefined cognitive requirements, however, they do not discuss under what conditions specific traits were assigned to categories or justify why these cognitive requirements were chosen. It is not fully clear from Thierry (2021) alone how each trait would align with the authors' categories. Given that these traits/categories are drawn on for their neuroanatomical hypotheses, it is important that the authors clarify this. It would be helpful to include a table with all behavioral traits with their respective categories, and explain their reasoning for selecting each cognitive requirement category.

      Thank you for this important suggestion. We have extensively revised the introduction to explain how we derived from the scientific literature the three cognitive dimensions—socio-cognitive demands, behavioral inhibition, and predictability of the social environment—. We now provide a complete overview of the 18 behavioral traits described in Thierry’s framework and their cognitive classification in a dedicated table , along with hypothesized neural correlates. We have also mentioned traits that were not classified in our framework along with short justification of this classification. We believe this addition significantly improves the transparency and intelligibility of our conceptual approach.

      “The concept of social tolerance, central to this comparative approach, has sometimes been used in a vague or unidimensional way. As Bernard Thierry (2021) pointed out, the notion was initially constructed around variations in agonistic relationships – dominance, aggressiveness, appeasement or reconciliation behaviors – before being expanded to include affiliative behaviors, allomaternal care or male–male interactions (Thierry, 2021). These traits do not necessarily align along a single hierarchical axis but rather reflect a multidimensional complexity of social style, in which each trait may have co-evolved with others (Thierry, 2021, 2000; Thierry et al., 2004). Moreover, the lack of a standardized scientific definition has sometimes led to labeling species as “tolerant” or “intolerant” without explicit criteria (Gumert and Ho, 2008; Patzelt et al., 2014). These behavioral differences are characterized by different styles of dominance (Balasubramaniam et al., 2012), severity of agonistic interactions (Duboscq et al., 2014), nepotism (Berman and Thierry, 2010; Duboscq et al., 2013; Sueur et al., 2011) and submission signals (De Waal and Luttrell, 1985; Rincon et al., 2023), among the 18 covariant behavioral traits described in Thierry's classification of social tolerance (Thierry, 2021, 2017, 2000)”.

      “To ground the investigation of social tolerance in a comparative neuroanatomical framework, we introduce a tentative working model that articulates behavioral traits, cognitive dimensions, and their potential subcortical neural substrates. Drawing upon 18 behavioral traits identified in Thierry’s comparative analyses (Thierry, 2021, 2007), we organized these traits into three core dimensions: socio-cognitive demands, behavioral inhibition, and the predictability of the social environment (Table 1). This conceptualization does not aim to redefine social tolerance itself, but rather to provide a structured basis for testing neuroanatomical hypotheses related to social style variability. It echoes recent efforts to bridge behavioral ecology and cognitive neuroscience by linking specific mental abilities – such as executive functions or metacognition – with distinct prefrontal regions shaped by social and ecological pressures (Bouret et al., 2024; Testard 2022)”.

      (4) One of the main distinctions the authors make between high social tolerance species and low tolerance species is the level of complex socio-cognitive demands, with more tolerant species experiencing the highest demands. However, socio-cognitive demands can also be very complex for less tolerant species because they need to strategically balance behaviors in the presence of others. The relationships between socio-cognitive demands and social tolerance grades should be viewed in a more nuanced and context-specific manner. 

      We fully agree and we did not mean that intolerant species lives in a ‘simple’ social environment but that the ones of more tolerant species is markedly more demanding. Evidence supporting this statement include their more efficient social networks (Sueur et al., 2011) and more complex communicative skills (e.g. tolerant macaques displayed higher levels of vocal diversity and flexibility than intolerant macaques in social situation with high uncertainty (Rebout et al., 2020).

      In the revised version (lines 106-122), we now highlight that socio-cognitive challenges arise across the tolerance spectrum, including in less tolerant species where strategic navigation of rigid hierarchies and risk-prone interactions is required. We hope that this addition offers a more balanced and nuanced framing of socio-cognitive demands across macaque societies

      “The first category, socio-cognitive demands, refers to the cognitive resources needed to process, monitor, and flexibly adapt to complex social environments. Linking those parameters to neurological data is at the core of the social brain theory to explain the expansion of the neocortex in primates (Dunbar). Macaques social systems require advanced abilities in social memory, perspective-taking, and partner evaluation (Freeberg et al., 2012). This is particularly true in tolerant species, where the increased frequency and diversity of interactions may amplify the demands on cognitive tracking and flexibility. Tolerant macaque species typically live in larger groups with high interaction frequencies, low nepotism, and a wider range of affiliative and cooperative behaviors, including reconciliation, coalition-building, and signal flexibility (REF). Tolerant macaque species also exhibit a more diverse and flexible vocal and facial repertoire than intolerants ones which may help reduce ambiguity and facilitate coordination in dense social networks (Rincon et al., 2023; Scopa and Palagi, 2016; Rebout 2020). Experimental studies further show that macaques can use facial expressions to anticipate the likely outcomes of social interactions, suggesting a predictive function of facial signals in managing uncertainty (Micheletta et al., 2012; Waller et al., 2016). Even within less tolerant species, like M. mulatta, individual variation in facial expressivity has been linked to increased centrality in social networks and greater group cohesion, pointing to the adaptive value of expressive signaling across social styles (Whitehouse et al., 2024)”.

      (5) While the limitations section touches on species-related considerations, the issue of individual variability within species remains important. Given that amygdala volume can be influenced by factors such as social rank and broader life experience, it might be useful to further emphasize that these factors could introduce meaningful variation across individuals. This doesn't detract from the current findings but highlights the importance of considering life history and context when interpreting subcortical volumes-particularly in future studies.

      We have now emphasized this point in the limitations section (lines 441-456). While our current dataset does not allow us to fully control for individual-level variables across all collection centers, we recognize that factors such as rank, social exposure, and individual life history may influence subcortical volumes

      “Although we explained some interspecies variability, adding subjects to our database will increase statistical power and will help addressing potential confounding factors such as age or sex in future studies. One will benefit from additional information about each subject. While considered in our modelling, the social living and husbandry conditions of the individuals in our dataset remain poorly documented. The living environment has been considered, and the size of social groups for certain individuals, particularly for individuals from the CdP, have been recorded. However, these social characteristics have not been determined for all individuals in the dataset. As previously stated, the social environment has a significant impact on the volumetry of certain regions. Furthermore, there is a lack of data regarding the hierarchy of the subjects under study and the stress they experience in accordance with their hierarchical rank and predictability of social outcomes position (McCowan et al., 2022)”. 

      Reviewer #2 (Public review):

      We thank Reviewer #2 for its thoughtful remarks and for acknowledging the value of our comparative approach despite its inherent constraints.

      Summary:

      This comparative study of macaque species and the type of social interaction is both ambitious and inevitably comes with a lot of caveats. The overall conclusion is that more intolerant species have a larger amygdala. There are also opposing development profiles regarding amygdala volume depending on whether it is a tolerant or intolerant species.

      To achieve any sort of power, they have combined data from 4 centres, which have all used different scanning methods, and there are some resolution differences. The authors have also had to group species into 4 classifications - again to assist with any generalisations and power. They have focused on the volumes of two structures, the amygdala and the hippocampus, which seems appropriate. Neither structure is homogeneous and so it may well be that a targeted focus on specific nuclei or subfields would help (the authors may well do this next) - but as the variables would only increase further along with the number of potential comparisons, alongside small group numbers, it seems only prudent to treat these findings are preliminary. That said, it is highly unlikely that large numbers of macaque brains will become available in the near future.

      This introduction is by way of saying that the study achieves what it sets out to do, but there are many reasons to see this study as preliminary. The main message seems to be twofold: (1) that more intolerant species have relatively larger amygdalae, and (2) that with development, there is an opposite pattern of volume change (increasing with age in intolerant species and decreasing with age in tolerant species). Finding 1 is the opposite of that predicted in Table 1 - this is fine, but it should be made clearer in the Discussion that this is the case, otherwise the reader may feel confused. As I read it, the authors have switched their prediction in the Discussion, which feels uncomfortable. 

      We thank the reviewer for this important observation. In the original version, Table 1 presented simplified direct predictions linking social tolerance grades to amygdala and hippocampus volumes. We recognize that this formulation may have created confusion In the revised manuscript, we have thoroughly restructured the table and its accompanying rationale. Table 1 now better reflects our conceptual framework grounded in three cognitive dimensions—sociocognitive demands, behavioral inhibition, and social predictability—each linked to behavioral traits and associated neural hypotheses based on published literature. This updated framework, detailed in lines 144-169 of the introduction, provides a more nuanced basis for interpreting our results and avoids the inconsistencies previously noted. The Discussion was also revised accordingly (lines 329-255) to clarify where our findings diverge from the original predictions and to explore alternative explanations based on social complexity. Rather than directly predicting amygdala size from social tolerance grades, we propose that variation in volume emerges from differing combinations of cognitive pressures across species.

      It is inevitable that the data in a study of this complexity are all too prone to post hoc considerations, to which the authors indulge. In the case of Grade 1 species, the individuals have a lot to learn, especially if they are not top of the hierarchy, but at the same time, there are fewer individuals in the troop, making predictions very tricky. As noted above, I am concerned by the seemingly opposite predictions in Table 1 and those in the Discussion regarding tolerance and amygdala volume. (It may be that the predictions in Table 1 are the opposite of how I read them, in which case the Table and preceding text need to align.)

      In order to facilitate the interpretation of our Bayesian modelling, we have selected a more focused ROI in our automatic segmentation procedure of the Hippocampus (from Hippocampal Formation to Hippocampus) and have added to the new analysis (Figure 4) that helps to properly test whether the hippocampus significantly differs between species from social grade 1 vs 4. The present analysis found that this is the case in adult monkeys. This is therefore consistent with our hypothesis that amygdala volumes are principally explained by heightened sociocognitive demands in more tolerant species.

      We also acknowledge the reviewer’s concerns about the limited generalizability due to our sample. The challenges of comparative neuroimaging in non-human primates—especially when using post-mortem datasets—are substantial. Given the ethical constraints and the rarity of available specimens, increasing the number of individuals or species is not feasible in the short term. However, we have made all data and code publicly available and clearly stated the limitations of our sample in the manuscript. Despite these constraints, we believe our dataset offers an unprecedented comparative perspective, particularly due to the inclusion of rare and tolerant species such as M. tonkeana, M. nigra, and M. thibetana, which have never been included in structural MRI studies before. We hope this effort will serve as a foundation for future collaborative initiatives in primate comparative neuroscience.

      Reviewer #3 (Public review):

      We thank Reviewer #3 for their thoughtful and detailed review. Their comments helped us refine both the conceptual and interpretative aspects of the manuscript. We respond point by point below.

      Summary:

      In this study, the authors were looking at neurocorrelates of behavioural differences within the genus Macaca. To do so, they engaged in real-world dissection of dead animals (unconnected to the present study) coming from a range of different institutions. They subsequently compare different brain areas, here the amygdala and the hippocampus, across species. Crucially, these species have been sorted according to different levels of social tolerance grades (from 1 to 4). 12 species are represented across 42 individuals. The sampling process has weaknesses ("only half" of the species contained by the genus, and Macaca mulatta, the rhesus macaque, representing 13 of the total number of individuals), but also strengths (the species are decently well represented across the 4 grades) for the given purpose and for the amount of work required here. I will not judge the dissection process as I am not a neuroanatomist, and I will assume that the different interventions do not alter volume in any significant ways / or that the different conditions in which the bodies were kept led to the documented differences across species. 

      25 brains were extracted by the authors themselves who are highly with this procedure. Overall, we believe that dissection protocols did not alter the total brain volume. Despite our expertise, we experienced some difficulties to not damage the cerebellum. Therefore, this region was not included in our analysis. We also noted that this brain region was also damaged or absent from the Prime-DE dataset.

      Several protocols were used to prepare and store tissue. It could have impacted the total brain volume.

      We agree that differences in tissue preparation and storage could potentially affect total brain volume. Therefore, we explicitly included the main sample preparation variable — whether brains had been previously frozen — as a covariate in our model. This factor did not explain our results. Moreover, Figures 1D and 1I display the frozen status and its correlation with the amygdala and hippocampus ratios, respectively. Figure 2 shows the parameters of the model and the posterior distributions for the frozen status and total brain volume effects.

      There are two main results of the study. First, in line with their predictions, the authors find that more tolerant macaque species have larger amygdala, compared to the hippocampus, which remains undifferentiated across species. Second, they also identify developmental effects, although with different trends: in tolerant species, the amygdala relative volume decreases across the lifespan, while in intolerant species, the contrary occurs. The results look quite strong, although the authors could bring up some more clarity in their replies regarding the data they are working with. From one figure to the other, we switch from model-calculated ratio to modelpredicted volume. Note that if one was to sample a brain at age 20 in all the grades according to the model-predicted volumes, it would not seem that the difference for amygdala would differ much across grades, mostly driven with Grade 1 being smaller (in line with the main result), but then with Grade 2 bigger than Grade 3, and then Grade 4 bigger once again, but not that different from Grade 2.

      Overall, despite this, I think the results are pretty strong, the correlations are not to be contested, but I also wonder about their real meaning and implications. This can be seen under 3 possible aspects:

      (1)  Classification of the social grade

      While it may be familiar to readers of Thierry and collaborators, or to researchers of the macaque world, there is no list included of the 18 behavioral traits used to define the three main cognitive requirements (socio-cognitive demands, predictability of the environment, inhibitory control). It would be important to know which of the different traits correspond to what, whether they overlap, and crucially, how they are realized in the 12 study species, as there could be drastic differences from one species to the next. For now, we can only see from Table S1 where the species align to, but it would be a good addition to have them individually matched to, if not the 18 behavioral traits, at least the 3 different broad categories of cognitive requirements.

      We fully agree with this observation. In the revised version of the manuscript, we now include a detailed conceptual table listing all 18 behavioral traits from Thierry’s framework. For each trait, we provide its underlying social implications, its associated cognitive dimension (when applicable), and the hypothesized neural correlate. 

      While some traits may could have been arguably classified in several cognitive dimensions (e.g. reconciliation rate), we preferred to assign each to a unique dimension for clarity. Additionally, the introduction (lines 95-169 + Table1) now explains how each trait was evaluated based on existing literature and assigned to one of the three proposed cognitive categories: socio-cognitive demands, behavioral inhibition, or social unpredictability. This structure offers a clearer and more transparent basis for the neuroanatomical hypotheses tested in the study.

      “Navigating social life in primate societies requires substantial cognitive resources: individuals must not only track multiple relationships, but also regulate their own behavior, anticipate others’ reactions, and adapt flexibly to changing social contexts. Taken advantage of databases of magnetic resonance imaging (MRI) structural scans, we conducted the first comparative study integrating neuroanatomical data and social behavioral data from closely related primate species of the same genus to address the following questions: To what extent can differences in volumes of subcortical brain structures be correlated with varying degrees of social tolerance? Additionally, we explored whether these dispositions reflect primarily innate features, shaped by evolutionary processes, or acquired through socialization within more or less tolerant social environments”.

      “The first category, socio-cognitive demands, refers to the cognitive resources needed to process, monitor, and flexibly adapt to complex social environments. Linking those parameters to neurological data is at the core of the social brain theory to explain the expansion of the neocortex in primates (Dunbar). Macaques social systems require advanced abilities in social memory, perspective-taking, and partner evaluation (Freeberg et al., 2012). This is particularly true in tolerant species, where the increased frequency and diversity of interactions may amplify the demands on cognitive tracking and flexibility. Tolerant macaque species typically live in larger groups with high interaction frequencies, low nepotism, and a wider range of affiliative and cooperative behaviors, including reconciliation, coalition-building, and signal flexibility (REF). Tolerant macaque species also exhibit a more diverse and flexible vocal and facial repertoire than intolerants ones which may help reduce ambiguity and facilitate coordination in dense social networks (Rincon et al., 2023; Scopa and Palagi, 2016; Rebout 2020). Experimental studies further show that macaques can use facial expressions to anticipate the likely outcomes of social interactions, suggesting a predictive function of facial signals in managing uncertainty (Micheletta et al., 2012; Waller et al., 2016). Even within less tolerant species, like M. mulatta, individual variation in facial expressivity has been linked to increased centrality in social networks and greater group cohesion, pointing to the adaptive value of expressive signaling across social styles (Whitehouse et al., 2024)”.

      “The second category, inhibitory control, includes traits that involve regulating impulsivity, aggression, or inappropriate responses during social interactions. Tolerant macaques have been shown to perform better in tasks requiring behavioral inhibition and also express lower aggression and emotional reactivity in both experimental and natural contexts (Joly et al., 2017; Loyant et al., 2023). These features point to stronger self-regulation capacities in species with egalitarian or less rigid hierarchies. More broadly, inhibition – especially in its strategic form (self-control) – has been proposed to play a key role in the cohesion of stable social groups. Comparative analyses across mammals suggest that this capacity has evolved primarily in anthropoid primates, where social bonds require individuals to suppress immediate impulses in favour of longer-term group stability (Dunbar and Shultz, 2025). This view echoes the conjecture of Passingham and Wise (2012), who proposed that the emergence of prefrontal area BA10 in anthropoids enabled the kind of behavioural flexibility needed to navigate complex social environments (Passingham et al., 2012)”.

      “The third category, social environment predictability, reflects how structured and foreseeable social interactions are within a given society. In tolerant species, social interactions are more fluid and less kin-biased, leading to greater contextual variation and role flexibility, which likely imply a sustained level of social awareness. In fact, as suggested by recent research, such social uncertainty and prolonged incentives are reflected by stress-related physiology : tolerant macaques such as M. tonkeana display higher basal cortisol levels, which may be indicative of a chronic mobilization of attentional and regulatory resources to navigate less predictable social environments (Sadoughi et al., 2021)”.

      “Each behavioral trait was individually evaluated based on existing empirical literature regarding the types of cognitive operations it likely involves. When a primary cognitive dimension could be identified, the trait was assigned accordingly. However, some behaviors – such as maternal protection, allomaternal care, or delayed male dispersal – do not map neatly onto a single cognitive process. These traits likely emerge from complex configurations of affective and socialmotivational systems, and may be better understood through frameworks such as attachment theory (Suomi, 2008), which emphasizes the integration of social bonding, emotional regulation, and contextual plasticity. While these dimensions fall beyond the scope of the present framework, they offer promising directions for future research, particularly in relation to the hypothalamic and limbic substrates of social and reproductive behavior”.

      “Rather than forcing these traits into potentially misleading categories, we chose to leave them unclassified within our current cognitive framework. This decision reflects both a commitment to conceptual clarity and the recognition that some behaviors emerge from a convergence of cognitive demands that cannot be neatly isolated. This tripartite framework, leaving aside reproductive-related traits, provides a structured lens through which to link behavioral diversity to specific cognitive processes and generate neuroanatomical predictions”.

      (2) Issue of nature vs nurture

      Another way to look at the debate between nature vs nurture is to look at phylogeny. For now, there is no phylogenetic tree that shows where the different grades are realized. For example, it would be illuminating to know whether more related species, independently of grades, have similar amygdala or hippocampus sizes. Then the question will go to the details, and whether the grades are realized in particular phylogenetic subdivisions. This would go in line with the general point of the authors that there could be general species differences.

      As pointed out by Thierry and collaborators, the social tolerance concept is already grounded in a phylogenetic framework as social tolerance matches the phylogenetical tree of these macaque species, suggesting a biological ground of these behavioral observations. Given the modest sample size and uneven species representation, we opted not to adopt tools such as Phylogenetic Generalized Least Squares (PGLS) in our analysis. Our primary aim in this study was to explore neuroanatomical variation as a function of social traits, not to perform a phylogenetic comparative analysis per see. That said, we now explicitly acknowledge this limitation in the Discussion and indicate that future work using larger datasets and phylogenetic methods will be essential to disentangle social effects from evolutionary relatedness. We hope that making our dataset openly available will facilitate such futures analyses.

      With respect to nurture, it is likely more complicated: one needs to take into account the idiosyncrasies of the life of the individual. For example, some of the cited literature in humans or macaques suggests that the bigger the social network, the bigger the brain structure considered. Right, but this finding is at the individual level with a documented life history. Do we have any of this information for any of the individuals considered (this is likely out of the scope of this paper to look at this, especially for individuals that did not originate from CdP)?

      We appreciate this insightful observation. Indeed, findings from studies in humans and nonhuman primates showing associations between brain structure and social network size typically rely on detailed life history and behavioral data at the individual level. Unfortunately, such finegrained information was not consistently available across our entire sample. While some individuals from the Centre de Primatologie (CdP) were housed in known group compositions and social settings, we did not have access to longitudinal social data—such as rank, grooming rates, or network centrality—that would allow for robust individual-level analyses. We now acknowledge this limitation more clearly in the Discussion (lines 436-443), and we fully agree that future work combining neuroimaging with systematic behavioral monitoring will be necessary to explore how species-level effects interact with individual social experience.

      (3) Issue of the discussion of the amygdala's function

      The entire discussion/goal of the paper, states that the amygdala is connected to social life. Yet, before being a "social center", the amygdala has been connected to the emotional life of humans and non-humans alike. The authors state L333/34 that "These findings challenge conventional expectations of the amygdala's primary involvement in emotional processes and highlight the complexity of the amygdala's role in social cognition". First, there is no dichotomy between social cognition and emotion. Emotion is part of social cognition (unless we and macaques are robots). Second, there is nowhere in the paper a demonstration that the differences highlighted here are connected to social cognition differences per se. For example, the authors have not tested, say, if grade 4 species are more afraid of snakes than grade 1 species. If so, one could predict they would also have a bigger amygdala, and they would probably also find it in the model. My point is not that the authors should try to correlate any kind of potential aspect that has been connected to the amygdala in the literature with their data (see for example the nice review by DomínguezBorràs and Vuilleumier, https://doi.org/10.1016/B978-0-12-823493-8.00015-8), but they should refrain from saying they have challenged a particular aspect if they have not even tested it. I would rather engage the authors to try and discuss the amygdala as a multipurpose center, that includes social cognition and emotion.

      We thank the reviewer for this important and nuanced point. We have revised the manuscript to adopt a more cautious and integrative tone regarding the function of the amygdala. In the revised Discussion (lines 341-355), we now explicitly state that the amygdala is involved in a broad range of processes—emotional, social, and affective—and that these domains are deeply intertwined. Rather than proposing a strict dissociation, we now suggest that the amygdala supports integrated socio-emotional functions that are mobilized differently across social tolerance styles. We also cite recent relevant literature (e.g., Domínguez-Borràs & Vuilleumier, 2021) to support this view and have removed any claim suggesting we challenge the emotional function of the amygdala per se. Our aim is to contribute to a richer understanding of how affective and social processes co-construct structural variation in this region.

      Strengths:

      Methods & breadth of species tested.

      Weaknesses:

      Interpretation, which can be described as 'oriented' and should rather offer additional views.

      Recommendations for the authors:

      Reviewer #1 (Recommendations for the authors):

      Private Comments:

      (1) Table 1 should be formatted for clarity i.e., bolded table headers, text realignment, and spacing. It was not clear at first glance how information was organized. It may also be helpful to place behavioral traits as the first column, seeing that these traits feed into the author's defined cognitive requirements.

      We have reformatted Table 1 to improve clarity and readability. Behavioral traits now appear in the first column, followed by cognitive dimensions and hypothesized neural correlates. Column headers have been bolded and alignment has been standardized.

      (2) Figures could include more detail to help with interpretations. For example, Figure 3 should define values included on the x-axis in the figure caption, and Figure 4 should explain the use of line, light color, and dark color. Figure 1 does not have a y-axis title.

      The figures have been revised and legends completed to ensure more clarity.

      (3) Please proofread for typos throughout.

      The manuscript has been carefully proofread, and all typographical and grammatical errors have been corrected. These changes are visible in the tracked version.

      Reviewer #2 (Recommendations for the authors):

      Specific comments:

      (1) Given all of the variability would it not be a good idea to just compare (eg in the supplemental) the macaque data from just the Strasbourg centre for m mulatta and m toneanna. I appreciate the ns will be lower, but other matters are more standardized.

      We fully understand the reviewer’s suggestion to restrict the comparison to data collected at a single site in order to minimize inter-site variability. However, as noted, such an analysis would come at the cost of statistical power, as the number of individuals per species within a single center is small. For example, while M. tonkeana is well represented at the Strasbourg centre, only one individual of M. mulatta is available from the same site. Thus, a restricted comparison would severely limit the interpretability of results, particularly for age-related trajectories. To address variability, we included acquisition site and brain preservation method as covariates or predictors where appropriate, and we have been cautious in our interpretations. We also now emphasize in the Methods and Discussion the value of future datasets with more standardized acquisition protocols across species and centers. We hope that by openly sharing our data and workflow, we can contribute to this broader goal.

      (2) I have various minor edits:

      (a) L 25 abstract - Specify what is meant by 'opposite trend'; the reader cannot infer what this is.

      Modified in line 25-28: “Unexpectedly, tolerant species exhibited a decrease in relative amygdala volume across the lifespan, contrasting with the age-related increase observed in intolerant species—a developmental pattern previously undescribed in primates.”

      (b) L67 - The reference 'Manyprimates' needs fixing as it does in the references section.

      After double checking, Manyprimates studies are international collaborative efforts that are supposed to be cite this way (https://manyprimates.github.io/#pubs).

      (c) L74 - Taking not Taken.

      This typo has been corrected.

      (d) L129 - It says 'total volume', but this is corrected total volume?

      We have clarified in the figures legends that the “total brain volume” used in our analyses excludes the cerebellum and the myelencephalon, as specified in our image preprocessing protocol. This ensures consistency across individuals and institutions.

      (e) L138 - Suddenly mentions 'frozen condition' without any prior explanation - this needs explaining in the legend - also L144.

      We have added an explanation of the ‘frozen condition’ variable in in the relevant figure legend.

      (f) L166 - Results - it would be helpful to remind readers what Grade 1 signifies, ie intolerant species.

      We now include a brief reminder in the Results section that Grade 1 corresponds to socially intolerant species, to help readers unfamiliar with the classification (Lines 240-251).

      (g)Figure 4 - Provide the ns for each of the 4 grades to help appreciate the meaningfulness of the curves, etc.

      The number of subjects has been added to the Figure and a novel analysis helps in the revised ms help to appreciate the meaningfulness of some of these curves.

      (h) L235 - 'we had assumed that species of high social tolerance grade would have presented a smaller amygdala in size compared to grade 1'. But surely this is the exact opposite of what is predicted in Table 1 - ie, the authors did not predict this as I read the paper (Unless Table l is misleading/ambiguous and needs clarification).

      As discussed in our response to Reviewer #2 and #3, we have restructured both Table 1 and the Discussion to ensure consistency. We now explicitly state that the findings diverge from our initial inhibitory-control-based prediction and propose alternative interpretations based on sociocognitive demands.

      (i) L270 - 'This observation' which?? Specify.

      We have replaced ‘this observation’ with a precise reference to the observed developmental decrease in amygdala volume in tolerant species.

      (j) L327 - 'groundbreaking' is just hype given that there are so many caveats - I personally do not like the word - novel is good enough.

      We have replaced the word ‘groundbreaking’ with ‘novel’ to adopt a more measured and appropriate tone in the discussion.

      (3) I might add that I am happy with the ethics regarding this study. 

      Thanks, we are also happy that we were able to study macaque brains from different species using opportunistic samplings along with already available data. We are collectively making progress on this!

      (4) Finally, I should commend the authors on all the additional information that they provide re gender/age/species. Given that there are 2xs are many females as males, it would be good to know if this affects the findings. I am not a primatologist, so I don't know, for example, if the females in Grade 1 monkeys are just as intolerant as the males?

      We thank the reviewer for this thoughtful comment. We now explicitly mention the female-biased sex ratio in the Methods section and report in the Results (Figure 2, Figure 3) that sex was included as a covariate in our Bayesian models. While a small effect of sex was found for hippocampal volume, no effect was observed for the amygdala. Given the strong imbalance in our dataset (2:1 female-to-male ratio), we refrained from drawing any conclusion about sex-specific patterns, as these would require larger and more balanced samples. Although we did not test for sex-by-grade interactions, we agree that this question—especially regarding whether females and males express social style differences similarly across grades—represents an important direction for future comparative work.

      Reviewer #3 (Recommendations for the authors):

      I found the article well-written, and very easy to follow, so I have little ways to propose improvements to the article to the authors, besides addressing the various major points when it comes to interpretation of the data.

      One list I found myself wanting was in fact the list of the social tolerance grades, and the process by which they got selected into 3 main bags of socio-cognitive skills. Then it would become interesting to see how each of the 12 species compares within both the 18 grades (maybe once again out of the scope of this paper, there are likely reviews out there that already do that, but then the authors should explicitly mention so in the paper: X, 19XX have compared 15 out of 18 traits in YY number of macaque species); and within the 3 major subcognitive requirements delineated by the authors, maybe as an annex?

      We thank the reviewer for this thoughtful suggestion. In the revised manuscript, we now include a detailed table (Table 1) that lists the 18 behavioral traits derived from Thierry’s framework, along with their associated cognitive dimension and hypothesized neuroanatomical correlate. While we did not create a matrix mapping each of the 12 species across all 18 traits due to space and data availability constraints, we agree this is an important direction that should be tackled by primatologist. We now include a sentence (line 87-90) in the manuscript to guide readers to previous comparative reviews (e.g., Thierry, 2000; Thierry et al., 2004, 2021) that document the expression of these traits across macaque species. We also clarify that our three cognitive categories are conceptual tools intended to structure neuroanatomical predictions, and not formal clusters derived from quantitative analyses.

      In the annex, it would also be good to have a general summarizing excel/R file for the raw data, with important information like age, sex, and the relevant calculated volumes for each individual. The folders available following the links do not make it an easy task for a reader to find the raw data in one place.

      We fully agree with the reviewer on the importance of data accessibility. We have now uploaded an additional supplementary file in .csv format on our OSF repository, which includes individuallevel metadata for all 42 macaques: species, sex, age, social grade, total brain volume, amygdala volume, and hippocampus volume. The link to this file is now explicitly mentioned in the Data Availability section. We hope this will facilitate comparisons with other datasets and improve usability for the community. In addition, we provide in a supplementary table the raw data that were used for our Bayesian modelling (see below).

      The availability of the raw data would also clear up one issue, which I believe results from the modelling process: it looks odd on Figure 2, that volume ratios, defined as the given brain area volume divided by the total brain volume, give values above 1 (especially for the hippocampus). As such, the authors should either modify the legend or the figure. In general, it would be nicer to have the "real values" somewhere easily accessible, so that they can be compared more broadly with: 1) other macaques species to address questions relevant to the species; 2) other primates to address other questions that are surely going to arise from this very interesting work!

      We thank the reviewer for pointing this out. The ratio values in Figure 1 correspond to the proportion of the regional volume (amygdala or hippocampus) relative to the total brain volume, excluding the cerebellum and myelencephalon. As such, values above 0.01 (i.e., above 1% of the brain volume) are expected for these structures and do not indicate an error. We have updated the figure legend to clarify this point explicitly. In addition, we have now made a cleaned .csv file available via OSF, containing all raw volumetric data and metadata in a format that facilitates cross-species or cross-study comparisons. This replaces the previous folder-based structure, which may have been less accessible.

      Typos:

      L233: delete 'in'

      L430: insert space in 'NMT template(Jung et al., 2021).'

    1. L'Implication Affective des Enseignants : Synthèse des Recherches de Maël Virat

      Résumé Exécutif

      Ce document de synthèse analyse les travaux de Maël Virat sur l'implication affective des enseignants et son impact sur les élèves.

      La thèse centrale est que la relation affective enseignant-élève, loin d'être un simple supplément à la pédagogie, est un moteur fondamental de l'apprentissage et du développement de l'élève.

      Cette dynamique s'ancre dans la théorie de l'attachement, où la sécurité affective fournie par l'enseignant libère les capacités d'exploration de l'élève.

      Les points clés sont les suivants :

      1. Sécurité et Exploration : La relation enseignant-élève est gouvernée par la même dynamique "sécurité-exploration" que celle observée entre un parent et son enfant.

      Un enseignant perçu comme une "base de sécurité" permet à l'élève, notamment celui de style d'attachement anxieux, de persévérer face aux difficultés scolaires.

      2. L'Engagement comme Médiateur : Des méta-analyses à grande échelle confirment le lien entre la qualité de la relation affective et la réussite scolaire.

      Cet effet est principalement médiatisé par l'engagement de l'élève : une relation sécurisante favorise la motivation et l'implication, qui à leur tour améliorent les résultats.

      3. L'Amour Compassionnel : Pour caractériser l'implication affective de l'enseignant, Maël Virat propose le concept d'« amour compassionnel ».

      Il s'agit d'un sentiment altruiste, centré sur le bien-être de l'autre, qui se distingue de l'amour romantique ou amical.

      Cet amour se manifeste par l'attention, le soutien comportemental et une sensibilité émotionnelle aux réussites et aux difficultés de l'élève.

      4. Les Facteurs d'Influence : L'implication de l'enseignant n'est pas un trait de personnalité immuable mais dépend fortement du contexte. Les facteurs déterminants incluent :

      Le soutien institutionnel : Le soutien perçu de la part des collègues et de la hiérarchie est directement corrélé à la capacité de l'enseignant à s'investir affectivement auprès de ses élèves. 

      Les croyances professionnelles : L'intention d'un enseignant de fournir un soutien émotionnel est principalement prédite par son attitude (les bénéfices qu'il en retire personnellement en termes de plaisir au travail et de relations), son sentiment de contrôle (se sentir formé, avoir le temps, considérer que cela fait partie de son rôle) et, dans une moindre mesure, par les normes sociales perçues.  

      Le contexte systémique : La taille de l'établissement, la culture professionnelle, et la formation initiale jouent un rôle crucial dans la facilitation ou l'inhibition de ces relations.

      En conclusion, améliorer l'engagement et la réussite des élèves passe par la reconnaissance et la valorisation du rôle affectif des enseignants.

      Cela nécessite des interventions qui ne se limitent pas à l'individu, mais qui agissent sur le système : la formation, la culture d'établissement et le soutien offert aux professionnels de l'éducation.

      --------------------------------------------------------------------------------

      1. Introduction à la Recherche de Maël Virat

      Maël Virat, chercheur en psychologie, concentre une partie significative de ses travaux sur la relation enseignant-élève, bien que ses recherches s'étendent également aux besoins sociaux des adolescents et au vécu des professionnels du travail social, notamment dans la protection de l'enfance.

      Ses travaux mobilisent la théorie de l'attachement comme cadre théorique principal pour analyser les dynamiques relationnelles en milieu scolaire.

      Il est membre d'un groupe de recherche francophone (FREE) qui s'intéresse à la manière de prendre en compte la dimension relationnelle dans la formation, initiale et continue, des enseignants.

      2. La Dynamique d'Attachement dans l'Apprentissage

      2.1. Fondements Théoriques : Sécurité et Exploration

      La théorie de l'attachement, développée par John Bowlby, établit un lien fondamental entre la sécurité affective et le comportement d'exploration.

      Les Expériences de Harlow : Les travaux de Harry Harlow avec des bébés singes ont démontré que le besoin de sécurité affective est primordial.

      Privés de leur mère mais en présence de substituts maternels (l'un en fil de fer nourrissant, l'autre en tissu doux), les singes privilégiaient le contact réconfortant.

      Ce manque de sécurité affective réduisait significativement leurs comportements exploratoires dans un nouvel environnement.

      Une Théorie pour toute la Vie : Cette dynamique n'est pas limitée à la petite enfance.

      Une étude sur des couples mariés a montré que lorsqu'un homme était confronté à une tâche impossible (résoudre des puzzles insolubles), la présence de sa partenaire agissant comme une base de sécurité (encouragements, attention, absence d'interférence) augmentait sa persistance dans la tâche.

      La figure d'attachement principale à l'âge adulte est souvent le partenaire amoureux, suivi par la mère.

      2.2. Application au Contexte Scolaire

      Plusieurs études expérimentales transposent cette dynamique à la relation enseignant-élève, démontrant que l'enseignant peut fonctionner comme une "base de sécurité" qui favorise l'apprentissage.

      Étude 1 : Soutien Émotionnel et Comportements Exploratoires

      Une étude basée sur l'observation de duos enseignant-élève a établi une chaîne causale claire :

      1. Soutien de l'enseignant : Plus l'enseignant manifeste de comportements de soutien émotionnel (temps d'attention, regards, encouragements).

      2. Sécurité de l'élève : Plus l'élève montre des signes de sécurité affective (détente, absence de stress, concentration).

      3. Exploration : Et plus il met en œuvre des comportements exploratoires (persistance face à la difficulté, concentration accrue).

      Étude 2 : L'Amorçage Subliminal par la Photo de l'Enseignant

      Des chercheurs allemands et autrichiens ont mené une expérience où des élèves devaient résoudre des tests psychotechniques.

      Protocole : Avant chaque test, la photo de leur enseignant était projetée de manière subliminale (20 à 40 millisecondes), un temps trop court pour une perception consciente.

      Pour le groupe contrôle, une image brouillée ayant les mêmes propriétés lumineuses était utilisée.

      Condition : Au préalable, les enseignants avaient évalué la qualité de leur relation avec chaque élève via une échelle mesurant la proximité et la chaleur, un outil fortement corrélé aux mesures d'attachement.

      Résultats : La présentation subliminale de la photo de l'enseignant améliorait les performances des élèves uniquement lorsque l'enseignant avait décrit sa relation avec cet élève comme étant chaleureuse, affective et sécurisante.

      Étude 3 : La Persistance des Adolescents face à l'Échec

      Une étude menée en Israël par Mario Mikuliner, spécialiste de l'attachement, a examiné la persistance scolaire chez des adolescents.

      | Variable mesurée | Méthode | | --- | --- | | Style d'attachement de l'élève | Questionnaire évaluant le niveau de sécurité ou d'anxiété dans les relations. | | Perception de l'enseignant comme "base de sécurité" | Questionnaire demandant aux élèves s'ils perçoivent leur professeur principal comme disponible, accueillant et non rejetant. | | Condition expérimentale (3 semaines plus tard) | Groupe expérimental : Exercice de visualisation demandant à l'élève de penser intensément à son professeur principal. <br> Groupe contrôle : Exercice de visualisation demandant de penser à un voisin neutre. | | Mesure de la persistance | Tâche d'association de mots contenant 4 items impossibles à résoudre. La persistance est mesurée par le temps passé sur ces items impossibles avant d'abandonner, comparativement au temps de réponse moyen de l'élève. |

      Résultats principaux :

      • Dans le groupe contrôle (pensée neutre), les élèves au style d'attachement anxieux montrent une persistance significativement plus faible que les autres.

      • Dans le groupe expérimental, le fait de penser à un enseignant perçu comme une base de sécurité compense totalement le déficit de persistance des élèves anxieux. Leur performance devient indiscernable de celle des élèves sécures.

      Conclusion de cette partie : Ces travaux démontrent expérimentalement que la perception d'un enseignant comme une figure sécurisante a un effet direct et mesurable sur les capacités cognitives et la persévérance des élèves, en particulier pour ceux qui sont les plus vulnérables sur le plan affectif.

      3. Impact Global et Nuances

      3.1. La Méta-analyse de Roorda (2017)

      Une méta-analyse majeure réalisée par Débora Roorda, portant sur 189 études et un total de près de 250 000 élèves du primaire et du secondaire, confirme l'importance de la relation affective.

      Lien avec la réussite et l'engagement : Il existe un lien statistique modéré mais robuste et constant entre la qualité de la relation affective enseignant-élève et à la fois l'engagement scolaire et la réussite scolaire.

      Le rôle médiateur de l'engagement : Le principal mécanisme par lequel la relation affective influence la réussite est l'engagement. Une relation positive renforce la motivation et l'implication de l'élève dans les tâches scolaires.

      Ordre de grandeur de l'effet : La relation positive avec les enseignants peut expliquer environ 10% de la variance de l'engagement des élèves.

      Dans le domaine de la psychologie, où il est rare d'expliquer plus de 50% d'un phénomène complexe, ce chiffre est considéré comme important.

      3.2. Qui sont les Figures Sécurisantes à l'École ?

      Une enquête menée par Maël Virat auprès de collégiens via le questionnaire "Who To ?" (Vers qui te tournes-tu en cas de problème ?) apporte des nuances importantes.

      Diversité des figures d'attachement : Si les enseignants sont fréquemment cités comme personnes ressources, les assistants d'éducation (AED) apparaissent également comme des figures sécurisantes majeures.

      Un constat préoccupant : Dans un premier échantillon, 50% des élèves n'ont nommé aucune personne au sein de leur établissement vers qui se tourner.

      Corrélation : Le nombre de personnes sécurisantes citées par un élève est positivement corrélé à sa motivation, son engagement scolaire et son sentiment d'appartenance à l'école.

      4. L'Implication Affective de l'Enseignant

      Face à l'abondance de littérature sur les effets de la relation, Maël Virat a orienté ses recherches sur une question moins explorée : qu'est-ce que l'implication affective du côté de l'enseignant ?

      Son postulat est qu'un élève ne peut se sentir en sécurité affective avec une personne qui n'est pas elle-même impliquée affectivement.

      4.1. La Quête du Bon Concept : L'Amour Compassionnel

      Après avoir écarté des concepts jugés inadéquats :

      La bienveillance : Trop général, pouvant s'appliquer à un voisin dans un train et pas nécessairement doté d'une dimension affective spécifique à la relation pédagogique.

      L'empathie : Décrit davantage une compétence cognitive et émotionnelle mobilisable dans divers contextes (y compris la vente) qu'un engagement relationnel durable.

      Il s'est arrêté sur le concept d'amour compassionnel.

      Définition de l'Amour Compassionnel : C'est une forme d'amour altruiste, centrée sur le bien et le développement de l'autre.

      Dans la théorie de l'attachement, c'est le sentiment éprouvé par la figure de soin (le caregiver) en réponse à l'attachement de l'enfant. Il se construit dans la durée et ne disparaît pas avec la fin de la relation.

      Cet amour se compose de trois dimensions :

      1. Cognitive : Une attention soutenue à l'autre, des efforts pour comprendre sa perspective.

      2. Comportementale : Des actes concrets d'aide, de soutien et de dévouement.

      3. Affective : Une sensibilité à l'état de l'autre, se traduisant par :

      ◦ Des émotions positives (plaisir au contact de l'élève, joie face à ses réussites).  

      ◦ Des émotions négatives (tristesse, peine, lorsque l'élève est en difficulté).  

      Note : Des études par questionnaire montrent que les enseignants reconnaissent plus facilement les émotions positives que les négatives, possiblement en raison de normes professionnelles.

      4.2. L'Interprétation Affective des Pratiques Pédagogiques

      Une hypothèse centrale est que de nombreuses actions perçues comme purement pédagogiques par l'enseignant sont interprétées par l'élève comme des signes d'implication affective.

      Une étude sur des élèves de 4ème en mathématiques a testé cette hypothèse :

      Variable indépendante : La perception par les élèves du "climat de classe" (structure de but), soit centré sur la maîtrise (chacun progresse à son rythme), soit sur la performance (comparaison et classement entre élèves).

      Variable médiatrice : La perception par l'élève de l'amour compassionnel de son enseignant de mathématiques à son égard.

      Variable dépendante : L'engagement affectif de l'élève pour les mathématiques ("j'aime les maths").

      Résultat : Un climat de classe centré sur la maîtrise est positivement lié à l'engagement de l'élève parce qu'il est interprété par ce dernier comme un signe que l'enseignant se soucie de lui et l'aime (amour compassionnel).

      L'efficacité du choix pédagogique passe par sa signification affective.

      5. Les Déterminants de l'Implication Enseignante

      L'amour compassionnel n'est pas une émotion arbitraire ("l'amour ne se commande pas"). Il peut être cultivé et dépend fortement de facteurs contextuels et personnels.

      5.1. Facteurs d'Influence sur la Relation Enseignant-Élève

      | Catégorie de Facteurs | Exemples | | --- | --- | | Facteurs Externes | Taille de l'école et de la classe (plus c'est petit, meilleures sont les relations), type de management du chef d'établissement, culture d'établissement valorisant les relations. | | Facteurs liés à l'Élève | Compétences sociales et scolaires, sexe (très léger effet en faveur des filles). Le facteur le plus puissant est la présence de problèmes de comportement. | | Facteurs liés à l'Enseignant | Quantité et qualité de la formation, état de stress, compétences émotionnelles et sociales, style d'attachement (les enseignants "sécures" ont des relations légèrement meilleures), sentiment d'efficacité, croyances sur leur rôle. |

      5.2. Le Soutien des Pairs comme Catalyseur

      Une étude montre que plus les enseignants déclarent recevoir de soutien de la part de leurs collègues, plus ils rapportent ressentir de l'amour compassionnel pour leurs élèves.

      Cela s'explique par le fait que le système de caregiving (prendre soin) de l'enseignant est d'autant plus actif que son propre système d'attachement est sécurisé par son environnement professionnel.

      5.3. Les Croyances qui Prédisent l'Intention de Soutenir Émotionnellement

      Une étude récente basée sur la théorie du comportement planifié a cherché à identifier les croyances spécifiques qui prédisent l'intention d'un enseignant de s'impliquer dans le soutien émotionnel.

      Le modèle testé explique 68% de la variance de cette intention, un score très élevé.

      Voici les croyances les plus déterminantes, qui constituent des cibles d'action pour la formation :

      1. L'Attitude (ce que l'enseignant pense du soutien émotionnel) L'intention est plus forte quand l'enseignant croit que le soutien émotionnel est bénéfique... pour lui-même.

      • Il améliore ses relations avec les élèves.

      • Il augmente son plaisir au travail.

      • Il renforce son sentiment d'utilité. (Argumenter sur les seuls bénéfices pour l'élève serait donc moins efficace pour motiver les enseignants).

      2. Le Contrôle Comportemental Perçu (se sentir capable) L'intention est plus forte quand l'enseignant :

      • Pense que le soutien émotionnel fait partie intégrante de son travail (et n'est pas "en plus").

      • Pense qu'il a suffisamment de temps pour cela.

      • Se sent formé à cette dimension du métier.

      3. Les Normes Sociales (ce qui est attendu, ce que font les autres)

      Cet aspect a un effet moins fort.

      L'intention est plus forte quand l'enseignant croit que ses collègues investis et compétents fournissent ce type de soutien, et non que seuls ceux qui "ne veulent pas en faire plus" s'en abstiennent.

      6. Conclusion et Perspectives

      La recherche de Maël Virat démontre que l'implication affective de l'enseignant est un pilier de la réussite et du bien-être de l'élève, avec des effets qui s'étendent bien au-delà des apprentissages scolaires (bien-être, symptômes dépressifs, rapport à l'autorité).

      Cette implication, conceptualisée comme de l'amour compassionnel, n'est pas une simple inclination personnelle mais le résultat d'un écosystème complexe.

      Pour la favoriser, il est essentiel d'agir à plusieurs niveaux :

      La formation : Intégrer la dimension relationnelle comme une compétence professionnelle à part entière.

      La culture d'établissement : Promouvoir une culture qui valorise les relations et reconnaît le soutien émotionnel comme partie intégrante du rôle enseignant.

      Le soutien aux professionnels : Assurer que les enseignants eux-mêmes se sentent soutenus par leurs pairs et leur hiérarchie, afin qu'ils puissent à leur tour devenir une base de sécurité pour leurs élèves.

    1. Synthèse sur le Parrainage de Proximité et le Soutien aux Enfants Protégés

      Résumé Exécutif

      Ce document de synthèse analyse le concept du parrainage de proximité comme un levier essentiel de mobilisation de la société civile dans le champ de la protection de l'enfance.

      Basé sur des témoignages et des expertises, il met en lumière comment des citoyens non professionnels peuvent jouer un rôle déterminant dans le parcours de vie d'enfants protégés en créant des liens d'attachement durables.

      Le cas central examiné est celui de Florian, un enseignant devenu le parrain de Dylan, son ancien élève de CP placé en famille d'accueil, illustrant la transformation d'une relation scolaire forte en un engagement personnel et structuré.

      L'analyse détaille le cadre opérationnel proposé par l'association France Parrainage, qui organise ce soutien.

      Le processus, rigoureux et sécurisé, comprend une évaluation des candidats parrains, la vérification des conditions d'accueil, et l'obtention indispensable du consentement de l'enfant et de ses parents.

      Le parrainage se distingue par sa flexibilité, offrant des modalités adaptées comme le "parrainage ciblé" (pour des personnes qui se connaissent déjà) et le "parrainage classique".

      Enfin, le document replace le parrainage dans un contexte plus large d'évolution des solutions d'accueil en protection de l'enfance, aux côtés du mentorat ou de l'accueil par des "tiers dignes de confiance".

      Ces dispositifs, plus souples et "poreux", visent à offrir aux enfants une expérience de vie normalisée et des relations affectives stables, complémentaires à l'accompagnement professionnel.

      --------------------------------------------------------------------------------

      1. Le Parrainage comme Mobilisation de la Société Civile

      Le parrainage de proximité est présenté comme une manifestation concrète de l'engagement de la société civile dans le domaine de la protection de l'enfance.

      L'Observatoire Départemental de la Protection de l'Enfance et de lutte contre les violences intrafamiliales (ODPE) du département du Nord souligne l'importance de ce sujet, qui vise à impliquer des personnes qui ne sont pas nécessairement des professionnels du secteur pour jouer un rôle significatif dans la vie des enfants protégés.

      Le témoignage central de la session, celui de Florian Merlin et de son engagement auprès de Dylan, est positionné comme une illustration de "l'histoire d'une rencontre" et de la création de "liens d'attachement à l'école" qui transcendent le cadre professionnel.

      Ces "savoirs issus de l'expérience" sont considérés comme un complément essentiel aux savoirs scientifiques et professionnels, apportant un éclairage différent et fondamental pour comprendre les enjeux humains du parrainage.

      2. Le Témoignage Central : La Rencontre entre Florian et Dylan

      Le Lien d'Attachement à l'École

      Florian Merlin, professeur des écoles depuis 10 ans, a eu Dylan, un enfant placé en famille d'accueil, dans sa classe de CP durant l'année scolaire 2023-2024.

      Il décrit la naissance d'un lien d'attachement "très naturel et très rapidement".

      Relation Spontanée : Le contact est passé "rapidement, facilement". Dylan venait lui faire un câlin tous les jours et lui tenait la main lors des sorties scolaires.

      Dépassement du Cadre Enseignant-Élève : Florian Merlin a ressenti que ce lien était "plus que ça".

      Un souvenir marquant est celui d'une sortie au cinéma où Dylan, devant le stand de confiseries, a compris de lui-même qu'il ne pouvait rien demander dans le cadre scolaire, illustrant une maturité et une nature particulière de leur relation.

      Soutien Émotionnel et Pédagogique : Dylan demandait beaucoup d'attention. Florian devait parfois s'isoler avec lui pour accueillir ses émotions et le conseiller avant qu'il puisse retourner aux apprentissages.

      De la Relation Enseignant-Élève au Projet de Parrainage

      À la fin de l'année scolaire, la famille d'accueil de Dylan a annoncé son départ, signifiant un changement de lieu de vie pour l'enfant.

      Le Refus de la Rupture : Pour Florian, il était "impensable de ne plus avoir de ses nouvelles".

      La Prise de Contact : En août 2024, il contacte la Maison Départementale de la Solidarité (MDS) de Calais pour prendre des nouvelles. Une interlocutrice lui suggère le parrainage et lui donne les coordonnées de France Parrainage.

      La Période d'Hésitation : Par crainte de créer une situation "compliquée" dans son couple, Florian met le projet de côté jusqu'en janvier. Il continue cependant à penser souvent à Dylan.

      Le Déclencheur : Le jour de l'anniversaire de Dylan, le 15 janvier 2024, le sentiment de ne pas pouvoir "laisser ce petit comme ça" le pousse à contacter définitivement France Parrainage. Les démarches administratives ont débuté en mars.

      3. France Parrainage : Cadre et Modalités du Parrainage de Proximité

      Rachel Lerou, éducatrice spécialisée et référente chez France Parrainage, a détaillé le fonctionnement de l'association, qui existe depuis 1947.

      Définition et Objectifs

      Deux Pôles d'Activité : L'association dispose d'un pôle international (soutien financier) et d'un pôle de parrainage de proximité, qui est au cœur du sujet.

      Mission Principale : Le parrainage de proximité consiste à "soutenir un enfant dans la création de liens" durables.

      L'objectif est de faire comprendre à l'enfant "qu'il compte pour quelqu'un".

      Pour certains enfants, notamment les pupilles de l'État, les parrains et marraines sont les "seules personnes hors professionnel qui sont dans leur vie".

      Public et Durée : L'accompagnement concerne les enfants de 2 à 18 ans, avec une possibilité de poursuite jusqu'à 21 ans. L'association souligne : "on sait à quel moment on commence, on sait pas à quel moment on finira".

      Le Processus de Validation des Parrains

      Le parcours pour devenir parrain ou marraine est structuré en plusieurs étapes, d'une durée d'environ deux mois.

      1. Réunion d'Information : Première étape pour présenter le dispositif.

      2. Formulaire de Demande : Formalisation de la candidature.

      3. Première Évaluation : Un entretien pour explorer les motivations et le sens du projet pour le candidat.

      4. Deuxième Évaluation à Domicile : Une visite pour vérifier que l'enfant sera accueilli "dans de bonnes conditions". La validation des lieux est effectuée même si des nuitées ne sont pas prévues initialement.

      5. Commission de Validation : Échange final sur le projet et validation de la candidature.

      Parrainage Ciblé vs. Parrainage Classique

      Parrainage Classique : La majorité des candidats ("les trois quarts de nos parrains/marraines") souhaitent passer du temps avec un enfant qu'ils ne connaissent pas. L'association se charge alors de trouver une correspondance.

      Parrainage Ciblé : Le cas de Florian et Dylan est un "parrainage ciblé", où deux personnes qui se connaissent déjà souhaitent formaliser et encadrer leur relation dans un autre cadre.

      Le Rôle Crucial du Consentement

      Le parrainage ne peut se mettre en place sans l'accord de toutes les parties.

      L'Avis de l'Enfant : La parole de l'enfant est sollicitée. Dans le cas de Dylan, une rencontre a été organisée à l'antenne d'Arras.

      Il a "très très vite compris qu'il allait revoir Florian et il était très content et très impatient". Si l'enfant refuse, le projet n'aboutit pas.

      L'Accord des Parents : L'accord des détenteurs de l'autorité parentale est également obligatoire. La mère de Dylan ne s'est pas opposée.

      L'association travaille à rassurer les parents "frileux" en leur expliquant qu'ils ne "perdent pas leur place de parents".

      4. La Mise en Œuvre du Parrainage : Le Quotidien de Florian et Dylan

      Le parrainage de Dylan par Florian est effectif depuis septembre.

      Rythme et Nature des Rencontres

      Fréquence : Dylan est accueilli environ deux week-ends par mois ("à peu près deux fois par mois").

      Phase d'Essai : Les trois premiers mois constituent une phase de test, initialement avec des journées sans nuitée (sauf une nuitée "exceptionnelle"). Un bilan est prévu le 10 décembre pour officialiser la poursuite du parrainage, qui inclura alors des nuitées régulières et des vacances.

      Intégration Familiale : Dylan s'est intégré "très naturellement" dans la vie de famille de Florian, rencontrant sa famille, sa belle-famille et ses amis. Il apprécie également les moments plus calmes "rien qu'à trois à la maison".

      La Collaboration avec les Acteurs

      La réussite du parrainage repose sur une bonne coordination entre les différentes personnes qui entourent l'enfant.

      Famille d'Accueil : Les relations avec la nouvelle famille d'accueil sont excellentes. Ils sont décrits comme "très ouverts" et favorisant le parrainage. Des temps d'échange de 15-20 minutes ont lieu à chaque fois.

      Services Sociaux : La collaboration avec la référente Aide Sociale à l'Enfance (ASE) de Dylan à la MDS est très bonne, ce qui a facilité la mise en place du projet.

      La Distinction des Rôles

      Un point essentiel est la transition du rôle d'enseignant à celui de parrain.

      Le Cadre Scolaire : Florian a clairement expliqué à Dylan qu'il n'était "pas là pour lui faire faire les devoirs". L'école reste importante, mais le temps de parrainage est dédié à d'autres activités.

      Spontanéité : Dylan a bien intégré ce nouveau cadre, appelant Florian "parfois Florian, parfois Parrain". Il lui arrive de réciter spontanément ses poésies, mais ce n'est pas une attente formelle.

      Prévention des Amalgames : France Parrainage favorise une fréquence d'accueil régulière (un week-end sur deux) pour que l'enfant ne se projette pas sur un accueil à long terme chez son parrain, son lieu de vie principal demeurant la famille d'accueil.

      5. Perspectives et Enjeux du Parrainage

      La discussion a élargi le sujet à des considérations plus générales sur le parrainage en protection de l'enfance.

      Profil des Parrains et Marraines

      Il a été noté qu'un nombre significatif de parrains et marraines sont des enseignants et des travailleurs sociaux.

      Ce constat suggère que les professionnels qui développent des liens particuliers dans le cadre de leur travail peuvent être amenés à "franchir un autre pas" vers un engagement personnel.

      Sécurité et Évolution des Dispositifs

      Vérifications de Sécurité : Un point important a été soulevé : le processus de recrutement des parrains inclut toutes les "sécurités qui sont vérifiées" pour ne pas confier un enfant à un adulte qui pourrait lui nuire davantage.

      "Porosité" des Solutions d'Accueil : Le parrainage s'inscrit dans un mouvement vers des solutions plus souples et diversifiées.

      Il existe aujourd'hui une "porosité beaucoup plus importante des possibilités d'accueil" qu'il y a 10 ans. Des dispositifs comme l'accueil bénévole durable ou l'accueil par un tiers digne de confiance (TDC) se développent. Parfois, un parrainage peut évoluer vers un statut de TDC.

      Normalisation de l'Expérience de l'Enfant : Ces solutions permettent de "remettre l'enfant dans des choses qui relèvent un peu de la normalité", comme passer du temps simple en famille, aller au bowling, etc., des activités qui ne sont pas toujours possibles dans les structures d'accueil traditionnelles.

      Résultats à Long Terme

      Bien que l'antenne du Pas-de-Calais n'ait que 5 ans d'existence, des antennes plus anciennes comme celle de Picardie (30 ans) rapportent des retours d'expérience très positifs.

      De nombreux parrainages se poursuivent à l'âge adulte sous la forme d'une relation "d'adulte à adulte", avec des échanges de nouvelles et des présentations de famille.

    1. Author response:

      The following is the authors’ response to the original reviews.

      Public Reviews:

      Reviewer #1 (Public review):

      Summary:

      The authors present a novel usage of fluorescence lifetime imaging microscopy (FLIM) to measure NAD(P)H autofluorescence in the Drosophila brain, as a proxy for cellular metabolic/redox states. This new method relies on the fact that both NADH and NADPH are autofluorescent, with a different excitation lifetime depending on whether they are free (indicating glycolysis) or protein-bound (indicating oxidative phosphorylation). The authors successfully use this method in Drosophila to measure changes in metabolic activity across different areas of the fly brain, with a particular focus on the main center for associative memory: the mushroom body.

      Strengths:

      The authors have made a commendable effort to explain the technical aspects of the method in accessible language. This clarity will benefit both non-experts seeking to understand the methodology and researchers interested in applying FLIM to Drosophila in other contexts.

      Weaknesses:

      (1) Despite being statistically significant, the learning-induced change in f-free in α/β Kenyon cells is minimal (a decrease from 0.76 to 0.73, with a high variability). The authors should provide justification for why they believe this small effect represents a meaningful shift in neuronal metabolic state.

      We agree with the reviewer that the observed f_free shift averaged per individual, while statistically significant, is small. However, to our knowledge, this is the first study to investigate a physiological (i.e., not pharmacologically induced) variation in neuronal metabolism using FLIM. As such, there are no established expectations regarding the amplitude of the effect. In the revised manuscript, we have included an additional experiment involving the knockdown of ALAT in α/β Kenyon cells, which further supports our findings. We have also expanded the discussion to expose two potential reasons why this effect may appear modest.

      (2) The lack of experiments examining the effects of long-term memory (after spaced or massed conditioning) seems like a missed opportunity. Such experiments could likely reveal more drastic changes in the metabolic profiles of KCs, as a consequence of memory consolidation processes.

      We agree with the reviewer that investigating the effects of long-term memory on metabolism represent a valuable future path of investigation. An intrinsic caveat of autofluorescence measurement, however, is to identify the cellular origin of the observed changes. To this respect, long-term memory formation is not an ideal case study as its essential feature is expected to be a metabolic activation localized to Kenyon cells’ axons in the mushroom body vertical lobes (as shown in Comyn et al., 2024), where many different neuron subtypes send intricate processes. This is why we chose to first focus on middle-term memory, where changes at the level of the cell bodies could be expected from our previous work (Rabah et al., 2022). But our pioneer exploration of the applicability of NAD(P)H FLIM to brain metabolism monitoring in vivo now paves the way to extending it to the effect of other forms of memory.

      (3) The discussion is mostly just a summary of the findings. It would be useful if the authors could discuss potential future applications of their method and new research questions that it could help address.

      The discussion has been expanded by adding interpretations of the findings and remaining challenges.

      Reviewer #2 (Public review):

      This manuscript presents a compelling application of NAD(P)H fluorescence lifetime imaging (FLIM) to study metabolic activity in the Drosophila brain. The authors reveal regional differences in oxidative and glycolytic metabolism, with a particular focus on the mushroom body, a key structure involved in associative learning and memory. In particular, they identify metabolic shifts in α/β Kenyon cells following classical conditioning, consistent with their established role in energy-demanding middle- and long-term memories.

      These results highlight the potential of label-free FLIM for in-vivo neural circuit studies, providing a powerful complement to genetically encoded sensors. This study is well-conducted and employs rigorous analysis, including careful curve fitting and well-designed controls, to ensure the robustness of its findings. It should serve as a valuable technical reference for researchers interested in using FLIM to study neural metabolism in vivo. Overall, this work represents an important step in the application of FLIM to study the interactions between metabolic processes, neural activity, and cognitive function.

      Reviewer #3 (Public review):

      This study investigates the characteristics of the autofluorescence signal excited by 740 nm 2-photon excitation, in the range of 420-500 nm, across the Drosophila brain. The fluorescence lifetime (FL) appears bi-exponential, with a short 0.4 ns time constant followed by a longer decay. The lifetime decay and the resulting parameter fits vary across the brain. The resulting maps reveal anatomical landmarks, which simultaneous imaging of genetically encoded fluorescent proteins helps to identify. Past work has shown that the autofluorescence decay time course reflects the balance of the redox enzyme NAD(P)H vs. its protein-bound form. The ratio of free-to-bound NADPH is thought to indicate relative glycolysis vs. oxidative phosphorylation, and thus shifts in the free-to-bound ratio may indicate shifts in metabolic pathways. The basics of this measure have been demonstrated in other organisms, and this study is the first to use the FLIM module of the STELLARIS 8 FALCON microscope from Leica to measure autofluorescence lifetime in the brain of the fly. Methods include registering the brains of different flies to a common template and masking out anatomical regions of interest using fluorescence proteins.

      The analysis relies on fitting an FL decay model with two free parameters, f_free and t_bound. F_free is the fraction of the normalized curve contributed by a decaying exponential with a time constant of 0.4 ns, thought to represent the FL of free NADPH or NADH, which apparently cannot be distinguished. T_bound is the time constant of the second exponential, with scalar amplitude = (1-f_free). The T_bound fit is thought to represent the decay time constant of protein-bound NADPH but can differ depending on the protein. The study shows that across the brain, T_bound can range from 0 to >5 ns, whereas f_free can range from 0.5 to 0.9 (Figure 1a). These methods appear to be solid, the full range of fits are reported, including maximum likelihood quality parameters, and can be benchmarks for future studies.

      The authors measure the properties of NADPH-related autofluorescence of Kenyon Cells(KCs) of the fly mushroom body. The results from the three main figures are:

      (1) Somata and calyx of mushroom bodies have a longer average tau_bound than other regions (Figure 1e);

      (2) The f_free fit is higher for the calyx (input synapses) region than for KC somata (Figure 2b);

      (3) The average across flies of average f_free fits in alpha/beta KC somata decreases from 0.734 to 0.718. Based on the first two findings, an accurate title would be "Autofluorecense lifetime imaging reveals regional differences in NADPH state in Drosophila mushroom bodies."

      The third finding is the basis for the title of the paper and the support for this claim is unconvincing. First, the difference in alpha/beta f_free (p-value of 4.98E-2) is small compared to the measured difference in f_free between somas and calyces. It's smaller even than the difference in average soma f_free across datasets (Figure 2b vs c). The metric is also quite derived; first, the model is fit to each (binned) voxel, then the distribution across voxels is averaged and then averaged across flies. If the voxel distributions of f_free are similar to those shown in Supplementary Figure 2, then the actual f_free fits could range between 0.6-0.8. A more convincing statistical test might be to compare the distributions across voxels between alpha/beta vs alpha'/beta' vs. gamma KCs, perhaps with bootstrapping and including appropriate controls for multiple comparisons.

      The difference observed is indeed modest relative to the variability of f_free measurements in other contexts. The fact that the difference observed between the somata region and the calyx is larger is not necessarily surprising. Indeed, these areas have different anatomical compositions that may result in different basal metabolic profiles. This is suggested by Figure 1b which shows that the cortex and neuropile have different metabolic signatures. Differences in average f_free values in the somata region can indeed be observed between naive and conditioned flies. However, all comparisons in the article were performed between groups of flies imaged within the same experimental batches, ensuring that external factors were largely controlled for. This absence of control makes it difficult to extract meaningful information from the comparison between naive and conditioned flies.

      We agree with the reviewer that the choice of the metric was indeed not well justified in the first manuscript. In the new manuscript, we have tried to illustrate the reasons for this choice with the example of the comparison of f_free in alpha/beta neurons between unpaired and paired conditioning (Dataset 8). First, the idea of averaging across voxels is supported by the fact that the distributions of decay parameters within a single image are predominantly unimodal. Examples for Dataset 8 are now provided in the new Sup. Figure 14. Second, an interpretable comparison between multiple groups of distributions is, to our knowledge, not straightforward to implement. It is now discussed in Supplementary information. To measure interpretable differences in the shapes of the distributions we computed the first three moments of distributions of f_free for Dataset 8 and compared the values obtained between conditions (see Supplementary information and new Sup. Figure 15). Third, averaging across individuals allows to give each experimental subject the same weight in the comparisons.

      I recommend the authors address two concerns. First, what degree of fluctuation in autofluorescence decay can we expect over time, e.g. over circadian cycles? That would be helpful in evaluating the magnitude of changes following conditioning. And second, if the authors think that metabolism shifts to OXPHOS over glycolosis, are there further genetic manipulations they could make? They test LDH knockdown in gamma KCs, why not knock it down in alpha/beta neurons? The prediction might be that if it prevents the shift to OXPHOS, the shift in f_free distribution in alpha/beta KCs would be attenuated. The extensive library of genetic reagents is an advantage of working with flies, but it comes with a higher standard for corroborating claims.

      In the present study, we used control groups to account for broad fluctuations induced by external factors such as the circadian cycle. We agree with the reviewer that a detailed characterization of circadian variations in the decay parameters would be valuable for assessing the magnitude of conditioning-induced shifts. We have integrated this relevant suggestion in the Discussion. Conducting such an investigation lies unfortunately beyond the scope and means of the current project.

      In line with the suggestion of the reviewer, we have included a new experiment to test the influence of the knockdown of ALAT on the conditioning-induced shift measured in alpha/beta neurons. This choice is motivated in the new manuscript. The obtained result shows that no shift is detected in the mutant flies, in accordance with our hypothesis.

      FLIM as a method is not yet widely prevalent in fly neuroscience, but recent demonstrations of its potential are likely to increase its use. Future efforts will benefit from the description of the properties of the autofluorescence signal to evaluate how autofluorescence may impact measures of FL of genetically engineered indicators.

      Recommendations for the authors

      Reviewer #1 (Recommendations for the authors):

      (1) Y axes in Figures 1e, 2c, 3b,c are misleading. They must start at 0.

      Although we agree that making the Y axes start at 0 is preferable, in our case it makes it difficult to observe the dispersion of the data at the same time (your next suggestion). To make it clearer to the reader that the axes do not start at 0, a broken Y-axis is now displayed in every concerned figure.

      (2) These same plots should have individual data points represented, for increased clarity and transparency.

      Individual data points were added on all boxplots.

      Reviewer #2 (Recommendations for the authors):

      I am evaluating this paper as a fly neuroscientist with experience in neurophysiology, including calcium imaging. I have little experience with FLIM but anticipate its use growing as more microscopes and killer apps are developed. From this perspective, I value the opportunity to dig into FLIM and try to understand this autofluorescence signal. I think the effort to show each piece of the analysis pipeline is valuable. The figures are quite beautiful and easy to follow. My main suggestion is to consider moving some of the supplemental data to the main figures. eLife allows unlimited figures, moving key pieces of the pipeline to the main figures would make for smoother reading and emphasize the technical care taken in this study.

      We thank the reviewer for their feedback. Following their advice we have moved panels from the supplementary figures to the main text (see new Figure 2).

      Unfortunately, the scientific questions and biological data do not rise to the typical standard in the field to support the claims in the title, "In vivo autofluorescence lifetime imaging of the Drosophila brain captures metabolic shifts associated with memory formation". The authors also clearly state what the next steps are: "hypothesis-driven approaches that rely on metabolite-specific sensors" (Intro). The advantage of fly neuroscience is the extensive library of genetic reagents that enable perturbations. The key manipulation in this study is the electric shock conditioning paradigm that subtly shifts the distribution of a parameter fit to an exponential decay in the somas of alpha/beta KCs vs others. This feels like an initial finding that deserves follow-up; but is it a large enough result to motivate a future student to pick this project up? The larger effect appears to be the gradients in f_free across KCs overall (Figure 2b). How does this change with conditioning?

      We acknowledge that the observed metabolic shift is modest relative to the variability of f_free and agree that additional corroborating experiments would further strengthen this result. Nevertheless, we believe it remains a valid and valuable finding that will be of interest to researchers in the field. The reviewer is right in pointing out that the gradient across KCs is higher in magnitude, however, the fact that this technique can also report experience-dependent changes, in addition to innate heterogeneities across different cell types, is a major incentive for people who could be interested in applying NAD(P)H FLIM in the future. For this reason, we consider it appropriate to retain mention of the memory-induced shift in the title, while making it less assertive and adding a reference to the structural heterogeneities of f_free revealed in the study. We have also rephrased the abstract to adopt a more cautious tone and expanded the discussion to clarify why a low-magnitude shift in f_free can still carry biological significance in this context. Finally, we have added the results of a new set of data involving the knockdown of ALAT in Kenyon cells, to further support the relevance of our observation relative to memory formation, despite its small magnitude. We believe that these elements together form a good basis for future investigations and that the manuscript merits publication in its present form.

      Together, I would recommend reshaping the paper as a methods paper that asks the question, what are the spatial properties of NADPH FL across the brain? The importance of this question is clear in the context of other work on energy metabolism in the MBs. 2P FLIM will likely always have to account for autofluorescence, so this will be of interest. The careful technical work that is the strength of the manuscript could be featured, and whether conditioning shifts f_free could be a curio that might entice future work.

      By transferring panels of the supplementary figures to the main text (see new Figure 2) as suggested by Reviewer 2, we have reinforced the methodological part of the manuscript. For the reasons explained above, we however still mention the ‘biological’ findings in the title and abstract.

      Minor recommendations on science:

      Figure 2C. Plotting either individual data points or distributions would be more convincing.

      Individual data points were added on all boxplots.

      There are a few mentions of glia. What are the authors' expectations for metabolic pathways in glia vs. neurons? Are glia expected to use one more than the other? The work by Rabah suggests it should be different and perhaps complementary to neurons. Can a glial marker be used in addition to KC markers? This seems crucial to being able to distinguish metabolic changes in KC somata from those in glia.

      Drosophila cortex glia are thought to play a similar role as astrocytes in vertebrates (see Introduction). In that perspective, we expect cortex glia to display a higher level of glycolysis than neurons. The work by Rabah et al. is coherent with this hypothesis. Reviewer 2 is right in pointing out that using a glial marker would be interesting. However, current technical limitations make such experiments challenging. These limitations are now exposed in the discussion.

      The question of whether KC somata positions are stereotyped can probably be answered in other ways as well. For example, the KCs are in the FAFB connectomic data set and the hemibrain. How do the somata positions compare?

      The reviewer’s suggestion is indeed interesting. However, the FAFB and hemibrain connectomic datasets are based on only two individual flies, which probably limits their suitability for assessing the stereotypy of KC subtype distributions. In addition, aligning our data with the FAFB dataset would represent substantial additional work.

      The free parameter tau_bound is mysterious if it can be influenced by the identity of the protein. Are there candidate NADPH binding partners that have a spatial distribution in confocal images that could explain the difference between somas and calyx?

      There are indeed dozens of NADH- or NADPH-binding proteins. For this reason, in all studies implementing exponential fitting of metabolic FLIM data, tau_bound is considered a complex combination of the contributions from many different proteins. In addition, one should keep in mind that the number of cell types contributing to the autofluorescence signal in the mushroom body calyx (Kenyon cells, astrocyte-like and ensheathing glia, APL neurons, olfactory projection neurons, dopamine neurons) is much higher than in the somas (only Kenyon cells and cortex glia). This could also participate in the observed difference. Hence, focusing on intracellular heterogeneities of potential NAD(P)H binding partners seems premature at that stage.

      The phrase "noticeable but not statistically significant" is misleading.

      We agree with the reviewer and have removed “noticeable but” from the sentence in the new version of the manuscript.

      Minor recommendations on presentation:

      The Introduction can be streamlined.

      We agree that some parts of the Introduction can seem a bit long for experts of a particular field. However, we think that this level of detail makes the article easily accessible for neuroscientists working on Drosophila and other animal models but not necessarily with FLIM, as well as for experts in energy metabolism that may be familiar with FLIM but not with Drosophila neuroscience.

    1. Attachement Fragilisé : Enjeux et Stratégies pour le Parcours Scolaire des Jeunes Protégés

      Résumé Exécutif

      Ce document de synthèse analyse les enjeux complexes liés à l'attachement fragilisé chez les jeunes relevant de la protection de l'enfance et de la protection judiciaire de la jeunesse, en s'appuyant sur les témoignages de professionnels du secteur.

      Il ressort que ces jeunes, souvent issus de systèmes familiaux extrêmement dégradés sur les plans économique, sanitaire et social, présentent des difficultés multiples qui impactent directement leur disponibilité pour les apprentissages.

      Les points critiques sont les suivants :

      Le Contexte Socio-économique : La réalité des familles est marquée par une précarité extrême (chômage, incarcération, addictions), loin des vignettes cliniques classiques.

      La Disponibilité Psychique Limitée : Bien que beaucoup de jeunes parviennent à se conformer aux normes scolaires durant la journée, leur énergie psychique s'épuise.

      Le soir, en institution, les angoisses (abandon, manque) resurgissent, rendant le travail scolaire presque impossible.

      Le Rôle Ambivalent de l'École : L'école est perçue à la fois comme un lieu de normalité essentiel, où l'enfant peut être "juste un élève", et une source de stress intense pour ceux dont la scolarité devient une stratégie de survie.

      La Posture Professionnelle : La clé de l'accompagnement réside dans une posture juste et prévisible.

      Les professionnels (éducateurs, assistants familiaux) doivent construire un lien de confiance dans la durée, en restant à leur place, sans se substituer aux parents ou aux enseignants.

      La Collaboration Interinstitutionnelle : Bien qu'indispensable, la collaboration entre les services de protection de l'enfance et l'Éducation Nationale se heurte à des freins structurels (rythmes de travail divergents) et à un débat sur le niveau d'information à partager concernant le parcours de l'enfant.

      En conclusion, la réussite de ces jeunes dépend d'une approche coordonnée et bienveillante, axée sur la valorisation de leurs compétences, la création d'un sentiment de sécurité et d'appartenance, et une communication fluide et préventive entre tous les acteurs impliqués.

      --------------------------------------------------------------------------------

      1. Profil et Manifestations de l'Attachement Fragilisé

      La table ronde met en lumière les caractéristiques et les défis quotidiens des jeunes protégés, dont le parcours est marqué par un attachement insécure ou fragilisé.

      1.1. Un Contexte Familial et Social Sévèrement Dégradé

      Pascal Abdakovi, directeur d'une Maison d'Enfants à Caractère Social (MECS), souligne un décalage majeur entre les vignettes cliniques théoriques et la réalité du terrain.

      Contrairement aux exemples de parents insérés professionnellement, la sociologie des familles accompagnées dans le Pas-de-Calais est marquée par une précarité extrême.

      Absence d'Insertion Professionnelle : Sur 280 parents suivis, "une dizaine de parents qui travaillent tout au plus".

      Problématiques Lourdes : Un nombre significativement plus élevé de parents est "incarcérés ou hospitalisés" que de parents en activité professionnelle.

      Facteurs Multiples : Les systèmes familiaux sont "très très fortement dégradés sur le plan économique, sur le plan de la santé mentale, sur le plan des addictions".

      1.2. Témoignages des Assistantes Familiales sur le Quotidien

      Les observations recueillies par Lidy Poevin auprès de deux assistantes familiales, Caroline de Velter et Sandrine Belligas, décrivent les manifestations concrètes de cet attachement fragilisé :

      Difficultés d'Apprentissage et Troubles Associés : Les enfants présentent souvent des retards et des troubles du sommeil, de l'alimentation et de la motricité. Les plus grands montrent un manque d'assiduité, de motivation et d'intérêt pour l'école.

      Insécurité et Conflit de Loyauté : Les contacts "en montagne russes" avec les parents biologiques génèrent un "grand sentiment d'insécurité, conflit de loyauté et une autoprotection envers l'attachement".

      Mise à l'Épreuve Constante : Les enfants testent la capacité des adultes "à tenir et à être toujours là quoi qu'il fasse", cherchant une attention exclusive.

      Comportements "Chronophages" : Ils sont décrits comme des "enfants chronophages" qui monopolisent l'attention, parfois par des bêtises, car "ils savent que c'est un moyen de mobiliser le plus de personnes possibles".

      Impact des Visites Parentales : Les troubles du comportement sont particulièrement marqués "la veille et les lendemains de visite", surtout si celle-ci se passe mal ou est annulée.

      Leur vécu familial est "toujours présent à chaque visite, à chaque appel".

      2. Conséquences sur la Scolarité et la Vie en Collectivité

      L'attachement fragilisé a des répercussions directes et profondes sur la capacité des jeunes à s'investir dans les apprentissages et à interagir au sein de leurs différents lieux de vie.

      2.1. Le Phénomène de la Double Disponibilité : École vs Institution

      Pascal Abdakovi décrit un phénomène courant chez les jeunes qui vont "plutôt bien".

      Adaptation en Milieu Scolaire : Pendant la journée, à l'école, ces enfants fonctionnent bien.

      Ils répondent à leur "envie de normalité" dans un environnement où ils sont face à "des adultes qui n'entravent rien à la protection de l'enfance".

      Ils peuvent encore avoir un "œil assez pétillant" et un "désir d'apprendre".

      Épuisement Psychique le Soir : Le retour en MECS le soir marque une rupture.

      Le jour décline, "les angoisses remontent : angoisses d'abandon, le manque des parents". L'enfant redevient un "enfant placé".

      Indisponibilité pour le Travail Scolaire : En fin de journée, la disponibilité psychique pour les devoirs est "souvent absente".

      Comme l'exprime Pascal Abdakovi : "n'en jetez plus la cour est pleine et ils sont complètement inaccessibles". Cette indisponibilité n'est pas une question de moyens mais de saturation psychique.

      2.2. Le Cas Spécifique des Adolescents Incarcérés

      Sophie Nicolas, responsable en Établissement Pénitentiaire pour Mineurs (EPM), décrit des jeunes "extrêmement abîmés" par des parcours institutionnels lourds et des ruptures de liens familiaux.

      | Comportement Observé | Analyse et Cause | | --- | --- | | Troubles Relationnels Extrêmes | Demande d'attention extrême ("collé à la jambe de l'éducateur") ou, à l'inverse, une mise à distance radicale avec l'adulte. | | Test Constant du Lien | Les jeunes cherchent à voir "jusqu'où l'adulte tiendra avec lui" et s'il vivra un "énième abandon". | | Estime de Soi Dégradée | Ils se dévalorisent fortement et ne comprennent pas quand un adulte pose un regard positif sur eux. | | Indisponibilité pour les Apprentissages | Malgré une scolarité obligatoire, ils sont focalisés sur d'autres inquiétudes, notamment familiales. L'exemple est donné d'un jeune angoissé pour sa mère, incapable de s'investir scolairement. |

      3. Stratégies d'Accompagnement et Postures Professionnelles

      Face à ces défis, les intervenants proposent des postures et des stratégies concrètes visant à créer un environnement sécurisant et propice au développement.

      3.1. Créer un Cadre Sécurisant et Prévisible

      Pascal Abdakovi insiste sur la nécessité de construire le lien "dans la durée" en organisant la prévisibilité.

      Rendre l'Environnement Lisible : "Leur permettre de savoir qui vont être les adultes présents le matin au lever, qui vont être les adultes présents au retour de l'école".

      Adopter une Posture Juste : Chaque professionnel doit "parler de la bonne place".

      L'éducateur n'est "pas le parent, pas l'enseignant, pas le juge". Ce respect des rôles est essentiel pour l'enfant, qui a un "fort besoin de normalité".

      3.2. Travailler sur l'Appartenance et la Valorisation

      Nadine Musinski, pilote de projet au service adoption, met en avant l'importance de la notion d'appartenance pour les pupilles de l'État, des enfants qui ont un "sentiment d'exister pour personne".

      Redonner une Place : Le fait de se réunir autour de l'enfant, de prendre son avis en compte, lui permet de "commencer à compter pour quelqu'un".

      Diluer la Culpabilité : Il est crucial d'aider l'enfant à comprendre les raisons de son placement pour qu'il ne se sente pas responsable.

      "Si l'adulte ne l'aide pas à diluer les responsabilités [...], il est persuadé que c'est lui qui est [...] victime de ce qui a causé ce délaissement".

      Appuyer sur les Compétences : Plutôt que de pointer les échecs, il faut "appuyer la compétence".

      Pointer ce que l'enfant ne sait pas faire "vient renforcer l'idéologie qu'ils ne sont bons à rien et qu'ils ne sont pas aimables".

      Éviter le Rapport de Force : Face à des enfants habitués à l'adversité, l'autorité punitive est inefficace.

      La négociation et la recherche d'adhésion permettent de leur montrer "un autre monde" que celui du rapport de force.

      3.3. L'Importance Cruciale du Partenariat avec l'École

      Les témoignages des assistantes familiales soulignent le rôle déterminant d'une collaboration positive avec l'équipe enseignante.

      Le Rôle de l'Enseignant : "Il y a cette rencontre, cet enseignant qui sans le savoir, par une approche bienveillante, par des paroles valorisantes, des encouragements malgré les faibles résultats, va appuyer notre discours et soulager l'enfant d'un poids".

      Activités Alternatives : Le sport ou les clubs au sein de l'établissement permettent de "les mettre en valeur dans d'autres domaines que la scolarité", ce qui est "non négligeable pour leur donner une bonne image à l'école".

      4. La Collaboration Interinstitutionnelle : Freins et Leviers

      La coordination entre les services de protection et l'Éducation Nationale est un facteur de réussite essentiel, mais elle rencontre des obstacles concrets.

      4.1. Les Freins Pratiques à la Communication

      Pascal Abdakovi identifie des difficultés structurelles :

      Différences de Rythmes : Le personnel éducatif travaillant en 3x8 n'est pas disponible aux mêmes horaires que le personnel enseignant.

      Le créneau de fin de journée (16h-17h30), idéal pour une rencontre, est "le moment où nous on a 140 enfants qui débarquent".

      Turnover du Personnel : L'instabilité des équipes peut également compliquer le suivi et la transmission d'informations.

      4.2. Les Leviers pour une Meilleure Collaboration

      Des solutions sont mises en œuvre pour surmonter ces obstacles :

      Aménagement du Temps Scolaire : Il est fréquent de proposer rapidement d'aménager les emplois du temps, notamment en sortant les enfants "de la cantine" ou de la "garderie" pour protéger les zones et moments les plus sensibles.

      Lignes de Communication Directes : Mettre en place des canaux de communication directs entre les cadres des institutions (ex: "les portables professionnels des cadres de chez nous") permet de "régler les problèmes avant de ne plus se supporter".

      Chartes Partenariales : Un CPE dans l'audience souligne l'efficacité des chartes partenariales qui, sans tout résoudre, "impulsent des nouvelles dynamiques et des liens" et permettent des "avancées concrètes sur l'orientation, le bien-être, la gestion des émotions".

      4.3. Le Débat sur le Partage d'Informations

      Une tension émerge entre le besoin de l'école d'avoir des informations pour mieux comprendre et accompagner l'élève, et la volonté des services sociaux de préserver la "normalité" de l'enfant en tant qu'élève.

      Le Point de Vue de l'Éducation Nationale : Un intervenant de l'école exprime le besoin de connaître les "grandes lignes" de l'histoire de l'élève (placement long, ruptures multiples) non par "curiosité malsaine", mais pour contextualiser des comportements (ex: un élève qui n'a pas ses affaires car "sa seule maison en fait c'est le collège") et gérer des procédures administratives complexes (signatures parentales).

      Le Point de Vue de la Protection de l'Enfance : Pascal Abdakovi met en garde contre le "fantasme" que connaître l'histoire de l'enfant donnera des solutions.

      Il soutient que cela peut "rompre le contrat" qui permet à l'enfant, pendant 7-8 heures par jour, de n'être "plus un enfant placé héritier d'une histoire sordide" mais "juste un élève".

    1. Briefing : L'Attachement à l'École et ses Implications

      Synthèse

      Ce document de synthèse analyse les concepts fondamentaux présentés par le Docteur Anne Raynaud concernant la théorie de l'attachement et son application cruciale dans le contexte scolaire et la protection de l'enfance.

      L'analyse met en lumière une crise systémique où l'école est devenue le réceptacle des difficultés sociétales, confrontée à une augmentation alarmante de la détresse psychologique chez les enfants, y compris des idées suicidaires dès l'âge de 4-5 ans.

      La théorie de l'attachement de John Bowlby est proposée comme une grille de lecture essentielle et une "culture commune" pour tous les professionnels de l'enfance.

      Elle offre des clés de décodage pour comprendre les comportements des enfants, qui sont souvent mal interprétés. Les points critiques sont :

      1. Sécurité comme Prérequis à l'Apprentissage : L'activation du système d'attachement (déclenché par la peur ou le sentiment de menace) désactive biologiquement et obligatoirement le système d'exploration, qui est nécessaire à la curiosité, la motivation et les apprentissages. Un enfant en insécurité ne peut pas apprendre.

      2. Décoder les Comportements "Aversifs" : L'agressivité, l'opposition et la provocation ne sont pas des signes de malveillance mais des "comportements d'attachement aversifs".

      Ce sont des signaux de détresse envoyés par un enfant dont le système d'attachement est activé et qui cherche désespérément protection et réconfort.

      3. L'Urgence de la Collaboration : Le fonctionnement en "couloirs de nage" des institutions (école, soin, protection de l'enfance) génère de l'insécurité et est délétère.

      Une collaboration basée sur une culture partagée, la confiance et une responsabilité collective est indispensable pour créer une "chaîne de sécurité" autour de l'enfant.

      4. Le Rôle des Professionnels : Les enseignants sont des figures d'attachement majeures ("porte-avions"), dont la posture et la propre sécurité émotionnelle ont un impact direct sur la scolarité de l'enfant.

      La manière d'interagir avec les parents est également déterminante : il est impératif de sécuriser les parents ("confetti positif") pour permettre une coéducation efficace, plutôt que de les menacer, ce qui active leur propre système d'attachement et bloque toute collaboration.

      Constat : Une Crise Systémique Affectant l'École et l'Enfance

      L'École comme Réceptacle des Difficultés Sociétales

      Le Docteur Raynaud constate que l'école est aujourd'hui un "espace réceptacle de toutes les difficultés des familles".

      De nouvelles missions s'y accumulent sans cesse (questions de genre, laïcité, enjeux sociaux et sociétaux), créant un "mille-feuilles" de responsabilités.

      Les enseignants se retrouvent à l'interface entre des "prescrits" nationaux exigeants (programmes, plans) et la réalité de plus en plus complexe du terrain, ce qui les place dans des injonctions paradoxales.

      L'école est le lieu où se déposent les conflits parentaux, les violences et le harcèlement, bousculant sa mission première d'apprentissage et de bien-être.

      La Souffrance Croissante des Enfants

      Le constat sur la santé mentale des enfants est qualifié de "très préoccupant".

      Augmentation des Idées Suicidaires : Une hausse sans précédent des idées suicidaires est observée chez de très jeunes enfants (4-5 ans), qui expriment un désir "d'être en paix" face à une pression qu'ils ressentent (évaluations, cris des adultes).

      Pression Académique Précoce : Le plan maternel, qui rend l'école obligatoire à 3 ans, promeut des attendus sur les fondamentaux à un âge où les enfants n'ont pas la maturité émotionnelle ou cérébrale pour y répondre. Cela les met en situation de menace constante.

      Augmentation des Signalements : Le nombre d'informations préoccupantes (IP) émanant des écoles, notamment maternelles, flambe, ce qui témoigne d'une détresse généralisée.

      Le Cloisonnement des Institutions

      Un frein majeur au système français est le fonctionnement cloisonné des différentes institutions (école, soin, médico-social, justice).

      "Couloirs de Nage" : Chaque institution opère dans son propre couloir, avec une méconnaissance mutuelle et peu de liens nourris, ce qui génère de la méfiance.

      "Causalité Externe" : Face aux difficultés, la tendance est de blâmer les autres institutions ("c'est la faute du juge", "l'enseignant n'a pas fait son job").

      Cette attitude empêche une remise en question et une évolution collective.

      Besoin d'une Culture Commune : Pour sortir de cette impasse, il est urgent de construire une culture partagée et d'adopter un langage commun pour observer et comprendre les enfants.

      La théorie de l'attachement est proposée comme ce socle commun.

      La Théorie de l'Attachement : Une Grille de Lecture Essentielle

      Fondements de la Théorie (John Bowlby)

      Contrairement à une croyance répandue, l'attachement au sens de Bowlby n'est pas synonyme d'amour ou d'affection. Il s'agit d'un besoin biologique et universel de sécurité.

      La théorie repose sur trois systèmes motivationnels interdépendants.

      | Système | Activation | Fonction | | --- | --- | --- | | Système d'Attachement | Peur, menace, détresse perçue (manque de cohérence, prévisibilité, stabilité). | Activer des comportements visant à obtenir protection et réconfort auprès d'une figure d'attachement. | | Système d'Exploration | Sentiment de sécurité, système d'attachement apaisé. | Développer la curiosité, la motivation, les apprentissages, la socialisation, le langage, etc. | | Système de Caregiving | Perception de la vulnérabilité et de la détresse d'autrui. | Apporter protection et réconfort, répondre au besoin de sécurité de l'autre (fonction parentale et professionnelle). |

      L'Interaction Cruciale entre Attachement et Exploration

      La découverte majeure de la théorie de l'attachement est l'incompatibilité biologique entre le système d'attachement et le système d'exploration.

      Principe de la Balance : "Quand j'active mon système d'attachement, quand j'ai peur... de fait biologiquement et de manière obligatoire je vais désactiver mon système d'exploration."

      Conséquence Directe : Un enfant dont le besoin de sécurité n'est pas satisfait ne peut pas se rendre disponible pour les apprentissages.

      Son énergie et son attention sont entièrement focalisées sur la régulation de sa peur. Sans sécurité, il ne peut y avoir d'apprentissage.

      Les Comportements d'Attachement : Décoder les Signaux de l'Enfant

      Lorsqu'un enfant active son système d'attachement, il envoie des signaux (comportements d'attachement) pour obtenir du réconfort. Le drame provient de la méconnaissance des signaux les plus difficiles :

      Comportements de Signalisation : Sourires, tentatives de rapprochement.

      Comportements Actifs : Suivre, s'agripper.

      Comportements Aversifs : C'est la catégorie la plus mal comprise.

      L'enfant, submergé par la peur, exprime son besoin de proximité par des comportements qui, paradoxalement, provoquent le rejet : agressivité, opposition, provocation, agitation, refus.

      Ces comportements aversifs sont l'expression d'un vécu émotionnel intense et le symptôme d'une insécurité profonde. Il est crucial de regarder sous la surface de l'iceberg : le comportement visible n'est que la pointe, cachant la peur et les besoins non satisfaits.

      Les Stratégies d'Attachement et leurs Manifestations à l'École

      En fonction de la disponibilité et de la sensibilité de ses figures d'attachement (le "porte-avions"), l'enfant (le "petit avion") développe différentes stratégies pour gérer sa peur et maintenir un lien.

      | Stratégie | Description de la Relation au "Porte-Avion" | Comportements Typiques à l'École | | --- | --- | --- | | Sécure | La base de sécurité est fiable et disponible. L'enfant explore sereinement et sait qu'il peut revenir chercher du réconfort si besoin. | Curieux, motivé, entre facilement en relation avec les pairs et les adultes, demande de l'aide en cas de difficulté. (Environ 60-65% de la population générale) | | Insécure Évitant | Le porte-avions est indisponible ou rejetant. L'enfant apprend à ne pas solliciter d'aide, à s'autonomiser et à réprimer l'expression de ses besoins. | Apparaît "trop sage", en retrait, isolé. Peut avoir un retard de langage ou un mutisme sélectif. S'intéresse plus aux objets qu'aux relations. Peut mimer des traits autistiques. | | Insécure Anxieux ("Attachiants") | La disponibilité du porte-avions est imprévisible. L'enfant ne sait jamais s'il obtiendra du réconfort et maximise donc ses signaux d'attachement. | Agité, provocateur, agressif, très exigeant sur le plan relationnel, difficile à apaiser. Son exploration est morcelée. Peut mimer un trouble de l'attention avec hyperactivité (TDAH). | | Désorganisé | Le porte-avions est à la fois la source de réconfort et la source de la peur (ex: humiliation, violence). L'enfant est pris dans un paradoxe insoluble. | Comportements contradictoires et "sans solution". Peut se manifester par un contrôle punitif (domination), un contrôle attentif (parentification) ou une hypersexualisation de la relation. Évolue souvent vers des psychopathologies (addictions, troubles de la personnalité). |

      Étude de Cas : Olivier, 7 ans

      Le cas d'Olivier illustre l'application concrète de cette grille de lecture.

      Contexte : Olivier arrive dans un nouvel établissement (ITEP) au moment où son père, atteint d'une pathologie psychiatrique, est hospitalisé. Ce cumul de facteurs de stress active massivement son système d'attachement.

      Comportements : Il présente une grande agitation, de l'opposition et de la provocation. Son système d'exploration est désactivé (il ne s'intéresse plus aux apprentissages).

      Interprétation via l'Attachement : Ses comportements ne sont pas des troubles en soi, mais des signaux de détresse aversifs témoignant de son insécurité. Il utilise une stratégie de type anxieux ("attachiants") pour tenter de faire face.

      Réponse du Système : L'équipe de l'ITEP, se sentant elle-même menacée et ne sachant pas décoder son comportement, rédige une information préoccupante "pour se protéger". Cette action, au lieu de sécuriser, a réactivé le traumatisme familial du placement et a majoré l'insécurité de tous.

      Implications pour les Professionnels et le Système

      Le Rôle de l'Enseignant comme Figure d'Attachement

      Les enseignants, en particulier en maternelle et en primaire, sont des figures d'attachement fondamentales et des "porte-avions" pour les élèves.

      L'Impact de l'Attachement du Professionnel : La propre stratégie d'attachement de l'enseignant (sécure, évitant, anxieux) influence sa capacité à percevoir les besoins de l'enfant et à y répondre de manière ajustée. La relation est une "histoire de rencontre" entre la stratégie de l'enfant et celle de l'adulte.

      Le Besoin de Formation : Il y a une sous-estimation de l'importance des relations interpersonnelles dans la fonction d'enseignant et un manque de formation sur cette dimension relationnelle, alors qu'ils sont confrontés à des enfants de plus en plus en difficulté.

      Vers une Coéducation et une Collaboration Efficaces

      Pour travailler efficacement avec les familles, surtout les plus vulnérables, il est impératif de ne pas activer leur système d'attachement.

      Le "Confetti Positif" : Commencer systématiquement par valoriser ce qui fonctionne, même si la situation est difficile. Pointer uniquement les dysfonctionnements met le parent sur la défensive, active sa peur et le rend incapable d'explorer des solutions ou de collaborer.

      Construire une "Chaîne de Sécurité" : La solution réside dans la création d'un réseau de sécurité autour de l'enfant, où tous les acteurs (enseignants, direction, soignants, éducateurs, parents) communiquent, se font confiance et partagent la même grille de lecture.

      Recommandations Stratégiques

      1. Former tous les acteurs du champ de l'enfance (enseignants, magistrats, travailleurs sociaux, etc.) à la théorie de l'attachement pour créer une culture et un langage communs.

      2. Repenser les pratiques institutionnelles pour prioriser la sécurité émotionnelle.

      Par exemple, revoir la rédaction des bulletins scolaires pour commencer par les réussites, ou organiser les équipes éducatives en s'assurant de la présence des figures d'attachement clés pour la famille.

      3. Sortir de la sur-pathologisation. Avant de poser un diagnostic (TDAH, autisme), il faut systématiquement questionner le niveau de sécurité de l'enfant. Anticiper des dossiers MDPH pour des enfants de 3-4 ans risque de les enfermer dans une pathologie qu'ils n'ont pas.

      4. Assumer une responsabilité collective et individuelle. Plutôt que de pointer les manquements des autres, chaque professionnel doit s'interroger sur sa part de responsabilité dans la création ou la rupture de la sécurité de l'enfant et de sa famille.

    1. Synthèse : L'Autorité Éducative Soutenue par la Confiance Interpersonnelle

      Résumé Exécutif

      Cette note de synthèse résume les principaux arguments de Marie Beretti concernant la relation intrinsèque entre l'autorité éducative et la confiance interpersonnelle, basés sur sa thèse de 2019.

      L'analyse met en lumière quatre points critiques :

      1. L'Autorité comme Relation Éducative : L'autorité n'est pas un pouvoir de contrainte, mais une relation professionnelle nécessaire et asymétrique, fondée sur la responsabilité de l'enseignant envers les besoins de l'élève.

      Son exercice est légitime à condition qu'il vise l'adhésion volontaire de l'élève et non sa soumission, en respectant sa dignité.

      2. Le Cercle Vertueux de la Confiance et de l'Autorité : L'argument central est l'existence d'un renforcement mutuel. Une autorité stable et bienveillante sécurise les élèves, ce qui nourrit leur confiance.

      En retour, la confiance des élèves facilite l'exercice de l'autorité, car elle engendre une tendance naturelle à l'adhésion et à la coopération, rendant la relation moins "coûteuse" en énergie pour l'enseignant.

      3. Les Trois Domaines de la Confiance : L'enquête de terrain révèle que la confiance des élèves envers leur enseignant n'est pas un concept monolithique.

      Elle se manifeste dans trois domaines distincts et coexistants :

      Confiance Élémentaire : Liée à la relation de personne à personne.  

      Confiance Juridique : Liée au rôle de l'enseignant comme garant du cadre collectif et des règles.  

      Confiance Éducative : Liée à la relation d'apprentissage et d'étayage intellectuel.

      4. Inspirer Confiance comme Compétence Professionnelle : La capacité à inspirer confiance n'est pas innée mais constitue une compétence professionnelle qui peut être développée.

      Elle repose sur l'adoption de postures spécifiques (fiabilité, loyauté, bienveillance), la mise en place d'expériences positives répondant aux besoins fondamentaux des élèves (reconnaissance, sécurité, appartenance) et une démarche compréhensive et empathique.

      --------------------------------------------------------------------------------

      1. Contexte de la Recherche

      L'intervention de Marie Beretti se fonde sur son travail de thèse soutenu en 2019, intitulé "La relation d'autorité éducative au prisme de la confiance".

      La recherche a été motivée par le besoin de comprendre les mécanismes relationnels de l'obéissance et de la désobéissance des élèves.

      Pour ce faire, une enquête de terrain approfondie a été menée pendant une année scolaire complète.

      Échantillon : Trois classes de cycle 3 (CE2, CM1, CM2, selon la définition de l'époque).

      Enseignants : Deux hommes et une femme, ayant tous entre 8 et 10 ans d'ancienneté.

      Ce choix visait à observer des praticiens ayant une posture d'autorité établie mais n'étant pas encore "trop aguerris ou fatigués".

      Méthodologie :

      ◦ Observations continues dans les classes, du premier au dernier jour de l'année.  

      ◦ Entretiens longs et approfondis avec les trois enseignants.  

      ◦ Entretiens avec les élèves, menés en petits groupes.

      C'est en analysant la confrontation des discours des enseignants et des élèves sur l'autorité que le concept de confiance a émergé comme un facteur explicatif central et inattendu.

      2. La Nature de l'Autorité Éducative

      Avant d'aborder la confiance, il est essentiel de définir l'autorité éducative comme une relation professionnelle nécessaire mais conditionnée.

      2.1. Le "Pourquoi" : Une Nécessité Inhérente à l'Éducation

      L'autorité est inévitable et nécessaire dans toute relation éducative.

      Relation Asymétrique : Elle découle de la verticalité de la relation enseignant-élève. Les places ne sont "ni niables ni interchangeables".

      Réponse aux Besoins de l'Élève : L'enfant est "constitutivement vulnérable" car en construction.

      L'autorité de l'éducateur est légitime car elle répond aux besoins fondamentaux de l'élève : être accompagné, encadré, enseigné.

      Responsabilité Professionnelle : Assumer cette autorité est une compétence attendue et une responsabilité. Nier la question de l'autorité reviendrait à "nier l'idée même d'éducation".

      2.2. Le "Comment" : Les Conditions d'un Exercice Éducatif

      Si l'autorité est nécessaire, elle ne doit pas être un pouvoir sans limites.

      Responsabilité vs. Pouvoir : L'autorité de l'enseignant est "plus une responsabilité envers les élèves qu'un pouvoir sur les élèves".

      Elle ne doit pas les écraser ou les nier en tant que personnes.

      Respect de la Dignité : L'élève, bien qu'en position "basse", est un "égal en droit", un "semblable".

      L'autorité doit s'exercer dans le respect de sa dignité.

      De la Soumission à l'Adhésion : L'objectif n'est pas de soumettre les élèves, mais de les amener à reconnaître la légitimité de l'autorité exercée sur eux, puis à y adhérer volontairement.

      Une autorité est véritablement éducative lorsque l'élève choisit "librement" d'obéir.

      En se référant aux travaux de Bruno Robe, Beretti distingue deux écueils :

      L'autorité évacuée : Un manque professionnel qui met l'élève en péril.

      L'autorité autoritariste : Une autorité imposée qui verse dans l'autoritarisme et ne permet pas à l'élève de "bien grandir".

      3. Le Lien de Renforcement Mutuel entre Autorité et Confiance

      Le résultat principal de la recherche est l'identification d'un cercle vertueux entre la relation d'autorité et la confiance interpersonnelle.

      De l'Autorité à la Confiance : Une relation d'autorité stable, contenante et bienveillante sécurise les élèves.

      Se sentant soutenus et encadrés, ils développent un sentiment de confiance envers leur enseignant.

      De la Confiance à l'Autorité : La confiance, en retour, facilite l'exercice de l'autorité.

      Un des effets majeurs de la confiance est qu'elle génère une "tendance à l'adhésion".

      Les élèves qui ont confiance sont plus enclins à coopérer et à obéir volontairement.

      Cette dynamique rend l'exercice de l'autorité beaucoup moins "coûteux" physiquement et psychiquement pour l'enseignant.

      Les classes où la confiance était forte étaient celles où l'autorité s'exerçait avec le plus de fluidité.

      4. Les Trois Domaines de la Confiance de l'Élève

      L'analyse des discours des élèves a permis de distinguer trois types de confiance qu'ils peuvent nourrir simultanément envers leur enseignant.

      Ces domaines sont distincts : un enseignant peut inspirer une forte confiance dans un domaine et une confiance faible dans un autre.

      | Domaine de Confiance | Description | Registre de la Relation | | --- | --- | --- | | Confiance Élémentaire | Confiance en l'enseignant en tant que personne fiable, loyale et bienveillante dans les échanges interpersonnels. | Échange de personne à personne | | Confiance Juridique | Confiance en l'enseignant comme garant juste et impartial du cadre collectif, des règles et du vivre-ensemble. | Échange collectif | | Confiance Éducative | Confiance en l'enseignant en tant qu'expert capable de guider l'apprentissage et de favoriser le développement. | Échange intellectuel et culturel |

      Un domaine de confiance fragilisé peut impacter négativement la relation de confiance globale et, par conséquent, la relation d'autorité. Il est donc crucial pour un enseignant de prêter attention à ces trois domaines de manière spécifique.

      5. Développer la Confiance : Une Compétence Professionnelle

      La capacité à inspirer confiance n'est pas un don, mais une compétence qui se travaille.

      Le processus de construction de la confiance suit un schéma précis.

      5.1. Le Mécanisme de Naissance de la Confiance

      1. Besoins Fondamentaux de l'Élève : Tout élève a des besoins de reconnaissance, de sécurité et d'appartenance.

      2. Expériences Positives : Lorsque l'enseignant, par ses actions et ses dispositifs, permet à l'élève de vivre des expériences positives (se sentir considéré, en sécurité, intégré), ces besoins sont comblés.

      3. Attribution à l'Enseignant : L'élève attribue ce bien-être à l'enseignant. Il a le sentiment que "c'est grâce à l'enseignant qu'il se sent bien".

      4. Génération de la Confiance : Ce sentiment positif nourrit la confiance envers l'enseignant, ce qui déclenche le mécanisme d'adhésion et de coopération.

      Inversement, des expériences négatives (sentiment de mépris, d'insécurité, de rejet) attribuées à l'enseignant génèrent de la méfiance ou de la défiance, ce qui rend la relation d'autorité conflictuelle et coûteuse.

      5.2. Postures et Attitudes Favorisant la Confiance

      Plusieurs postures transversales, relevant de compétences relationnelles et communicationnelles, ont été identifiées chez les enseignants qui inspirent confiance.

      Être une Personne Fiable, Loyale et Animée de Bonnes Intentions :

      Fiabilité : Se montrer stable, constant dans ses attitudes et solide dans ses compétences.   

      Loyauté : Faire preuve de transparence en rendant explicites les objectifs, les règles et les décisions.  

      Bonnes Intentions : Démontrer que toutes les actions, même les contraintes, visent le bien des élèves et non un intérêt personnel.

      Donner des Gages et des Preuves :

      Gages : Annoncer ce qui va se passer, faire des promesses, anticiper.   

      Preuves : Assurer une congruence totale entre les discours et les actes. "Faire ce qu'on dit et dire ce qu'on va faire."

      Témoigner de sa Confiance envers les Élèves :

      ◦ La confiance se nourrit de la confiance. Il faut postuler a priori que les élèves sont dignes de confiance, plutôt que d'attendre qu'ils le prouvent.

      Fédérer le Groupe et se Positionner en Leader :

      ◦ Créer un sentiment d'appartenance en donnant une identité au groupe-classe.  

      ◦ L'enseignant doit se positionner comme un membre du groupe ("nous", "on"), tout en en étant le guide.

      Adopter une Attitude Compréhensive et Empathique :

      ◦ Partir du point de vue des élèves pour évaluer la pertinence de ses propres attitudes et dispositifs.   

      ◦ Faire verbaliser les élèves sur leur ressenti et leur interprétation des situations, de manière inconditionnelle.

      Travailler en Équipe :

      ◦ Un engagement collectif de l'équipe éducative sur la voie de la confiance renforce les chances de succès, car il crée un climat global et modélise des relations de confiance pour les élèves.

      6. Points Clés de la Session de Questions-Réponses

      Conscience des Enseignants : Les enseignants ont une conscience intuitive du lien entre confiance et autorité ("c'est plus facile quand on a la confiance des élèves").

      Cependant, les mécanismes précis de construction de cette confiance sont souvent un "impensé".

      Influence de l'Âge : La confiance est plus spontanée chez les plus jeunes enfants, car elle est liée à une question de survie et de réponse aux besoins vitaux.

      Plus l'élève grandit, plus la confiance devient une construction rationnelle et doit être activement travaillée.

      Profils d'Attachement et Confiance Généralisée : L'histoire personnelle de l'élève (relation à l'adulte, profil sécure/insécure) constitue sa "confiance généralisée".

      C'est un déterminant qui ne dépend pas de l'enseignant mais qui influence la capacité de l'élève à faire confiance.

      L'enseignant doit en avoir conscience pour ajuster ses attentes et ses précautions, tout en sachant que le résultat n'est jamais garanti.

      Rôle de la Famille : La confiance des parents envers l'école et l'enseignant peut "étayer" la confiance de l'enfant.

      Il est donc important de travailler également à inspirer confiance aux parents.

      Cependant, une défiance institutionnelle profonde de la part des familles est très difficile à surmonter à l'échelle d'un seul enseignant.

    1. Author response:

      The following is the authors’ response to the original reviews.

      Public Reviews

      Reviewer #1 (Public review):

      Summary:

      In this paper, the authors investigate the effects of Miro1 on VSMC biology after injury. Using conditional knockout animals, they provide the important observation that Miro1 is required for neointima formation. They also confirm that Miro1 is expressed in human coronary arteries. Specifically, in conditions of coronary diseases, it is localized in both media and neointima, and, in atherosclerotic plaque, Miro1 is expressed in proliferating cells.

      However, the role of Miro1 in VSMC in CV diseases is poorly studied, and the data available are limited; therefore, the authors decided to deepen this aspect. The evidence that Miro-/- VSMCs show impaired proliferation and an arrest in S phase is solid and further sustained by restoring Miro1 to control levels, normalizing proliferation. Miro1 also affects mitochondrial distribution, which is strikingly changed after Miro1 deletion. Both effects are associated with impaired energy metabolism due to the ability of Miro1 to participate in MICOS/MIB complex assembly, influencing mitochondrial cristae folding. Interestingly, the authors also show the interaction of Miro1 with NDUFA9, globally affecting super complex 2 assembly and complex I activity.

      Finally, these important findings also apply to human cells and can be partially replicated using a pharmacological approach, proposing Miro1 as a target for vasoproliferative diseases.

      Strengths:

      The discovery of Miro1 relevance in neointima information is compelling, as well as the evidence in VSMC that MIRO1 loss impairs mitochondrial cristae formation, expanding observations previously obtained in embryonic fibroblasts.

      The identification of MIRO1 interaction with NDUFA9 is novel and adds value to this paper. Similarly, the findings that VSMC proliferation requires mitochondrial ATP support the new idea that these cells do not rely mostly on glycolysis.

      Weaknesses:

      (1) Figure 3:

      I appreciate the system used to assess mitochondrial distribution; however, I believe that time-lapse microscopy to evaluate mitochondrial movements in real time should be mandatory. The experimental timing is compatible with time-lapse imaging, and these experiments will provide a quantitative estimation of the distance travelled by mitochondria and the fraction of mitochondria that change position over time. I also suggest evaluating mitochondrial shape in control and MIRO1-/- VSMC to assess whether MIRO1 absence could impact mitochondrial morphology, altering fission/fusion machinery, since mitochondrial shape could differently influence the mobility.

      Mitochondrial motility experiments. WT and Miro1-/- VSMCs were transiently transfected with mito-ds-red and untargeted GFP adenoviruses to fluorescently label mitochondria and cytosol, respectively. Live-cell fluorescence confocal microscopy was used to acquire mitochondrial images at one-minute intervals over a 25-30-minute period. WT cells exhibited dynamic reorganization of the mitochondrial network, whereas Miro1-/- VSMCs displayed minimal mitochondrial movement, characterized only by limited oscillatory behavior without network remodeling (Supplemental Video 1).

      Mitochondrial shape (form factor) was assessed by confocal microscopy in WT and Miro1-/- VSMCs. Analysis of the mitochondrial form factor (defined as the ratio of mitochondrial length to width) during cell cycle progression revealed morphological changes in wild type (WT) cells, characterized by an increase in form factor. In contrast, Miro1-/- cells exhibited no significant alterations in mitochondrial morphology (Figure 3- Figure supplement 1B).

      (2) Figure 6:

      The evidence of MIRO1 ablation on cristae remodeling is solid; however, considering that the mechanism proposed to explain the finding is the modulation of MICOS/MIB complex, as shown in Figure 6D, I suggest performing EM analysis in each condition. In my mind, Miro1 KK and Miro1 TM should lead to different cristae phenotypes according to the different impact on MICOS/MIB complex assembly. Especially, Miro1 TM should mimic Miro1 -/- condition, while Miro1 KK should drive a less severe phenotype. This would supply a good correlation between Miro1, MICOS/MIB complex formation and cristae folding.

      I also suggest performing supercomplex assembly and complex I activity with each plasmid to correlate MICOS/MIB complex assembly with the respiratory chain efficiency.

      Complex I activity assays revealed that overexpression of MIRO1-WT fully restored enzymatic activity in MIRO1-/- cells, whereas MIRO1-KK provided partial rescue. In contrast, a MIRO1 mutant lacking the transmembrane domain failed to restore activity and resembled the Miro1-/- phenotype (Figure 6- Figure supplement 2).

      The Complex I activity in each Miro1 mutant correlated with the degree of MICOS/MIB complex assembly in pulldown assays, implying a functional link between Miro1 and mitochondrial cristae organization.

      Moreover, an in-gel Complex V activity assay was performed to evaluate the enzymatic activity of mitochondrial ATP synthase in a native gel following electrophoresis. To normalize the activity signal, a Blue Native PAGE of the same samples was probed for the ATP5F1 subunit. A modest, yet statistically significant reduction in Complex V activity was observed in Miro1-/- cells (Figure 6- Figure supplement 1).

      (3) I noticed that none of the in vitro findings have been validated in an in vivo model. I believe this represents a significant gap that would be valuable to address. In your animal model, it should not be too complex to analyze mitochondria by electron microscopy to assess cristae morphology. Additionally, supercomplex assembly and complex I activity could be evaluated in tissue homogenates to corroborate the in vitro observations.

      We appreciate the reviewer’s comment. However, our currently available samples have been processed by light microscopy and are therefore not suitable for embedding for light for electron microscopy.

      (4) I find the results presented in Figure S7 somewhat unclear. The authors employ a pharmacological strategy to reduce Miro1 and validate the findings previously obtained with the genetic knockout model. They report increased mitophagy and a reduction in mitochondrial mass. However, in my opinion, these changes alone could significantly impact cellular metabolism. A lower number of mitochondria would naturally result in decreased ATP production and reduced mitochondrial respiration. This, in turn, weakens the proposed direct link between Miro1 deletion and impaired metabolic function or altered electron transport chain (ETC) activity. I believe this section would benefit from additional experiments and a more in-depth discussion.

      We initially conducted experiments using the MIRO1 reducer to explore the translational potential of our findings. These experiments aimed to provide a foundation in vivo studies. However, despite multiple attempts, we were unable to demonstrate a significant effect of MIRO1reducer, delivered via a Pluronic gel, on the mitochondria of the vascular wall. Of note, he role of MIRO1 in mitophagy has been well-established in several studies (for example, PMID: 34152608), which show that genetic deletion of Miro1 delays the translocation of the E3 ubiquitin ligase Parkin onto damaged mitochondria, thereby reducing mitochondrial clearance in fibroblasts and cultured neurons. Furthermore, loss of Miro1 in the hippocampus and cortex increases mitofusin levels with the appearance of hyperfused mitochondria and activation of the integrated stress response. Thus, MIRO1 deletion in genetic models does not result in a substantial reduction of mitochondria but causes hyperfused mitochondria. The rationale for developing the MIRO1 reducer stems from genetic forms of Parkinson’s disease, where Miro1 is retained in PD cells but degraded in healthy cells following mitochondrial depolarization (PMID: 31564441). Thus, the degradation of mutant MIRO1 by the reducer does not phenocopy the effects of genetic MIRO1 depletion. Thus, we believe the data with the reducer demonstrate that MIRO1 can be acutely targeted in vitro, but the mechanism of action (as the reviewer points out, the reduction of mitochondrial mass may lead to decreased ATP levels, potentially reducing cell proliferation) differs from that of chronic genetic deletion. In fact, we observe somewhat increased mitochondrial length in MIRO1-/- cells. We acknowledge that this is complex and have revised the paragraph to clarify the use of the MIRO1 reducer.

      Reviewer #2 (Public review):

      Summary:

      This study identifies the outer mitochondrial GTPase MIRO1 as a central regulator of vascular smooth muscle cell (VSMC) proliferation and neointima formation after carotid injury in vivo and PDGF-stimulation ex vivo. Using smooth muscle-specific knockout male mice, complementary in vitro murine and human VSMC cell models, and analyses of mitochondrial positioning, cristae architecture, and respirometry, the authors provide solid evidence that MIRO1 couples mitochondrial motility with ATP production to meet the energetic demands of the G1/S cell cycle transition. However, a component of the metabolic analyses is suboptimal and would benefit from more robust methodologies. The work is valuable because it links mitochondrial dynamics to vascular remodeling and suggests MIRO1 as a therapeutic target for vasoproliferative diseases, although whether pharmacological targeting of MIRO1 in vivo can effectively reduce neointima after carotid injury has not been explored. This paper will be of interest to those working on VSMCs and mitochondrial biology.

      Strengths:

      The strength of the study lies in its comprehensive approach, assessing the role of MIRO1 in VSMC proliferation in vivo, ex vivo, and importantly in human cells. The subject provides mechanistic links between MIRO1-mediated regulation of mitochondrial mobility and optimal respiratory chain function to cell cycle progression and proliferation. Finally, the findings are potentially clinically relevant given the presence of MIRO1 in human atherosclerotic plaques and the available small molecule MIRO1.

      Weaknesses:

      (1) There is a consistent lack of reporting across figure legends, including group sizes, n numbers, how many independent experiments were performed, or whether the data is mean +/- SD or SEM, etc. This needs to be corrected.

      These data were added in the revised manuscript.

      (2) The in vivo carotid injury experiments are in male mice fed a high-fat diet; this should be explicitly stated in the abstract, as it's unclear if there are any sex- or diet-dependent differences. Is VSMC proliferation/neointima formation different in chow-fed mice after carotid injury?

      This is an important point, and we appreciate the feedback. In this model, the transgene is located on the Y chromosome. As a result, only male mice can be studied. However, in our previous experiments, we have not observed any sex-dependent changes in neointimal formation. Additionally, please note that smooth muscle cell proliferation in neointimal formation is enhanced in models of cholesterol-fed mice on a high-fat diet.

      (3) The main body of the methods section is thin, and it's unclear why the majority of the methods are in the supplemental file. The authors should consider moving these to the main article, especially in an online-only journal.

      We thank the reviewer for this suggestion. We moved the methods to the main manuscript.

      (4) Certain metabolic analyses are suboptimal, including ATP concentration and Complex I activity measurements. The measurement of ATP/ADP and ATP/AMP ratios for energy charge status (luminometer or mass spectrometry), while high-resolution respirometry (Oroboros) to determine mitochondrial complex I activity in permeabilized VSMCs would be more informative.

      ATP/ADP and ATP/AMP ratios were assessed on samples from WT and Miro1-/- VSMCs using an ATP/ADP/AMP Assay Kit (Cat#: A-125) purchased from Biomedical Research Service, University at Buffalo, New York). Miro1-/- samples exhibited reduced ATP levels accompanied by elevated concentrations of ADP and AMP. As a result, both ATP/ADP and ATP/AMP ratios were significantly lower in MIRO1-/- cells compared to WT, indicating impaired cellular energy homeostasis (Figure 5B, C).

      (5) The statement that 'mitochondrial mobility is not required for optimal ATP production' is poorly supported. XF Seahorse analysis should be performed with nocodazole and also following MIRO1 reconstitution +/- EF hands.

      To evaluate the metabolic effects of Nocodazole, we conducted Seahorse metabolic assays on vascular smooth muscle cells with various conditions (VSMCs). We used WT VSMCs, Miro1-/- VSMCs, and Miro1-/- VSMCs that expressed either MIRO1-WT, KK, or ΔTM mutants.Our results demonstrate that Nocodazole exposure did not compromise mitochondrial respiratory activity. However, Miro1-/- VSMCs displayed a trend toward reduced basal and maximal mitochondrial respiration when compared to WT cells. This deficit was only partially corrected by the expression of the MIRO1-KK mutant. In contrast, reintroducing MIRO1-WT through adenoviral delivery fully restored mitochondrial respiration to normal levels (Figure 5- Figure supplement 1).

      (6) The authors should consider moving MIRO1 small molecule data into the main figures. A lot of value would be added to the study if the authors could demonstrate that therapeutic targeting of MIRO1 could prevent neointima formation in vivo.

      We appreciate the reviewer's comment and attempted the suggested in vivo experiments using the commercially available Miro1 reducer. For these experiments, we used a pluronic gel to deliver the reducer to the adventitial area surrounding the carotid artery. Despite numerous attempts to optimize the experimental conditions, we were unable to reliably detect a significant effect of the reducer on mitochondria in the vascular wall.

      Reviewer #3 (Public review):

      Summary:

      This study addresses the role of MIRO1 in vascular smooth muscle cell proliferation, proposing a link between MIRO1 loss and altered growth due to disrupted mitochondrial dynamics and function. While the findings are potentially useful for understanding the importance of mitochondrial positioning and function in this specific cell type within health and disease contexts, the evidence presented appears incomplete, with key bioenergetic and mechanistic claims lacking adequate support.

      Strengths:

      (1)The study focuses on an important regulatory protein, MIRO1, and its role in vascular smooth muscle cell (VSMC) proliferation, a relatively underexplored context.

      (2) It explores the link between smooth muscle cell growth, mitochondrial dynamics, and bioenergetics, which is a potentially significant area for both basic and translational biology.

      (3) The use of both in vivo and in vitro systems provides a potentially useful experimental framework to interrogate MIRO1 function in this context.

      Weaknesses:

      (1) The central claim that MIRO1 loss impairs mitochondrial bioenergetics is not convincingly demonstrated, with only modest changes in respiratory parameters and no direct evidence of functional respiratory chain deficiency.

      (2) The proposed link between MIRO1 and respiratory supercomplex assembly or function is speculative, lacking mechanistic detail and supported by incomplete or inconsistent biochemical data.

      (3) Key mitochondrial assays are either insufficiently controlled or poorly interpreted, undermining the strength of the conclusions regarding oxidative phosphorylation.

      (4) The study does not adequately assess mitochondrial content or biogenesis, which could confound interpretations of changes in respiratory activity.

      (5) Overall, the evidence for a direct impact of MIRO1 on mitochondrial respiratory function in the experimental setting is weak, and the conclusions overreach the data.

      Recommendations for the authors:

      Reviewer #3 (Recommendations for the authors):

      (1)  Throughout the manuscript, the authors incorrectly use "mobility" to describe the active transport of mitochondria. The appropriate term is "mitochondrial motility," which refers to active, motor-driven movement. "Mobility" implies passive diffusion and is not scientifically accurate in this context.

      (2) "Super complex" should be consistently written as "supercomplex," in line with accepted mitochondrial biology terminology.

      We thank the reviewer for this comment and revised the text accordingly.

      (3) A significant limitation of the in vivo model is the mild phenotype observed, which is expected from an inducible knockout system. The authors should clarify whether a constitutive, tissue-specific knockout was considered and, if not, whether embryonic lethality or another limitation prevented its generation.

      This genetic model was originally developed by Dr. Janet Shaw at the University of Utah. In the original publication, Miro1 was constitutively knocked out in neurons. Germline inactivation of Miro1 was achieved by crossing mice harboring the Miro1F allele with a mouse line expressing Cre recombinase under the control of the hypoxanthine-guanine phosphoribosyltransferase (HPRT) promoter. Mating Miro1+/− mice resulted in Miro1−/− animals, which were cyanotic and died shortly after birth. Due to this outcome, we opted to develop an inducible, smooth muscle-specific model. Additionally, we considered testing whether the acute use of an inhibitor or a knockdown system targeting Miro1 could be evaluated as a potential therapeutic approach.

      (4) In Figure 1A and S1A, the authors use Western blotting to validate the knockout in the aorta and IHC in carotid arteries. The choice of different methods does not seem justified, and qPCR data are shown only for the aorta. IHC appears to be suboptimal for assessing MIRO1 levels in vascular tissue due to high autofluorescence, and IHC in Figure S1A is merely qualitative, with no quantification provided.

      We present complementary approaches to validate the deletion of Miro1. For Western blot analysis, we used the aorta because it provides more material for analysis. The autofluorescence observed via immunofluorescence is characteristic of elastin fibers within the media layer, making our results typical for this technique. As shown in Figure 1- Figure supplement 1, our data demonstrate a significant decrease, if not a complete knockout, of the target protein specifically in smooth muscle cells.

      (5) In Figure 1G, the bottom left panel (magnification) shows a lower green signal than the top left panel, suggesting these may have been collected with different signal intensity. This raises concerns about image consistency and representation.

      Top images in Figure 1G are taken at magnification 63x. Bottom images were made at magnification 20x. The intensity is different between the two magnifications, but similar between genotypes.

      (6) In Figure S3, the sampling is uncontrolled: the healthy subject and the patient differ markedly in age. The claim of colocalization is not substantiated with any quantitative analysis.

      As outlined in the Methods section, our heart samples were obtained from LVAD patients or explanted hearts from transplant recipients. Due to the limited availability of such samples, there is indeed a difference in age between the healthy subject and the patient. While we acknowledge this limitation, the scarcity of samples made it challenging to control for age. Additionally, we determined that performing a quantitative analysis of colocalization would not yield robust or meaningful data given the constraints of our sample size and variability. 

      (7) Figure S4A lacks statistical analysis, which is necessary for interpreting the data shown.

      This appears to be a misunderstanding. In this manuscript, we do present statistically significant differences and focus on those that are biologically meaningful. Specifically, we highlight differences between PDGF treatment versus no treatment within the same genotype, as well as differences between the two genotypes under the same treatment condition (control or PDGF treatment). In this particular case, there is only a statistical difference between WT+PDGF and SM-Miro1-/, but since this is not a meaningful comparison, it is not shown. Please note that this approach applies to all figures in the manuscript. Including all comparisons—whether statistically significant or not, and whether biologically meaningful or not—may appear rigorous but in our opinion, ultimately detracts from the main message of this paper.

      (8) The authors state, "given the generally poor proliferation of VSMCs from SM-MIRO1-/- mice, in later experiments we used VSMCs from MIRO1fl/fl mice and infected them with adenovirus expressing cre." This is not convincing, especially since in vivo cre efficiency is generally lower than in vitro. Moreover, the methods indicate that "VSMCs from littermate controls were subjected to the same procedure with empty vector control adenovirus," yet in Figure 2A, the control appears to be MIRO1fl/fl VSMCs transduced with Ad-EV. The logic and consistency of the controls used need clarification.

      For the initial experiments, cells were explanted from SM-MIRO1-/- mice (Figure 2- Figure supplement 1). In these mice, Cre recombination had occurred in vivo, and the cells exhibited very poor growth. In fact, their growth was so limited that we decided not to pursue this experimental approach after three independent experiments.

      For subsequent experiments, cells were explanted from Miro1fl/fl mice and passaged several times, which allowed us to generate the number of cells required for the experiments (Figure 2B). Once sufficient Miro1fl/fl cells were obtained, they were treated with adenovirus expressing Cre, as described in the Methods section. Control cells were treated with an empty vector adenovirus. To clarify, the control cells are Miro1fl/fl cells infected with an empty vector adenovirus, while the MIRO1-/- cells are Miro1fl/fl cells infected with adenovirus expressing Cre. The statement that “littermate controls were used” is incorrect as in fact, Miro1fl/fl cells from the same preparation were either infected with an empty vector adenovirus, or with adenovirus expressing Cre. As mentioned, the knockdown was confirmed by Western blotting.

      (9) Figure 2C shows a growth delay in MIRO1-/- cells. Have the authors performed additional time points to determine when these cells return to G1 and quantify the duration of the lag?

      This is an excellent suggestion. So far, we have not performed this experiment.

      (10) In the 24 h time point of Figure 2C, MIRO1-/- cells appear to be cycling, yet no cyclin E signal is detected. How do the authors explain this inconsistency? Additionally, in Figure 2H, the quantification of cyclin E is unreliable, given that lanes 3 and 4 show no detectable signal.

      We agree with the reviewer—the inconsistency is driven by the exposure of the immunoblot presented. We revisited the data, reviewed the quantification, and performed an additional experiment. We are now presenting an exposure that demonstrates levels of cyclin E (Figure 2G).

      (11) In Figure 3D, the authors present mitochondrial probability map vs. distance from center curves. How was the "center" defined in this analysis? Were radial distances normalized across cells (e.g., to the cell radius or maximum extent)? If not, variation in cell and/or nucleus size or shape could significantly affect the resulting profiles. No statistical analysis is provided for this assessment, which undermines its quantitative value. Furthermore, the rationale behind the use of mito95 values is not clearly explained.

      The center refers to the center of the microchip's Y-shaped pattern, to which each cell is attached. Since all Y-shapes on the chip are identical in size, normalization is not required. The size of the optimal Y-shapes was tested as recommended by CYTOO. For further context, please refer to the papers by the Kittler group.

      Additionally, a graph demonstrating the percentage of mitochondria localized at specific distances can be produced for any given distance. Notably, the further from the center of the chip, the more pronounced the differences become.

      (12) The authors apply a 72 h oligomycin treatment to assess proliferation and a 16 h treatment to measure ATP levels. This discrepancy in experimental design is not justified in the manuscript. The length of treatment directly impacts the interpretation of the data in Figures 4C, 4D, and 4E, and needs to be addressed.

      Thank you for this comment. We have performed additional experiments to align these time points. In the revised manuscript, we now present proliferation and ATP production measured at the same time point (Figure 4A, B for proliferation and ATP levels).

      (13) The manuscript repeatedly suggests that MIRO1 loss causes a defect in mitochondrial ATP production, yet no direct demonstration of a bioenergetic defect is provided. The claim relies on a modest decrease in supercomplex species (of undefined composition) and a mild reduction in complex I activity that does not support a substantial OXPHOS defect. Notably, the respirometry data in Figure 5I do not align with the BN-PAGE results in Figure 6I. There is increasing evidence that respiratory chain supercomplexes do not confer a catalytic advantage. The authors should directly assess the enzymatic activities of all respiratory complexes. Reported complex I activity in MIRO1-/- cells appears rotenone-like (virtually zero, figure 3K) or ~30% residual (Figure 3L), suggesting a near-total loss of functional complex I, which is not reflected in the BN-PAGE. Additionally, complex I activity has not been normalized to a mitochondrial reference, such as citrate synthase.

      Given that we work in primary cells and are limited by the number of cells we can generate, we concentrated on ETC1 and 5 and performed experiments in cells after expression of MIRO1 WT and MIRO1 mutants (Figure 6- Figure supplement 1). Please note that the addition of Rotenone abolishes the slope of NADH consumptions (Figure 6- Figure supplement 2F).

      While the ETC1 activity is measured in Fig. 6K, the blue native gel shown in Figure 6I is performed without substrate and thus, indicative of protein complex abundance rather than complex activity.

      In additional experiments, we normalized the activity to citrate synthase as requested.

      (14) In the methods section, the complex I activity assay is incorrectly described: complex I is a NADH dehydrogenase, so the assay measures NADH oxidation, not NADPH.

      We thank the reviewer for his comment and revised the manuscript accordingly.

      (15) The authors have not assessed mitochondrial mass, which is a critical omission. Differences in mitochondrial biogenesis or content could underlie several observed phenotypes and should be controlled for.

      A qPCR assay was used to assess mitochondrial DNA copy number in WT and Miro1-/- VSMCs. We determined the abundance of COX1 and MT-RNR1 DNA as mitochondrial gene targets and NDUFV DNA as the nuclear reference gene. While the results in Miro1-/- cells were highly variable, no statistically significant reduction of copy numbers was detected (Figure 3- Figure supplement 1B).

      (16) Complex IV signal is missing in Figure 6I. Its omission is not acknowledged or explained.

      Thank you for this comment. We believe this is due to a technical issue. Complex IV can be challenging to detect consistently, as its visibility is highly dependent on sample preparation conditions. In this specific case, we suspect that the buffer used during the isolation process may have influenced the detection of Complex IV.

      (17) Figure 6D does not appear representative of the quantifications shown. C-MYC signal is visibly reduced in the mutant, consistent with the lower levels of interactors such as Sam50 and NDUFA9. Additionally, the SDHA band is aligned at the bottom of the blot box. The list of antibodies used, and their catalog number is missing, or it was not provided to the reviewers. It seems plausible that the authors used a cocktail antibody set (e.g., Abcam ab110412), which includes anti-NDUFA9. This would contradict the claim of reduced complex I and SC levels, as the steady-state levels of NDUFA9 appear unchanged.

      We acknowledge that the expression of the myc-MIRO1 mutant is lower compared to myc-MIRO1 WT or myc-MIRO1 KK. Achieving identical expression levels when overexpressing multiple MIRO1 constructs is challenging. We agree that the lower expression of this mutant contributes to a reduced pull-down. Our quantification shows a reduction in association, although it is not statistically significant.

      A list of the antibodies was provided in the Methods section.

      We would like to clarify that we did not use an antibody cocktail in our experiments.

      (18) The title of Figure 6, "Loss of Miro1 leads to dysregulation of ETC activity under growth conditions," is vague. The term "dysregulation" should be replaced with a more specific mechanistic descriptor-what specific regulatory defect is meant?

      We thank the reviewer for this suggestion and rephrased the title.

      (19) In the results text for Figure 6, the authors state: "These data demonstrate that MIRO1 associates with MIB/MICOS and that this interaction promotes the formation of mitochondrial super complexes and the activity of ETC complex I." This conclusion is speculative and not mechanistically supported by the data presented.

      We appreciate the reviewer's feedback. We have revised the text to clarify the relationship between MIRO1, MIB/MICOS, supercomplex formation, and ETC activity. The updated text now states: "These data demonstrate that MIRO1 associates with MIB/MICOS. Additionally, MIRO1 promotes the formation of mitochondrial supercomplexes and enhances the activity of ETC complex I.”

      (20) In Figure 7A, it is unclear what the 3x siControl/siMiro1 pairs represent-are these different cell lines or technical replicates of the same line? No loading control is shown. If changes in mitochondrial protein abundance are being evaluated, using COX4 as a loading control is inappropriate. The uneven COX4 signal across samples further complicates interpretation

      Please note that we used primary cells, not cell lines. The three siControl/siMiro1 pairs represent independent cell isolations, each transfected with either siControl or. siMIRO1 mRNA. While the possibility of a difference in mitochondrial mass is an interesting question, the primary objective of this experiment is to demonstrate that the technique effectively results in the knockdown of Miro1, which is exclusively localized to mitochondria and not present in the cytosol. As such, we believe that Cox4 serves as a reasonable loading control. Although Miro1 knockdown may lead to a reduction in mitochondrial mass, the focus of this experiment is not to assess mitochondrial mass but to confirm the reduction in Miro1 protein levels on mitochondria. We also performed anti-VDAC immunoblots on the same membranes as alternative loading control (Author response image 1).

      Author response image 1.

      (21) Figure 7G is difficult to interpret. Why did the authors choose to use a sensor-based method instead of the chemiluminescent assay to measure ATP in these samples?

      Both methods were employed to assess ATP levels in human samples. ATP measurements obtained with luminescent assay are provided.

    1. Reviewer #2 (Public review):

      In the current study, the authors use an odor-guided sequence learning task described as a "figure 8" task to probe neuronal differences in latent state encoding within the orbitofrontal cortex after cocaine (n = 3) vs sucrose (n = 3) self-administration. The task uses six unique odors which are divided into two sequences that run in series. For both sequences, the 2nd and 3rd odors are the same and predict reward is not available at the reward port. The 1st and 4th odors are unique, and are followed by reward. Animals are well-trained before undergoing electrode implant and catheterization, and then retrained for two weeks prior to recording. The hypothesis under test is that cocaine-experienced animals will be less able to use the latent task structure to perform the task, and instead encode information about each unique sequence that is largely irrelevant. Behaviorally, both cocaine and sucrose-experienced rats show high levels of accuracy on task, with some group differences noted. When comparing reaction times and poke latencies between sequences, more variability was observed in the cocaine-treated group, implying animals treated these sequences somewhat differently. Analyses done at the single unit and ensemble level suggests that cocaine self-administration had increased the encoding of sequence-specific information, but decreased generalization across sequences. For example, the ability to decode odor position and sequence from neuronal firing in cocaine-treated animals was greater than controls. This pattern resembles that observed within the OFC of animals that had fewer training sessions. The authors then conducted tensor component analysis (TCA) to enable a more "hypothesis agnostic" evaluation of their data.

      Overall, the paper is well written and the authors do a good job of explaining quite complicated analyses so that the reader can follow their reasoning. I have the following comments.

      While well-written, the introduction mainly summarises the experimental design and results, rather than providing a summary of relevant literature that informed the experimental design. More details regarding the published effects of cocaine self-administration on OFC firing, and on tests of behavioral flexibility across species, would ground the paper more thoroughly in the literature and explain the need for the current experiment.

      For Fig 1F, it is hard to see the magnitude of the group difference with the graph showing 0-100%- can the y axis be adjusted to make this difference more obvious? It looks like the cocaine-treated animals were more accurate at P3- is that right?<br /> The concluding section is quite brief. The authors suggest that the failure to generalize across sequences observed in the current study could explain why people who are addicted to cocaine do not use information learned e.g. in classrooms or treatment programs to curtail their drug use. They do not acknowledge the limitations of their study e.g. use of male rats exclusively, or discuss alternative explanations of their data.

      Is it a problem that neuronal encoding of the "positions" i.e. the specific odors was at or near chance throughout in controls? Could they be using a simpler strategy based on the fact that two successive trials are rewarded, then two successive trials are not rewarded, such that the odors are irrelevant?

      When looking at the RT and poke latency graphs, it seems the cocaine-experienced rats were faster to respond to rewarded odors, and also faster to poke after P3. Does this mean they were more motivated by the reward?

    1. Or a computer program can repeat an action until a condition is met:

      This reminds me of when youtubers post videos of followers doing "day x until y" messages. I never considered the possibility that it was fake until now. If you combine this with the sleep feature and randomize the timeframe of the post, it could look very real. I also wonder if in the near future this could be done with AI to create automated videos.

    1. alleles

      Image

      Image

      Image

      Image

      English — alleles (thorough explanation)

      1) What alleles are (core idea)

      Alleles are different versions of the same gene. They control variations of a trait, such as eye color or flower color.

      Allele = one version of a gene

      Each gene can have two or more alleles, but an individual organism usually carries two alleles per gene (one from each parent).


      2) Where alleles are found

      • Genes are located on chromosomes
      • Alleles sit at the same position (locus) on homologous chromosomes
      • One allele comes from the mother
      • One allele comes from the father

      📌 This is why offspring show traits from both parents.


      3) Example of alleles (simple)

      Trait: Seed color

      • Y = yellow
      • y = green

      Possible allele combinations:

      • YY
      • Yy
      • yy

      These combinations affect the trait that appears.


      4) Dominant vs recessive alleles (Science 10 focus)

      Dominant allele

      • Shown with a capital letter (A)
      • Expressed if at least one copy is present

      Recessive allele

      • Shown with a lowercase letter (a)
      • Expressed only if two copies are present

      📌 Example:

      • Aa → dominant trait shows
      • aa → recessive trait shows

      5) Alleles, genotype, and phenotype (key relationship)

      | Term | Meaning | Example | | --------- | ------------------ | ---------- | | Allele | Version of a gene | A or a | | Genotype | Allele combination | Aa | | Phenotype | Physical trait | Brown eyes |

      Alleles determine the genotype, which determines the phenotype.


      6) Alleles in Punnett squares

      Punnett squares:

      • Show how alleles from parents combine
      • Predict possible offspring genotypes
      • Estimate trait probabilities

      📌 Example: Parents: Aa × Aa

      • Possible offspring: AA, Aa, Aa, aa

      7) Why alleles are important

      Alleles:

      • Explain variation within a species
      • Help predict inheritance patterns
      • Are the basis of genetics and evolution
      • Allow populations to adapt over time

      One-sentence exam summary

      Alleles are different versions of the same gene that determine variations in traits.


      中文 — alleles(等位基因) 详细解释

      1) 什么是等位基因(核心概念)

      等位基因(alleles)是指同一基因的不同版本,决定同一性状的不同表现。

      等位基因 = 同一基因的不同形式


      2) 等位基因在哪里

      • 基因位于染色体
      • 等位基因位于同源染色体的相同位置
      • 一个来自母亲,一个来自父亲

      3) 等位基因举例

      性状:豌豆高度

      • T = 高
      • t = 矮

      组合可能是:

      • TT
      • Tt
      • tt

      4) 显性与隐性等位基因(必考)

      • 显性等位基因:只要有一个就会表现
      • 隐性等位基因:必须两个都有才表现

      📌 Tt → 显性性状 📌 tt → 隐性性状


      5) 等位基因与性状的关系

      • 等位基因 → 基因型
      • 基因型 → 表现型

      一句话考试版总结

      等位基因是控制同一性状的不同基因形式。

      如果你愿意,我可以把 alleles → genotype → phenotype → Punnett squares 整理成 Science 10 中英对照闪卡或互动练习,直接用于复习或教学。


      等位基因(allele) EN: Different versions of the same gene that may produce different forms of a trait. Example: For pea flower colour, one allele codes for purple and another for white. 中文:位于同源染色体相同位置、控制同一性状的基因的不同形式。 例子:例如花色基因可以有紫花等位基因和白花等位基因。

    1. ith these definitions, τ 0Oe takesa positive value, by Ampère’s law,

      Naively, looks like it should take a negative value. H_oe points in -Y by amperes law. for a magnetization with phi=0, then by right hand rule the torque points down.

      However, torque and tau are related by a minus sign, because tau is defined as mhatdot

    1. Los síntomas de la hiperplasia prostática benigna pueden deberse al componente obstructivo de la próstata o a la respuesta secundaria de la vejiga a la resistencia en la salida. El componente obstructivo puede subdividirse en obstrucción mecánica y obstrucción dinámica. ++ Conforme ocurre el crecimiento prostático, la obstrucción mecánica puede deberse a la intrusión en la luz uretral o el cuello vesical, lo que aumenta la resistencia de la salida vesical. El tamaño de la próstata en el DRE tiene una escasa correlación con los síntomas. ++ El componente dinámico de la obstrucción prostática explica la naturaleza variable de los síntomas. El estroma prostático se compone de músculo liso y colágena, y tiene una rica inervación adrenérgica. Por consiguiente, el grado de estimulación autonómica establece un “tono” en la uretra prostática. El tratamiento α-bloqueador reduce este tono, lo que disminuye la resistencia en la salida. ++ Las molestias irritativas durante la micción de la hiperplasia prostática benigna provienen de la respuesta secundaria de la vejiga al aumento de la resistencia a la salida. La obstrucción de la salida vesical causa hipertrofia e hiperplasia del músculo detrusor, así como depósito de colágena. Esto último es el causante más probable de la disminución de la distensibilidad vesical, aunque también hay inestabilidad del detrusor.

      sintomatología

    1. After the primordium forms, SRY—a gene on the Y chromosome discovered in 1990, thanks to the participation of intersex XX males and XY females—might be activated.*
      1. this line speaks to the genre because the genre is scientific. The author uses a lot of research, quotes experts, and uses scientific rhetoric to defend their claims
    1. Author response:

      The following is the authors’ response to the original reviews

      Public Reviews:

      Reviewer #1 (Public review):

      Summary:

      Here, the authors have addressed the recruitment and firing patterns of motor units (MUs) from the long and lateral heads of the triceps in the mouse. They used their newly developed Myomatrix arrays to record from these muscles during treadmill locomotion at different speeds, and they used template-based spike sorting (Kilosort) to extract units. Between MUs from the two heads, the authors observed differences in their firing rates, recruitment probability, phase of activation within the locomotor cycle, and interspike interval patterning. Examining different walking speeds, the authors find increases in both recruitment probability and firing rates as speed increases. The authors also observed differences in the relation between recruitment and the angle of elbow extension between motor units from each head. These differences indicate meaningful variation between motor units within and across motor pools and may reflect the somewhat distinct joint actions of the two heads of triceps.

      Strengths:

      The extraction of MU spike timing for many individual units is an exciting new method that has great promise for exposing the fine detail in muscle activation and its control by the motor system. In particular, the methods developed by the authors for this purpose seem to be the only way to reliably resolve single MUs in the mouse, as the methods used previously in humans and in monkeys (e.g. Marshall et al. Nature Neuroscience, 2022) do not seem readily adaptable for use in rodents.

      The paper provides a number of interesting observations. There are signs of interesting differences in MU activation profiles for individual muscles here, consistent with those shown by Marshall et al. It is also nice to see fine-scale differences in the activation of different muscle heads, which could relate to their partially distinct functions. The mouse offers greater opportunities for understanding the control of these distinct functions, compared to the other organisms in which functional differences between heads have previously been described.

      The Discussion is very thorough, providing a very nice recounting of a great deal of relevant previous results.

      We thank the Reviewer for these comments.

      Weaknesses:

      The findings are limited to one pair of muscle heads. While an important initial finding, the lack of confirmation from analysis of other muscles acting at other joints leaves the general relevance of these findings unclear.

      The Reviewer raises a fair point. While outside the scope of this paper, future studies should certainly address a wider range of muscles to better characterize motor unit firing patterns across different sets of effectors with varying anatomical locations. Still, the importance of results from the triceps long and lateral heads should not be understated as this paper, to our knowledge, is the first to capture the difference in firing patterns of motor units across any set of muscles in the locomoting mouse.

      While differences between muscle heads with somewhat distinct functions are interesting and relevant to joint control, differences between MUs for individual muscles, like those in Marshall et al., are more striking because they cannot be attributed potentially to differences in each head's function. The present manuscript does show some signs of differences for MUs within individual heads: in Figure 2C, we see what looks like two clusters of motor units within the long head in terms of their recruitment probability. However, a statistical basis for the existence of two distinct subpopulations is not provided, and no subsequent analysis is done to explore the potential for differences among MUs for individual heads.

      We agree with the Reviewer and have revised the manuscript to better examine potential subpopulations of units within each muscle as presented in Figure 2C. We performed Hartigan’s dip test on motor units within each muscle to test for multimodal distributions. For both muscles, p > 0.05, so we cannot reject the null hypothesis that the units in each muscle come from a multimodal distribution. However, Hartigan’s test and similar statistical methods have poor statistical power for the small sample sizes (n=17 and 16 for long and lateral heads, respectively) considered here, so the failure to achieve statistical significance might reflect either the absence of a true difference or a lack of statistical resolution.

      Still, the limited sample size warrants further data collection and analysis since the varying properties across motor units may lead to different activation patterns. Given these results, we have edited the text as follows:

      “A subset of units, primarily in the long head, were recruited in under 50% of the total strides and with lower spike counts (Figure 2C). This distribution of recruitment probabilities might reflect a functionally different subpopulation of units. However, the distribution of recruitment probabilities were not found to be significantly multimodal (p>0.05 in both cases, Hartigan’s dip test; Hartigan, 1985). However, Hartigan’s test and similar statistical methods have poor statistical power for the small sample sizes (n=17 and 16 for long and lateral heads, respectively) considered here, so the failure to achieve statistical significance might reflect either the absence of a true difference or a lack of statistical resolution.”

      The statistical foundation for some claims is lacking. In addition, the description of key statistical analysis in the Methods is too brief and very hard to understand. This leaves several claims hard to validate.

      We thank the Reviewer for these comments and have clarified the text related to key statistical analyses throughout the manuscript, as described in our other responses below.

      Reviewer #2 (Public review):

      The present study, led by Thomas and collaborators, aims to describe the firing activity of individual motor units in mice during locomotion. To achieve this, they implanted small arrays of eight electrodes in two heads of the triceps and performed spike sorting using a custom implementation of Kilosort. Simultaneously, they tracked the positions of the shoulder, elbow, and wrist using a single camera and a markerless motion capture algorithm (DeepLabCut). Repeated one-minute recordings were conducted in six mice at five different speeds, ranging from 10 to 27.5 cm·s<sup>-1</sup>.

      From these data, the authors reported that:

      (1) a significant portion of the identified motor units was not consistently recruited across strides,

      (2) motor units identified from the lateral head of the triceps tended to be recruited later than those from the long head,

      (3) the number of spikes per stride and peak firing rates were correlated in both muscles, and

      (4) the probability of motor unit recruitment and firing rates increased with walking speed.

      The authors conclude that these differences can be attributed to the distinct functions of the muscles and the constraints of the task (i.e., speed).

      Strengths:

      The combination of novel electrode arrays to record intramuscular electromyographic signals from a larger muscle volume with an advanced spike sorting pipeline capable of identifying populations of motor units.

      We thank the Reviewer for this comment.

      Weaknesses:

      (1) There is a lack of information on the number of identified motor units per muscle and per animal.

      The Reviewer is correct that this information was not explicitly provided in the prior submission. We have therefore added Table 1 that quantifies the number of motor units per muscle and per animal.

      (2) All identified motor units are pooled in the analyses, whereas per-animal analyses would have been valuable, as motor units within an individual likely receive common synaptic inputs. Such analyses would fully leverage the potential of identifying populations of motor units.

      Please see our answer to the following point, where we address questions (2) and (3) together.

      (3) The current data do not allow for determining which motor units were sampled from each pool. It remains unclear whether the sample is biased toward high-threshold motor units or representative of the full pool.

      We thank the Reviewer for these comments. To clarify how motor unit responses were distributed across animals and muscle targets, we updated or added the following figures:  

      Figure 2C

      Figure 4–figure supplement 1

      Figure 5–figure supplement 2

      Figure 6–figure supplement 2

      These provide a more complete look at the range of activity within each motor pool, suggesting that we do measure from units with different activation thresholds within the same motor pool, rather than this variation being due to cross-animal differences. For example, Figure 2C illustrates that motor units from the same muscle and animal show a wide variety of recruitment probabilities. However, the limited number of motor units recorded from each individual animal does not allow a statistically rigorous test for examining cross-animal differences.

      (4) The behavioural analysis of the animals relies solely on kinematics (2D estimates of elbow angle and stride timing). Without ground reaction forces or shoulder angle data, drawing functional conclusions from the results is challenging.

      The Reviewer is correct that we did not measure muscular force generation or ground reaction forces in the present study. Although outside the scope of this study, future work might employ buckle force transducers as used in larger animals (Biewener et al., 1988; Karabulut et al., 2020) to examine the complex interplay between neural commands, passive biomechanics, and the complex force-generating properties of muscle tissue.

      Major comments:

      (1) Spike sorting

      The conclusions of the study rely on the accuracy and robustness of the spike sorting algorithm during a highly dynamic task. Although the pipeline was presented in a previous publication (Chung et al., 2023, eLife), a proper validation of the algorithm for identifying motor unit spikes is still lacking. This is particularly important in the present study, as the experimental conditions involve significant dynamic changes. Under such conditions, muscle geometry is altered due to variations in both fibre pennation angles and lengths.

      This issue differs from electrode drift, and it is unclear whether the original implementation of Kilosort includes functions to address it. Could the authors provide more details on the various steps of their pipeline, the strategies they employed to ensure consistent tracking of motor unit action potentials despite potential changes in action potential waveforms, and the methods used for manual inspection of the spike sorting algorithm's output?

      This is an excellent point and we agree that the dynamic behavior used in this investigation creates potential new challenges for spike sorting. In our analysis, Kilosort 2.5 provides key advantages in comparing unit waveforms across multiple channels and in detecting overlapping spikes. We modified this version of Kilosort to construct unit waveform templates using only the channels within the same muscle (Chung et al., 2023), as clarified in the revised Methods section (see “Electromyography (EMG)”):

      “A total of 33 units were identified across all animals. Each unit’s isolation was verified by confirming that no more than 2% of inter-spike intervals violated a 1 ms refractory limit. Additionally, we manually reviewed cross-correlograms to ensure that each waveform was only reported as a single motor unit.”

      The Reviewer is correct that our ability to precisely measure a unit’s activity based on its waveform will depend on the relationship between the embedded electrode and the muscle geometry, which alters over the course of the stride. As a follow-up to the original text, we have included new analyses to characterize the waveform activity throughout the experiment and stride (also in Methods):

      “We further validated spike sorting by quantifying the stability of each unit’s waveform across time (Figure 1–figure supplement 1). First, we calculated the median waveform of each unit across every trial to capture long-term stability of motor unit waveforms. Additionally, we calculated the median waveform through the stride binned in 50 ms increments using spiking from a single trial. This second metric captures the stability of our spike sorting during the rapid changes in joint angles that occur during the burst of an individual motor unit. In doing so, we calculated each motor unit’s waveforms from the single channel in which that unit’s amplitude was largest and did not attempt to remove overlapping spikes from other units before measuring the median waveform from the data. We then calculated the correlation between a unit’s waveform over either trials or bins in which at least 30 spikes were present. The high correlation of a unit waveform over time, despite potential changes in the electrodes’ position relative to muscle geometry over the dynamic task, provides additional confidence in both the stability of our EMG recordings and the accuracy of our spike sorting.”

      (2) Yield of the spike sorting pipeline and analyses per animal/muscle

      A total of 33 motor units were identified from two heads of the triceps in six mice (17 from the long head and 16 from the lateral head). However, precise information on the yield per muscle per animal is not provided. This information is crucial to support the novelty of the study, as the authors claim in the introduction that their electrode arrays enable the identification of populations of motor units. Beyond reporting the number of identified motor units, another way to demonstrate the effectiveness of the spike sorting algorithm would be to compare the recorded EMG signals with the residual signal obtained after subtracting the action potentials of the identified motor units, using a signal-to-residual ratio.

      Furthermore, motor units identified from the same muscle and the same animal are likely not independent due to common synaptic inputs. This dependence should be accounted for in the statistical analyses when comparing changes in motor unit properties across speeds and between muscles.

      We thank the Reviewer for this comment. Regarding motor unit yield, as described above the newly-added Table 1 displays the yield from each animal and muscle.

      Regarding spike sorting, while signal-to-residual is often an excellent metric, it is not ideal for our high-resolution EMG signals since isolated single motor units are typically superimposed on a “bulk” background consisting of the low-amplitude waveforms of other motor units. Because these smaller units typically cannot be sorted, it is challenging to estimate the “true” residual after subtracting (only) the largest motor unit, since subtracting each sorted unit’s waveform typically has a very small effect on the RMS of the total EMG signal. To further address concerns regarding spike sorting quality, we added Figure 1–figure supplement 1 that demonstrates motor units’ consistency over the experiment, highlighting that the waveform maintains its shape within each stride despite muscle/limb dynamics and other possible sources of electrical noise or artifact.

      Finally, the Reviewer is correct that individual motor units in the same muscle are very likely to receive common synaptic inputs. These common inputs may reflect in sparse motor units being recruited in overlapping rather than different strides. Indeed, in the following added to the Results, we identified that motor units are recruited with higher probability when additional units are recruited.

      “Probabilistic recruitment is correlated across motor units

      Our results show that the recruitment of individual motor units is probabilistic even within a single speed quartile (Figure 5A-C) and predicts body movements (Figure 6), raising the question of whether the recruitment of individual motor units are correlated or independent. Correlated recruitment might reflect shared input onto the population of motor units innervating the muscle (De Luca, 1985; De Luca & Erim, 1994; Farina et al., 2014). For example, two motor units, each with low recruitment probabilities, may still fire during the same set of strides. To assess the independence of motor unit recruitment across the recorded population, we compared each unit’s empirical recruitment probability across all strides to its conditional recruitment probability during strides in which another motor unit from the same muscle was recruited (Figure 7). Doing this for all motor unit pairs revealed that motor units in both muscles were biased towards greater recruitment when additional units were active (p<0.001, Wilcoxon signed-rank tests for both the lateral and long heads of triceps). This finding suggests that probabilistic recruitment reflects common synaptic inputs that covary together across locomotor strides.”

      (3) Representativeness of the sample of identified motor units

      However, to draw such conclusions, the authors should exclusively compare motor units from the same pool and systematically track violations of the recruitment order. Alternatively, they could demonstrate that the motor units that are intermittently active across strides correspond to the smallest motor units, based on the assumption that these units should always be recruited due to their low activation thresholds.

      One way to estimate the size of motor units identified within the same muscle would be to compare the amplitude of their action potentials, assuming that all motor units are relatively close to the electrodes (given the selectivity of the recordings) and that motoneurons innervating more muscle fibres generate larger motor unit action potentials.

      We thank the Reviewer for this comment. Below, we provide more detailed analyses of the relationships between motor unit spike amplitude and the recruitment probability as well as latency (relative to stride onset) of activation.

      We generated the below figures to illustrate the relationship between the amplitude of motor units and their firing properties. As suspected, units with larger-amplitude waveforms fired with lower probability and produced their first spikes later in the stride. If we were comfortable assuming that larger spike amplitudes mean higher-force units, then this would be consistent with a key prediction of the size principle (i.e. that higher-force units are recruited later). However, we are hesitant to base any conclusions on this assumption or emphasize this point with a main-text figure, since EMG signal amplitude may also vary due to the physical properties of the electrode and distance from muscle fibers. Thus it is possible that a large motor unit may have a smaller waveform amplitude relative to the rest of the motor pool.

      Author response image 1.

      Relation between motor unit amplitude and (A) recruitment probability and (B) mean first spike time within the stride. Colored lines indicate the outcome of linear regression analyses.

      Currently, the data seem to support the idea that motor units that are alternately recruited across strides have recruitment thresholds close to the level of activation or force produced during slow walking. The fact that recruitment probability monotonically increases with speed suggests that the force required to propel the mouse forward exceeds the recruitment threshold of these "large" motor units. This pattern would primarily reflect spatial recruitment following the size principle rather than flexible motor unit control.

      We thank the Reviewer for this comment. We agree with this interpretation, particularly in relation to the references suggested in later comments, and have added the following text to the Discussion to better reflect this argument:

      “To investigate the neuromuscular control of locomotor speed, we quantified speed-dependent changes in both motor unit recruitment and firing rate. We found that the majority of units were recruited more often and with larger firing rates at faster speeds (Figure 5, Figure5–figure supplement 1). This result may reflect speed-dependent differences in the common input received by populations of motor neurons with varying spiking thresholds (Henneman et al., 1965). In the case of mouse locomotion, faster speeds might reflect a larger common input, increasing the recruitment probability as more neurons, particularly those that are larger and generate more force, exceed threshold for action potentials (Farina et al., 2014).”

      (4) Analysis of recruitment and firing rates

      The authors currently report active duration and peak firing rates based on spike trains convolved with a Gaussian kernel. Why not report the peak of the instantaneous firing rates estimated from the inverse of the inter-spike interval? This approach appears to be more aligned with previous studies conducted to describe motor unit behaviour during fast movements (e.g., Desmedt & Godaux, 1977, J Physiol; Van Cutsem et al., 1998, J Physiol; Del Vecchio et al., 2019, J Physiol).

      We thank the Reviewer for this comment. In the revised Discussion (see ‘Firing rates in mouse locomotion compared to other species’) we reference several examples of previous studies that quantified spike patterns based on the instantaneous firing rate. We chose to report the peak of the smoothed firing rate because that quantification includes strides with zero spikes or only one spike, which occur regularly in our dataset (and for which ISI rate measures, which require two spikes to define an instantaneous firing rate, cannot be computed). Regardless, in the revised Figure 4B, we present an analysis that uses inter-spike intervals as suggested, which yielded similar ranges of firing rates as the primary analysis.

      (5) Additional analyses of behaviour

      The authors currently analyse motor unit recruitment in relation to elbow angle. It would be valuable to include a similar analysis using the angular velocity observed during each stride, re broadly, comparing stride-by-stride changes in firing rates with changes in elbow angular velocity would further strengthen the final analyses presented in the results section.

      We thank the Reviewer for this comment. To address this, we have modified Figure 6 and the associated Supplemental Figures, to show relationships in unit activation with both the range of elbow extension and the range of elbow velocity for each stride. These new Supplemental Figures show that the trends shown in main text Figure 6C and 6E (which show data from all speed quartiles on the same axes) are also apparent in both the slower and faster quartiles individually, although single-quartile statistical tests (with smaller sample size than the main analysis) not reach statistical significance in all cases.

      Reviewer #3 (Public review):

      Summary:

      Using the approach of Myomatrix recording, the authors report that:

      (1) Motor units are recruited differently in the two types of muscles.

      (2) Individual units are probabilistically recruited during the locomotion strides, whereas the population bulk EMG has a more reliable representation of the muscle.

      (3) The recruitment of units was proportional to walking speed.

      Strengths:

      The new technique provides a unique data set, and the data analysis is convincing and well-performed.

      We thank the Reviewer for the comment.

      Weaknesses:

      The implications of "probabilistical recruitment" should be explored, addressed, and analyzed further.

      Comments:

      One of the study's main findings (perhaps the main finding) is that the motor units are "probabilistically" recruited. The authors do not define what they mean by probabilistically recruited, nor do they present an alternative scenario to such recruitment or discuss why this would be interesting or surprising. However, on page 4, they do indicate that the recruitment of units from both muscles was only active in a subset of strides, i.e., they are not reliably active in every step.

      If probabilistic means irregular spiking, this is not new. Variability in spiking has been seen numerous times, for instance in human biceps brachii motor units during isometric contractions (Pascoe, Enoka, Exp physiology 2014) and elsewhere. Perhaps the distinction the authors are seeking is between fluctuation-driven and mean-driven spiking of motor units as previously identified in spinal motor networks (see Petersen and Berg, eLife 2016, and Berg, Frontiers 2017). Here, it was shown that a prominent regime of irregular spiking is present during rhythmic motor activity, which also manifests as a positive skewness in the spike count distribution (i.e., log-normal).

      We thank the Reviewer for this comment and have clarified several passages in response. The Reviewer is of course correct that irregular motor unit spiking has been described previously and may reflect motor neurons’ operating in a high-sensitivity (fluctuation-driven) regime. We now cite these papers in the Discussion (see ‘Firing rates in mouse locomotion compared to other species’). Additionally, the revision clarifies that “probabilistically” - as defined in our paper - refers only to the empirical observation that a motor unit spikes during only a subset of strides, either when all locomotor speeds are considered together (Figure 2) or separately (Figure 5A-C):

      “Motor units in both muscles exhibited this pattern of probabilistic recruitment (defined as a unit’s firing on only a fraction of strides), but with differing distributions of firing properties across the long and lateral heads (Figure 2).”

      “Our findings (Figure 4) highlight that even with the relatively high firing rates observed in mice, there are still significant changes in firing rate and recruitment probability across the spikes within bursts (Figure 4B) and across locomotor speeds (Figure 5F). Future studies should more carefully examine how these rapidly changing spiking patterns derive from both the statistics of synaptic inputs and intrinsic properties of motor neurons (Manuel & Heckman, 2011; Petersen & Berg, 2016; Berg, 2017).”

      Recommendations for the authors:

      Reviewer #1 (Recommendations for the authors):

      As mentioned above, there are several issues with the statistics that need to be corrected to properly support the claims made in the paper.

      The authors compare the fractions of MUs that show significant variation across locomotor speeds in their firing rate and recruitment probability. However, it is not statistically founded to compare the results of separate statistical tests based on different kinds of measurements and thus have unconstrained differences in statistical power. The comparison of the fractional changes in firing rates and recruitment across speeds that follow is helpful, though in truth, by contemporary standards, one would like to see error bars on these estimates. These could be generated using bootstrapping.

      The Reviewer is correct, and we have revised the manuscript to better clarify which quantities should or should not be compared, including the following passage (see “Motor unit mechanisms of speed control” in Results):

      “Speed-dependent increases in peak firing rate were therefore also present in our dataset, although in a smaller fraction of motor units (22/33) than changes in recruitment probability (31/33). Furthermore, the mean (± SE) magnitude of speed-dependent increases was smaller for spike rates (mean rate<sub>fast</sub>/rate<sub>slow</sub> of 111% ± 20% across all motor units) than for recruitment probabilities (mean p(recruitment) <sub>fast</sub>/p(recruitment) <sub>slow</sub> of 179% ± 3% across all motor units). While fractional changes in rate and recruitment probability are not readily comparable given their different upper limits, these findings could suggest that while both recruitment and peak rate change across speed quartiles, increased recruitment probability may play a larger role in driving changes in locomotor speed.”

      The description in the Methods of the tests for variation in firing rates and recruitment probability across speeds are extremely hard to understand - after reading many times, it is still not clear what was done, or why the method used was chosen. In the main text, the authors quote p-values and then state "bootstrap confidence intervals," which is not a statistical test that yields a p-value. While there are mathematical relationships between confidence intervals and statistical tests such that a one-to-one correspondence between them can exist, the descriptions provided fall short of specifying how they are related in the present instance. For this reason, and those described in what follows, it is not clear what the p-values represent.

      Next, the authors refer to fitting a model ("a Poisson distribution") to the data to estimate firing rate and recruitment probability, that the model results agree with their actual data, and that they then bootstrapped from the model estimates to get confidence intervals and compute p-values. Why do this? Why not just do something much simpler, like use the actual spike counts, and resample from those? I understand that it is hard to distinguish between no recruitment and just no spikes given some low Poisson firing rate, but how does that challenge the ability to test if the firing rates or the number of spiking MUs changes significantly across speeds? I can come up with some reasons why I think the authors might have decided to do this, but reasoning like this really should be made explicit.

      In addition, the authors would provide an unambiguous description of the model, perhaps using an equation and a description of how it was fit. For the bootstrapping, a clear description of how the resampling was done should be included. The focus on peak firing rate instead of mean (or median) firing rate should also be justified. Since peaks are noisier, I would expect the statistical power to be lower compared to using the mean or median.

      We thank the Reviewer for the comments and have revised and expanded our discussion of the statistical tests employed. We expanded and clarified our description of these techniques in the updated Methods section:

      “Joint model of rate and recruitment

      We modeled the recruitment probability and firing rate based on empirical data to best characterize firing statistics within the stride. Particularly, this allowed for multiple solutions to explain why a motor unit would not spike within a stride. From the empirical data alone, strides with zero spikes would have been assumed to have no recruitment of a unit. However, to create a model of motor unit activity that includes both recruitment and rate, it must be possible that a recruited unit can have a firing rate of zero. To quantify the firing statistics that best represent all spiking and non-spiking patterns, we modeled recruitment probability and peak firing rate along the following piecewise function:

      where y denotes the observed peak firing rate on a given stride (determined by convolving motor unit spike times with a Gaussian kernel as described above), p denotes the probability of recruitment, and λ denotes the expected peak firing rate from a Poisson distribution of outcomes. Thus, an inactive unit on a given stride may be the result of either non-recruitment or recruitment with a stochastically zero firing rate. The above equations were fit by minimizing the negative log-likelihood of the parameters given the data.

      “Permutation test for joint model of rate and recruitment and type 2 regression slopes

      To quantify differences in firing patterns across walking speeds, we subdivided each mouse’s total set of strides into speed quartiles and calculated rate (𝜆, Eq. 1 and 2, Fig. 5A-C) and recruitment probability terms (p, Eq. 1 and 2, Fig. 5D-F) for each unit in each speed quartile. Here we calculated the difference in both the rate and recruitment terms across the fastest and slowest speed quartiles (p<sub>fast</sub>-p<sub>slow</sub> and 𝜆<sub>fast</sub>-𝜆<sub>slow</sub>). To test whether these model parameters were significantly different depending on locomotor speed, we developed a null model combining strides from both the fastest and slowest speed quartiles. After pooling strides from both quartiles, we randomly distributed the pooled set of strides into two groups with sample sizes equal to the original slow and fast quartiles. We then calculated the null model parameters for each new group and found the difference between like terms. To estimate the distribution of possible differences, we bootstrapped this result using 1000 random redistributions of the pooled set of strides. Following the permutation test, the 95% confidence interval of this final distribution reflects the null hypothesis of no difference between groups. Thus, the null hypothesis can be rejected if the true difference in rate or recruitment terms exceeds this confidence interval.

      We followed a similar procedure to quantify cross-muscle differences in the relationship between firing parameters. For each muscle, we estimated the slope across firing parameters for each motor unit using type 2 regression. In this case, the true difference was the difference in slopes between muscles. To test the null hypothesis that there was no difference in slopes, the null model reflected the pooled set of units from both muscles. Again, slopes were calculated for 1000 random resamplings of this pooled data to estimate the 95% confidence interval.”

      The argument for delayed activation of the lateral head is interesting, but I am not comfortable saying the nervous system creates a delay just based on observations of the mean time of the first spike, given the potential for differential variability in spike timing across muscles and MUs. One way to make a strong case for a delay would be to show aggregate PSTHs for all the spikes from all the MUs for each of the two heads. That would distinguish between a true delay and more gradual or variable activation between the heads.

      This is a good point and we agree that the claim made about the nervous system is too strong given the results. Even with Author response image 2 below that the Reviewer suggested, there is still not enough evidence to isolate the role of the nervous system in the muscles’ activation.

      Author response image 2.

      Aggregate peristimulus time histogram (PSTH) for all motor unit spike times in the long head (top) and lateral head (bottom) within the stride.

      In the ideal case, we would have more simultaneous recordings from both muscles to make a more direct claim on the delay. Still, within the current scope of the paper, to correct this and better describe the difference in timing of muscle activity, we edited the text to the following:

      “These findings demonstrate that despite the synergistic (extensor) function of the long and lateral heads of the triceps at the elbow, the motor pool for the long head becomes active roughly 100 ms before the motor pool supplying the lateral head during locomotion (Figure 3C).”

      The results from Marshall et al. 2022 suggest that the recruitment of some MUs is not just related to muscle force, but also the frequency of force variation - some of their MUs appear to be recruited only at certain frequencies. Figure 5C could have shown signs of this, but it does not appear to. We do not really know the force or its frequency of variation in the measurements here. I wonder whether there is additional analysis that could address whether frequency-dependent recruitment is present. It may not be addressable with the current data set, but this could be a fruitful direction to explore in the future with MU recordings from mice.

      We agree that this would be a fruitful direction to explore, however the Reviewer is correct that this is not easily addressable with the dataset. As the Reviewer points out, stride frequency increases with increased speed, potentially offering the opportunity to examine how motor unit activity varies with the frequency, phase, and amplitude of locomotor movements. However, given our lack of force data (either joint torques or ground reaction forces), dissociating the frequency/phase/amplitude of skeletal kinematics from the frequency/phase/amplitude of muscle force. Marshall et al. (2022) mitigated these issues by using an isometric force-production task (Marshall et al., 2022). Therefore, while we agree that it would be a major contribution to extend such investigations to whole-body movements like locomotion, given the complexities described above we believe this is a project for the future, and beyond the scope of the present study.

      Minor:

      Page 5: "Units often displayed no recruitment in a greater proportion of strides than for any particular spike count when recruited (Figures 2A, B)," - I had to read this several times to understand it. I suggest rephrasing for clarity.

      We have changed the text to read:

      “Units demonstrated a variety of firing patterns, with some units producing 0 spikes more frequently than any non-zero spike count (Figure 2A, B),...”

      Figure 3 legend: "Mean phase ({plus minus} SE) of motor unit burst duration across all strides.": It is unclear what this means - durations are not usually described as having a phase. Do we mean the onset phase?

      We have changed the text to read:

      “Mean phase ± SE of motor unit burst activity within each stride”

      Page 9: "suggesting that the recruitment of individual motor units in the lateral and long heads might have significant (and opposite) effects on elbow angle in strides of similar speed (see Discussion)." I wouldn't say "opposite" here - that makes it sound like the authors are calling the long head a flexor. The authors should rephrase or clarify the sense in which they are opposite.

      This is a fair point and we agree we should not describe the muscles as ‘opposite’ when both muscles are extensors. We have removed the phrase ‘and opposite’ from the text.

      Page 11: "in these two muscles across in other quadrupedal species" - typo.

      We have corrected this error.

      Page 16: This reviewer cannot decipher after repeated attempts what the first two sentences of the last paragraph mean. - “Future studies might also use perturbations of muscle activity to dissociate the causal properties of each motor unit’s activity from the complex correlation structure of locomotion. Despite the strong correlations observed between motor unit recruitment and limb kinematics (Fig. 6, Supplemental Fig. 3), these results might reflect covariations of both factors with locomotor speed rather than the causal properties of the recorded motor unit.”

      For better clarity, we have changed the text to read:

      “Although strong correlations were observed between motor unit recruitment and limb kinematics during locomotion (Figure 6, Figure 6–figure supplement 1), it remains unclear whether such correlations actually reflect the causal contributions that those units make to limb movement. To resolve this ambiguity, future studies could use electrical or optical perturbations of muscle contraction levels (Kim et al., 2024; Lu et al., 2024; Srivastava et al., 2015, 2017) to test directly how motor unit firing patterns shape locomotor movements. The short-latency effects of patterned motor unit stimulation (Srivastava et al., 2017) could then reveal the sensitivity of behavior to changes in muscle spiking and the extent to which the same behaviors can be performed with many different motor commands.”

      Reviewer #2 (Recommendations for the authors):

      Minor comments:

      Introduction:

      (1) "Although studies in primates, cats, and zebrafish have shown that both the number of active motor units and motor unit firing rates increase at faster locomotor speeds (Grimby, 1984; Hoffer et al., 1981, 1987; Marshall et al., 2022; Menelaou & McLean, 2012)." I would remove Marshall et al. (2022) as their monkeys performed pulling tasks with the upper limb. You can alternatively remove locomotor from the sentence and replace it with contraction speed.

      Thank you for the comment. While we intended to reference this specific paper to highlight the rhythmic activity in muscles, we agree that this deviates from ‘locomotion’ as it is referenced in the other cited papers which study body movement. We have followed the Reviewer’s suggestion to remove the citation to Marshall et al.

      (2) "The capability and need for faster force generation during dynamic behavior could implicate motor unit recruitment as a primary mechanism for modulating force output in mice."

      The authors could add citations to this sentence, of works that showed that recruitment speed is the main determinant of the rate of force development (see for example Dideriksen et al. (2020) J Neurophysiol; J. L. Dideriksen, A. Del Vecchio, D. Farina, Neural and muscular determinants of maximal rate of force development. J Neurophysiol 123, 149-157 (2020)).

      Thank you for pointing out this important reference. We have included this as a citation as recommended.

      Results:

      (3) "Electrode arrays (32-electrode Myomatrix array model RF-4x8-BHS-5) were implanted in the triceps brachii (note that Figure 1D shows the EMG signal from only one of the 16 bipolar recording channels), and the resulting data were used to identify the spike times of individual motor units (Figure 1E) as described previously (Chung et al., 2023)."

      This sentence can be misleading for the reader as the array used by the researchers has 4 threads of 8 electrodes. Would it be possible to specify the number of electrodes implanted per head of interest? I assume 8 per head in most mice (or 4 bipolar channels), even if that's not specifically written in the manuscript.

      Thank you for the suggestion. As described above, we have added Table 1, which includes all array locations, and we edited the statement referenced in the comment as follows:

      “Electrode arrays (32-electrode Myomatrix array model RF-4x8-BHS-5) were implanted in forelimb muscles (note that Figure 1D shows the EMG signal from only one of the 16 bipolar recording channels), and the resulting data were used to identify the spike times of individual motor units in the triceps brachii long and lateral heads (Table 1, Figure 1E) as described previously (Chung et al., 2023).“

      (4) "These findings demonstrate that despite the overlapping biomechanical functions of the long and lateral heads of the triceps, the nervous system creates a consistent, approximately 100 ms delay (Figure 3C) between the activation of the two muscles' motor neuron pools. This timing difference suggests distinct patterns of synaptic input onto motor neurons innervating the lateral and long heads."

      Both muscles don't have fully overlapping biomechanical functions, as one of them also acts on the shoulder joint. Please be more specific in this sentence, saying that both muscles are synergistic at the elbow level rather than "have overlapping biomechanical functions".

      We agree with the above reasoning and that our manuscript should be clearer on this point. We edited the above text in accordance with the Reviewer suggestion as follows:

      "These findings demonstrate that despite the synergistic (extensor) function of the long and lateral heads of the triceps at the elbow, …”  

      (5) "Together with the differences in burst timing shown in Figure 3B, these results again suggest that the motor pools for the lateral and long heads of the triceps receive distinct patterns of synaptic input, although differences in the intrinsic physiological properties of motor neurons innervating the two muscles might also play an important role."

      It is difficult to draw such an affirmative conclusion on the synaptic inputs from the data presented by the authors. The differences in firing rates may solely arise from other factors than distinct synaptic inputs, such as the different intrinsic properties of the motoneurons or the reception of distinct neuromodulatory inputs.

      To better explain our findings, we adjusted the above text in the Results (see “Motor unit firing patterns in the long and lateral heads of the triceps”):

      “Together with the differences in burst timing shown in Figure 3B, these results again suggest that the motor pools for the lateral and long heads of the triceps receive distinct patterns of synaptic input, although differences in the intrinsic physiological properties of motor neurons innervating the two muscles might also play an important role.”

      We also included the following distinction in the Discussion (see “Differences in motor unit activity patterns across two elbow extensors”) to address the other plausible mechanisms mentioned.

      “The large differences in burst timing and spike patterning across the muscle heads suggest that the motor pools for each muscle receive distinct inputs. However, differences in the intrinsic physiological properties of motor units and neuromodulatory inputs across motor pools might also make substantial contributions to the structure of motor unit spike patterns (Martínez-Silva et al., 2018; Miles & Sillar, 2011).”

      (6) "We next examined whether the probabilistic recruitment of individual motor units in the triceps and elbow extensor muscle predicted stride-by-stride variations in elbow angle kinematics."

      I'm not sure that the wording is appropriate here. The analysis does not predict elbow angle variations from parameters extracted from the spiking activity. It rather compares the average elbow angle between two conditions (motor unit active or not active).

      We thank the Reviewer for this comment and agree that the wording could be improved here to better reflect our analysis. To lower the strength of our claim, we replaced usage of the word ‘predict’ with ‘correlates’ in the above text and throughout the paper when discussing this result.

      Methods:

      (7) "Using the four threads on the customizable Myomatrix array (RF-4x8-BHS-5), we implanted a combination of muscles in each mouse, sometimes using multiple threads within the same muscle. [...] Some mice also had threads simultaneously implanted in their ipsilateral or contralateral biceps brachii although no data from the biceps is presented in this study."

      A precise description of the localisation of the array (muscles and the number of arrays per muscle) for each animal would be appreciated.

      (8) "A total of 33 units were identified and manually verified across all animals." A precise description of the number of motor units concurrently identified per muscle and per animal would be appreciated. Moreover, please add details on the manual inspection. Does it involve the manual selection of missing spikes? What are the criteria for considering an identified motor unit as valid?

      As discussed earlier, we added Table 1 to the main text to provide the details mentioned in the above comments.

      Regarding spike sorting, given the very large number of spikes recorded, we did not rely on manual adjusting mislabeled spikes. Instead, as described in the revised Methods section, we verified unit isolation by ensuring units had >98% of spikes outside of 1ms of each other. Moreover, as described above we have added new analyses (Figure 1–figure supplement 1) confirming the stability of motor unit waveforms across both the duration of individual recording sessions (roughly 30 minutes) and across the rapid changes in limb position within individual stride cycles (roughly 250 msec).

      Reviewer #3 (Recommendations for the authors):

      Figure 2 (and supplement) show spike count distributions with strong positive skewness, which is in accordance with the prediction of a fluctuation-driven regime. I suggest plotting these on a logarithmic x-axis (in addition to the linear axis), which should reveal a bell-shaped distribution, maybe even Gaussian, in a majority of the units.

      We thank the Reviewer for the suggestion. We present the requested analysis below, which shows bell-shaped distributions for some (but not all) distributions. However, we believe that investigating why some replotted distributions are Gaussian and others are not falls beyond the scope of this paper, and likely requires a larger dataset than the one we were able to obtain.

      Author response image 3.

      Spike count distributions for each motor unit on a logarithmic x-axis.

      Why not more data? I tried to get an overview of how much data was collected.

      Supplemental Figure 1 has all the isolated units, which amounts to 38 (are the colors the two muscle types?). Given there are 16 leads in each myomatrix, in two muscles, of six mice, this seems like a low yield. Could the authors comment on the reasons for this low yield?

      Regarding motor unit yield, even with multiple electrodes per muscle and a robust sorting algorithm, we often isolated only a few units per muscle. This yield likely reflects two factors. First, because of the highly dynamic nature of locomotion and high levels of muscle contraction, isolating individual spikes reliably across different locomotor speeds is inherently challenging, regardless of the algorithm being employed. Second, because the results of spike-train analyses can be highly sensitive to sorting errors, we have only included the motor units that we can sort with the highest possible confidence across thousands of strides.

      Minor:

      Figure captions especially Figure 6: The text is excessively long. Can the text be shortened?

      We thank the Reviewer for this comment. Generally, we seek to include a description of the methods and results within the figure captions, but we concede that we can condense the information in some cases. In a number of cases, we have moved some of the descriptive text from the caption to the Methods section.

      References

      Berg, R. W. (2017). Neuronal Population Activity in Spinal Motor Circuits: Greater Than the Sum of Its Parts. Frontiers in Neural Circuits, 11. https://doi.org/10.3389/fncir.2017.00103

      Biewener, A. A., Blickhan, R., Perry, A. K., Heglund, N. C., & Taylor, C. R. (1988). Muscle Forces During Locomotion in Kangaroo Rats: Force Platform and Tendon Buckle Measurements Compared. Journal of Experimental Biology, 137(1), 191–205. https://doi.org/10.1242/jeb.137.1.191

      Chung, B., Zia, M., Thomas, K. A., Michaels, J. A., Jacob, A., Pack, A., Williams, M. J., Nagapudi, K., Teng, L. H., Arrambide, E., Ouellette, L., Oey, N., Gibbs, R., Anschutz, P., Lu, J., Wu, Y., Kashefi, M., Oya, T., Kersten, R., … Sober, S. J. (2023). Myomatrix arrays for high-definition muscle recording. eLife, 12, RP88551. https://doi.org/10.7554/eLife.88551

      De Luca, C. J. (1985). Control properties of motor units. Journal of Experimental Biology, 115(1), 125–136. https://doi.org/10.1242/jeb.115.1.125

      De Luca, C. J., & Erim, Z. (1994). Common drive of motor units in regulation of muscle force. Trends in Neurosciences, 17(7), 299–305. https://doi.org/10.1016/0166-2236(94)90064-7

      Farina, D., Negro, F., & Dideriksen, J. L. (2014). The effective neural drive to muscles is the common synaptic input to motor neurons. The Journal of Physiology, 592(16), 3427–3441. https://doi.org/10.1113/jphysiol.2014.273581

      Hartigan, P. M. (1985). Algorithm AS 217: Computation of the Dip Statistic to Test for Unimodality. Applied Statistics, 34(3), 320. https://doi.org/10.2307/2347485

      Henneman, E., Somjen, G., & Carpenter, D. O. (1965). FUNCTIONAL SIGNIFICANCE OF CELL SIZE IN SPINAL MOTONEURONS. Journal of Neurophysiology, 28(3), 560–580. https://doi.org/10.1152/jn.1965.28.3.560

      Karabulut, D., Dogru, S. C., Lin, Y.-C., Pandy, M. G., Herzog, W., & Arslan, Y. Z. (2020). Direct Validation of Model-Predicted Muscle Forces in the Cat Hindlimb During Locomotion. Journal of Biomechanical Engineering, 142(5), 051014. https://doi.org/10.1115/1.4045660

      Kim, J. J., Wyche, I. S., Olson, W., Lu, J., Bakir, M. S., Sober, S. J., & O’Connor, D. H. (2024). Myo-optogenetics: Optogenetic stimulation and electrical recording in skeletal muscles. https://doi.org/10.1101/2024.06.21.600113

      Lu, J., Zia, M., Baig, D. A., Yan, G., Kim, J. J., Nagapudi, K., Anschutz, P., Oh, S., O’Connor, D., Sober, S. J., & Bakir, M. S. (2024). Opto-Myomatrix: μLED integrated microelectrode arrays for optogenetic activation and electrical recording in muscle tissue. https://doi.org/10.1101/2024.07.01.601601

      Manuel, M., & Heckman, C. J. (2011). Adult mouse motor units develop almost all of their force in the subprimary range: A new all-or-none strategy for force recruitment? Journal of Neuroscience, 31(42), 15188–15194. https://doi.org/10.1523/JNEUROSCI.2893-11.2011

      Marshall, N. J., Glaser, J. I., Trautmann, E. M., Amematsro, E. A., Perkins, S. M., Shadlen, M. N., Abbott, L. F., Cunningham, J. P., & Churchland, M. M. (2022). Flexible neural control of motor units. Nature Neuroscience, 25(11), 1492–1504. https://doi.org/10.1038/s41593-022-01165-8

      Martínez-Silva, M. de L., Imhoff-Manuel, R. D., Sharma, A., Heckman, C. J., Shneider, N. A., Roselli, F., Zytnicki, D., & Manuel, M. (2018). Hypoexcitability precedes denervation in the large fast-contracting motor units in two unrelated mouse models of ALS. eLife, 7(2007), 1–26. https://doi.org/10.7554/eLife.30955

      Miles, G. B., & Sillar, K. T. (2011). Neuromodulation of Vertebrate Locomotor Control Networks. Physiology, 26(6), 393–411. https://doi.org/10.1152/physiol.00013.2011

      Petersen, P. C., & Berg, R. W. (2016). Lognormal firing rate distribution reveals prominent fluctuation–driven regime in spinal motor networks. eLife, 5. https://doi.org/10.7554/elife.18805

      Srivastava, K. H., Elemans, C. P. H., & Sober, S. J. (2015). Multifunctional and Context-Dependent Control of Vocal Acoustics by Individual Muscles. The Journal of Neuroscience, 35(42), 14183–14194. https://doi.org/10.1523/JNEUROSCI.3610-14.2015

      Srivastava, K. H., Holmes, C. M., Vellema, M., Pack, A. R., Elemans, C. P. H., Nemenman, I., & Sober, S. J. (2017). Motor control by precisely timed spike patterns. Proceedings of the National Academy of Sciences of the United States of America, 114(5), 1171–1176. https://doi.org/10.1073/pnas.1611734114